Zero Signup ToolsFree browser tools

Developer Tools

Error Budget Calculator

Compute SLO error budget remaining, burn rate, and time-to-exhaustion for event-based or time-based SLOs. Includes the SRE multi-window burn rate alert table.

SLO type

Pick an SLO style

Most teams pick one or the other. Event-based SLOs count successful requests; time-based SLOs count uptime.

Examples

Load a sample SLO to see how the budget math reacts, then edit any field with your own numbers.

Event-based SLO

Your service-level numbers

Event-based SLOs treat each request, query, or job as a good or bad event. Paste the totals you see in your dashboard.

Common targets: 99% (web internal), 99.9% (consumer API), 99.95% (cloud SLA), 99.99% (four nines).

Window length

For a 30-day window, day 15 is 50%. For a 7-day window, three full days is about 43%. Enter 100 if the window has just closed.

Multi-window burn rate alerts

When to page on a burn rate

The Google SRE workbook recommends pairing a long detection window with a short verification window so spikes do not page on-call and slow burns still wake someone. Thresholds below are calibrated to a 30-day budget.

SeverityLong windowShort windowBurn rateBudget consumed
Page on-call1 hour5 minutes14.4x2% of 30d budget
Page on-call6 hours30 minutes6x5% of 30d budget
Ticket3 days6 hours1x10% of 30d budget

Rows highlighted in red would fire at your current burn rate. A 14.4x burn rate means a 1-hour outage would consume 2% of a 30-day budget; a 6x rate would consume 5% in 6 hours.

Method

Error budget

For an SLO target T (as a fraction, so 99.9% becomes 0.999), the budget is 1 minus T of the window: events allowed to fail, or seconds allowed to be down.

Burn rate

Observed error fraction divided by the allowed error fraction. 1x means the service will spend exactly the whole budget by end of window. 2x means it will spend the whole budget in half the window.

Time to exhaustion

Remaining budget divided by the current consumption rate. Assumes the rate stays constant. Useful for projecting when a freeze conversation is due.

How to use

  1. Pick the SLO style: event-based (success ratio) when each request, query, or job counts as a good or bad event, or time-based (availability) when uptime in seconds is the metric.
  2. Enter the SLO target as a percentage (99.9, 99.95, 99.99). Pick a window length: 24 hours, 7 days, 28 days, 30 days, or 90 days.
  3. Type the share of the window already elapsed (50 for half a 30-day window, 100 for a window that just closed) so the budget can be denominated against the full window.
  4. For event-based: paste the total events and bad events observed so far. For time-based: paste the observed downtime and pick a unit (seconds, minutes, hours, or days).
  5. Read the burn rate, the share of budget consumed, the time to exhaustion at the current rate, and the multi-window alert table. Rows that would fire at your burn rate are highlighted in red.
  6. Click Copy report to grab a plain-text summary for an incident review, an SLO policy doc, a release readiness checklist, or a status page footnote.

About this tool

Error Budget Calculator turns an SLO target, an observation window, and the numbers you already see in your dashboard into the four things SRE conversations actually need: the window's total error budget, how much of it has been spent, the current burn rate, and a projection of when the budget runs out. Two SLO styles are supported. Event-based SLOs measure the share of successful events; paste the total event count, the bad event count, and the share of the window that has elapsed, and the tool reports the event budget for the full window, the events already burned, the share consumed, and the time to exhaustion at the current rate. Time-based SLOs measure availability; paste an observed downtime (with a unit picker for seconds, minutes, hours, or days) and the tool reports the downtime budget for the window, the share burned, the projected exhaustion time, and the current achieved availability. The window picker covers the spans teams actually report on: 24 hours for intra-day alerts, 7 days for a weekly rolling SLO, 28 days (the Google SRE workbook default), 30 days for AWS billing-month basis, and 90 days for a quarter. A multi-window burn rate alert table renders the Google SRE workbook recommended thresholds (1 hour + 5 minute window at 14.4x, 6 hours + 30 minute window at 6x, 3 days + 6 hour window at 1x) and highlights in red any row that would be tripped at the current burn rate, so a team can decide which thresholds page on-call and which only file a ticket. A live progress bar shows the share of budget burned, color-coded against the SLO target. The math follows the standard error-budget definitions: budget equals (1 minus target) times window, burn rate equals observed error fraction divided by allowed error fraction, and exhaustion time equals remaining budget divided by the current consumption rate. A copy-ready report summarises every number for pasting into an incident review, an error budget policy doc, a postmortem, a quarterly SLO review, or a release readiness checklist. Useful for SRE teams running error budget policies, platform engineers writing freeze rules, on-call engineers deciding whether a burn rate is page-worthy, product managers tracking against an SLO commitment, and anyone managing a service against a measurable reliability target. All math runs in your browser, so traffic counts, error counts, and downtime values never leave your device.

Free to use. Works in your browser. No signup, no login.

Related tools

You may also like

All tools
All toolsDeveloper Tools