Incidents

Track outages, communicate with your users, and resolve issues efficiently.

Overview

Incidents represent periods of degraded service or downtime. They can be created automatically by monitors when downtime is detected, or manually by your team for planned communications.

Each incident is associated with one or more components and progresses through a series of status updates until resolved.

Incident Statuses

Incidents progress through four statuses:

  1. Investigating — the team is aware and looking into the issue
  2. Identified — the root cause has been found
  3. Monitoring — a fix has been applied and the team is watching for recurrence
  4. Resolved — the incident is over and services are restored

Impact Levels

Each incident has an impact level that determines its severity on your status page:

  • None — informational, no impact on services
  • Minor — small impact, most users unaffected
  • Major — significant impact on some services
  • Critical — widespread impact across services

Creating an Incident Manually

  1. Go to Incidents in your dashboard
  2. Click Create Incident
  3. Enter a title describing the issue
  4. Select the affected component(s)
  5. Choose the impact level and initial status
  6. Add an update message explaining the situation
  7. Subscribers will be notified immediately

Posting Updates

As you investigate and resolve an incident, post updates to keep your users informed. Each update includes a status change and a message. Subscribers receive email notifications for every update.

Good incident updates are concise, honest, and include an estimated time to resolution when possible.