Overall
—
Healthy
—
Degraded
—
Down
—
Last sweep
—
Dependencies
Probed every 5 min — refreshes every 30s in this view| Service | Tier | Status | Latency | Message | Breaks if down | Checked | |
|---|---|---|---|---|---|---|---|
| Loading… | |||||||
Probes run on a 5-minute cadence in the Python backend. State transitions (healthy → down, degraded → down, etc.) emit org_signals to the existing bus, so the SRE agent picks up vendor outages without separate integration. A single failed probe holds at "degraded" — two consecutive failures are required to flip to "down" so transient blips don't page.