Health Checks
Health checks emit 2 events
check.passedcheck.failed
notification.yamlapiVersion: mission-control.flanksource.com/v1
kind: Notification
metadata:
name: api-http-fail-alert
namespace: default
spec:
events:
- check.failed
filter: check.type == 'http'
title: API HTTP Check {{.check.name}} failing
body: |
## Check Failed
Error: {{.status.error}}
Failed at {{.status.created_at}}
to:
email: alerts@acme.com

Default Templates
check.passed
Title
{{ if ne channel "slack"}}Check {{.check.name}} has passed{{end}}
Template
Default content is generated from the notification payload in modules/mission-control/notification/message.go.
check.failed
Title
{{ if ne channel "slack"}}Check {{.check.name}} has failed{{end}}
Template
Default content is generated from the notification payload in modules/mission-control/notification/message.go.
Template Variables
| Field | Description | Scheme |
|---|---|---|
agent | Details of the agent | |
canary | canary | |
channel | The notification channel, e.g. |
|
check | Check | |
groupedResources | A list of grouped resource names. Only available when notifications are grouped using |
|
permalink | A link to the resource in Mission Control |
|
silenceURL | A link to silence this notification |
|
status | check status |
Agent
| Field | Description | Scheme |
|---|---|---|
description | Short description of the agent |
|
id | The id of the agent |
|
name | The name of the agent |
|
Canary
| Field | Description | Scheme |
|---|---|---|
created_at | The created at of the canary |
|
deleted_at | The deleted at of the canary |
|
id | The id of the canary |
|
labels | The labels of the canary |
|
name | The name of the canary |
|
namespace | The namespace of the canary |
|
source | The source of the canary |
|
updated_at | The updated at of the canary |
|
Check
| Field | Description | Scheme |
|---|---|---|
created_at | The created at of the check |
|
deleted_at | The deleted at of the check |
|
description | The description of the check |
|
id | The id of the check |
|
labels | The labels of the check |
|
last_runtime | The last runtime of the check |
|
last_transition_time | The last transition time of the check |
|
latency | The past 1 hour latency summary | |
name | The name of the check |
|
next_runtime | The next runtime of the check |
|
severity | The severity of the check |
|
status | Check status details |
|
transformed | Whether the check has been transformed |
|
type | The type of the check |
|
updated_at | The updated at of the check |
|
uptime | The past 1 hour uptime summary |
CheckStatus
| Field | Description | Scheme |
|---|---|---|
check_id | The id of the check associated with this status |
|
created_at | The created at of the check |
|
duration | The duration of the check |
|
error | The error of the check in case of failure |
|
invalid | Whether the check errored out |
|
message | The success message of the check |
|
status | The status of the check |
|
time | The time of the check |
|
Uptime
| Field | Description | Scheme |
|---|---|---|
failed | The number of checks that failed |
|
last_fail | The last time a check failed |
|
last_pass | The last time a check passed |
|
p100 | The percentage of checks that passed |
|
passed | The number of checks that passed |
|
Latency
| Field | Description | Scheme |
|---|---|---|
p95 | The latency of the check |
|
p97 | The latency of the check |
|
p99 | The latency of the check |
|
rolling1h | The latency of the check |
|