mirror of https://github.com/grafana/grafana.git synced 2024-11-26 02:40:26 -06:00

* Update change-home-dashboard.md

* added survey code

Added survey blurb.

* Update whats-new-in-v7-3.md

* minor edits

2020-10-22 13:22:34 -07:00

4.3 KiB

Raw Blame History

+++ title = "Alerting" type = "docs" [menu.docs] identifier = "alerting" parent = "features" weight = 100 +++

Alerts overview

Alerts allow you to identify problems in your system moments after they occur. By quickly identifying unintended changes in your system, you can minimize disruptions to your services.

Alerts consists of two parts:

Alert rules - When the alert is triggered. Alert rules are defined by one or more conditions that are regularly evaluated by Grafana.
Notification channel - How the alert is delivered. When the conditions of an alert rule are met, the Grafana notifies the channels configured for that alert.

Currently only the graph panel visualization supports alerts.

Alert tasks

You can perform the following tasks for alerts:

[Add or edit an alert notification channel]({{< relref "notifications.md" >}})
[Create an alert rule]({{< relref "create-alerts.md" >}})
[View existing alert rules and their current state]({{< relref "view-alerts.md" >}})
[Test alert rules and troubleshoot]({{< relref "troubleshoot-alerts.md" >}})

Clustering

Currently alerting supports a limited form of high availability. Since v4.2.0 of Grafana, alert notifications are deduped when running multiple servers. This means all alerts are executed on every server but no duplicate alert notifications are sent due to the deduping logic. Proper load balancing of alerts will be introduced in the future.

Notifications

You can also set alert rule notifications along with a detailed message about the alert rule. The message can contain anything: information about how you might solve the issue, link to runbook, and so on.

The actual notifications are configured and shared between multiple alerts.

Alert execution

Alert rules are evaluated in the Grafana backend in a scheduler and query execution engine that is part of core Grafana. Only some data sources are supported right now. They include Graphite, Prometheus, InfluxDB, Elasticsearch, Google Cloud Monitoring, Cloudwatch, Azure Monitor, MySQL, PostgreSQL, MSSQL, OpenTSDB, Oracle, and Azure Data Explorer.

Metrics from the alert engine

The alert engine publishes some internal metrics about itself. You can read more about how Grafana publishes [internal metrics]({{< relref "../administration/metrics/" >}}).

Description	Type	Metric name
Total number of alerts	counter	`alerting.active_alerts`
Alert execution result	counter	`alerting.result`
Notifications sent counter	counter	`alerting.notifications_sent`
Alert execution timer	timer	`alerting.execution_time`

Got a spare two and a half minutes to help us improve the docs?

Yes, I'll help Close

4.3 KiB Raw Blame History