mirror of https://github.com/grafana/grafana.git synced 2024-12-02 13:39:19 -06:00

achatterjee-grafana adbe383456

* Restructured and fixed broken relrefs.

* Added relrefs.

* created shared alerts section.

2021-05-25 10:51:28 -04:00

3.5 KiB

Raw Blame History

+++ title = "Alerts" aliases = ["/docs/grafana/latest/alerting/rules/", "/docs/grafana/latest/alerting/metrics/"] weight = 110 +++

Alerts overview

Alerts allow you to know about problems in your systems moments after they occur. Robust and actionable alerts help you identify and resolve issues quickly, minimizing disruption to your services.

Note: In Grafana 8.0, you can optionally choose the beta version of our new [unified altering]({{< relref "./unified-alerting/_index.md" >}}) system. Out of the box, Grafana still supports the [old alters]({{< relref "./old-alerting/_index.md" >}}) .

Alerts have four main components:

Alert rule - One or more conditions, the frequency of evaluation, and the (optional) duration that a condition must be met before notifying.
Contact point - A channel for sending notifications when the conditions of an alert rule are met. (Available in unified alerting only.)
Notification policy - A set of matching and grouping criteria used to determine where, and how frequently, to send notifications.
Silences - Date and matching criteria used to silence notifications. (Available in unified alerting only.)

Alert tasks

You can perform the following tasks for alerts:

[Create an alert rule]({{< relref "./old-alerting/create-alerts.md" >}})
[View existing alert rules and their current state]({{< relref "./old-alerting/view-alerts.md" >}})
[Test alert rules and troubleshoot]({{< relref "./old-alerting/troubleshoot-alerts.md" >}})
[Add or edit an alert contact point]({{< relref "./old-alerting/notifications.md" >}})

Clustering

Currently alerting supports a limited form of high availability. Since v4.2.0 of Grafana, alert notifications are deduped when running multiple servers. This means all alerts are executed on every server but no duplicate alert notifications are sent due to the deduping logic. Proper load balancing of alerts will be introduced in the future.

Alert evaluation

Grafana managed alerts are evaluated by the Grafana backend. Rule evaluations are scheduled, according to the alert rule configuration, and queries are evaluated by an engine that is part of core Grafana.

Alert rules can only query backend data sources with alerting enabled:

builtin or developed and maintained by grafana: Graphite, Prometheus, Loki, InfluxDB, Elasticsearch, Google Cloud Monitoring, Cloudwatch, Azure Monitor, MySQL, PostgreSQL, MSSQL, OpenTSDB, Oracle, and Azure Data Explorer
any community backend data sources with alerting enabled (backend and alerting properties are set in the [plugin.json]({{< relref "../developers/plugins/metadata.md" >}}))

Metrics from the alert engine

The alert engine publishes some internal metrics about itself. You can read more about how Grafana publishes [internal metrics]({{< relref "../administration/view-server/internal-metrics.md" >}}).

Metric Name	Type	Description
`alerting.alerts`	gauge	How many alerts by state
`alerting.request_duration_seconds`	histogram	Histogram of requests to the Alerting API
`alerting.active_configurations`	gauge	The number of active, non default alertmanager configurations for grafana managed alerts
`alerting.rule_evaluations_total`	counter	The total number of rule evaluations
`alerting.rule_evaluation_failures_total`	counter	The total number of rule evaluation failures
`alerting.rule_evaluation_duration_seconds`	summary	The duration for a rule to execute
`alerting.rule_group_rules`	gauge	The number of rules

3.5 KiB Raw Blame History