grafana

mirror of https://github.com/grafana/grafana.git synced 2024-11-25 18:30:41 -06:00

Author	SHA1	Message	Date
Ieva	58efa49933	Chore: remove `IsDisabled` method for access control (#74340 ) remove IsDisabled method for access control, clean up tests	2023-09-05 11:04:39 +01:00
Yuri Tseretyan	baea7a7556	Alerting: Fix provisioning of contact points when contact point is renamed (#74238 ) * add test that demonstrates the bug * fix renaming provisioning contact points when it is the last in the group	2023-09-04 13:30:15 -04:00
Serge Zaitsev	58f6648505	Chore: capitalise messages for alerting (#74335 )	2023-09-04 18:46:34 +02:00
github-actions[bot]	eb93ebe0d0	Alerting: Update Swagger spec (#74300 ) chore: update alerting swagger spec Co-authored-by: rwwiv <rwwiv@users.noreply.github.com>	2023-09-04 16:17:49 +00:00
George Robinson	439270f6cb	Rename Google Hangouts to Google Chat (#74162 ) * Rename Google Hangouts to Google Chat * Fix prettier	2023-08-31 16:09:22 +03:00
Ryan McKinley	025b2f3011	Chore: use any rather than interface{} (#74066 )	2023-08-30 18:46:47 +03:00
linoman	1b8e9b51b2	Replace signed in user for identity.requester (#74048 ) * Make identity.Requester available at Context * Clean pkg/services/guardian/guardian.go * Clean guardian provider and guardian AC * Clean pkg/api/team.go * Clean ctxhandler, datasources, plugin and live * Clean dashboards and guardian * Implement NewUserDisplayDTOFromRequester * Change status code numbers for http constants * Upgrade signature of ngalert services * log parsing errors instead of throwing error	2023-08-30 16:51:18 +02:00
github-actions[bot]	42efd13062	Alerting: Update Swagger spec (#73877 ) chore: update alerting swagger spec Co-authored-by: rwwiv <rwwiv@users.noreply.github.com>	2023-08-30 14:00:13 +00:00
Alexander Weaver	dfba94e052	Alerting: Limit redis pool size to 5 and make configurable (#74057 ) * Limit redis pool size to 5 and expose it in config ini * Coerce negative pool sizes to the default	2023-08-29 14:59:12 -05:00
Carl Bergquist	10a82e30ba	Alerting: add route owner middleware (#73869 ) alerting: add route owner middleware Signed-off-by: bergquist <carl.bergquist@gmail.com>	2023-08-29 12:43:33 +02:00
Jo	a307582212	Revert "Replace signed in user for identity.requester (#73750 )" (#73962 ) This reverts commit `9b9c9e83dc`.	2023-08-28 21:05:59 +02:00
linoman	9b9c9e83dc	Replace signed in user for identity.requester (#73750 ) * Make identity.Requester available at Context * Clean pkg/services/guardian/guardian.go * Clean guardian provider and guardian AC * Clean pkg/api/team.go * Clean ctxhandler, datasources, plugin and live * Question: what to do with the UserDisplayDTO? * Clean dashboards and guardian * Remove identity.Requester from ReqContext * Implement NewUserDisplayDTOFromRequester * Fix tests * Change status code numbers for http constants * Upgrade signature of ngalert services * log parsing errors instead of throwing error * Fix tests and add logs * linting	2023-08-28 12:04:36 -05:00
Torkel Ödegaard	3ee26df41e	PublicDashboards: Variables refactor (#73476 ) Co-authored-by: Juan Cabanas <juan.cabanas@grafana.com> Co-authored-by: Ezequiel Victorero <ezequiel.victorero@grafana.com> Co-authored-by: Ryan McKinley <ryantxu@gmail.com>	2023-08-25 13:56:02 -05:00
George Robinson	bbef000202	Alerting: Add contact point for Grafana OnCall (#73733 ) Add contact point for Grafana OnCall	2023-08-24 10:45:12 +02:00
github-actions[bot]	69267cd28b	Alerting: Update Swagger spec (#72568 ) chore: update alerting swagger spec Co-authored-by: rwwiv <rwwiv@users.noreply.github.com>	2023-08-22 14:35:48 -04:00
Misi	d7166f5f96	RBAC: Remove unused scope from alert.instances:read fixed role (#73268 ) Fix alert.instances:read scope for fixed role	2023-08-16 09:55:49 +02:00
Yuri Tseretyan	938e26b59f	Alerting: Add new metrics and tracings to state manager and scheduler (#71398 ) * add metrics and tracing to state manager * propagate tracer to state manager * add scheduler metrics * fix backtesting * add test for state metrics * remove StateUpdateCount * update docs * metrics can be null * add tracer to new tests	2023-08-16 09:04:18 +02:00
Yuri Tseretyan	90e3f516ff	Alerting: Update Discord settings to treat 'url' as a secure setting (#69588 ) * make discord url secure * support migrating unsecure settings to secure settings * Update public/app/features/alerting/unified/utils/receiver-form.ts Co-authored-by: William Wernert <william.wernert@grafana.com> --------- Co-authored-by: Gilles De Mey <gilles.de.mey@gmail.com> Co-authored-by: William Wernert <william.wernert@grafana.com>	2023-08-16 09:03:56 +02:00
Yuri Tseretyan	0717ec11d6	Alerting: Update state manager to change all current states in the case when Error\NoData is executed as Ok\Nomal (#68142 )	2023-08-15 10:27:15 -04:00
Jean-Philippe Quéméner	2266e09f94	Alerting: optimize rules with multiple loki range queries (#73103 )	2023-08-09 19:00:51 +02:00
Yuri Tseretyan	69c8200fc9	Alerting: Add more tests for state manager ProcessEvalResults (#73019 ) Co-authored-by: Matthew Jacobson <matthew.jacobson@grafana.com>	2023-08-09 12:21:12 -04:00
Jo	97ba611e4c	Chore: Fix ngalert Evaluate signature change (#73084 ) fix ngalert Evaluate sig change	2023-08-09 11:27:14 +02:00
Yuri Tseretyan	6b4a9d73d7	Alerting: Export contact points to check access control action instead legacy role (#71990 ) * introduce a new action "alert.provisioning.secrets:read" and role "fixed:alerting.provisioning.secrets:reader" * update alerting API authorization layer to let the user read provisioning with the new action * let new action use decrypt flag * add action and role to docs	2023-08-08 19:29:34 +03:00
Jean-Philippe Quéméner	2c6cf66741	Alerting: Optimize external Loki queries (#73014 )	2023-08-08 15:13:41 +02:00
Yuri Tseretyan	0053b07885	Alerting: Refactor of state manager tests (#72849 ) * calculate cacheID instead of literals * use mocked clocks * advance clocks with the eval results * use clearer timestamp aliases * make expected state labels be more clear to read Co-authored-by: Matthew Jacobson <matthew.jacobson@grafana.com>	2023-08-04 13:39:49 -04:00
Serge Zaitsev	7767ab6f43	Chore: Add folder data migration, fix unique index (#72602 ) * add folder data migration, fix unique index * fix unique index * pass a fake store in tests * pass store into other providers in tests * and now with alerting!	2023-08-01 09:36:37 +02:00
Yuri Tseretyan	c7598cc6fb	Alerting: Add ability to control scheduler tick interval via config (#71980 ) * add ability to control scheduler interval via config * add feature flag `configurableSchedulerTick`	2023-07-26 12:44:12 -04:00
Yuri Tseretyan	5ba164d92b	Alerting: Exclude expression refIDs from NoData state (#72219 )	2023-07-26 11:42:04 -04:00
Yuri Tseretyan	78fc3bcdf4	Alerting: Fix state manager to not keep datasource_uid and ref_id labels in state after Error (#72216 )	2023-07-26 11:41:46 -04:00
Matthew Jacobson	d31d175109	Alerting: Fix contact point testing with secure settings (#72235 ) * Alerting: Fix contact point testing with secure settings Fixes double encryption of secure settings during contact point testing and removes code duplication that helped cause the drift between alertmanager and test endpoint. Also adds integration tests to cover the regression. Note: provisioningStore is created to remove cycle and the unnecessary dependency.	2023-07-25 10:04:27 -04:00
Arati R	20ffbbc41e	NestedFolders: Add library panels counting and deletion to folder registry (#69149 ) * Expose library element service's folder service * Register library panels, add count implementation * Expand folder counts test * Update registry deletion method interface * Allow getting library elements from any folder * Add test for library panel deletion * Add test for library panel counting	2023-07-25 13:05:53 +02:00
github-actions[bot]	24872370b5	Alerting: Update Swagger spec (#72177 ) chore: update alerting swagger spec Co-authored-by: rwwiv <rwwiv@users.noreply.github.com>	2023-07-25 11:34:00 +02:00
Alexander Weaver	8c8b3ecb5b	Alerting: Add dashboardUID and panelID query parameters for loki state history (#72119 ) * read query parameters * Generate loki query from params	2023-07-24 23:46:46 -05:00
Matthew Jacobson	cfb1656968	Alerting: Add notification policy provisioning file export (#70009 ) * Alerting: Add notification policy provisioning file export - Add provisioning API endpoint for exporting notification policies. - Add option in notification policy view ellipsis dropdown for exporting. - Update various provisioning documentation.	2023-07-24 17:56:53 -04:00
Kyle Brandt	1df4d332c9	SSE: Use errutil to show better error messages in prod (#71658 ) - include public message - propagate data source query errors so they are shown as well to which fixes #70026	2023-07-21 06:38:29 -04:00
Alexander Weaver	ff48a145cc	Alerting: Add exported getters for PanelKey fields (#72064 ) Add getters	2023-07-20 15:47:20 -05:00
Yuri Tseretyan	cbbbe2e6f6	SSE: DSNode to update result with names to make each value identifiable by labels (only Graphite and TestData) (#71246 ) * introduce a function checkIfSeriesNeedToBeFixed to scan all value fields in the response and provide a function that updates Series so they can be uniquely identifiable. Only Graphite and TestData are checked. * update `convertDataFramesToResults` to run this function and provide it to WideToMany * update WideToMany to run the fix function if it is not nil	2023-07-20 14:44:12 -04:00
Matthew Jacobson	13121d3234	Alerting: Add contact point provisioning file export (#71692 ) * Add contact point provisioning file export apis * Regenerate api * docs * frontend * add mock to tests * Fix missing row-level export button on viewer role w/ prov. read * Address review comments --------- Co-authored-by: Gilles De Mey <gilles.de.mey@gmail.com>	2023-07-20 14:35:56 -04:00
George Robinson	8dd3eb856d	Alerting: Improve performance of matching captures (#71828 ) This commit updates eval.go to improve the performance of matching captures in the general case. In some cases we have reduced the runtime of the function from 10s of minutes to a couple 100ms. In the case where no capture matches the exact labels, we revert to the current subset/superset match, but with a reduced search space due to grouping captures.	2023-07-20 09:07:00 +01:00
George Robinson	f1af0502db	Alerting: Add tests for matching captures (#71928 ) This commit adds tests for matching captures, which we do not have at present.	2023-07-19 12:52:26 +01:00
George Robinson	89dcaaf049	Alerting: Sort NumberCaptureValues in EvaluationString (#71927 ) This commit changes extractEvalString to sort NumberCaptureValues in ascending order of Var before building the output string. This means that users will see EvaluationString in a consistent order, but also make it possible to assert its output in tests.	2023-07-19 12:09:21 +01:00
Alexander Weaver	d6db9a5b3c	Alerting: Add exported constructor for panelKey (#71872 ) Exported constructor for panelKey	2023-07-18 13:37:43 -05:00
Alexander Weaver	18b910e654	Alerting: Refactor annotation historian to isolate dashboard service dependency (#71689 ) * Refactor annotation historian to isolate dashboard service dependency * Export PanelKey * Don't export parsePanelKey * Remove commented out code	2023-07-18 08:18:55 -05:00
Will Browne	a8577c21ba	Plugins: Migrate PluginStore mock to pre-existing fakes package (#71664 ) * migrate to existing fakes package * fix imports	2023-07-17 10:21:44 +00:00
Yuri Tseretyan	541bfe636d	SSE: Support for ML query node (#69963 ) * introduce a new node-type ML and implement a command outlier that uses ML plugin as a source of data. * add feature flag mlExpressions that guards the feature	2023-07-13 20:37:50 +03:00
Yuri Tseretyan	64aa5465ac	Alerting: do not expand template for labels\annotations if value is not a template (#71492 )	2023-07-12 14:53:40 -04:00
Kyle Brandt	f6a28cadbc	Alerting: (Chore/Instrumentation) Add traceID to logs with contextual logger (#71289 ) Alerting: (Chore) Add traceID to logs with contextual logger	2023-07-11 10:59:52 +02:00
Matthew Jacobson	e3787de470	Alerting: Fix Alertmanager change detection for receivers with secure settings (#71307 ) * Alerting: Make ApplyAlertmanagerConfiguration only decrypt/encrypt new/changed secure settings Previously, ApplyAlertmanagerConfiguration would decrypt and re-encrypt all secure settings. However, this caused re-encrypted secure settings to be included in the raw configuration when applied to the embedded alertmanager, resulting in changes to the hash. Consequently, even if no actual modifications were made, saving any alertmanager configuration triggered an apply/restart and created a new historical entry in the database. To address the issue, this modifies ApplyAlertmanagerConfiguration, which is called by POST `api/alertmanager/grafana/config/api/v1/alerts`, to decrypt and re-encrypt only new and updated secure settings. Unchanged secure settings are loaded directly from the database without alteration. We determine whether secure settings have changed based on the following (already in-use) assumption: Only new or updated secure settings are provided via the POST `api/alertmanager/grafana/config/api/v1/alerts` request, while existing unchanged settings are omitted. * Ensure saving a grafana-managed contact point will only send new/changed secure settings Previously, when saving a grafana-managed contact point, empty string values were transmitted for all unset secure settings. This led to potential backend issues, as it assumed that only newly added or updated secure settings would be provided. To address this, we now exclude empty ('', null, undefined) secure settings, unless there was a pre-existing entry in secureFields for that specific setting. In essence, this means we only transmit an empty secure setting if a previously configured value was cleared. * Fix linting * refactor omitEmptyUnlessExisting * fixup --------- Co-authored-by: Gilles De Mey <gilles.de.mey@gmail.com>	2023-07-11 08:23:07 +02:00
Yuri Tseretyan	30fc075cd7	Alerting: Fix panic in backtesting API when the testing interval is not times of evaluation interval (#68727 ) * add test for the bug * update backtesting evaluators to accept a number of evaluations instead of `to` to have control over the number evaluations in one place	2023-07-06 11:21:03 -04:00
Yuri Tseretyan	ada325de2a	Alerting: Use unsafe.Slice for hashing a string during rule fingerprint calculation (#71000 )	2023-06-30 14:58:23 -04:00
Alexander Weaver	f94fb765b5	Alerting: Add limit query parameter to Loki-based ASH api, drop default limit from 5000 to 1000, extend visible time range for new ASH UI (#70769 ) * Add limit query parameter * Drop copy paste comment * Extend history query limit to 30 days and 250 entries * Fix history log entries ordering * Update no history message, add empty history test --------- Co-authored-by: Konrad Lalik <konrad.lalik@grafana.com>	2023-06-28 13:32:28 -05:00
George Robinson	594c851d4b	Alerting: Add duration to saving alert states done (#70844 )	2023-06-28 15:19:21 +01:00
Steve Simpson	21ac224c45	Alerting: Make ImageService public in NGAlert. (#70737 )	2023-06-27 13:11:22 +02:00
João Calisto	1d68f5ba77	Alerting: Fix HA alerting membership sync (#70607 ) * Alerting: Fix HA alerting membership sync * Added comment about filtering duplicates	2023-06-26 17:12:10 +01:00
William Wernert	4aa477f48f	Alerting: Move rule UID from Loki stream labels into log lines (#70637 ) Move rule uid into log line to reduce cardinality	2023-06-26 09:57:45 -04:00
George Robinson	7edbe72483	Alerting: Support concurrent queries for saving alert instances (#70525 ) This commit adds support for concurrent queries when saving alert instances to the database. This is an experimental feature in response to some customers experiencing delays between rule evaluation and sending alerts to Alertmanager, resulting in flapping. It is disabled by default.	2023-06-23 11:36:07 +01:00
guangwu	bbe4b0d3de	chore: remove refs to deprecated io/ioutil (#70300 )	2023-06-22 12:19:23 +02:00
Andreas Deininger	95b1f3c875	Fixing typos (#70487 )	2023-06-22 09:43:38 +01:00
Santiago	d3bb9fbbaf	Alerting: Use only token for images in notifications (#70196 ) * Alerting: Use only tokens for images in notifications * update tests * make linter and modfile validator happy	2023-06-21 20:53:45 -03:00
Santiago	ff9eff49bd	Alerting: Bump grafana/alerting and refactor the ImageStore/Provider to provide image URL/bytes (#70182 ) * implement alerting.images.Provider interface in our ImageStore * add URLExists() method to fakeConfigStore * make linter happy * update integration tests	2023-06-21 20:53:30 -03:00
Alexander Weaver	ce6f73bd32	Alerting: Add two missing tests which cover missing URLs for Loki state history (#70460 ) Add two missing tests which cover individual missing URLs	2023-06-21 12:58:37 -05:00
George Robinson	8a13ee3cd4	Alerting: Add debug logs when saving instances is finished (#70447 )	2023-06-21 14:19:04 +02:00
George Robinson	a1cb7319d5	Alerting: Update in app documentation for customizing message and subject (#70367 )	2023-06-20 12:20:01 +02:00
George Robinson	815e98ed95	Alerting: Add debug logs for EndsAt timestamp (#70336 ) This commit adds debug logs for previous_ends_at and next_ends_at to state.go to help us debug issues where alerts are resolved in Alertmanager due to expiration. This change is in response to a support escalation where this information was needed but unavailable.	2023-06-20 12:13:38 +03:00
SatVeer Singh	1bfa3a0f1e	Chore: Replace go-multierror with errors package (#66432 ) * code refactor and type assertions added to tests * no-lint rule added for specific line	2023-06-19 12:29:45 +03:00
Jean-Philippe Quéméner	934ba1aaa1	Alerting: Rewrite range to instant queries if possible (#69976 )	2023-06-16 19:55:49 +02:00
Yuri Tseretyan	842f33580e	SSE: Add functions that determine NodeType by UID and construct a data source struct from NodeType (#70106 ) * add NodeTypeFromDatasourceUID and DataSourceModelFromNodeType() * deprecate expr.DataSourceModel * replace usages of IsDataSource to NodeTypeFromDatasourceUID * replace usages of DataSourceModel to DataSourceModelFromNodeType()	2023-06-16 13:05:06 -04:00
Yuri Tseretyan	f1d47d18a8	Alerting: Sort RefIDs in error message returned by api.validateCondition (#70198 ) sort RefIDs in error message	2023-06-15 18:37:30 -03:00
Yuri Tseretyan	b963defa44	Alerting: update rules POST API to validate query and condition only for rules that changed. (#68667 ) * replace condition validation with just structural validation * validate conditions of only new and updated rules * add integration tests for rule update\delete API Co-authored-by: George Robinson <george.robinson@grafana.com>	2023-06-15 13:33:42 -04:00
Yuri Tseretyan	baffe83da6	Alerting: Improve performance of cache.getOrCreate (#63909 ) * move expansion of labels and annotations outside of mutex lock * propagate struct but not pointer	2023-06-15 09:37:47 -04:00
George Robinson	f085e99d3c	Alerting: Add matchers metrics to Alertmanager (#69855 )	2023-06-15 09:18:01 +01:00
Santiago	ff3e028a85	Alerting: Add image URI annotation only when there's an image (#69825 ) * Alerting: Add image URI annotation only when there's an image * fix function name (changed on main branch)	2023-06-09 10:59:24 -03:00
Matthew Jacobson	ba3994d338	Alerting: Repurpose rule testing endpoint to return potential alerts (#69755 ) * Alerting: Repurpose rule testing endpoint to return potential alerts This feature replaces the existing no-longer in-use grafana ruler testing API endpoint /api/v1/rule/test/grafana. The new endpoint returns a list of potential alerts created by the given alert rule, including built-in + interpolated labels and annotations. The key priority of this endpoint is that it is intended to be as true as possible to what would be generated by the ruler except that the resulting alerts are not filtered to only Resolved / Firing and ready to be sent. This means that the endpoint will, among other things: - Attach static annotations and labels from the rule configuration to the alert instances. - Attach dynamic annotations from the datasource to the alert instances. - Attach built-in labels and annotations created by the Grafana Ruler (such as alertname and grafana_folder) to the alert instances. - Interpolate templated annotations / labels and accept allowed template functions.	2023-06-08 18:59:54 -04:00
Matthew Jacobson	0c688190f7	Alerting: Fix unique violation when updating rule group with title chains/cycles (#67868 ) * Alerting: Fix unique violation when updating rule group with title chains/cycles The uniqueness constraint for titles within an org+folder is enforced on every update within a transaction instead of on commit (deferred constraint). This means that there could be a set of updates that will throw a unique constraint violation in an intermediate step even though the final state is valid. For example, a chain of updates RuleA -> RuleB -> RuleC could fail if not executed in the correct order, or a swap of titles RuleA <-> RuleB cannot be executed in any order without violating the constraint. The exact solution to this is complex and requires determining directed paths and cycles in the update graph, adding in temporary updates to break cycles, and then executing the updates in reverse topological order (see first commit in PR if curious). This is not implemented here. Instead, we choose a simpler solution that works in all cases but might perform more updates than necessary. This simpler solution makes a determination of whether an intermediate collision could occur and if so, adds a temporary title on all updated rules to break any cycles and remove the need for specific ordering. In addition, we make sure diffs are executed in the following order: DELETES, UPDATES, INSERTS.	2023-06-08 18:51:50 -04:00
Will Browne	624777258b	Plugins: Refactor creation of plugin context to dedicated service (#66451 ) * first pass * fix tests * return errs * change signature * tidy * delete unnecessary fields from test * tidy * fix tests * simplify * separate error check in API * apply nits	2023-06-08 13:59:51 +02:00
Horst Gutmann	f4c04d4055	Alerting: Update patch for #865 after #68898 (#890 )	2023-06-06 13:38:37 +02:00
dsotirakis	f9c310dbaf	Require alert.notifications:write permissions to test receivers and templates (#865 ) # Conflicts: # pkg/services/ngalert/api/authorization.go	2023-06-06 13:33:56 +02:00
Matthew Jacobson	c16f1f5e99	Alerting: Fix provisioned templates being ignored by alertmanager (#69485 ) * Alerting: Fix provisioned templates being ignored by alertmanager Template provisioning sets the template in cfg.TemplateFiles while a recent change made it so that alertmanager reads cfg.AlertmanagerConfig.Templates instead. This change fixes the issue on both ends, by having provisioning set boths fields and reverts the change on the alertmanager side so that it uses cfg.TemplateFiles.	2023-06-02 15:47:43 -04:00
Arati R	6cb1a5e368	Nested folders: Add alert rule counts and deletion to folder registry (#67259 ) * Let alert rule service implement registry service * Add count method to RuleStore interface * Add implementation for deletion of alert rules * Rename uid to folderUID in registry methods * Check forceDeleteRule value for registry deletion * Register alerting store with folder service * Move folder test functions to separate package * Add testing for alert rule counting, deletion * Remove redundant count method * Fix deleteChildrenInFolder signature * Update pkg/services/ngalert/store/alert_rule.go Co-authored-by: Sofia Papagiannaki <1632407+papagian@users.noreply.github.com> * Add tests for nested folder deletion * Refactor TestIntegrationNestedFolderService * Add rules store as parameter for alertng provider --------- Co-authored-by: Sofia Papagiannaki <1632407+papagian@users.noreply.github.com>	2023-06-02 16:38:02 +02:00
Ieva	d8b66d5c4b	RBAC: remove some `IsDisabled` checks (#69272 ) * remove some access contorl IsDisabled() checks * cleaning up tests * update tests * linting	2023-05-31 09:58:57 +01:00
Alexander Weaver	0ed5d3bdf2	Revert "Alerting: Refactor the ImageStore/Provider to provide image URL/bytes" (#69265 ) Revert "Alerting: Refactor the ImageStore/Provider to provide image URL/bytes (#67693)" This reverts commit `72a187b0be`.	2023-05-30 11:33:33 -05:00
Alexander Weaver	0f88b117dc	Alerting: Skip flaky test TestRouteGetRuleStatuses (#69258 ) Skip TestRouteGetRuleStatuses	2023-05-30 09:48:02 -05:00
Santiago	72a187b0be	Alerting: Refactor the ImageStore/Provider to provide image URL/bytes (#67693 ) * (WIP) Refactor the ImageStore interface to work with our latest alerting repository * update alerting package * refactor, new URLExists method in ImageProvider * tests for the new methods * fix linter warnings * use alertingImages as an alias for grafana/alerting/images * logs about image uris and not found images * nerf image not found logs * extract duplicated code to getImageFromURI() method * refactor getImageFromURI() * add index on url * add comment about migration log * sync generated files	2023-05-30 11:25:55 -03:00
Ieva	d98813796c	RBAC: Remove legacy AC from HasAccess permission check (#68995 ) * remove unused HasAdmin and HasEdit permission methods * remove legacy AC from HasAccess method * remove unused function * update alerting tests to work with RBAC	2023-05-30 14:39:09 +01:00
Matthew Jacobson	97ae6ae6ef	Alerting: Fix flaky TestIntegrationUpdateAlertRules (#69106 ) Prevents duplicate alert rule ids and 0 value for BaseInterval and IntervalSeconds	2023-05-25 16:00:06 -04:00
Yuri Tseretyan	b57ef1f2c7	Alerting: Fix TestIntegration_GetAlertRulesForScheduling to make sure rules are created in different org (#69088 ) make sure rules are created in different org	2023-05-25 13:51:38 -04:00
Sladyn	a06a5a7393	Alerting: Improve log messages (#67688 ) * Rename base logger and capatilize messages * Remove cflogger from config.go	2023-05-25 18:55:01 +03:00
Yuri Tseretyan	e00260465b	Alerting: Fix provenance guard checks for Alertmanager configuration to not cause panic when compared nested objects (#69009 ) * fix current settings parsed as new * replace map comparison with cmp.Diff and log the diff	2023-05-25 11:41:11 -04:00
Jean-Philippe Quéméner	5717d8954f	Alerting: Return empty list on export if no rules exist (#69023 )	2023-05-25 14:12:18 +02:00
Ieva	4980b64274	RBAC: Remove legacy ac from authorization middleware (#68898 ) remove legacy AC fallback from RBAC middleware, and some unused auth logic	2023-05-24 09:49:42 +01:00
Yuri Tseretyan	ab5a3820d5	Alerting: Fix status code of successful response POST /api/alertmanager/grafana/api/v2/silences in swagger specs (#67951 ) * update status code to reflect reality * update docs	2023-05-15 11:23:30 -04:00
Emil Tullstedt	23a9963507	Chore: Upgrade Prometheus to 2.43.0 (#67853 ) - github.com/prometheus/prometheus => 2.43.0 (aka 0.43.0) - github.com/prometheus/client_golang => 1.15.1	2023-05-10 14:09:49 +02:00
Virginia Cepeda	e1ff434917	Alerting: Change text on cloud AM email addresses for contact points (#68143 )	2023-05-10 10:44:17 +02:00
Matthew Jacobson	5422609fb1	Alerting: Fix broken integration test (#68140 ) From https://github.com/grafana/grafana/pull/68122	2023-05-09 22:27:40 +03:00
Jean-Philippe Quéméner	8bb62a8316	Alerting: Add option for memberlist label (#67982 )	2023-05-09 10:32:23 +02:00
Matthew Jacobson	91471ac7ae	Alerting: Template Testing API (#67450 )	2023-04-28 15:56:59 +01:00
Yuri Tseretyan	9eb10bee1f	Alerting: Scheduler use rule fingerprint instead of version (#66531 ) * implement calculation of fingerprint for ruleWithFolder * update scheduler to use fingerprint instead of rule's version	2023-04-28 10:42:16 -04:00
Uwe Sommerlatt	dfc99cdd19	Alerting: Fix misleading status code in provisioning API (#67331 ) Fixes #66249	2023-04-27 09:25:34 +01:00
Santiago	b0881daf23	Alerting: Use URLs in image annotations (#66804 ) * use tokens or urls in image annotations * improve tests, fix some comments * fix empty tokens * code review changes, check for url before checking for token (support old token formats)	2023-04-26 13:06:18 -03:00
Yuri Tseretyan	a8b4a4bb45	Alerting: Update alerting module to 20230418161049-5f374e58cb32 + refactoring (#66622 ) * update to alerting 20230418161049-5f374e58cb32 * rename renamed structs in https://github.com/grafana/alerting/pull/73 * update ValidateContactPoint to use BuildReceiverConfiguration * update logger factory according to changes * rewrite integration builder Co-authored-by: Santiago <santiagohernandez.1997@gmail.com>	2023-04-25 13:39:46 -04:00
Alexander Weaver	117636e8ca	Alerting: Fix panic when reparenting receivers to groups following an attempted rename via Provisioning (#67167 )	2023-04-24 21:23:23 -04:00
Steve Simpson	9effb9a708	Alerting: Allow hooking into request handler functions. (#67000 ) * Alerting: Allow hooking into request handler functions. Adds a facility to AlertNG for hooking into API handlers, allowing the replacement of request handlers for specific paths. One of goals of this approach was to allow hooking as late as possible in the request, e.g. after all middleware has been applied, to simplfiy usage. * Update pkg/services/ngalert/api/hooks.go Co-authored-by: gotjosh <josue.abreu@gmail.com> * Update pkg/services/ngalert/api/hooks.go Co-authored-by: gotjosh <josue.abreu@gmail.com> * Update pkg/services/ngalert/ngalert.go Co-authored-by: gotjosh <josue.abreu@gmail.com> * Fixes to review comments * Fix passing logger in --------- Co-authored-by: gotjosh <josue.abreu@gmail.com>	2023-04-24 18:18:44 +02:00
Jean-Philippe Quéméner	bbce69f295	Alerting: Use configured headers for external alertmanager (#63819 )	2023-04-21 16:16:27 +02:00
Matthew Jacobson	eddd4f4508	Alerting: Add totalsFiltered to RuleResponse for hidden by filters count (#66883 ) Alerting: Add totalsFiltered to RuleResponse to facilitate hidden by filters count Currently, when both a limit_alerts and a matcher/state filter is applied, there is not enough information to determine how many alert instances were hidden by the filters. Only enough to determine the total hidden by the limit and filter combined. This change adds a separate totalsFiltered field alongside the AlertRule totals that will contain the count of instances after filters but before limits.	2023-04-21 09:35:12 +01:00
George Robinson	35342a3c76	Alerting: Fix DatasourceUID and RefID missing for DatasourceNoData alerts (#66733 ) This commit fixes a bug where DatasourceUID and RefID annotations are missing for DatasourceNoData alerts in Grafana 9.5. This bug affects datasource plugins that have moved to using the data plane contract.	2023-04-20 14:38:20 +01:00
George Robinson	883dcc81c0	Alerting: Add tests for Evaluate (#66739 )	2023-04-20 11:24:40 +01:00
Alexander Weaver	3634079b8f	Alerting: Attach hash of instance labels to state history log lines (#65968 ) * Add instanceID which is hash of labels * Rename field to fingerprint * Move to prometheus style signature * Appease linter	2023-04-19 14:22:19 -05:00
Jean-Philippe Quéméner	bc11a484ed	Alerting: Add support for running HA using Redis (#65267 ) Co-authored-by: Steve Simpson <steve.simpson@grafana.com>	2023-04-19 17:05:26 +02:00
Alexander Weaver	a384194e15	Alerting: Use default page size of 5000 when querying Loki for state history (#66315 ) Always specify limit of 5000	2023-04-18 14:31:29 -05:00
Alexander Weaver	cf7157f683	Alerting: Capture refID of rule's condition expression in Loki state history entries (#66419 ) * Capture condition from rule * Add test	2023-04-18 14:21:28 -05:00
Alex Moreno	f64a89727e	Alerting: Allow provenance disable in alerting provisioning API (#63650 ) * Allow provenance None in alert rule update and rule group replace * Allow provenance None in contact point update * Allow updating policies to none by sending x-disable-provenance header * Allow mute timings to disable provenance with x-disable-provenance header * Allow disabling provenance by using x-disable-provenance header * Add provenance helper to lower the cyclomatic complexity * Do not downgrade provenance except un ReplaceRuleGroup * Add function explanation and change error handling * Add docs for x-disable-provenance changes (#66300) * Add docs for x-disable-provenance changes * Apply suggestions from code review Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * Update _index.md --------- Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> * Update docs/sources/alerting/set-up/provision-alerting-resources/_index.md Co-authored-by: George Robinson <george.robinson@grafana.com> * Add error message check in tests * Change docs --------- Co-authored-by: brendamuir <100768211+brendamuir@users.noreply.github.com> Co-authored-by: George Robinson <george.robinson@grafana.com>	2023-04-18 15:10:36 +02:00
Kyle Brandt	840fb32ad8	SSE: (Instrumentation) Add Tracing (#66700 ) spans are prefixed `SSE.`	2023-04-18 08:04:51 -04:00
Kyle Brandt	2f13c851e4	SSE: (Chore/Instrumentation) Add ds_queries_total metric and move met… (#66695 ) * SSE: (Chore/Instrumentation) Add ds_queries_total metric and move metrics to service	2023-04-17 16:12:44 -07:00
George Robinson	19ebb079ba	Alerting: Add limits and filters to Prometheus Rules API (#66627 ) This commit adds support for limits and filters to the Prometheus Rules API. Limits: It adds a number of limits to the Grafana flavour of the Prometheus Rules API: - `limit` limits the maximum number of Rule Groups returned - `limit_rules` limits the maximum number of rules per Rule Group - `limit_alerts` limits the maximum number of alerts per rule It sorts Rule Groups and rules within Rule Groups such that data in the response is stable across requests. It also returns summaries (totals) for all Rule Groups, individual Rule Groups and rules. Filters: Alerts can be filtered by state with the `state` query string. An example of an HTTP request asking for just firing alerts might be `/api/prometheus/grafana/api/v1/rules?state=alerting`. A request can filter by two or more states by adding additional `state` query strings to the URL. For example `?state=alerting&state=normal`. Like the alert list panel, the `firing`, `pending` and `normal` state are first compared against the state of each alert rule. All other states are ignored. If the alert rule matches then its alert instances are filtered against states once more. Alerts can also be filtered by labels using the `matcher` query string. Like `state`, multiple matchers can be provided by adding additional `matcher` query strings to the URL. The match expression should be parsed using existing regular expression and sent to the API as URL-encoded JSON in the format: { "name": "test", "value": "value1", "isRegex": false, "isEqual": true } The `isRegex` and `isEqual` options work as follows: \| IsEqual \| IsRegex \| Operator \| \| ------- \| -------- \| -------- \| \| true \| false \| = \| \| true \| true \| =~ \| \| false \| true \| !~ \| \| false \| false \| != \|	2023-04-17 17:45:06 +01:00
Arati R	cab3ba519a	NestedFolders: Add folder service registry with dashboard service implementation (#65033 ) * Delete folders, dashboards with registry service Co-authored-by: Serge Zaitsev <hello@zserge.com> * Update signature of ProvideDashboardServiceImpl * Regenerate mockery file * Add test for DeleteInFolder * Add test for DeleteDashboardsInFolder * Delete child dashboard associations via registry * Add validation of folder uid and org id --------- Co-authored-by: Serge Zaitsev <hello@zserge.com>	2023-04-14 11:17:23 +02:00
Yuri Tseretyan	afd52d0866	Alerting: use alerting GrafanaReceiver and BuildReceiverConfiguration in Grafana (#65224 ) * replace receiver errors with one from alerting * add the converter to alerting models * update buildReceiverIntegration to accept GrafanaReceiver --------- Co-authored-by: George Robinson <george.robinson@grafana.com>	2023-04-13 12:25:32 -04:00
gotjosh	2bbf0c9de4	Alerting: Allow Rules to Schedule to be filtered by Rule Group (#59990 ) * Alerting: Allow Rules to Schedule to be filtered by Rule Group	2023-04-13 12:55:42 +01:00
Michael Mandrus	5626461b3c	Caching: Refactor enterprise query caching middleware to a wire service (#65616 ) * define initial service and add to wire * update caching service interface * add skipQueryCache header handler and update metrics query function to use it * add caching service as a dependency to query service * working caching impl * propagate cache status to frontend in response * beginning of improvements suggested by Lean - separate caching logic from query logic. * more changes to simplify query function * Decided to revert renaming of function * Remove error status from cache request * add extra documentation * Move query caching duration metric to query package * add a little bit of documentation * wip: convert resource caching * Change return type of query service QueryData to a QueryDataResponse with Headers * update codeowners * change X-Cache value to const * use resource caching in endpoint handlers * write resource headers to response even if it's not a cache hit * fix panic caused by lack of nil check * update unit test * remove NONE header - shouldn't show up in OSS * Convert everything to use the plugin middleware * revert a few more things * clean up unused vars * start reverting resource caching, start to implement in plugin middleware * revert more, fix typo * Update caching interfaces - resource caching now has a separate cache method * continue wiring up new resource caching conventions - still in progress * add more safety to implementation * remove some unused objects * remove some code that I left in by accident * add some comments, fix codeowners, fix duplicate registration * fix source of panic in resource middleware * Update client decorator test to provide an empty response object * create tests for caching middleware * fix unit test * Update pkg/services/caching/service.go Co-authored-by: Arati R. <33031346+suntala@users.noreply.github.com> * improve error message in error log * quick docs update * Remove use of mockery. Update return signature to return an explicit hit/miss bool * create unit test for empty request context * rename caching metrics to make it clear they pertain to caching * Update pkg/services/pluginsintegration/clientmiddleware/caching_middleware.go Co-authored-by: Marcus Efraimsson <marcus.efraimsson@gmail.com> * Add clarifying comments to cache skip middleware func * Add comment pointing to the resource cache update call * fix unit tests (missing dependency) * try to fix mystery syntax error * fix a panic * Caching: Introduce feature toggle to caching service refactor (#66323) * introduce new feature toggle * hide calls to new service behind a feature flag * remove licensing flag from toggle (misunderstood what it was for) * fix unit tests * rerun toggle gen --------- Co-authored-by: Arati R. <33031346+suntala@users.noreply.github.com> Co-authored-by: Marcus Efraimsson <marcus.efraimsson@gmail.com>	2023-04-12 12:30:33 -04:00
Kyle Brandt	e78be44e1a	SSE: Dataplane Compliance (#65927 ) Takes a specific code path for data that identifies itself as dataplane instead of "guessing" what the data is. The data must identify itself by being in the dataplane by having both the following frame metadata properties: - TypeVersion property that is greater than 0.0 - 'Type' property The flag is disableSSEDataplane and disables this functionality and uses the old code for all queries regardless. See https://github.com/grafana/grafana-plugin-sdk-go/blob/main/data/contract_docs/contract.md for dataplane details.	2023-04-12 12:24:34 -04:00
Matthew Jacobson	63187fae0c	Alerting: Remove and revert flag alertingBigTransactions (#65976 ) * Alerting: Remove and revert flag alertingBigTransactions This is a partial revert of #56575 and a removal of the `alertingBigTransactions` flag. Real-word use has seen no clear performance incentive to maintain this flag. Lowered db connection count came at the cost of significant increase in CPU usage and query latency. * Fix lint backend * Removed last bits of alertingBigTransactions --------- Co-authored-by: Armand Grillet <2117580+armandgrillet@users.noreply.github.com>	2023-04-06 18:06:25 +02:00
gotjosh	1c3ce0735f	Alerting: Tiny refactor on the eval and schedule packages (#66130 ) * Alerting: Tiny refactor on the eval and schedule packages two very small things: - We had a constructor on something called a `Context` which is not a `context.Context` so let's just name that constructor `NewContext` - The user that we use to run query evaluations is the same (with some variation) abstract it to a function so that it can be re-used when necessary. * Update pkg/services/ngalert/schedule/schedule.go Co-authored-by: Alexander Weaver <weaver.alex.d@gmail.com> * Update pkg/services/ngalert/schedule/schedule.go Co-authored-by: Alexander Weaver <weaver.alex.d@gmail.com> --------- Co-authored-by: Alexander Weaver <weaver.alex.d@gmail.com>	2023-04-06 16:02:28 +01:00
Matthew Jacobson	85f738cdf9	Alerting: Add endpoint to revert to a previous alertmanager configuration (#65751 ) * Alerting: Add endpoint to revert to a previous alertmanager configuration This endpoint is meant to be used in conjunction with /api/alertmanager/grafana/config/history to revert to a previously applied alertmanager configuration. This is done by ID instead of raw config string in order to avoid secure field complications.	2023-04-05 14:10:03 -04:00
Alexander Weaver	fb520edd72	Alerting: Use a completely isolated context for state history writes (#64989 ) * Add fresh context with timeout and same log properties, re-derive logger * Unify timeout constants * Move ctx after shortcut that got added through rebasing * Unify timeouts * Port opentracing's SpanFromContext and ContextFromSpan to the grafana tracing package * Support both opentracing and otel variants * Better document why we're creating a new ctx * Add new func to FakeSpan which was added after rebase * Support grafana-specific traceID key in both tracer implementations	2023-04-04 16:41:46 -05:00
George Robinson	bd29071a0d	Revert "Alerting: Add limits to the Prometheus Rules API" (#65842 )	2023-04-03 15:20:37 +00:00
George Robinson	d96b0a71d3	Alerting: Add limits to the Prometheus Rules API (#65169 ) This commit adds a number of limits to the Grafana flavor of the Prometheus Rules API: 1. `limit` limits the maximum number of Rule Groups returned 2. `limit_rules` limits the maximum number of rules per Rule Group 3. `limit_alerts` limits the maximum number of alerts per rule It sorts Rule Groups and rules within Rule Groups such that data in the response is stable across requests. It also returns summaries (totals) for all Rule Groups, individual Rule Groups and rules.	2023-04-03 10:17:02 +01:00
Santiago	aba91d3053	Alerting: Fetch all applied alerting configurations (#65728 ) * WIP * skip invalid historic configurations instead of erroring * add warning log when bad historic config is found * remove unused custom marshaller for GettableHistoricUserConfig * add id to historic user config, move limit check to store, fix typo * swagger spec	2023-03-31 17:43:04 -03:00
Alexander Weaver	da4832724e	Alerting: Delete stub for SQL alert state history backend (#65667 ) Delete stub for SQL backend	2023-03-31 11:15:56 -05:00
Matthew Jacobson	b9dc04139a	Alerting: Respect "For" Duration for NoData alerts (#65574 ) * Alerting: Respect "For" Duration for NoData alerts This change modifies `resultNoData` to be more inline with the logic of the other state handlers. The main effects of this are: 1) NoData states with NoDataState config set to Alerting will respect "For" duration. 2) Prevents zero value in StartsAt and EndsAt for alerts that have only even been in normal state. This includes state transitions from NoDataState=OK and ExecErrState=OK. 3) Better state transition logging.	2023-03-31 19:05:15 +03:00
Steve Simpson	04336d53a9	Alerting: Update prometheus version (#65688 )	2023-03-31 16:34:35 +02:00
Yuri Tseretyan	622c23716a	Alerting: Use logger with context in the state cache (#65663 )	2023-03-31 10:11:30 -04:00
Alexander Weaver	b2abb63286	Alerting: Introduce proper feature toggles for common state history backend combinations (#65497 ) * define 3 feature toggles for rollout phases * Pass feature toggles along * Implement first feature toggle * Try a different strategy with fall-throughs to specific configurations * Apply toggle overrides once outside of backend composition * Emit log messages when we coerce backends * Run code generator for feature toggle files * Improve wording in flag descs * Re-run generator * Use code-generated constants instead of plain strings * Use converted enum values rather than strings for pre-parsing	2023-03-30 13:53:21 -05:00
Alexander Weaver	5e87ea745d	Alerting: Fix and re-enable `filters instance labels in log line` test (#65618 ) Fix and reenable test	2023-03-30 09:02:18 -05:00
Dimitris Sotirakis	e758b017d0	Alerting: Disable `filters instance labels in log line` test (#65610 ) * Disable filters instance labels in log line test * Add drone reference	2023-03-30 16:04:29 +03:00
Yuri Tseretyan	9eaffdf5a8	Alerting: Remove dependency on alerting package in definitions (#65390 ) * move export rules to definitions package * move provisioning contact point methods to provisioning package * move AlertRuleGroupWithFolderTitle to ngalert models and adapter functions to api's compat * move rule_types files back to where they were before.	2023-03-29 13:34:59 -04:00
Alexander Weaver	a416100abc	Alerting: No longer index state history log streams by instance labels (#65474 ) * Remove private labels * No longer index by instance labels * Labels are now invariant, only build them once * Remove bucketing since everything is in a single stream * Refactor statesToStreams to only return a single unified log stream * Don't query on labels that no longer exist * Move selector logic to loki layer, genericize client to work in terms of straight logQL * Add support for line-level label filters in query * Combine existing selector tests for better parallelism * Tests for logQL construction * Underscore instead of dot for unwrapping labels in logql	2023-03-29 11:52:11 -05:00
Santiago	7b92849508	Alerting: Add CustomDetails field in PagerDuty contact point (#64860 ) * Alerting: Add CustomDetails for PagerDuty * fix default value for 'severity' from 'error' to 'critical' * minimal docs for notifiers, specifying config for PagerDuty * replace notifier -> integration * replace notifier -> integration	2023-03-29 10:35:01 -03:00
Alexander Weaver	de1637afe5	Alerting: Add alert instance labels to Loki log lines in addition to stream labels (#65403 ) Add instance labels to log line	2023-03-28 08:57:51 -05:00
Alexander Weaver	dd04757fc9	Alerting: Add "backend" label to state history writes metrics (#65395 ) * Add backend label to state history writes metrics * Update test expectations	2023-03-28 08:49:51 -05:00
Giuseppe Guerra	a89202eab2	Plugins: Improve instrumentation by adding metrics and tracing (#61035 ) * WIP: Plugins tracing * Trace ID middleware * Add prometheus metrics and tracing to plugins updater * Add TODOs * Add instrumented http client * Add tracing to grafana update checker * Goimports * Moved plugins tracing to middleware * goimports, fix tests * Removed X-Trace-Id header * Fix comment in NewTracingHeaderMiddleware * Add metrics to instrumented http client * Add instrumented http client options * Removed unused function * Switch to contextual logger * Refactoring, fix tests * Moved InstrumentedHTTPClient and PrometheusMetrics to their own package * Tracing middleware: handle errors * Report span status codes when recording errors * Add tests for tracing middleware * Moved fakeSpan and fakeTracer to pkg/infra/tracing * Add TestHTTPClientTracing * Lint * Changes after PR review * Tests: Made "ended" in FakeSpan private, allow calling End only once * Testing: panic in FakeSpan if span already ended * Refactoring: Simplify Grafana updater checks * Refactoring: Simplify plugins updater error checks and logs * Fix wrong call to checkForUpdates -> instrumentedCheckForUpdates * Tests: Fix wrong call to checkForUpdates -> instrumentedCheckForUpdates * Log update checks duration, use Info log level for check succeeded logs * Add plugin context span attributes in tracing_middleware * Refactor prometheus metrics as httpclient middleware * Fix call to ProvidePluginsService in plugins_test.go * Propagate context to update checker outgoing http requests * Plugin client tracing middleware: Removed operation name in status * Fix tests * Goimports tracing_middleware.go * Goimports * Fix imports * Changed span name to plugins client middleware * Add span name assertion in TestTracingMiddleware * Removed Prometheus metrics middleware from grafana and plugins updatechecker * Add span attributes for ds name, type, uid, panel and dashboard ids * Fix http header reading in tracing middlewares * Use contexthandler.FromContext, add X-Query-Group-Id * Add test for RunStream * Fix imports * Changes from PR review * TestTracingMiddleware: Changed assert to require for didPanic assertion * Lint * Fix imports	2023-03-28 11:01:06 +02:00
Serge Zaitsev	0beb768427	Chore: Remove result fields from ngalert (#65410 ) * remove result fields from ngalert * remove duplicate imports	2023-03-28 10:34:35 +02:00
Yuri Tseretyan	ec4152c7e5	Alerting: Remove dependency on secrets in definitions package (#65391 )	2023-03-27 16:35:54 -04:00
Yuri Tseretyan	52a0f59706	Alerting: introduce AlertQuery in definitions package (#63825 ) * copy AlertQuery from ngmodels to the definition package * replaces usages of ngmodels.AlertQuery in API models * create a converter between models of AlertQuery --------- Co-authored-by: Alex Moreno <alexander.moreno@grafana.com>	2023-03-27 11:55:13 -04:00
Alexander Weaver	07368dec74	Alerting: Fix attachment of external labels to Loki state history log streams (#65140 ) Fix attachment of external labels, add tests	2023-03-21 18:00:59 -05:00
Alexander Weaver	bf54f2672e	Alerting: Switch to snappy-compressed-protobuf for outgoing push requests to Loki (#65077 ) * Encode with snappy, always * JSON encoder type * Headers * Copy labels formatter from promtail * Implement snappy-proto encoding * Create encoder interface, test both encoders, choose snappy-proto by default * Make encoder configurable at the LokiCfg level * Export both encoders * Touch up comment and tests * Drop unnecessary conversions after move to plain strings to appease linter	2023-03-21 13:38:42 -05:00
Alexander Weaver	cc7e5ce62e	Alerting: Fix ambiguous handling of equals in labels when bucketing Loki state history streams (#65013 ) * Use JSON instead of data.Labels string format as label repr * Drop debug log line	2023-03-21 12:33:27 -05:00
Alexander Weaver	e39d7f44c9	Alerting: Elide requests to Loki if nothing should be recorded (#65011 ) Exit early if no log streams or annotations	2023-03-21 09:30:56 -05:00
Alexander Weaver	40c5713cbd	Vendor errors.Join from Go standard library to avoid version incompatibilities (#64985 ) Vendor errors.Join from std lib	2023-03-17 14:07:58 -05:00
Alexander Weaver	a31672fa40	Alerting: Create new state history "fanout" backend that dispatches to multiple other backends at once (#64774 ) * Rename RecordStatesAsync to Record * Rename QueryStates to Query * Implement fanout writes * Implement primary queries * Simplify error joining * Add test for query path * Add tests for writes and error propagation * Allow fanout backend to be configured * Touch up log messages and config validation * Consistent documentation for all backend structs * Parse and normalize backend names more consistently against an enum * Touch-ups to documentation * Improve clarity around multi-record blocking * Keep primary and secondaries more distinct * Rename fanout backend to multiple backend * Simplify config keys for multi backend mode	2023-03-17 12:41:18 -05:00
Alexander Weaver	9bcf8819d3	Alerting: Handful of small adjustments to log levels and parameters (#64572 ) Calculate duration earlier in scheduler	2023-03-17 12:15:49 +00:00
gotjosh	02a8f62021	Alerting: Fix stats that display alert count when using unified alerting (#64852 ) * Alerting: Fix stats when using unified alerting	2023-03-17 11:19:18 +00:00
George Robinson	0b506b4ccc	Alerting: Update github.com/grafana/alerting (#64882 )	2023-03-16 13:59:35 +00:00
Yuri Tseretyan	85a954cd81	Alerting: Update scheduler to get updates only from database (#64635 ) * stop using the scheduler's Update and Delete methods all communication must be via the database * update scheduler's registry to calculate diff before re-setting the cache * update fetcher to return the diff generated by registry * update processTick to update rule eval routine if the rule was updated and it is not going to be evaluated at this tick. * remove references to the scheduler from api package * remove unused methods in the scheduler	2023-03-14 18:02:51 -04:00
Alexander Weaver	faef3a8258	Alerting: Log error but don't fail initialization if state history connection test fails (#64699 ) Don't return init error if ping fails, add tests	2023-03-13 15:54:46 -05:00
Andrej Ocenas	6647217208	Phlare: Use enum config to send deduplicated func and filenames (#64435 )	2023-03-13 11:06:04 +01:00
Jean-Philippe Quéméner	fb5ed0b0b3	Alerting: fix flaky cache test (#64499 )	2023-03-09 06:08:05 -05:00
Ryan McKinley	42e7ec9fe4	Chore: cleanup dashboard service names (#64442 )	2023-03-08 14:37:45 -05:00
George Robinson	0c8876c3a2	Alerting: Return errors when expanding templates (#63662 ) This commit changes the state package so that errors encountered while expanding templates for custom labels and annotations are returned from the function. This is not used at present, but will be used in the future as we look at how to offer better feedback to users who don't have access to logs, for example our customers who use Hosted Grafana.	2023-03-08 12:25:02 +00:00
Alexander Weaver	4a1c18abf6	Alerting: Fix intermittency when seeding database in rule store tests (#64322 ) Force unique IDs when seeding database	2023-03-07 09:40:55 -05:00
George Robinson	ed71012ced	Alerting: Fix Classic Conditions $values variable (#64243 ) This commit fixes a bug in the $values variable in notification templates when using Classic Conditions. Since Classic Conditions are not multi-dimensional, the values of each series that exceeded the condition should be available as a RefID and offset. For example, B0, B1, etc. However, this bug meant that instead just a single condition would be printed as B, not B0.	2023-03-06 12:08:00 -05:00
Alexander Weaver	19d01dff91	Alerting: Expose Prometheus metrics for persisting state history (#63157 ) * Create historian metrics and dependency inject * Record counter for total number of state transitions logged * Track write failures * Track current number of active write goroutines * Record histogram of how long it takes to write history data * Don't copy the registerer * Adjust naming of write failures metric * Introduce WritesTotal to complement WritesFailedTotal * Measure TransitionsFailedTotal to complement TransitionsTotal * Rename all to state_history * Remove redundant Total suffix * Increment totals all the time, not just on success * Drop ActiveWriteGoroutines * Drop PersistDuration in favor of WriteDuration * Drop unused gauge * Make writes and writesFailed per org * Add metric indicating backend and a spot for future metadata * Drop _batch_ from names and update help * Add metric for bytes written * Better pairing of total + failure metric updates * Few tweaks to wording and naming * Record info metric during composition * Create fakeRequester and simple happy path test using it * Blocking test for the full historian and test for happy path metrics * Add tests for failure case metrics * Smoke test for full annotation persistence * Create test for metrics on annotation persistence, both happy and failing paths * Address linter complaints * More linter complaints * Remove unnecessary whitespace * Consistency improvements to help texts * Update tests to match new descs	2023-03-06 10:40:37 -06:00
gotjosh	5422f7cf56	Alerting: Add metrics for active receiver and integrations (#64050 ) * Alerting: Add metrics for active receiver and integrations Introduces metrics that allows us to track the number of configured receivers and integration in the Alertmanager for all orgs. As a bonus, I realised that the alert reception metrics where not being exported nor collected. This does that too.	2023-03-06 16:37:07 +00:00
Serge Zaitsev	0bdb105df2	Chore: Remove xorcare/pointer dependency (#63900 ) * Chore: remove pointer dependency * fix type casts * deprecate xorcare/pointer library in linter * rooky mistake	2023-03-06 05:23:15 -05:00
Ieva	a52999a886	Access Control: revert to using folder store from the scope resolvers (#64132 ) * revert to using folder store from the resolvers * fixing tests after revert * api test fixes --------- Co-authored-by: Kristin Laemmert <mildwonkey@users.noreply.github.com>	2023-03-03 10:56:33 -05:00
Yuri Tseretyan	e760f22402	Alerting: Use background context for maintenance function (#64065 )	2023-03-02 14:19:52 -05:00
Emil Tullstedt	10ee900beb	Errors: Remove direct dependencies on github.com/pkg/errors (#64026 ) Co-authored-by: Sofia Papagiannaki <1632407+papagian@users.noreply.github.com>	2023-03-02 16:28:10 +01:00
Kristin Laemmert	bb798e24f3	chore(services): replace dependencies on dashboard store with dashboard service (#63937 ) * chore(services): replace dependencies on dashboard store with dashboard service This continues the backend service/store split by replacing dashboard store dependencies with service dependencies. the folder service remains the single exception for now; otherwise we'd have a dependency cycle between the folder and dashboard services. I have some ideas for that, but I'll take care of all the easy parts first. While doing this, I identified and removed a number of unused arguments from the following functions: NewFolderNameScopeResolver NewFolderIDScopeResolver NewFolderUIDScopeResolver NewDashboardIDScopeResolver NewDashboardUIDScopeResolver resolveDashboardScope I have a small enterprise PR to support this commit. * lingering fmt	2023-03-02 08:09:57 -05:00
Yuri Tseretyan	5e2a661dec	Alerting: update API models to user NoDataState and ExecutionErrorState from definitions instead of models (#63824 )	2023-02-28 16:21:41 -05:00
Yuri Tseretyan	f561e71de8	Alerting: decouple api models from domain\dto models: separate Provenance status + converters (#63594 ) * move conversions of domain models to api models and reverse from definition package to api package	2023-02-27 17:57:15 -05:00
Alexander Weaver	e77621649d	Alerting: Instrument outgoing state history requests using weaveworks/common (#63600 ) * Loki backend and client depend on a requester * Instrument all requests to loki using weaveworks TimedClient * Construct collector in metrics package	2023-02-23 17:52:02 -06:00
Yuri Tseretyan	98e1aeaebd	Alerting: Fix client to external Alertmanager to correctly build URL for Mimir Alertmanager (#63676 )	2023-02-23 13:55:26 -05:00
Emil Tullstedt	3abaf32cf2	Chore: Upgrade golangci-lint to v1.51.2 (#63630 ) Co-authored-by: Sofia Papagiannaki <1632407+papagian@users.noreply.github.com>	2023-02-23 15:10:03 +01:00
Alex Moreno	f60dc4441f	Alerting: Add status label to GroupRules metric (#63454 ) * Add status label to GroupRules metric * Add state (active and paused) label to GrouRules * Add active/paused metrics tests	2023-02-23 12:38:27 +01:00
George Robinson	f93a9c794d	Alerting: Fix incorrect comment in eval.go (#63510 ) This commit fixes an incorrect comment in the Result struct in eval.go that I had written some time ago. The comment now documents the actual behaviour and content of this field.	2023-02-21 15:42:04 +00:00
bla2ej	56c8661929	Alerting: Get alert rules on faults (#61248 ) (#63051 ) * Alerting: get alert rules on faults (#61248) Two functions used to fetch alert rules from DB are updated: - GetAlertRulesForScheduling - ListAlertRules Rows are scanned one by one so good ones are returned. Common Error is logged with indication how many rules failed on deserialization. Resolved: #61248 * updates from review comments	2023-02-21 08:54:20 -06:00
George Robinson	9f2fb3fa27	Alerting: Add filter and remove funcs for custom labels and annotations (#63437 ) This commit adds filterLabels, filterLabelsRe, removeLabels, and removeLabelsRe functions to templates for custom labels and annotations. It allows for use cases such as removing all private labels.	2023-02-20 14:40:26 +00:00
George Robinson	c637a5543e	Alerting: Rename caps to captures as cap is a reserved word (#63432 )	2023-02-20 10:08:36 +00:00
George Robinson	aacf9da969	Alerting: Change Data to use Labels instead of map[string]string (#63431 ) This commit changes the Data struct in template.go to use Labels instead of map[string]string. It changes how labels are printed when using {{ .Labels }} from map[foo:bar bar:baz] to foo=bar, bar=baz.	2023-02-20 10:08:23 +00:00
George Robinson	0a01391ebe	Alerting: Small readability improvements to template.go (#63422 ) * Alerting: Small readability improvements to template.go * Fix lint	2023-02-20 09:24:11 +00:00
George Robinson	0659134793	Alerting: Better printing of labels (#63348 ) This commit changes how labels are printed in templates for custom annotations and labels from map[foo:bar bar:baz] to foo=bar, bar=baz. Labels are comma separated, and sorted in increasing order.	2023-02-16 12:04:15 -05:00
George Robinson	9e86916d48	Alerting: Move templating to template package (#63347 ) This commit moves templating from the state package to a sub-package called template. This sub-package will be the logical package for future ease-of-use improvements to templating custom annotations and labels.	2023-02-16 17:16:36 +01:00
Alexander Weaver	958fb2c50a	Alerting: Unify structs in Loki client and make them more consistent with Prometheus (#63055 ) * Use existing row struct instead of [2]string, add deserialization helper * Replace Stream struct with stream struct which is exactly the same * Drop unused status field * Don't export queryRes and queryData * Tests for custom marshalling * Rename row fields to T and V for consistency with prometheus samples * Rename row to sample	2023-02-11 05:17:44 -06:00
George Robinson	1f984409a2	Alerting: Fix a bug taking screenshots with Dashboard UID (#63220 ) This commit fixes a bug where Grafana would fail to take a screenshot if the same Dashboard UID was present across two or more different orgs.	2023-02-09 15:23:01 -05:00
Steve Simpson	4d1a2c3370	Alerting: Move `rule_groups_rules` metric from State to Scheduler. (#63144 ) The `rule_groups_rules` metric is currently defined and computed by `State`. It makes more sense for this metric to be computed off of the configured rule set, not based on the rule evaluation state. There could be an edge condition where a rule does not have a state yet, and so is uncounted. Additionally, we would like this metric (and others), to have a `rule_group` label, and this is much easier to achieve if the metric is produced from the `Scheduler` package.	2023-02-09 17:05:19 +01:00
suntala	49b3027049	Chore: Remove Result field from datasources (#63048 ) * Remove Result field from AddDataSourceCommand * Remove DatasourcesPermissionFilterQuery Result * Remove GetDataSourceQuery Result * Remove GetDataSourcesByTypeQuery Result * Remove GetDataSourcesQuery Result * Remove GetDefaultDataSourceQuery Result * Remove UpdateDataSourceCommand Result	2023-02-09 15:49:44 +01:00
Alexander Weaver	f80bf11782	Alerting: Make time range query parameters not required when querying Loki (#62985 ) * Make from and to not required * Move default range calculation up to loki.go	2023-02-07 14:26:43 -06:00
Santiago	955c7b13ea	Alerting: Change error log to warning and apply correct format when updating historic config (#62973 ) * change error to warning, apply correct format * Update pkg/services/ngalert/store/alertmanager.go Co-authored-by: George Robinson <george.robinson@grafana.com> * different wording and data --------- Co-authored-by: George Robinson <george.robinson@grafana.com>	2023-02-07 11:04:05 -03:00
Joey Tawadrous	121260e0dd	Parca: Use data query schema (#62840 ) * Parca data query schema * Remove groupBy	2023-02-07 09:56:21 +00:00
Alexander Weaver	0efb84617e	Alerting: Create benchmarking test for state.ProcessEvalResults (#62041 ) * Create benchmark for ProcessEvalResults * Simplify the test	2023-02-03 15:38:08 -06:00
Yuri Tseretyan	f066e8cdcd	Alerting: Update to alerting 20230203015918-0e4e2675d7aa (after refactoring) (#62823 ) * add alerting prefix to some packages from alerting that have similar names in prometheus alertmanager	2023-02-03 11:36:49 -05:00
idafurjes	982939111b	Rename Id to ID for annotation models (#62886 ) * Rename Id to ID for annotation models * Add xorm tags * Rename Id to ID for API key models * Add xorm tags	2023-02-03 17:23:09 +01:00
Alexander Weaver	9eeea8f5ea	Alerting: Add label query parameters to state history endpoint (#62831 ) * Allow equality-only matching of arbitrary labels via query params * Pre-initialize map	2023-02-02 16:52:08 -06:00
Jean-Philippe Quéméner	1694a35e0c	Alerting: implement loki query for alert state history (#61992 ) * Alerting: implement loki query for alert state history * extract selector building * add unit tests for selector creation * backup * give selectors their own type * build dataframe * add some tests * small changes after manual testing * use struct client * golint * more golint * Make RuleUID optional for Loki implementation * Drop initial assumption that we only have one series * Pare down to three columns, fix timestamp overflows, improve failure cases in loki responses * Embed structred log lines in the dataframe as objects rather than json strings * Include state history label filter * Remove dead code --------- Co-authored-by: Alex Weaver <weaver.alex.d@gmail.com>	2023-02-02 16:31:51 -06:00
Matthew Jacobson	f9ec16e74f	Alerting: Fix template validation in provisioning api (#62530 ) * Alerting: Fix template validation in provisioning api Fix issue where provisioning API accepts a malformed template having extra text outside of definition block and template name matching definition name.	2023-02-02 15:26:39 -05:00
Alexander Weaver	647f73ddc5	Alerting: Add static label to all state history entries (#62817 ) * Add static label to all state history entries * Separate label and value visually	2023-02-02 13:25:26 -06:00
Santiago	ba731f7865	Alerting: Mark AM configuration as applied (#61330 ) * Mark AM configuration as applied * add missing checks, make linter happy * fix deadlock, mark as valid on save and on load * mark configurations only if needed * check error after applyConfig() * code review comments * code review changes * more code review changes * clean HistoricConfigFromAlertConfig function	2023-02-02 14:45:17 -03:00
Alexander Weaver	6ad1cfef38	Alerting: Add endpoint for querying state history (#62166 ) * Define endpoint and generate * Wire up and register endpoint * Cleanup, define authorization * Forgot the leading slash * Wire up query and SignedInUser * Wire up timerange query params * Add todo for label queries * Drop comment * Update path to rules subtree	2023-02-02 11:34:00 -06:00
Alexander Weaver	9fa28c11c5	Alerting: Usability adjustments to Loki representation of state history values (#62643 ) * Extract label merge, add test file * Extract error/NoData to first class fields, remove a layer from values * Include dashUID and panelID as line-level fields * Drop unnecessary object receiver * Add tests for stream building * Drop NoData field from log lines	2023-02-02 10:54:10 -06:00
idafurjes	23c27cffb3	Chore: Rename Id to ID in alerting models (#62777 ) * Chore: Rename Id to ID in alerting models * Add xorm tags for datasource * Add xorm tag for uid	2023-02-02 17:22:43 +01:00
Sonia Aguilar	753c84f825	Alerting: Pass yaml as a query param in export request (#62751 ) * Set YAML as default value for exporting alert rules * use YAML format for rule list export Co-authored-by: Sonia Aguilar <33540275+soniaAguilarPeiron@users.noreply.github.com> * lint * Add new format query param to swagger+docs * Fix broken test --------- Co-authored-by: Gilles De Mey <gilles.de.mey@gmail.com> Co-authored-by: Matt Jacobson <matthew.jacobson@grafana.com>	2023-02-02 16:10:02 +00:00
Steve Simpson	c44e9f6b71	Alerting: Add metrics around notification delivery. (#62778 ) This change exposes more metrics from the embedded Alertmanager, which are valuable for troubleshooting Alertmanager operation particularly in HA setups. ``` grafana_alerting_notifications_total grafana_alerting_notifications_failed_total grafana_alerting_notification_requests_total grafana_alerting_notification_requests_failed_total grafana_alerting_notification_latency_seconds grafana_alerting_nflog_gc_duration_seconds grafana_alerting_nflog_snapshot_duration_seconds grafana_alerting_nflog_snapshot_size_bytes grafana_alerting_nflog_queries_total grafana_alerting_nflog_query_errors_total grafana_alerting_nflog_query_duration_seconds grafana_alerting_nflog_gossip_messages_propagated_total grafana_alerting_dispatcher_aggregation_groups grafana_alerting_dispatcher_alert_processing_duration_seconds ``` Note that `alertmanager_dispatcher_aggregation_group_limit_reached_total` is explicitly not exposed, as the group limit metrics are not enabled.	2023-02-02 14:44:20 +01:00
Alexander Weaver	fcecf4d3cb	Alerting: Refactor away a layer of indirection around the goroutine in Loki state history (#62644 ) Inline recordStreamsAsync in loki backend	2023-02-01 11:57:29 -06:00
Gilles De Mey	26866953c1	Alerting: hide "silence" button for external AM setups (#62133 )	2023-02-01 15:51:05 +01:00
Sofia Papagiannaki	f143b0a5b2	Chore: Move folder store interface, implementation and test under pkg/services/folder (#62586 ) * Chore: Move folder store into folder service package * Split folder and dashboard store implementations	2023-02-01 15:43:21 +02:00
Alex Moreno	53945afedf	Alerting: Allow alert rule pausing from API (#62326 ) * Add is_paused attr to the POST alert rule group endpoint * Add is_paused to alerting API POST alert rule group * Fixed tests * Add is_paused to alerting gettable endpoints * Fix integration tests * Alerting: allow to pause existing rules (#62401) * Display Pause Rule switch in Editing Rule form * add isPaused property to form interface and dto * map isPaused prop with is_paused value from DTO Also update test snapshots * Append '(Paused)' text on alert list state column when appropriate * Change Switch styles according to discussion with UX Also adding a tooltip with info what this means * Adjust styles * Fix alignment and isPaused type definition Co-authored-by: gillesdemey <gilles.de.mey@gmail.com> * Fix test * Fix test * Fix RuleList test --------- Co-authored-by: gillesdemey <gilles.de.mey@gmail.com> * wip * Fix tests and add comments to clarify AlertRuleWithOptionals * Fix one more test * Fix tests * Fix typo in comment * Fix alert rule(s) cannot be paused via API * Add integration tests for alerting api pausing flow * Remove duplicated integration test --------- Co-authored-by: Virginia Cepeda <virginia.cepeda@grafana.com> Co-authored-by: gillesdemey <gilles.de.mey@gmail.com> Co-authored-by: George Robinson <george.robinson@grafana.com>	2023-02-01 13:15:03 +01:00
Alexander Weaver	03f3fbec0d	Alerting: Fix handling of special floating-point cases when writing observed values to annotations (#61074 ) * Fix json serialization of state values * Simplify two of the tests * Fix linter complaint * Don't return error if we fail to look up dashboard, just log it and move on * Address linter complaint	2023-01-31 15:27:51 -06:00
gotjosh	55e7cf1aed	Alerting: Introduce Metric Aggregation starting with Silences (#62512 ) * Alerting: Introduce Metric Aggregation starting with Silences --------- Co-authored-by: Alexander Weaver <weaver.alex.d@gmail.com>	2023-01-31 19:54:38 +00:00
gotjosh	178f290f0c	Update dskit to the latest main (#62616 ) * Update dskit to the latest main * Break free from a cortex depedency	2023-01-31 19:05:49 +00:00
ismail simsek	91221bc436	Expressions: Fixes the issue showing expressions editor (#62510 ) * Use suggested value for uid * update the snapshot * use __expr__ * replace all -100 with __expr__ * update snapshot * more changes * revert redundant change * Use expr.DatasourceUID where it's possible * generate files	2023-01-31 18:50:10 +01:00
Alexander Weaver	e7ace4ed62	Alerting: Allow separate read and write path URLs for Loki state history (#62268 ) Extract config parsing and add tests	2023-01-30 16:30:05 -06:00
Alexander Weaver	b4682fe3cb	Alerting: Configurable externalLabels for Loki state history (#62404 ) * Add config option for external labels * Remove redundant nilcheck	2023-01-30 14:24:45 -06:00
Alex Moreno	7a465f42a6	Alerting: Allow pausing alerts from provisioning (#62263 ) * Allow pausing alerts from provisioning * Update swagger * Add IsPaused to provision export endpoints * Add pause field in sample.yml * Add exception for reset state in first loop iteration of scheduler if rule is paused * Update provision definition and swagger docs * Fix provisioning export tests * Suggestion: Simplify if condition * Add more context to a comment	2023-01-30 16:29:05 +01:00
Ieva	ee3d742c7d	RBAC: inherit folder permissions when resolving managed permissions (#62244 ) * add nested folder scope inheritance to managed permission services * add a more specific erorr * remove circular dependencies * use errutil for returning erorr * fix tests * fix tests * define a new error in ac package	2023-01-30 14:19:42 +00:00
idafurjes	3bda112c5f	Chore: Move search model from models package to search service (#62215 ) * Chore: Move search model from models package to search service * Remove unused imports * Cleanup after merge	2023-01-30 15:17:53 +01:00
Serge Zaitsev	d6d4097567	Chore: Fix goimports grouping in alerting (#62424 ) * fix goimports * fix goimports order	2023-01-30 09:55:35 +01:00
Yuri Tseretyan	0c4671e31f	Alerting: Update historian to ignore transitions from Normal Paused and Updated (#62267 )	2023-01-27 16:26:22 -05:00
gotjosh	3c616da83f	Alerting: Refactor metrics/ngalert.go into seperate files (#62362 ) * Alerting: Refactor metrics/ngalert.go into seperate files	2023-01-27 18:49:49 +00:00
Matthew Jacobson	c006df375a	Alerting: Create endpoints for exporting in provisioning file format (#58623 ) This adds provisioning endpoints for downloading alert rules and alert rule groups in a format that is compatible with file provisioning. Each endpoint supports both json and yaml response types via Accept header as well as a query parameter download=true/false that will set Content-Disposition to recommend initiating a download or inline display. This also makes some package changes to keep structs with potential to drift closer together. Eventually, other alerting file structs should also move into this new file package, but the rest require some refactoring that is out of scope for this PR.	2023-01-27 11:39:16 -05:00
George Robinson	b6db1ed524	Revert "Alerting: Add is_paused attr to the POST alert rule group endpoint" (#62310 ) Revert "Alerting: Add is_paused attr to the POST alert rule group endpoint (#62253)" This reverts commit `3ccafe3a5a`.	2023-01-27 13:41:36 +01:00
Alex Moreno	3ccafe3a5a	Alerting: Add is_paused attr to the POST alert rule group endpoint (#62253 ) Add is_paused attr to the POST alert rule group endpoint	2023-01-27 10:50:06 +01:00
Yuri Tseretyan	05bf241952	Alerting: Update state manager to return StateTransitions when Delete or Reset (#62264 ) * update Delete and Reset methods to return state transitions this will be used by notifier code to decide whether alert needs to be sent or not. * update scheduler to provide reason to delete states and use transitions * update FromAlertsStateToStoppedAlert to accept StateTransition and filter by old state * fixup * fix tests	2023-01-27 09:46:21 +01:00
idafurjes	6c5a573772	Chore: Move ReqContext to contexthandler service (#62102 ) * Chore: Move ReqContext to contexthandler service * Rename package to contextmodel * Generate ngalert files * Remove unused imports	2023-01-27 08:50:36 +01:00
Alex Moreno	531b439cf1	Alerting: Add alert pausing feature (#60734 ) * Add field in alert_rule model, add state to alert_instance model, and state to eval * Remove paused state from eval package * Skip paused alert rules in scheduler * Add migration to add is_paused field to alert_rule table * Convert to postable alerts only if not normal, pernding, or paused * Handle paused eval results in state manager * Add Paused state to eval package * Add paused alerts logic in scheduler * Skip alert on scheduler * Remove paused status from eval package * Apply suggestions from code review Co-authored-by: George Robinson <george.robinson@grafana.com> * Remove state * Rethink schedule and manager for paused alerts * Change return to continue * Remove unused var * Rethink alert pausing * Paused alerts storing annotations * Only add one state transition * Revert boolean method renaming refactor * Revert take image refactor * Make registry errors public * Revert method extraction for getting a folder title * Revert variable renaming refactor * Undo unnecessary changes * Revert changes in test * Remove IsPause check in PatchPartiLAlertRule function * Use SetNormal to set state * Fix text by returning to old behaviour on alert rule deletion * Add test in schedule_unit_test.go to test ticks with paused alerts * Add coment to clarify usage of context.Background() * Add comment to clarify resetStateByRuleUID method usage * Move rule get to a more limited scope * Update pkg/services/ngalert/schedule/schedule.go Co-authored-by: George Robinson <george.robinson@grafana.com> * rum gofmt on pkg/services/ngalert/schedule/schedule.go * Remove defer cancel for context * Update pkg/services/ngalert/models/instance_test.go Co-authored-by: Santiago <santiagohernandez.1997@gmail.com> * Update pkg/services/ngalert/models/testing.go Co-authored-by: Santiago <santiagohernandez.1997@gmail.com> * Update pkg/services/ngalert/schedule/schedule_unit_test.go Co-authored-by: Santiago <santiagohernandez.1997@gmail.com> * Update pkg/services/ngalert/schedule/schedule_unit_test.go Co-authored-by: Santiago <santiagohernandez.1997@gmail.com> * Update pkg/services/ngalert/models/instance_test.go Co-authored-by: Santiago <santiagohernandez.1997@gmail.com> * skip scheduler rule state clean up on paused alert rule * Update pkg/services/ngalert/schedule/schedule.go Co-authored-by: Santiago <santiagohernandez.1997@gmail.com> * Fix mock in test * Add (hopefully) final suggestions * Use error channel from recordAnnotationsSync to cancel context * Run make gen-cue * Place pause alert check in channel update after version check * Reduce branching un update channel select * Add if for error and move code inside if in state manager ResetStateByRuleUID * Add reason to logs * Update pkg/services/ngalert/schedule/schedule.go Co-authored-by: George Robinson <george.robinson@grafana.com> * Do not delete alert rule routine, just exit on eval if is paused * Reduce branching and create-close a channel to avoid deadlocks * Separate state deletion and state reset (includes history saving) * Add current pause state in rule route in scheduler * Split clearState and bring errCh closer to RecordStatesAsync call * Change rule to ruleMeta in RecordStatesAsync * copy state to be able to modify it * Add timeout to context creation * Shorten the timeout * Use resetState is rule is paused and deleteState if rule is not paused * Remove Empty state reason * Save every rule change in historian * Add tests for DeleteStateByRuleUID and ResetStateByRuleUID * Remove useless line * Remove outdated comment Co-authored-by: George Robinson <george.robinson@grafana.com> Co-authored-by: Santiago <santiagohernandez.1997@gmail.com> Co-authored-by: Armand Grillet <2117580+armandgrillet@users.noreply.github.com>	2023-01-26 18:29:10 +01:00
Kristin Laemmert	e8b8a9e276	chore: move dashboard_acl models into dashboard service (#62151 )	2023-01-26 08:46:30 -05:00
gotjosh	0bfe150928	Alerting: Fix Test Receivers when settings are non-strings (#62156 ) * Alerting: Fix Test Receivers when settings are non-strings As part of the Alerting extraction, we want to make sure we don't have circular depedencies. As such, I had to move `PostableGrafanaReceiver` to a new struct in `grafana/alerting` called `GrafanaReceiver`. `PostableGrafanaReceiver` has an attribute called `Settings` that uses a Grafana-propietary struct called `RawMessage`, this struct shadows `json.RawMessage`. When I created `GrafanaReceiver`, I turned settings into a `map[string]string` thinking all settings would end up as strings. This was a mistake, and this test proves that it doesn't work, and breaks the API.	2023-01-26 12:54:03 +00:00
George Robinson	a7eab8e46e	Alerting: Support context.Context in Loki interface (#61979 ) This commit adds support for canceleable contexts in the Loki interface.	2023-01-26 09:31:20 +00:00
Sofia Papagiannaki	cd27562c76	Access control: Modify dashboard/folder resolvers so that return also the inherited scopes (#62025 ) * Access Control: Add folder service dependency to the dashboard/folder resolvers * Expose the function fetching parents to folder interface * Add generic prepend utility * Modify dashboard resolvers to return inherited scopes	2023-01-26 10:21:10 +02:00
Alexander Weaver	eb1293ebb1	Alerting: Re-generate swagger definitions (#62154 ) Regenerate swagger, add body binding so parameters work	2023-01-25 15:01:42 -06:00
Alexander Weaver	046a9bb7c1	Alerting: Copy rule definitions into state history (#62032 ) * Copy rules instead of accepting pointer * Deep-copy the rule, for even more guarantees * Create struct just for needed fields * Move RuleMeta to historian/model package, iron out package dependencies * Move tests for dash ID parsing to model package along with code	2023-01-25 11:29:57 -06:00
idafurjes	b54b80f473	Chore: Remove Result from dashboard models (#61997 ) * Chore: Remove Result from dashboard models * Fix lint tests * Fix dashboard service tests * Fix API tests * Remove commented out code * Chore: Merge main - cleanup	2023-01-25 10:36:26 +01:00
idafurjes	421976e919	Chore: Remove folders from models pkg (#61853 )	2023-01-25 09:14:32 +01:00
Santiago	e5920c211e	Chore: Fix random indices for slices in test files (#61884 ) * Fix random indices for slices in test files * Empty commit	2023-01-24 15:07:37 -03:00
George Robinson	239d94205a	Alerting: Return chan <-error for #61811 (#61858 )	2023-01-24 15:41:38 +00:00
Alexander Weaver	7ccc845187	Alerting: Push state history entries to Loki (#61724 ) * Implement push endpoint * Drop duplicated struct * Genericize auth/tenant headers and improve logging in error case * Flesh out the data model * Drop dead code * Drop log line entirely * Drop unused arg * Rename a few type manipulation functions * Extract label keys as constants * Improve logs when loki responds with error * Inline lokiRepresentation function	2023-01-23 16:31:03 -06:00
Sofia Papagiannaki	c7a7ebd3e0	Chore: Drop search service dependency from folder service (#61789 ) * Chore: Drop search service dependency from folder service	2023-01-23 14:09:09 +02:00
gotjosh	0be920e61c	Alerting: Remove unused code after importing from grafana/alerting (#61869 ) * Alerting: Remove unused code after importing from grafana/alerting	2023-01-23 10:30:10 +00:00
gotjosh	511dab3b4b	Update `grafana/alerting` to the latest main (#61810 ) * Update `grafana/alerting` to the latest main Also updates prometheus-alertmanager since we use that one directly for some structs.	2023-01-19 20:44:49 +00:00
Sofia Papagiannaki	c104cc7020	Chore: Split folder store and dashboard store interfaces (#61655 ) * update folder store mock * Split folder store and dashboard store interfaces	2023-01-19 18:38:07 +02:00
Alexander Weaver	c10713ea76	Alerting: Create query interface for state history along with annotation-based implementation (#61646 )	2023-01-19 10:45:31 +01:00
Yuri Tseretyan	2c46f46d37	Alerting: Rule evaluator to get cached data source info (#61305 ) do not skip cache when get data source info	2023-01-18 14:25:11 -05:00
Jean-Philippe Quéméner	44b11d3228	Alerting: support basic auth for the state history loki client (#61696 )	2023-01-18 20:24:40 +01:00
George Robinson	d4256b352d	Docs: Rename Message templates to Notification templates (#59477 ) This commit renames "Message templates" to "Notification templates" in the user interface as it suggests that these templates cannot be used to template anything other than the message. However, message templates are much more general and can be used to template other fields too such as the subject of an email, or the title of a Slack message.	2023-01-18 17:26:34 +00:00
Sofia Papagiannaki	b80c9bb974	Chore: Drop dashboard service dependency from folder service (#61614 ) * Chore: Drop dashboard dependency from folder service	2023-01-18 17:47:59 +02:00
Santiago	b5fa9e3501	Chore: Fix "manger" typo (#61649 ) fix mangers -> managers	2023-01-17 23:13:27 +00:00
Alexander Weaver	1ac89ea040	Alerting: Add client configuration for remote Loki historian backend and test connection (#61114 ) * Create loki client type and ping method * Expose TestConnection on client * Configure and ping Loki URL * Close response body reader if present * Add 30 second timeout * Remove duplicate close	2023-01-17 13:58:52 -06:00
Kristin Laemmert	f6e3252c00	chore: move notifications models into notifications service (#61638 )	2023-01-17 14:47:31 -05:00
Matthew Jacobson	23e05373a7	Alerting: Fix flaky TestIntegrationUpdateAlertRules (#61641 ) Prevents random OrgID=0 in test alert generation causing invalid alert rule.	2023-01-17 19:09:46 +00:00
Alexander Weaver	4f1bdc0607	Alerting: Skip flaky test in TestIntegrationUpdateAlertRules (#61627 ) * Skip flaky test * Add comment	2023-01-17 10:39:16 -06:00
Denis Limarev	e6dee8a723	Perfomance: Preallocate slices (#61580 )	2023-01-17 11:50:17 +00:00
idafurjes	7c2522c477	Chore: Move dashboard models to dashboard pkg (#61458 ) * Copy dashboard models to dashboard pkg * Use some models from current pkg instead of models * Adjust api pkg * Adjust pkg services * Fix lint	2023-01-16 16:33:55 +01:00
Jo	dcfeab2c73	AuthN: User Quota (#61540 ) * remove reqContext from quota checks in login * add guards for nil ScopeParams	2023-01-16 11:54:15 +01:00

... 3 4 5 6 7 ...

1313 Commits