feat(anomaly detection): Cron to cleanup disabled alerts #1455

aayush-se · 2024-11-19T18:43:55Z

Currently, cleanup only happens upon detection call leaving stale timestamps for disabled alerts to accumulate
Created cron which runs weekly to cleanup (delete) disabled alerts based on when they were last queued for detection (28 days) or if last_queued is null then deletes old alerts
Cleaning alerts as a byproduct cleans the timeseries that are associated with them because of the parent-child relationship

ram-senth · 2024-11-21T00:27:14Z

src/seer/anomaly_detection/tasks.py

+@sentry_sdk.trace
+def cleanup_disabled_alerts():
+
+    logger.info("Cleaning up timeseries data for alerts that have been inactive for over 28 days")


By inactive we mean that anomaly detection was not called for the alert, right? Do we want to make that explicit here?

ram-senth · 2024-11-21T00:28:40Z

src/seer/anomaly_detection/tasks.py

+        # Get all alerts that haven't been queued in the last 28 days indicating that they are disabled and are safe to cleanup
+        alerts = (
+            session.query(DbDynamicAlert)
+            .filter(DbDynamicAlert.last_queued_at < date_threshold)


Do we need to handle the edge case where the last_queued_at is null, as will be the case until the first detection happens?

The default value for last_queued_at is the current time so that value should be filled in upon creation of the alert regardless of detection

Actually, I will include a null check in here because of the old alerts that were created before the data pruning logic was implemented likely resulting in some null values

ram-senth · 2024-11-21T00:29:50Z

src/seer/anomaly_detection/tasks.py

+
+        deleted_count = 0
+        for alert in alerts:
+            deleted_count += delete_old_timeseries_points(alert, date_threshold.timestamp())


So we just delete the time series points but the actual entry in the alert table will still remain? Should we delete that as well?

I left it in case they re-enable the alert. For example, if they go from dynamic --> static --> dynamic, then the alert entry should still be in the table right? But if deleting the alert then re-creating it is fine, then I can also delete the alert itself within this cron

Yes it will recreate the alert if revived. So we should delete it as well.

Directly deleted the alerts which in turn also deletes the associated timeseries data from DbDynamicAlertTimeSeries

ram-senth · 2024-11-21T00:31:07Z

tests/seer/anomaly_detection/test_cleanup_tasks.py

+                    .one_or_none()
+                )
+                assert alert is not None
+                alert.last_queued_at = datetime.now() - timedelta(days=29)


We should add a unit that includes one alert with last_queued_at as null

ram-senth · 2024-11-21T20:11:53Z

src/seer/anomaly_detection/tasks.py

+def cleanup_disabled_alerts():
+
+    logger.info(
+        "Cleaning up timeseries data for alerts that have been inactive (detection has not been run) for over 28 days"


Minor nit - add date_threshold in this info message

Create cron to delete timestamps from disabled alerts

83fb731

aayush-se requested a review from a team as a code owner November 19, 2024 18:43

aayush-se requested a review from ram-senth November 19, 2024 18:44

Fix celery tests

a8bf43c

ram-senth reviewed Nov 21, 2024

View reviewed changes

Delete disabled alerts and update tests

e15f312

aayush-se changed the title ~~feat(anomaly detection): Cron to cleanup timeseries from disabled alerts~~ feat(anomaly detection): Cron to cleanup disabled alerts Nov 21, 2024

Fully ensure timeseries for deleted alerts are also purged

a52053d

ram-senth approved these changes Nov 21, 2024

View reviewed changes

Include date_threshold in log

da00d16

aayush-se merged commit f0f7c7f into main Nov 21, 2024
11 checks passed

aayush-se deleted the anomaly-detection/cron-disabled-alerts branch November 21, 2024 21:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(anomaly detection): Cron to cleanup disabled alerts #1455

feat(anomaly detection): Cron to cleanup disabled alerts #1455

aayush-se commented Nov 19, 2024 •

edited

Loading

ram-senth Nov 21, 2024

ram-senth Nov 21, 2024

aayush-se Nov 21, 2024

aayush-se Nov 21, 2024

ram-senth Nov 21, 2024

aayush-se Nov 21, 2024

ram-senth Nov 21, 2024

aayush-se Nov 21, 2024

ram-senth Nov 21, 2024

ram-senth Nov 21, 2024

feat(anomaly detection): Cron to cleanup disabled alerts #1455

feat(anomaly detection): Cron to cleanup disabled alerts #1455

Conversation

aayush-se commented Nov 19, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

aayush-se commented Nov 19, 2024 •

edited

Loading