feat(anomaly detection): Dynamic Window for Matrix Profiling #1451

aayush-se · 2024-11-19T00:22:57Z

Use 2 windows (1 determined by the SuSS algorithm, and a "fixed" window at size 10) to perform detection reducing the recovery time for the algorithm ultimately making alerts persist for a shorter period of time after anomalous behavior is over
Performing detection twice for each window to maintain the matrix profiles
Updated the anomaly_algo_data to store matrix profile related information for both windows for a given alert
Created MPTimeSeriesAnomaliesSingleWindow class and updated the MPTimeSeriesAnomalies class to effectively manage and store the proper matrix profile related information and the boolean for determining which window logic to use
Recalculates both matrix profiles and update the respective rows if they are not present to ensure backwards compatibility with the older MPTimeSeriesAnomalies class
Updated current tests to match the new classes and their fields along with shapes for anomaly_algo_data

aayush-se · 2024-11-19T00:51:44Z

Waiting for codecov to give report -- will update test coverage upon receiving

ram-senth · 2024-11-19T16:56:54Z

src/seer/anomaly_detection/accessors.py

-                if original_flag is None:
-                    original_flag = "none"
-                original_flags.append(original_flag)
+                algo_data = MPTimeSeriesAnomalies.extract_algo_data(point.anomaly_algo_data)


Seems like this is not backward compatible as both mp_suss and mp_fixed are not there. So detection for existing alerts in production will fail, right?

This was one thing I wanted to ask about -- if we were to use this class for prod, could we rerun calculation to populate the objects appropriately?

Because to get the algo data for both suss and fixed, we would need to store the MP as a field in the object

I think it will be best to just let the recalculation happen when data pruning happens. So this code should be backward compatible. We will need an unit test to check that. Also, test it locally with an alert pre-populated.

Sure, will include those tests and confirm through local testing

ram-senth · 2024-11-19T17:38:49Z

src/seer/anomaly_detection/anomaly_detection.py

+            convert_external_ts_to_internal(timeseries), config, window_size=window_size
+        )
+        anomalies_fixed = batch_detector.detect(
+            convert_external_ts_to_internal(timeseries), config, window_size=10


We should add this 10 as a constant to the MPConfig class?

ram-senth · 2024-11-21T19:36:53Z

src/seer/anomaly_detection/accessors.py

+            and "mp_suss" not in timeseries[-1].anomaly_algo_data
+            and "mp_fixed" not in timeseries[-1].anomaly_algo_data
+        ):
+            timeseries = self._recalculate_batch_detection(db_alert)


So we recalculate but we do not save to the database here. However, the new time step that will be stored in the db will have these additional information. So next detect call will actually fail, right?

Within _recalculate_batch_detection, another function update_timeseries is called that directly updates the DbDynamicAlertTimeSeries with the new anomaly_algo_data

…pruning

ram-senth · 2024-11-24T15:05:44Z

I'm good with the changes. Just wanted to confirm that you tested locally with a saved alert that does not have the fixed window history to confirm the backward compatibility. We do not want production failures when we deploy. Also, seems like the unit test coverage needs increasing. There are a few low hanging fruits like testing for length mismatch etc that should take it over the goal.

aayush-se requested a review from a team as a code owner November 19, 2024 00:22

aayush-se force-pushed the anomaly-detection/dynamic-window branch from cc46826 to bf302c1 Compare November 19, 2024 00:41

aayush-se requested a review from ram-senth November 19, 2024 00:51

ram-senth reviewed Nov 19, 2024

View reviewed changes

aayush-se added 16 commits November 21, 2024 09:56

Update structure of algo_data

e616467

conflicts

bf8e40d

Update params for timeseries object

25420df

fix duplciate variable

d1ccf37

Resolve

84d8ce0

WIP fix bugs

fd23ed1

Ensure shapes are correct during detection

b418b67

Resolve

7f1e62e

mypy fixes

bf0d5f9

Resolve

15d3bd3

Ensure backward compatibility with anomaly detection classes

2ae5223

Recalculate batch

7ad8970

test print

ed630d3

Resolve

41d6aa6

Use MPConfig and increase test coverage

a8c86fa

Remove extra comments

29d9139

aayush-se force-pushed the anomaly-detection/dynamic-window branch from 66e45c6 to 29d9139 Compare November 21, 2024 18:02

Test recalculation

a32f3bf

ram-senth reviewed Nov 21, 2024

View reviewed changes

aayush-se added 2 commits November 21, 2024 16:17

Use dep injection for config and ensure fixed window is recalced for …

f3839de

…pruning

Skip fixed window for backwards compatibility and update tests

23633f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(anomaly detection): Dynamic Window for Matrix Profiling #1451

feat(anomaly detection): Dynamic Window for Matrix Profiling #1451

aayush-se commented Nov 19, 2024 •

edited

Loading

aayush-se commented Nov 19, 2024

ram-senth Nov 19, 2024

aayush-se Nov 19, 2024 •

edited

Loading

aayush-se Nov 19, 2024

ram-senth Nov 19, 2024 •

edited

Loading

aayush-se Nov 19, 2024

ram-senth Nov 19, 2024

ram-senth Nov 21, 2024

aayush-se Nov 21, 2024

ram-senth commented Nov 24, 2024

feat(anomaly detection): Dynamic Window for Matrix Profiling #1451

Are you sure you want to change the base?

feat(anomaly detection): Dynamic Window for Matrix Profiling #1451

Conversation

aayush-se commented Nov 19, 2024 • edited Loading

aayush-se commented Nov 19, 2024

ram-senth Nov 19, 2024

Choose a reason for hiding this comment

aayush-se Nov 19, 2024 • edited Loading

Choose a reason for hiding this comment

aayush-se Nov 19, 2024

Choose a reason for hiding this comment

ram-senth Nov 19, 2024 • edited Loading

Choose a reason for hiding this comment

aayush-se Nov 19, 2024

Choose a reason for hiding this comment

ram-senth Nov 19, 2024

Choose a reason for hiding this comment

ram-senth Nov 21, 2024

Choose a reason for hiding this comment

aayush-se Nov 21, 2024

Choose a reason for hiding this comment

ram-senth commented Nov 24, 2024

aayush-se commented Nov 19, 2024 •

edited

Loading

aayush-se Nov 19, 2024 •

edited

Loading

ram-senth Nov 19, 2024 •

edited

Loading