Model Monitoring Inference Aggregator block #818

robiscoding · 2024-11-15T19:28:04Z

Description

Currently Model Monitoring does not support for inferences made by InferencePipeline. To add support, this Model Monitoring Inference Aggregator block allows users to arbitrarily send an aggregated and consolidated set of inference results to MM at a defined frequency (in seconds)

Type of change

Please delete options that are not relevant.

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
This change requires a documentation update

How has this change been tested, please provide a testcase or example of how you tested the change?

Tested using local workflow run when frequency is and is not in range for reporting

Any specific deployment considerations

Depends on https://github.com/roboflow/roboflow/pull/4918

Docs

Docs updated? What were the changes:

robiscoding · 2024-11-15T19:30:21Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_exporter/v1.py

+        description="Reference data to extract property from",
+        examples=["$steps.my_step.predictions"],
+    )
+    frequency: Union[


I considered using the RateLimiter block, but this export block would always require that as a dependency. I'm not sure if there's a pattern for making another block required to run, so I just added this to the params for now, open to doing it another way

name is misleading, as it suggests compaction of predictions to be sent in batches, which does not take place

Yeah maybe something like SamplePredictions, ReportPredictionsSample, ModelMonitoringPredictionsSampler

robiscoding · 2024-11-15T19:32:10Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_exporter/v1.py

+                "model_type": detections.data.get("prediction_type", [""])[i],
+            }
+            results.append(formatted_det)
+    elif isinstance(detections, dict):


What case would predictions be returned as a dict? I saw in other Blocks that predictions will either be of type sv.Detections or dict

PawelPeczek-Roboflow

I have general concern with this PR - what is the quality of data acquired in that way? Is that meaningful monitoring at the end of the day? What do we want to display as the result of running monitoring about a week-long stream? How would people understand that something is wrong with the model?

I mean, we have stream running and we sub sample of predictions based on time, suggesting to update once per few seconds. We do it as this is basically unfeasible to push through the wire all predictions - but maybe - in this scenario it makes sense to compute aggregates of predictions on the client end and submit compacted results once per interval?

PawelPeczek-Roboflow · 2024-11-18T10:52:47Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_exporter/v1.py

+    def get_manifest(cls) -> Type[WorkflowBlockManifest]:
+        return BlockManifest
+
+    def is_in_reporting_range(self, frequency: int) -> bool:


please move into usage first, then declaration (run(...) method first

PawelPeczek-Roboflow · 2024-11-18T10:52:56Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_exporter/v1.py

+    def is_in_reporting_range(self, frequency: int) -> bool:
+        now = datetime.now()
+        last_report_time_str = self._cache.get(LAST_REPORT_TIME_CACHE_KEY)
+        print(


please remove print statements

PawelPeczek-Roboflow · 2024-11-18T10:53:27Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_exporter/v1.py

+        self,
+        fire_and_forget: bool,
+        predictions: Union[sv.Detections, dict],
+        frequency: int = 3,


defaults should not be set here, but in manifest

PawelPeczek-Roboflow · 2024-11-18T10:53:50Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_exporter/v1.py

+BLOCK_NAME = "Roboflow Model Monitoring Exporter"
+
+
+class BlockManifest(WorkflowBlockManifest):


please follow migration guide: https://inference.roboflow.com/workflows/execution_engine_changelog/#execution-engine-v130-inference-v0270

PawelPeczek-Roboflow · 2024-11-18T10:54:10Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_exporter/v1.py

+    )
+    type: Literal[
+        "roboflow_core/roboflow_model_monitoring_exporter@v1",
+        BLOCK_NAME,


please remove name aliasing, this is not needed for new blocks

PawelPeczek-Roboflow · 2024-11-18T10:55:00Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_exporter/v1.py

+                "error_status": False,
+                "message": "Not in reporting range, skipping report. (Ok)",
+            }
+        if self._api_key is None:


should be validated first - fail fast approach - clearer error handling

PawelPeczek-Roboflow · 2024-11-18T10:56:30Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_exporter/v1.py

+
+
+# TODO: maybe make this a helper or decorator, it's used in multiple places
+def get_workspace_name(


I will remember about this and if I see re-ocurring pattern of usage, I will move to common inference utils

PawelPeczek-Roboflow · 2024-11-18T10:57:27Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_exporter/v1.py

+        description="Reference data to extract property from",
+        examples=["$steps.my_step.predictions"],
+    )
+    frequency: Union[


name is misleading, as it suggests compaction of predictions to be sent in batches, which does not take place

PawelPeczek-Roboflow · 2024-11-18T10:59:36Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_exporter/v1.py

+
+    def is_in_reporting_range(self, frequency: int) -> bool:
+        now = datetime.now()
+        last_report_time_str = self._cache.get(LAST_REPORT_TIME_CACHE_KEY)


global key? how would that work on hosted platform?

robiscoding · 2024-11-18T15:12:24Z

I have general concern with this PR - what is the quality of data acquired in that way? Is that meaningful monitoring at the end of the day? What do we want to display as the result of running monitoring about a week-long stream? How would people understand that something is wrong with the model?

This is particularly meant for inferences made by InferencePipeline in edge deployments. Model Monitoring currently does not get any data from InferencePipeline. This is because of the large volume of requests made by IP would overload our system. So the goal here is not to have comprehensive inference results data or an aggregation of it (that's coming later), but more of a health/status check at a regular interval that IP is running and making inferences. Is there a recommended way of doing this?

I mean, we have stream running and we sub sample of predictions based on time, suggesting to update once per few seconds. We do it as this is basically unfeasible to push through the wire all predictions - but maybe - in this scenario it makes sense to compute aggregates of predictions on the client end and submit compacted results once per interval?

Aggregating is fine, but we were trying to defer doing aggregations until we had a better idea of what type of aggregation makes sense for this use case. I thought the Aggregator block could be used in conjunction with this but maybe not?

PawelPeczek-Roboflow · 2024-11-19T09:48:31Z

is model monitoring in given form even acceptable to run on video?
I would say we should disable it by default given the performance penalty

PawelPeczek-Roboflow · 2024-11-21T12:55:59Z

Today we need to close the topic, otherwise release 0.28.0 label will be removed

PawelPeczek-Roboflow · 2024-11-21T13:54:30Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_inference_aggregator/v1.py

+        self._cache = cache
+        self._background_tasks = background_tasks
+        self._thread_pool_executor = thread_pool_executor
+        self._last_report_time_cache_key = "roboflow_model_monitoring_last_report_time"


that would have a side effect for all blocks of this type to have shared state of last report time - is that desired end?

Oh no, the intent was for each instance of the block to have it's own last_report_time. Should I add a random string to the key name to make that happen?

if the expectation is that it works at hosted, I would use redis with the following structure of keys workflows:steps_cache:roboflow_core/model_monitoring_inference_aggregator@v1:{uuid4()}:last_report_time

PawelPeczek-Roboflow · 2024-11-21T13:55:12Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_inference_aggregator/v1.py

+            last_report_time = now
+        else:
+            last_report_time = datetime.fromisoformat(last_report_time_str)
+        time_elapsed = int((now - last_report_time).total_seconds())


type conversion to int seems not to be needed

PawelPeczek-Roboflow · 2024-11-21T13:56:48Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_inference_aggregator/v1.py

+        )
+
+
+def format_sv_detections_for_model_monitoring(


please change the name to reflect that sv.Detections and cls results are handled

PawelPeczek-Roboflow · 2024-11-21T13:58:48Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_inference_aggregator/v1.py

+) -> List[Prediction]:
+    results = []
+    if isinstance(detections, sv.Detections):
+        num_detections = len(detections.data.get("detection_id", []))


there is an iter in sv.Detections

for detection in detections: would be easier and less error-prone with getting the list of size 1, that seems to fail with strange error if other blocks break the contract

PawelPeczek-Roboflow · 2024-11-21T14:03:26Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_inference_aggregator/v1.py

+        if system_info:
+            for key, value in system_info.items():
+                inference_data[key] = value
+        inference_data["inference_results"] = predictions


I have a strange feeling it may fail on serialisation - predictions is List of dataclasses which does not get automatically converted into json afaik

PawelPeczek-Roboflow · 2024-11-21T14:04:29Z

inference/core/workflows/core_steps/sinks/roboflow/model_monitoring_inference_aggregator/v1.py

+LONG_DESCRIPTION = """
+This block periodically reports an aggregated sample of inference results to Roboflow Model Monitoring.
+
+It aggregates predictions in memory between reports and then sends a representative sample of predictions at a regular interval specified by the `frequency` parameter.


I would elaborate on what we understand by representative - seems like "most confident for given class"

…d bump version

PawelPeczek-Roboflow

Approving under following conditions (also reported in docs):

🚨 Limitations

The block is should not be relied on when running Workflow in inference server or via HTTP request to Roboflow
hosted platform, as the internal state is not persisted in a memory that would be accessible for all requests to
the server, causing aggregation to only have a scope of single request. We will solve that problem in future
releases if proven to be serious limitation for clients.
This block do not have ability to separate aggregations for multiple videos processed by InferencePipeline -
effectively aggregating data for all video feeds connected to single process running InferencePipeline.

@robiscoding in the interest of having the feature I will push the PR, also fixing tests that were broken, but I will hold you responsible for any follow-up changes if there are bug / problem reports.

robiscoding · 2024-11-22T13:55:29Z

Approving under following conditions (also reported in docs):

🚨 Limitations

The block is should not be relied on when running Workflow in inference server or via HTTP request to Roboflow
hosted platform, as the internal state is not persisted in a memory that would be accessible for all requests to
the server, causing aggregation to only have a scope of single request. We will solve that problem in future
releases if proven to be serious limitation for clients.

This block do not have ability to separate aggregations for multiple videos processed by InferencePipeline -
effectively aggregating data for all video feeds connected to single process running InferencePipeline.

@robiscoding in the interest of having the feature I will push the PR, also fixing tests that were broken, but I will hold you responsible for any follow-up changes if there are bug / problem reports.

We can work with those current limitations as the block will still provide us with some observability of inferencepipeline running in edge environments. And yes we'll be iterating on this functionality so let me know if any issues come up in the interim, thanks for the thorough review!

robiscoding added 6 commits November 14, 2024 10:40

WIP

a7412fd

Merge branch 'main' into inference-pipline-mm-support

2a2b0ad

Merge branch 'main' into inference-pipline-mm-support

d210a43

Add core logic to report to MM

ae9d09d

Add unit tests

bf94c8a

Merge branch 'main' into inference-pipline-mm-support

0493001

robiscoding requested review from PawelPeczek-Roboflow, grzegorz-roboflow, yeldarby, probicheaux and hansent as code owners November 15, 2024 19:28

Lint

98dba82

robiscoding commented Nov 15, 2024

View reviewed changes

PawelPeczek-Roboflow requested changes Nov 18, 2024

View reviewed changes

robiscoding marked this pull request as draft November 18, 2024 15:42

PawelPeczek-Roboflow added the release 0.28.0 label Nov 21, 2024

robiscoding added 2 commits November 21, 2024 08:34

Refactor as MM Inference Aggregator

b0dabc9

Merge branch 'main' into model-monitoring-export-block

0b8f4b3

robiscoding marked this pull request as ready for review November 21, 2024 13:35

Minor copy change and remove additional api key check

6cfaafe

robiscoding changed the title ~~Model monitoring export block~~ Model Monitoring Inference Aggregator block Nov 21, 2024

Make tests not flaky

38cee36

PawelPeczek-Roboflow reviewed Nov 21, 2024

View reviewed changes

robiscoding and others added 2 commits November 21, 2024 11:46

Add unique aggregator key

4417c2f

Merge branch 'main' into model-monitoring-export-block

a3584c3

robiscoding requested a review from PawelPeczek-Roboflow November 21, 2024 19:30

PawelPeczek-Roboflow added 2 commits November 22, 2024 11:28

Merge branch 'main' into model-monitoring-export-block

b55927d

Apply changes into block description, fix tests and styling issues an…

96817a5

…d bump version

PawelPeczek-Roboflow approved these changes Nov 22, 2024

View reviewed changes

Merge branch 'main' into model-monitoring-export-block

538eda1

grzegorz-roboflow approved these changes Nov 22, 2024

View reviewed changes

PawelPeczek-Roboflow merged commit f3378be into main Nov 22, 2024
71 checks passed

PawelPeczek-Roboflow deleted the model-monitoring-export-block branch November 22, 2024 11:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model Monitoring Inference Aggregator block #818

Model Monitoring Inference Aggregator block #818

robiscoding commented Nov 15, 2024 •

edited

Loading

robiscoding Nov 15, 2024

PawelPeczek-Roboflow Nov 18, 2024

robiscoding Nov 18, 2024

robiscoding Nov 15, 2024

PawelPeczek-Roboflow left a comment

PawelPeczek-Roboflow Nov 18, 2024

PawelPeczek-Roboflow Nov 18, 2024

PawelPeczek-Roboflow Nov 18, 2024

PawelPeczek-Roboflow Nov 18, 2024

PawelPeczek-Roboflow Nov 18, 2024

PawelPeczek-Roboflow Nov 18, 2024

PawelPeczek-Roboflow Nov 18, 2024

PawelPeczek-Roboflow Nov 18, 2024

PawelPeczek-Roboflow Nov 18, 2024

robiscoding commented Nov 18, 2024

PawelPeczek-Roboflow commented Nov 19, 2024

PawelPeczek-Roboflow commented Nov 21, 2024

PawelPeczek-Roboflow Nov 21, 2024

robiscoding Nov 21, 2024

PawelPeczek-Roboflow Nov 21, 2024

PawelPeczek-Roboflow Nov 21, 2024

PawelPeczek-Roboflow Nov 21, 2024

PawelPeczek-Roboflow Nov 21, 2024

PawelPeczek-Roboflow Nov 21, 2024

PawelPeczek-Roboflow Nov 21, 2024

PawelPeczek-Roboflow left a comment

robiscoding commented Nov 22, 2024

🚨 Limitations

		BLOCK_NAME = "Roboflow Model Monitoring Exporter"


		class BlockManifest(WorkflowBlockManifest):



		# TODO: maybe make this a helper or decorator, it's used in multiple places
		def get_workspace_name(

Model Monitoring Inference Aggregator block #818

Model Monitoring Inference Aggregator block #818

Conversation

robiscoding commented Nov 15, 2024 • edited Loading

Description

Type of change

How has this change been tested, please provide a testcase or example of how you tested the change?

Any specific deployment considerations

Docs

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PawelPeczek-Roboflow left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

robiscoding commented Nov 18, 2024

PawelPeczek-Roboflow commented Nov 19, 2024

PawelPeczek-Roboflow commented Nov 21, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PawelPeczek-Roboflow left a comment

Choose a reason for hiding this comment

🚨 Limitations

robiscoding commented Nov 22, 2024

🚨 Limitations

robiscoding commented Nov 15, 2024 •

edited

Loading