[rush] Differentiate remote and local execution in telemetry. #4755

aramissennyeydd · 2024-05-30T21:00:39Z

Summary

Fixes #4737. My goal is to address data skew questions before we go ahead with #4680 which just adjusts the data skew.

Details

There is no great way currently to determine if telemetry for an operation was generated from the current machine or a remote machine. This is likely to cause data skew depending on how you ingest the Rush telemetry, either

You restore duration from nonCachedDurationMs, which causes multiple events with the same duration (+/- a few milliseconds) if you emit events from each cobuild agent. That messes with averages and whatnot when aggregating your data.
You calculate duration from startTimestampMs and endTimestampMs which causes massive spikes in duration collected across your agents, as all but the primary agents report 0.05s and the primary agent reports 15.00s. That also messes with averages and whatnot during aggregation.

I propose a new wasExecutedOnThisMachine flag that monorepo maintainers can then use in their plugins to decide whether or not they want to process the given operation's data.

How it was tested

Tested in this repository, using the sharded-repo sandbox.

Impacted documentation

Anything where Rush describes writing your own telemetry plugin.

…ions that were executed remotely and locally

dmichon-msft · 2024-05-31T01:37:58Z

libraries/rush-lib/src/cli/scriptActions/PhasedScriptAction.ts

+            wasExecutedOnThisMachine:
+              !operationResult.cobuildRunnerId ||
+              operationResult.cobuildRunnerId === cobuildConfiguration?.cobuildRunnerId,


Strictly speaking, does a replay from cache count as "executed on this machine"?

Open to other names here. Strictly speaking, kind of/kind of not right? Cobuilds are intended to be state restores across machines as though it was built on any given cobuild agent, but from the data, that build wasn't executed on this machine b/c the cobuild runner ids from the state file and machine's cobuild config don't match.

aramissennyeydd added 3 commits May 30, 2024 16:31

feat(cobuilds,telemetry): allow differentiating telemetry from operat…

c3cb846

…ions that were executed remotely and locally

add changeset

1fe7d79

revert adding telemetry

92bb11b

aramissennyeydd requested review from iclanton, octogonz, apostolisms, D4N14L and dmichon-msft as code owners May 30, 2024 21:00

undeprecate nonCachedDurationMs

05e6c95

iclanton approved these changes May 31, 2024

View reviewed changes

dmichon-msft reviewed May 31, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rush] Differentiate remote and local execution in telemetry. #4755

[rush] Differentiate remote and local execution in telemetry. #4755

aramissennyeydd commented May 30, 2024

dmichon-msft May 31, 2024

aramissennyeydd May 31, 2024

[rush] Differentiate remote and local execution in telemetry. #4755

Are you sure you want to change the base?

[rush] Differentiate remote and local execution in telemetry. #4755

Conversation

aramissennyeydd commented May 30, 2024

Summary

Details

How it was tested

Impacted documentation

dmichon-msft May 31, 2024

Choose a reason for hiding this comment

aramissennyeydd May 31, 2024

Choose a reason for hiding this comment