-
Notifications
You must be signed in to change notification settings - Fork 134
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft Statement of Work - Test reliability lead #1629
Comments
@nodejs/tsc as discussed in the TSC meeting today a first cut at what a statement of work for a test flakiness lead might look like. |
Adding to agenda so that we review/get feedback in a future meeting. |
I think we should explicitly add:
Otherwise this might just optimize towards marking all the tests as flaky and let them rot in the status files, which isn't ideal. |
@joyeecheung updated, thanks for the suggestion. |
I think we should have a more measurable success criteria, such as:
I would structure the agreement as:
Alternatively, if the person is a member of the TSC, I would put at: daily rate of XXX * number of days worked, capped at zzz. |
That would be assuming the list of tests doesn't change while this is happening, which is unlikely to be true. Other contributors can always alter the tests as necessary or mark tests as flaky as they see fit, or add more tests while all these are happening, and could use some eyes watching the status of the new tests or otherwise new flakes still come up and we won't be much better off. What we care about is whether the overall situation improves, while the situation isn't always static without the hired individual doing anything. Also, identifying this list is also non-trivial work, especially when it could be challenging to triage and identify a correct list. It wouldn't be too meaningful if the list contains a lot of false positives, yet eliminating false positives can already be difficult enough. If we want a quantitative measurement, then I think the rate of a passing node-test-commit CI on the main branch is already enough (which has been around 0% for some time). |
I also think we want somebody who will do more than just fix specific tests, helping to improve how we manage and resolve flaky tests though automation and tools is just as important as fixing specific flaky tests. |
I've not been privy to this conversation so I might be missing the mark with this comment, but I think it will be more understandable and more professional-sounding if you call it a "test reliability lead" rather than a "test flakiness lead". In a formal/professional document, I wouldn't refer to "flakiness" but instead refer to "reliability" (or "unreliability"). |
@Trott like that suggestion, incorporated. |
The test flakiness lead will be expected to:
Duration
Success looks like
The text was updated successfully, but these errors were encountered: