Add automatic benchmarks to Github Actions #103

nukelet · 2024-03-13T09:24:59Z

This is a WIP attempt at implementing automatic benchmarks in the project's CI/CD pipeline (#45). This should prove useful for catching potential performance regressions in the future.

I tried to achieve this by launching a headless sway session (using WLR_BACKENDS=headless) with a config that creates a virtual, headless output that wayshot can capture. We then run wayshot to take a screenshot and use flamegraph to plot a heatmap of the functions in wayshot that used the most CPU time, and also run a few benchmarks with hyperfine. The results of both are then saved as artifacts for later inspection.

Currently there are three main issues:

The flamegraph plot is currently missing some symbols (probably due to misconfigured kernel options in the runner's VM)
We probably would need to use a tool such as bencher to actually keep track of past benchmark results and issue alerts when a significant performance regression is detected
I'm not entirely sure whether running these benchmarks in a Github Actions runner is the best idea, maybe a self-hosted runner would be better (more deterministic)?

Here is an example of a successful CI/CD run with these changes, as well as the flamegraph/hyperfine artifacts produced: https://github.com/nukelet/wayshot/actions/runs/8261924814

I'd like to get some feedback on whether this looks like a step in the right direction or if another approach would be better.

Shinyzenith · 2024-03-13T09:38:22Z

Hi can you point your pr to freeze feat Andreas
It currently points to main which is not our current dev branch.

Shinyzenith · 2024-03-13T09:39:25Z

Since it was Andreas who introduced me to flamegraph and suggested to have a testing suite, let me just ping them to get their opinion on it too.

CC: @AndreasBackx

AndreasBackx · 2024-03-15T00:41:10Z

This would be a great direction imo. I agree that running this on CI/CD might not be super accurate, though I would say let's give it a shot and see how stable those measurements are for now. This would definitely be better than nothing.

I haven't got much experience with benchmarking tools out there, though I would again focus on what's free and simply experiment. We can make this a non-blocking signal in CI/CD or elsewhere. Maybe make a bot automatically post the information on a PR. Though even simply making this easy to run already is hugely impactful.

Would be nice to see benchmarks for a combination of screen resolutions, scalings, and amount of them. I'd say a permutation of the following would already be plenty:

Resolutions: 1080p, 1440p, 4K
Scaling: 1, 1.25, 1.5, 1.75, 2 (though perhaps only for 4K and maybe 1440p as laptops might use those)
Amount of monitors: 1, 3

This might be too many permutations though we can look at the most important ones. Also would be interesting to see this for maybe 8K (7680x4320) or 4K super ultrawides (7680x2160).

Finally, would be nice to have this easily visible when making PRs. I don't know if GitHub actions makes it easy to automatically give some feedback or not.

…enchmark

nukelet · 2024-03-21T12:51:13Z

Hi again!

Finally, would be nice to have this easily visible when making PRs. I don't know if GitHub actions makes it easy to automatically give some feedback or not.

Just pushed a workflow to automatically comment the benchmark results on PRs. Currently it will comment on every PR sync (new commit, force push, etc), so I'm not sure if it would be too spammy, but we can adjust that later if needed. I'm also working on the other test cases you suggested :-)

nukelet · 2024-03-21T12:57:03Z

Also, I spent way longer than I'd like to admit trying to figure out a way to get the bot to post the flamegraph heatmap SVG as an image in the comment, but that proved quite tricky without hosting the image outside of Github. Maybe we could convert it to PNG and upload to Imgur, or just upload the raw SVG somewhere (since Github apparently has support for displaying them)?

Shinyzenith · 2024-03-21T14:41:02Z

GitHub has a native CDN right? I believe it's under the usercontent subdomain. Can we just keep the svg data in memory and pass it to the cdn?

nukelet · 2024-03-21T16:04:59Z

It does, but AFAIK it only works when you manually add a file through the browser, similarly to this (I don't think we can reproduce this in CI/CD since it would at the very least require user authentication).

I just had the idea of uploading the SVG to a gist (since it's just code anyway) and then pointing to it in the PR comment. Let's see if this works. :D

Also, had to fix a silly permission error that caused the last actions run to fail.

Shinyzenith · 2024-03-21T16:11:38Z

Feel free to contact me on discord. I'm willing to buy some hosting to make this easier for you. Don't want to spam GitHub and potentially get blocked.

Add benchmark jobs to Github Actions

292e839

nukelet changed the base branch from main to freeze-feat-andreas March 13, 2024 10:04

AndreasBackx and others added 4 commits March 21, 2024 09:32

First version of scaling fix. (waycrate#85)

37326d5

Add benchmark jobs to Github Actions

f284deb

Run benchmarks on separate workflow and automatically comment results

8c244cd

Merge branch 'feat-automatic-benchmark-comment' into feat-automatic-b…

16e57f8

…enchmark

Fix permission error for the benchmark workflow

bfcec45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add automatic benchmarks to Github Actions #103

Add automatic benchmarks to Github Actions #103

nukelet commented Mar 13, 2024

Shinyzenith commented Mar 13, 2024 •

edited

Loading

Shinyzenith commented Mar 13, 2024

AndreasBackx commented Mar 15, 2024 •

edited

Loading

nukelet commented Mar 21, 2024

nukelet commented Mar 21, 2024

Shinyzenith commented Mar 21, 2024

nukelet commented Mar 21, 2024

Shinyzenith commented Mar 21, 2024

Add automatic benchmarks to Github Actions #103

Are you sure you want to change the base?

Add automatic benchmarks to Github Actions #103

Conversation

nukelet commented Mar 13, 2024

Shinyzenith commented Mar 13, 2024 • edited Loading

Shinyzenith commented Mar 13, 2024

AndreasBackx commented Mar 15, 2024 • edited Loading

nukelet commented Mar 21, 2024

nukelet commented Mar 21, 2024

Shinyzenith commented Mar 21, 2024

nukelet commented Mar 21, 2024

Shinyzenith commented Mar 21, 2024

Shinyzenith commented Mar 13, 2024 •

edited

Loading

AndreasBackx commented Mar 15, 2024 •

edited

Loading