Add a performance measuring top-level user guide page #10539

malteneuss · 2024-11-08T17:00:21Z

Another step of the user guide improvement initiative #9214:

Add a simple profiling user-guide page based on https://discourse.haskell.org/t/ghc-profiling-a-cabal-project-with-an-interactive-application/10465/2?u=malteneuss, which the author allowed us to use.

Feel free to modify yourself if that is faster.

Patches conform to the coding conventions.
Is this a PR that fixes CI? If so, it will need to be backported to older cabal release branches (ask maintainers for directions).

Kleidukos · 2024-11-08T17:11:17Z

Thanks a bunch for starting this. Several things:

I would recommend that you make use of the ghc-prof-options key, which is used like this:

ghc-prof-options: -fprof-auto -fno-prof-count-entries -fprof-auto-calls

You mention Speedscope and the -pj format. Is the JSON produced by -pj natively understood by Speedscope? Or do you still need something like https://github.com/mpickering/hs-speedscope ?

doc/how-to-analyze-haskell-code-performance.rst

malteneuss · 2024-11-09T15:06:13Z

Thanks for the quick review @Kleidukos and @geekosaur. I've added all your review remarks. Yes, the json output can be loaded into speedscope directly. This https://github.com/mpickering/hs-speedscope only deals with eventlog files, which irrc is another profiling report format to analyze memory consumption (some topic for a future section in this guide), although it also measures cpu performance.

doc/how-to-analyze-haskell-code-performance.rst

tchoutri · 2024-11-09T16:30:19Z

@malteneuss Much better, only two more changes to make and we should be good to go! :)

jasagredo

My understanding is that eventlog profiling is far superior than just -pj, and it is (for recent GHCs) just a matter of passing -l -p instead (or if a smaller eventlog is wanted -l-agu -p). I have personally never used the json profiling report.

doc/how-to-analyze-haskell-code-performance.rst

malteneuss · 2024-11-09T19:53:38Z

@jasagredo Do you have a link to a post or blog that discusses the differences? I have never used it. Would you like to add a second section showing how to use it in another MR? I think we should add another section for profiling memory, where eventlog should be discussed after all.

ulysses4ever · 2024-11-10T03:09:28Z

Thank you for taking this on! It's amazing to see efforts in expanding the guide section! I have certain critique below but, please, realize that documentation is a matter where reasonable people can disagree. I can survive the current version, especially given that you already got the mandatory two approvals.

IMO the text uses a cabal workflow that is suboptimal. In particular:

It advocates for explicit cabal build, while you never need an explicit build in simple scenarios (including profiling). You should use cabal run where possible and rely on cabal to manage (re)compilation.
(In part, because of (1)) it has to use $(cabal list-bin my-app) in an earlier, supposedly lighter subsection. This is long (i.e. annoying), quite platform-dependent (i.e. it assumes bash; I normally don't use bash even under Linux, for instance), and perfectly avoidable (see below).
In an earlier, supposedly lighter subsection, it mentions adding profiling-related GHC options in a cabal file. Even though it's done as a note, this idea comes too early (advanced options should come later) and also gives a bad advice (you shouldn'g use cabal file for it -- a project file is the right place).

I propose solving most (all) of these with the following reformatting.

Start with cabal configure and explain that it creates cabal.project.local with local configuration.
Now, simply cabal run works.
Now you can mention the advanced GHC options. You are able to put them in the local project file at this point. Optionally, you could have a little subsection at the very end instead of here. You could also say two words why these options are good in there.
Now comes the section Profiling your dependencies too.

ulysses4ever

I temporarily block it to make sure you have enough time to respond.

doc/how-to-analyze-haskell-code-performance.rst

jasagredo · 2024-11-10T05:53:05Z

Re-reading the document again I am unsure this describes Cabal's job on profiling. I think the section should be more about what options does cabal need to enable profiling than how to produce a profiling report, therefore I think the title is misleading and we are stepping into GHC's User Guide territory.

My suggestion would be to leave most of the description and RTS options to the GHC User guide and describe only the following in the text:

A brief explanation on why building for profiling is different (one has to link to the profiling RTS and insert cost-centres)
How to enable profiling (either via flag or via cabal.project)
The different options to enable profiling (library-vanilla for example)
The different customizations cabal provides (like profiling-detail which uses different names than GHC) and how they map to GHC options
How to pass other GHC profiling options.
How to enable profiling for specific dependencies or for all dependencies.
The usual caveats (for example I build with --enable-profiling, then using only run will re-build without profiling because the flags used during building are not persistent)

This way the choice of p vs pj, or what files are produced or where to load them is deferred to the User Guide, as it is GHC-specific business.

This way we don't mix "How to profile" and "How to configure for profiling". The latter is Cabal's job, but the former is GHC or general Haskell information which should not live in Cabal's docs I think. It will always be too specific or too general what you can say there and it might depend on GHC versions and so on.

Mikolaj · 2024-11-12T12:01:54Z

OTOH, for some users a full example session of profiling would be immensely valuable and precisely what they are looking for, in despair reaching even for the cabal documentation. So maybe at least link to the original discourse post, saying it contains practical examples, including how to tweak GHC to profile best, which is out of scope of the cabal guide?

jasagredo · 2024-11-12T12:19:31Z

I think this deserves a place in something like https://haskell.foundation/hs-opt-handbook.github.io, not in cabal's documentation. Cabal docs should talk about how to configure cabal for the different profiling options, but not about how to profile or interpret the results.

tchoutri · 2024-11-12T12:39:10Z

@jasagredo I disagree part of what you said. The cabal manual should describe how to operate cabal. Interpretation of the results certainly deserves to be centralised in the optimisation handbook but producing a profile is certainly well within what one could expect of the cabal manual.

jasagredo · 2024-11-12T13:32:39Z

I think there is a hope that Cabal's docs could tell you how to do everything, but IMHO that is not how it should be.

For example I think Rust usually has had a much more comprehensible documentation than Haskell, and they usually made very correct choices in my opinion.

The equivalent to what we are discussing here is this page on "The Cargo Book" https://doc.rust-lang.org/cargo/reference/profiles.html which describes the options that exist to customize optimization or debug information.

Then it defers most of the information to "The rustc book" which describes how each option works. How to do profiling is covered in "The Rust Performance book" https://nnethercote.github.io/perf-book/profiling.html which links to different tools to produce and analyze profiles.

Why is this different than in Haskell? Because Rust performance can be analyzed by standard tools, whereas in Haskell it is the GHC RTS the one that produces its own reports, and third party tools the ones that interpret those reports in different ways.

GHC's User Guide already speaks about how to produce reports and gives a brief outline on how to analyze them, but it is mostly via 3rd party tools that those reports are consumed (profiteur, eventlog2html, hp2pretty/hp2ps, speedscope, ...). It therefore is reasonable that each tool explains their business, and having a central place that outlines the overall complete process, which I think is the Haskell Optimization Handbook.

In any case, both producing and analyzing documentation live outside of "The Cargo Book" in the Rust ecosystem, which I think is the right choice, and as such I would argue we could do the same and leave this outside of the cabal documentation.

jasagredo · 2024-11-12T14:48:53Z

Maybe I should re-phrase my suggestion after my wall of text above (sorry for that):

cabal documentation should explain just how to configure your project for profiling
It could mention explicitly that RTS options will have to be passed to produce a report, but for what flags to pass that is GHC's work.
Distinction between memory and CPU profiling is GHC's work.
Interpreting reports is Haskell Optimization Handbook's work.

malteneuss · 2024-11-12T20:23:49Z

Here's the second proposal after having it streamlined similar to @ulysses4ever suggestions. I don't mention cabal.project.local as i find it too unintuitive why it exists in the first place (cabal.project is also "local"; maybe something for another how-to-configure things guide).

I'm with @Mikolaj and @tchoutri on having a full example session (if i need such a guide, i don't want to scramble snippets together from different places) and that the main interface where we configure things is Cabal here. However, i emphasize now that Cabal does only a configuring part, and GHC the actual work; and i mention the optimization handbook.

geekosaur · 2024-11-12T20:28:38Z

I don't mention cabal.project.local as i find it too unintuitive why it exists in the first place (cabal.project is also "local"; maybe something for another how-to-configure things guide).

cabal.project is shared between all developers of a particular project; cabal.project.local is for individual developers to customize or adapt to their local machine/development environment.

doc/how-to-analyze-haskell-code-performance.rst

ulysses4ever · 2024-11-13T03:07:00Z

cabal.project is shared between all developers of a particular project; cabal.project.local is for individual developers to customize or adapt to their local machine/development environment.

correct. Even simpler rule of thumb is: cabal.project holds options that are useful to store in Git and cabal.project.local is for local experiments (like profiling) that, in general, shouldn't be committed to Git.

jasagredo

As this is going forward anyways, let me add some suggestions to improve it. I won't "Request changes" so feel free to disregard my comments.

doc/how-to-analyze-haskell-code-performance.rst

jasagredo · 2024-11-14T08:35:56Z

doc/how-to-analyze-haskell-code-performance.rst

+The first step to build your application, e.g. ``my-app``, with profiling enabled, and
+the second step to run it to collect a report, can be done with a single ``cabal run`` command:
+
+.. code-block:: console
+
+      $ cabal run --enable-profiling --profiling-detail=late my-app -- +RTS -pj -RTS
+      <program runs and finishes>


I would rather first mention the cabal configure path, then in a note mention that you can do this in one go but it is not persistent.

Yes, please!

jasagredo · 2024-11-14T08:37:03Z

doc/how-to-analyze-haskell-code-performance.rst

+      $ cabal run --enable-profiling --profiling-detail=late my-app -- +RTS -pj -RTS
+      <program runs and finishes>


I would mention that there are three outputs, in particular -p, -pj and -l -p and that you will, just for educational purposes, show the -pj in particular.

From my comment below, I would actually mention the three if we are going to mention any at all.

Yes, please! The eventlog property that Javier is getting at is called time series, I believe? https://en.wikipedia.org/wiki/Time_series

jasagredo · 2024-11-14T08:39:32Z

doc/how-to-analyze-haskell-code-performance.rst

+Finally, a profiling JSON report is written to a ``<app-name>.prof`` file,
+i.e. ``my-app.prof``, in the current directory.
+Load the profiling report file  ``my-app.prof`` into a visualizer
+and look for performance bottlenecks. One popular open-source
+`flame graph <https://www.brendangregg.com/flamegraphs.html>`__
+visualizer is
+`Speedscope <https://speedscope.app>`__,
+which runs in the browser and can open this JSON file directly.


I would mention something like:

-pj produces JSON output, can be visualized in speedscope.

-p produces GHC's own .prof format, can be visualized in profiteur or ghcprofview.

-l -p produces an eventlog that can be visualized in speedscope by first converting it via hs-speedscope.

Also I'm not 100% sure of that I'm about to say, but my understanding is that -p and -pj shows total time spent, and even in speedscope you see totals. One can see how much time a function took, but not when.

I think the eventlog path shows totals but also shows when this happened, i.e. information will be interleaved with GCs or switching contexts, and split by capabilities. IMHO this is more useful and the thing I usually use.

jasagredo · 2024-11-14T08:42:37Z

doc/how-to-analyze-haskell-code-performance.rst

+The ``cabal run`` command above is essentially a shorthand for
+
+.. code-block:: console
+
+    $ cabal build --enable-profiling --profiling-detail=late my-app
+    $ cabal list-bin my-app
+    /path/to/my-app
+    $ /path/to/my-app +RTS -pj -RTS
+    <program runs and finishes>


As said, I would mention this first and then the compressed command.

Co-authored-by: Javier Sagredo <[email protected]>

malteneuss requested review from geekosaur and Kleidukos November 8, 2024 17:01

geekosaur requested changes Nov 8, 2024

View reviewed changes

malteneuss requested a review from geekosaur November 9, 2024 15:06

malteneuss added the documentation label Nov 9, 2024

malteneuss force-pushed the add_profiling_guide branch from 9839ef1 to 0370522 Compare November 9, 2024 15:14

malteneuss mentioned this pull request Nov 9, 2024

[Initiative] Improve Cabal documentation structure to become more beginner-friendly #9214

Open

19 tasks

malteneuss force-pushed the add_profiling_guide branch from 0370522 to 8988bee Compare November 9, 2024 15:26

geekosaur approved these changes Nov 9, 2024

View reviewed changes

doc/how-to-analyze-haskell-code-performance.rst Outdated Show resolved Hide resolved

tchoutri requested changes Nov 9, 2024

View reviewed changes

doc/how-to-analyze-haskell-code-performance.rst Outdated Show resolved Hide resolved

jasagredo reviewed Nov 9, 2024

View reviewed changes

doc/how-to-analyze-haskell-code-performance.rst Outdated Show resolved Hide resolved

ulysses4ever reviewed Nov 9, 2024

View reviewed changes

doc/how-to-analyze-haskell-code-performance.rst Outdated Show resolved Hide resolved

malteneuss force-pushed the add_profiling_guide branch from 77341b0 to d326c39 Compare November 9, 2024 19:46

malteneuss requested review from tchoutri and geekosaur November 9, 2024 19:48

geekosaur approved these changes Nov 9, 2024

View reviewed changes

tchoutri approved these changes Nov 9, 2024

View reviewed changes

Kleidukos approved these changes Nov 9, 2024

View reviewed changes

malteneuss added the merge me Tell Mergify Bot to merge label Nov 9, 2024

mergify bot added the ready and waiting Mergify is waiting out the cooldown period label Nov 9, 2024

ulysses4ever requested changes Nov 10, 2024

View reviewed changes

doc/how-to-analyze-haskell-code-performance.rst Outdated Show resolved Hide resolved

mergify bot added the merge delay passed Applied (usually by Mergify) when PR approved and received no updates for 2 days label Nov 11, 2024

Add top-level performance measuring guide page

c0aacca

malteneuss force-pushed the add_profiling_guide branch from 48cb15a to c0aacca Compare November 12, 2024 20:14

malteneuss requested review from geekosaur, Kleidukos and ulysses4ever November 12, 2024 20:25

geekosaur approved these changes Nov 12, 2024

View reviewed changes

doc/how-to-analyze-haskell-code-performance.rst Show resolved Hide resolved

jasagredo reviewed Nov 14, 2024

View reviewed changes

malteneuss and others added 2 commits November 14, 2024 17:56

Update doc/how-to-analyze-haskell-code-performance.rst

f42a969

Co-authored-by: Javier Sagredo <[email protected]>

Update doc/how-to-analyze-haskell-code-performance.rst

4825f85

Co-authored-by: Javier Sagredo <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a performance measuring top-level user guide page #10539

Add a performance measuring top-level user guide page #10539

malteneuss commented Nov 8, 2024

Kleidukos commented Nov 8, 2024

malteneuss commented Nov 9, 2024 •

edited

Loading

tchoutri commented Nov 9, 2024

jasagredo left a comment

malteneuss commented Nov 9, 2024 •

edited

Loading

ulysses4ever commented Nov 10, 2024

ulysses4ever left a comment

jasagredo commented Nov 10, 2024 •

edited

Loading

Mikolaj commented Nov 12, 2024

jasagredo commented Nov 12, 2024

tchoutri commented Nov 12, 2024

jasagredo commented Nov 12, 2024

jasagredo commented Nov 12, 2024

malteneuss commented Nov 12, 2024

geekosaur commented Nov 12, 2024

ulysses4ever commented Nov 13, 2024

jasagredo left a comment

jasagredo Nov 14, 2024

ulysses4ever Nov 14, 2024

jasagredo Nov 14, 2024 •

edited

Loading

jasagredo Nov 14, 2024

ulysses4ever Nov 14, 2024

jasagredo Nov 14, 2024

jasagredo Nov 14, 2024

jasagredo Nov 14, 2024

		$ cabal run --enable-profiling --profiling-detail=late my-app -- +RTS -pj -RTS
		<program runs and finishes>

Add a performance measuring top-level user guide page #10539

Are you sure you want to change the base?

Add a performance measuring top-level user guide page #10539

Conversation

malteneuss commented Nov 8, 2024

Kleidukos commented Nov 8, 2024

malteneuss commented Nov 9, 2024 • edited Loading

tchoutri commented Nov 9, 2024

jasagredo left a comment

Choose a reason for hiding this comment

malteneuss commented Nov 9, 2024 • edited Loading

ulysses4ever commented Nov 10, 2024

ulysses4ever left a comment

Choose a reason for hiding this comment

jasagredo commented Nov 10, 2024 • edited Loading

Mikolaj commented Nov 12, 2024

jasagredo commented Nov 12, 2024

tchoutri commented Nov 12, 2024

jasagredo commented Nov 12, 2024

jasagredo commented Nov 12, 2024

malteneuss commented Nov 12, 2024

geekosaur commented Nov 12, 2024

ulysses4ever commented Nov 13, 2024

jasagredo left a comment

Choose a reason for hiding this comment

jasagredo Nov 14, 2024

Choose a reason for hiding this comment

ulysses4ever Nov 14, 2024

Choose a reason for hiding this comment

jasagredo Nov 14, 2024 • edited Loading

Choose a reason for hiding this comment

jasagredo Nov 14, 2024

Choose a reason for hiding this comment

ulysses4ever Nov 14, 2024

Choose a reason for hiding this comment

jasagredo Nov 14, 2024

Choose a reason for hiding this comment

jasagredo Nov 14, 2024

Choose a reason for hiding this comment

jasagredo Nov 14, 2024

Choose a reason for hiding this comment

malteneuss commented Nov 9, 2024 •

edited

Loading

malteneuss commented Nov 9, 2024 •

edited

Loading

jasagredo commented Nov 10, 2024 •

edited

Loading

jasagredo Nov 14, 2024 •

edited

Loading