Auto-generate benchmarks with genthat, and run them #22

vogr · 2021-08-13T13:50:35Z

(Note: this PR requires a patched version of genthat, see PRL-PRG/genthat#162)

This PR makes it possible to extract benchmarks from CRAN packages using genthat, to automatically generate the necessary fields in rebench.conf (including the number of inner-iterations per benchmarks).

All of this can be done in a reproducible way using MRAN (pinned to the day 2020-02-28, one day before the release of R 3.6.3 so that all the packages are compatible with R 3.6.2).

The steps necessary to generate and run the benchmarks are:

Install dependencies, install the packages from which to extract calls (defined in packages.txt): see RBenchmarking/Setup/genthat/README.md (install_genthat.R, install_pkgs.R, extract_testcases.R). All these steps can be automated using the docker image built from Setup/genthat/Dockerfile (details in the README). You should then copy the benchmarks to Benchmarks/genthat-CRAN/generated
- optionally: check that the results are stable over several iterations (check_against_recorded_retv.sh)
- optionally: decrease the number of benchmarks by picking a single file per function in each package (pick_one_testcase.sh)
- optionally (but recommended): compute the number of iterations necessary for each benchmark so that they all run for 200ms with R, as a baseline (with min_nb_iter.R, see Setup/genthat/inner_it/README.md)
Run the benchmarks using rebench. Everything is automated in the file Setup/run.sh, like for the other benchmarks. The configuration file will be generated by genthat_rebenchconf.py from the name of the benchmarks and the number of iterations determined in the previous step.

You do not have to actually run step 1: the generated files are already saved in the repo (as their generation takes a large amount of time, and because it makes sense to "freeze" the benchmarks for reproducibility).

To actually run step 2, the Docker image used for rebench needs some modification (modified from container/benchmark/Dockerfile in https://github.com/reactorlabs/rir) :

ARG CI_COMMIT_SHA
FROM registry.gitlab.com/rirvm/rir_mirror:$CI_COMMIT_SHA
ENV R_LIBS="/opt/r_library"
ENV PATH="$PATH:/opt/rir/external/custom-r/bin"
RUN apt-get update && \
    DEBIAN_FRONTEND=noninteractive apt-get install -y -qq python3-pip sudo && \
    apt-get clean && rm -rf /var/cache/apt/lists && \
    git clone --depth 1 https://github.com/smarr/ReBench.git /opt/ReBench && cd /opt/ReBench && pip3 install . && \
    mv /usr/local/bin/rebench-denoise /usr/local/bin/rebench-denoise.bkp && cp /usr/bin/false /usr/local/bin/rebench-denoise
RUN git clone --depth 10 https://github.com/vogr/RBenchmarking.git /opt/rbenchmarking && cd /opt/rbenchmarking && git checkout 12573c102bac99b644ea89ec3d59acde129d7b37
RUN /opt/rbenchmarking/Setup/genthat/install_pkgs.R /opt/rbenchmarking/Setup/genthat/packages.txt /opt/r_library

The two last lines were modified: use the modified RBenchmarking branch, and install the R packages necessary to run the benchmarks. Also set R_LIBS accordingly (alternatively, the default folders could be used, but I wanted to prevent collisions with other potential R packages).

To actually run the benchmarks:

# update CI_COMMIT_SHA to match the rir version you want to use
$ docker build -t rir-rebench --build-arg CI_COMMIT_SHA=b3e7e854cc78fa42b6b1748effcd0586e00b9881 .

# run a transient container
$ docker run --rm -it rir-rebench bash

# run only the genthat benchmarks, with Rsh, and don't do reporting
$ /opt/rbenchmarking/Setup/run.sh /opt/rbenchmarking/rebench.conf /opt/rbenchmarking/Benchmarks /opt/rir/build/release "e:PIR-LLVM s:genthat-CRAN -R"

The next step would be to actually decide which packages to extract calls from (currenty only 8 packages were chosen, at random), and to select a relevant subset of the generated file (currently, one file has been kept per function, to have a total of 51 files).

Note: if the PR PRL-PRG/genthat#162 gets merged into master, the script install_genthat.R should be updated to install genthat from the master branch instead of the only-calls branch.

…per outer-iteration

GNU R and prune.

those that succeed with GNUR)

This prevent problems with benchmark named a<-b.R for instance.

the genthat-CRAN directory (else they would be detected by the configuration scripts).

owned by vogr

o- · 2021-08-16T12:43:07Z

very cool. thanks a lot @vogr. do you know how long it takes to run the full thing?

vogr added 30 commits August 3, 2021 14:45

Tooling for benchmarks generated by genthat.

bad7b51

Keep some generated tests (from yaml package) in the repo.

dee0ff7

Quote entry name for genthat benchmarks in rebench.conf.

23033e7

Add script to download and install R packages.

798fb9b

Download and install in a single script.

ec641dc

Commit packages.txt file

cafbeb8

Specify repo when downloading devtools

617f4c1

Do not depend on genthat for setup_R_packages.sh

0d47711

Add genthat-CRAN in the suits in rebench.conf

eee62cd

Add prodlim tests in the repo.

640490b

Improve script to install packages: only use stdlib functions.

b15a4b3

Remove freeze.sh script: no needed since we use MRAN.

079d6f0

intall_pkgs.sh: make saving the source optionnal.

a2bc055

Script to extract testcases from the installed packages.

ac3bbb7

Add parameter to change number of inner iterations.

079420b

Fix check in number of arguments.

45d74f2

Add README.md explaining how to extract testcases using the Dockerfile.

5108f1d

Install dependancies for extract_testcases.

54c8b5c

Add Dockerfile to automate test extraction.

6d843d7

Add main entrypoint script for Dockerfile

93647dc

Typos in README and run.sh

536b228

Split installation of genthat and of pkgs in two scripts.

973e519

Typo in README.md

11ede08

Also pin CRAN snapshot for installation of genthat.

94bdf27

Do not try to extract source to null directory.

c6046ca

Optionnaly, do not specify install directory

793ed35

Do not put class name to lower in genthat benchmarks

b58a7a3

Count number of inner-iterations necessary to reach 200ms of runtime …

b5d7d32

…per outer-iteration

Correctly generate the benchmarks: only keep function that can be run by

3f7518d

GNU R and prune.

Read number of inner-iterations from the file previously generated

14240c9

vogr added 29 commits August 10, 2021 10:18

Typo

e7ec0c6

Use parallel record_retv in instructions.

645f500

Correctly save seed.

da79d8a

Add selection of generated tests with their retv (filtered to keep only

f039551

those that succeed with GNUR)

Use same pattern for extension added to failed tests.

943b5a4

Add comment about isolation in min_nb_iter

5799897

Update usage of min_nb_iter.R

f4780e3

Don't load doParallel in scripts, use it through namespace.

f589d2e

Move capture.output outside of the hot loop in min_inner_it computation.

4dfdbc1

min_nb_iter: source in a user-created environment

a999691

Update n_inner_it command in README

4ea7194

Commit n_inner_it for the 2000 first testcases.

bccf9e8

Update packages.txt

5e6cb03

Remove mistakenly added line in packages.txt

68ecea1

Keep only the extracted testcases from the 100 first packages.

e6d0241

Document scripts in inner_it

89ac82d

Genthat is needed when running the benchmarks.

c99ff8d

Use the right result variable in the debug message.

fc0ee9f

Quote benchmark name in rebench command

fcdec7e

This prevent problems with benchmark named a<-b.R for instance.

Normalize path of extfile and Rfile in genthat-CRAN harness

387e911

Move the generated, but currently unused, genthat-benchmark files out

6e9bfa5

the genthat-CRAN directory (else they would be detected by the configuration scripts).

Make the genthat files compatible with the new harness.

c2a88bb

Make the genthat harness compatible with the new genthat files.

e0430e5

Remove the archived tests

deaae4f

Keep only a few tests.

e3b7745

Record retv is now integrated in genthat.

af50f50

Use the PRL-PRG branch for the modified genthat, instead of the one

c3ddcbf

owned by vogr

Use correct retv name in logging in harness.

ff7fc06

Update the extfile so that they match the expected format.

12573c1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Auto-generate benchmarks with genthat, and run them #22

Auto-generate benchmarks with genthat, and run them #22

vogr commented Aug 13, 2021 •

edited

Loading

o- commented Aug 16, 2021

Auto-generate benchmarks with genthat, and run them #22

Are you sure you want to change the base?

Auto-generate benchmarks with genthat, and run them #22

Conversation

vogr commented Aug 13, 2021 • edited Loading

o- commented Aug 16, 2021

vogr commented Aug 13, 2021 •

edited

Loading