Benchmarking your Scientific Python Packages Using ASV and GitHub Actions

Slide 1

Slide 1 text

Anissa Zacharias | February 5, 2024 Benchmarking your Scientific Python Packages Using ASV and GitHub Actions

Slide 2

Slide 2 text

Preview What We’re Coving and Why 2 Benchmarking your Scientiﬁc Python Packages Using ASV and GitHub Actions We wanted to implement benchmarking that: ‣ We could develop alongside our package code ‣ Wouldn’t clutter up our package repo with results ‣ Would show us how our benchmarks runs changed over time Here we’ll cover: ‣ Airspeed Velocity (ASV), the package we used for benchmarking ‣ How to (or at least how we) set up benchmarking and a benchmark archive ‣ Some details about our GitHub action conﬁguration ‣ Discussion

Slide 3

Slide 3 text

Airspeed Velocity Background 3 Benchmarking your Scientiﬁc Python Packages Using ASV and GitHub Actions • benchmarking over package lifetime • can track runtime, memory consumption and custom values • highly customizable • a fair amount of initial set up work numpy’s (old) ASV set up https://github.com/airspeed-velocity/asv https://asv.readthedocs.io

Slide 4

Slide 4 text

Airspeed Velocity Writing Benchmarks • Benchmarks go in a folder in the benchmarked repo Types of benchmarks: 1. Timing - time_*() 2. Memory - mem_*() 3. Peak memory - peakmem_*() 4. Raw timing - timeraw_*() 5. Tracking - track_*() 4 Benchmarking your Scientiﬁc Python Packages Using ASV and GitHub Actions class Import: """Benchmark importing geocat-comp.""" def timeraw_import_geocat_comp(self): return "import geocat.comp" benchmarks/import.py location configured in asv.conf.json benchmarks can be organized into separate files You can set individual benchmark attributes like timeout, setup, teardown, repeat counts, and more. Benchmarks can have version numbers if you decide to alter a benchmark.

Slide 5

Slide 5 text

Airspeed Velocity Important commands 5 Benchmarking your Scientific Python Packages Using ASV and GitHub Actions $ asv run main..mybranch Run all commits on a branch since branching off main https://asv.readthedocs.io/en/stable/commands.html $ asv quickstart Used to generate config files and set up a new benchmarking suite $ asv run v0.1^! Benchmark a single commit or tag $ asv publish Generates html from benchmark results $ asv preview Used to preview the results locally $ asv run —skip-existing Skips running benchmarks that have existing results

Slide 6

Slide 6 text

Structure Overall 6 Benchmarking your Scientiﬁc Python Packages Using ASV and GitHub Actions Python Package Repository Package Code Benchmarks Directory benchmark python files configuration file runs benchmarks and pushes results to archive repo on push to main Benchmark Archive Repository configuration file archived results benchmarks.json archived results deploy to gh pages makes html from archived results on push to main static html gh pages website

Slide 7

Slide 7 text

Structure Overall 7 Benchmarking your Scientiﬁc Python Packages Using ASV and GitHub Actions Python Package Repository Package Code Benchmarks Directory benchmark python files configuration file runs benchmarks and pushes results to archive repo on push to main Benchmark Archive Repository configuration file archived results benchmarks.json archived results deploy to gh pages makes html from archived results on push to main static html gh pages website Original code that we wanted to benchmark 1 2 2.5 After set up, everything from here over happens automatically Deploys from GH action to hosted website https://ncar.github.io/geocat-comp-asv/

Slide 8

Slide 8 text

Benchmarking your Scientific Python Packages Using ASV and GitHub Actions The Benchmarked Package Repo To add ASV to an existing project: • make benchmarks directory • add benchmarks • add configuration file • add github actions for automation Structure 8 Python Package Repository Package Code Benchmarks Directory benchmark python files configuration file runs benchmarks and pushes results to archive repo on push to main 1

Slide 9

Slide 9 text

Benchmarking your Scientific Python Packages Using ASV and GitHub Actions The Benchmark Archive Repo • separate benchmark archive repo to avoid cluttering up main package repository with results To set up archive repo: • add top-level config file • add github action to make static html and deploy github pages Structure 9 Benchmark Archive Repository configuration file archived results benchmarks.json archived results deploy to gh pages makes html from archived results on push to main static html gh pages website 2 2.5

Slide 10

Slide 10 text

Airspeed Velocity Setting Up Conﬁg Files 10 Benchmarking your Scientiﬁc Python Packages Using ASV and GitHub Actions Package Repo Archive Repo For running the benchmarks For generating the publish/preview hosting for pages { "version": 1, "project": "geocat-comp", "project_url": "https://geocat-comp.readthedocs.io", "repo": "..", "branches": ["main"], // for git "environment_type": "conda", "install_timeout": 600, "show_commit_url": "http://github.com/NCAR/geocat-comp/commit/", "pythons": ["3.10"], "conda_channels": ["conda-forge"], "conda_environment_file": "../build_envs/asv-bench.yml", "matrix": { "python": [""], }, "benchmark_dir": ".", } { "version": 1, "project": "geocat-comp", "project_url": "https://github.com/NCAR/geocat-comp", "repo": "https://github.com/NCAR/geocat-comp", "branches": [ "main" ], "dvcs": "git", "environment_type": "conda", "benchmark_dir": "benchmarks" }

Slide 11

Slide 11 text

GitHub Actions Automating Benchmarks 11 Benchmarking your Scientiﬁc Python Packages Using ASV and GitHub Actions https://github.com/NCAR/geocat-comp/blob/main/.github/workﬂows/asv-benchmarking.yml

Slide 12

Slide 12 text

GitHub Actions Automating Dashboard Creation 12 Benchmarking your Scientiﬁc Python Packages Using ASV and GitHub Actions https://github.com/NCAR/geocat-comp-asv/blob/main/.github/workﬂows/make-publish.yml