Add a performance benchmarking tool #419

wholmgren · 2018-02-02T00:51:45Z

We should look into using a package like asv for performance benchmarks. PRs such as #409 would benefit from a standard set of performance tests that cover multiple use cases.

kandersolar · 2020-03-21T03:30:27Z

There is also pytest-benchmark which could integrate with the test suite, but I think it might be better to have a separate benchmark suite because a lot of the tests doesn't really reflect realistic usage.

As far as running benchmarks in a CI service, this thread has some insights: astropy/astropy#6149
Although it appears pandas does it anyway: https://github.com/pandas-dev/pandas/blob/master/.github/workflows/ci.yml#L74-L90

Is there still interest in benchmarking pvlib? I was playing around with asv a bit tonight and would be happy to pick up where #530 left off.

mikofski · 2020-03-21T03:58:36Z

Kevin, benchmarking was also proposed as a GSoC idea, check out the wiki.

wholmgren · 2020-03-21T04:11:11Z

Definitely interested! I might soon have dedicated hardware available for this if/when the software is ready for it.

And feel free to take over my pr or start fresh.

kandersolar · 2020-03-21T15:50:29Z

Oh sorry I didn't realize, thanks @mikofski. If it doesn't get chosen as a GSOC project then I'll pick this up.

mikofski · 2020-03-21T15:59:43Z

Perhaps you could help mentor? Ok with me if you want to take it on anytime, don't let GSoC interfere 😀 thanks!

wholmgren · 2020-03-21T16:01:05Z

I’m not optimistic this is going to be picked up in GSOC

…

On Sat, Mar 21, 2020 at 8:59 AM Mark Mikofski ***@***.***> wrote: Perhaps you could help mentor? Ok with me if you want to take it on anytime, don't let GSoC interfere 😀 thanks! — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#419 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABBOER4DLTELP76SWZW7STLRITP7XANCNFSM4EO2CKKQ> .

kandersolar · 2020-07-23T02:38:23Z

After playing around with asv a bit more, I'd like to get feedback on an implementation plan. Some quick background for people who haven't used asv before: it's sort of like pytest, you create a folder of code snippets you want to benchmark and invoke asv on them to generate a set of timings. Those timings are associated with the machine and the pvlib git commit used to run the benchmarks. When new code is committed, the benchmarks are rerun, creating a history of timings. This history is then used to generate an HTML report like this example from astropy that shows the timing history for each benchmark. It also has some other nice features like regression detection logic and a binary search to determine which commit introduced the regression.

A goal that seems useful and achievable to me is to have a machine somewhere that does a nightly benchmark test on whatever commits were added to master that day. Running the benchmarks manually and inspecting the results is a bit of a hassle, so I want it to be automated and generating results consistently over time so that they are available whenever they're needed.

Choice of machine: In terms of timing stability, a dedicated machine is ideal. However, a virtual cloud server is more convenient. I've been running a basic pvlib benchmark suite on a cheapo cloud VPS and getting surprisingly stable results -- not as stable as the astropy timings, but good enough to detect significant shifts. Could be that I'm just getting lucky and running during stable conditions.

Tracking timing history: the timing history is stored in json files, so it is reasonable to use git to keep track of them over time. asv also has a convenient command to push the HTML report to the gh-pages branch for hosting on github pages. However, keeping track of the results in the main pvlib repo would add a lot of clutter to master's commit history. NumPy and Astropy, and probably others, keep the results in a separate respository, which seems like a good idea to me. That way a cron job could push to it as much as it wants without making a mess of the main repo's commit history.

Long term it might be cool to somehow integrate benchmarks with a CI PR check, but I think it makes sense to take smaller steps and just benchmark master for now to get more familiar with asv and figure out how to get the most value from it. So to summarize, here's what I'm proposing:

Create a new benchmarks folder (at the top level?) in this repo for the benchmark suite, it would look something like this: https://github.com/kanderso-nrel/pvlib-python/tree/asv_setup/asv_bench/benchmarks
Create a new pvlib-benchmarks repository in the pvlib organization to store benchmark results and the HTML plotting report (like the astropy example). The HTML would be on the gh-pages branch so that it would be available at http://pvlib.github.io/pvlib-benchmarks or something like that.
Set up one or more benchmark machines with a nightly cron job that pulls master, runs the suite on any new commits, and pushes results and the HTML report to pvlib-benchmarks so that the Github Pages website is kept up to date and shows the entire history. @wholmgren mentioned a dedicated machine may be available, but either way I volunteer to maintain a cheapo VPS as a point for comparison.

Does this sound like a good plan?

mikofski · 2020-07-23T14:20:21Z

Wow, great research Kevin! Sounds good to me. My preference would be for a dedicated machine, paid for by a dedicated source. Who is paying for the cheapo vps?

kandersolar · 2020-07-23T14:42:26Z

I have a squad of small VPSs I use for personal projects and have just been testing on one of those. The cost is a drop in the bucket, not worth worrying about. Agreed that a dedicated machine is preferable.

wholmgren added the testing label Feb 2, 2018

wholmgren added this to the 0.6.0 milestone Feb 2, 2018

wholmgren modified the milestones: 0.6.0, Someday Aug 7, 2018

wholmgren mentioned this issue Aug 13, 2018

add airspeed velocity config and performance tests #530

Closed

10 tasks

kandersolar mentioned this issue Jul 7, 2020

Why remove parallelization for bifacial.py? #997

Closed

kandersolar mentioned this issue Sep 6, 2020

Set up airspeed velocity benchmarks #1049

Merged

6 tasks

wholmgren added the performance label Sep 9, 2020

wholmgren closed this as completed in #1049 Sep 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a performance benchmarking tool #419

Add a performance benchmarking tool #419

wholmgren commented Feb 2, 2018

kandersolar commented Mar 21, 2020

mikofski commented Mar 21, 2020

wholmgren commented Mar 21, 2020 •

edited

Loading

kandersolar commented Mar 21, 2020

mikofski commented Mar 21, 2020

wholmgren commented Mar 21, 2020 via email

kandersolar commented Jul 23, 2020

mikofski commented Jul 23, 2020

kandersolar commented Jul 23, 2020

Add a performance benchmarking tool #419

Add a performance benchmarking tool #419

Comments

wholmgren commented Feb 2, 2018

kandersolar commented Mar 21, 2020

mikofski commented Mar 21, 2020

wholmgren commented Mar 21, 2020 • edited Loading

kandersolar commented Mar 21, 2020

mikofski commented Mar 21, 2020

wholmgren commented Mar 21, 2020 via email

kandersolar commented Jul 23, 2020

mikofski commented Jul 23, 2020

kandersolar commented Jul 23, 2020

wholmgren commented Mar 21, 2020 •

edited

Loading