Plot expectation value benchmarks #168

natestemen · 2025-01-14T02:07:45Z

Description

This PR refactors the expectation value benchmarking script to ensure it works with the run_benchmarks.sh script. It also introduces new circuits to broaden the test of the expectation value testing. Scripts to visualize relative and absolute errors across different compilers over time are added, with one plot added to the README.

@jordandsullivan we have the option to plot relative or absolute error, but relative is much higher for ucc, which I don't understand the reason for.

benchmarks/scripts/run_benchmarks.sh

…nd/ucc into plot-expectation-value

the new data isn't the same format

jordandsullivan

Thanks for your work in this! Great getting to hack in person together.

I'm wondering why are the relative errors in the hundreds in the first place? What are the actual expectation values? I'd say we want to report errors in terms of a percent.

natestemen · 2025-01-15T19:31:16Z

why are the relative errors in the hundreds

I think it's because the ideal values are coming out so close to 0 that the relative errors are blowing up. You can find all the most recent results in benchmarks/results/expval_2025-01-14_20.csv. E.g. for QFT the ideal expectation value (last column) is $\mathcal{O}(10^{-21})$ so it's easy to be $>100\%$ off.

ucc/benchmarks/results/expval_2025-01-14_20.csv

Lines 11 to 14 in 8471ab3

    
           ucc,qft,ZZZZZZZZZZ,4.7704895589362195e-18,4.771491994589455e-18,4759.8985323285315,-1.002435653235501e-21 
        
           qiskit,qft,ZZZZZZZZZZ,8.673617379884035e-19,8.68364173641639e-19,866.2542786051877,-1.002435653235501e-21 
        
           pytket,qft,ZZZZZZZZZZ,-3.2526065174565133e-18,3.2516040818032778e-18,3243.703544769454,-1.002435653235501e-21 
        
           cirq,qft,ZZZZZZZZZZ,2.168404344971009e-19,2.178428701503364e-19,217.31356965129692,-1.002435653235501e-21

Not sure what the best course of action here is.

jordandsullivan · 2025-01-15T21:12:55Z

Okay just as a sanity check can you plot the simulated and ideal expectation values and standard deviation, similar to what I did here for #58 (where I was running on real hardware)?

This reminds me, we can also simply measure an array of observables in addition to ZZZZZZ like I did there. Maybe just adding in some that measure like XIIIIIII or XXXXXZZZZZ, etc.

jordandsullivan

Looking good, a few suggestions.

benchmarks/average_relative_error_over_time.png

benchmarks/latest_expval_benchmark_by_compiler.png

benchmarks/latest_relative_absolute_errors_by_circuit.png

benchmarks/scripts/write_qasm.py

bachase · 2025-02-10T21:51:27Z

@natestemen #220 (and f6a53bd) in particular is my attempt to fix the cirq issue. Let me know if this helps with your noise model approach (or I can try testing that myself too)

natestemen · 2025-02-11T13:52:03Z

I noticed there are many changes to the qasm files. Can you summarize why they had to change?

There are some new additions (circuits of sizes that are reasonable to simulate). These were added so we could have something to test one. If you see any QASM files that were modified let me know, because that was definitely not the intention.

bachase

Looks great overall.

Several nits and one typo to consider.

bachase · 2025-02-11T19:08:24Z

benchmarks/qasm_circuits/qasm2/ucc/prep_select_N10_ghz.qasm

Was it intentional to delete this file?

benchmarks/scripts/common.py

benchmarks/scripts/expval_benchmark.py

benchmarks/scripts/plot_expval_benchmarks_over_time.py

bachase · 2025-02-11T19:39:54Z

benchmarks/scripts/plot_expval_benchmarks_over_time.py

+ax.set_xlabel("Date")
+ax.set_ylabel("Average Absolute Error")
+ax.grid(True)
+ax.legend(title="Compiler")


Do we want to use the same version labels shown in the other plots?

Hmmm I think probably yes, but I'll leave that for a subsequent PR.

benchmarks/scripts/plot_latest_expval_benchmarks.py

benchmarks/latest_expval_benchmark_by_compiler.png

we are not plotting averages

Misty-W

LGTM, thanks @natestemen !

bachase · 2025-02-13T11:36:57Z

benchmarks/scripts/expval_benchmark.py


-    simulator = AerSimulator(
-        method="density_matrix", noise_model=depolarizing_noise
+def parse_arguments() -> tuple[str, str, str, bool]:


Another style nit for the future -- we should consider [argparse](https://docs.python.org/3/library/argparse.html) or [click](https://click.palletsprojects.com/en/stable/) for managing command line parsing. Likely some other libraries out there these days too.

bachase · 2025-02-13T11:41:25Z

benchmarks/scripts/run_benchmarks.sh

 for qasm_file in "${QASM_EXPVAL_FILES[@]}"; do
    for compiler in "${COMPILERS[@]}"; do
        # Combine the common folder path with the QASM file
        full_qasm_file="${QASM_FOLDER}${qasm_file}"

        # Build the command, passing the results folder as an argument
-        command="python3 $(dirname "$0")/expval_benchmark.py \"$full_qasm_file\" \"$compiler\" \"$RESULTS_FOLDER\" $LOG_EXPVAL"
+        command="poetry run python $(dirname "$0")/expval_benchmark.py \"$full_qasm_file\" \"$compiler\" \"$RESULTS_FOLDER\" $LOG_EXPVAL"


One thing to be careful about here is if this script itself is run via poetry, which is how I think the prior benchmark runs were done in the github actions (e.g. they were run via poetry run ./benchmarks/scripts .....

update Suprisingly, this seems to work!

bachase

Looks great and thanks for all the changes.

I'll create a few issues for some of the nits/style suggestions that were broader than just this PR

jordandsullivan and others added 4 commits January 13, 2025 15:00

remove unused QASM generation functions

9f62ec5

Bug fixes for save

47632ad

Add N=10 qubits QCNN file

c7dcd18

refactor expval benchmark to run single iteration

1ed386c

natestemen commented Jan 14, 2025

View reviewed changes

benchmarks/scripts/run_benchmarks.sh Outdated Show resolved Hide resolved

jordandsullivan added 5 commits January 14, 2025 09:47

Merge branch 'plot-expectation-value' of https://github.com/unitaryfu…

fc9a3e5

…nd/ucc into plot-expectation-value

Copy benchpress QASM 2 files for N=10 and N=9 qubits

019a232

Rename qasm file to include basis

c32ee7c

Add filenames to expal run

dcb4322

Fix filenames

1c28fd3

Misty-W linked an issue Jan 14, 2025 that may be closed by this pull request

Add expectation value plot to GH benchmarks pipeline #147

Closed

natestemen added 6 commits January 14, 2025 20:39

ensure observable matches circuit size

22a35a7

add qisit aer to requirements

5353c4b

remove old expval data

683310f

the new data isn't the same format

first results

1fb692f

first plot

39f4ed2

more plots

8471ab3

natestemen marked this pull request as ready for review January 15, 2025 05:33

natestemen requested review from jordandsullivan and Misty-W January 15, 2025 05:33

jordandsullivan reviewed Jan 15, 2025

View reviewed changes

jordandsullivan closed this Jan 15, 2025

jordandsullivan reopened this Jan 15, 2025

jordandsullivan reviewed Jan 16, 2025

View reviewed changes

natestemen added 4 commits January 16, 2025 21:13

uncomment first commands to run

782c920

s/relative/absolute for errors

011b185

add new data and remove relative errors

e334d0c

remove old plot

f6b6417

Misty-W reviewed Feb 10, 2025

View reviewed changes

benchmarks/scripts/write_qasm.py Outdated Show resolved Hide resolved

natestemen added 4 commits February 11, 2025 08:21

fix dependency name

4869cda

use transpile as translate for consistency

12e40d6

Merge branch 'main' into plot-expectation-value

6afbc51

ignore python compiled files

c4f5134

natestemen added 3 commits February 11, 2025 08:52

new lockfile

2b6542b

remove pycache from repo

ee4cc04

fix botched imports

914ea08

natestemen requested a review from Misty-W February 11, 2025 15:41

Misty-W modified the milestone: 0.4.3 Feb 11, 2025

natestemen requested a review from bachase February 11, 2025 17:11

bachase approved these changes Feb 11, 2025

View reviewed changes

natestemen added 7 commits February 12, 2025 15:59

use glob for consistency

f6d634e

remove average from y axis

8cb27ea

we are not plotting averages

fix typo

b9cbea5

describe violin plot of expectation values

3bb18dd

use poetry to run expval benchmarks

a334c7a

refactor expval script

cbeaa5b

formatting

22945b6

natestemen requested a review from bachase February 12, 2025 22:26

Misty-W approved these changes Feb 13, 2025

View reviewed changes

bachase reviewed Feb 13, 2025

View reviewed changes

bachase approved these changes Feb 13, 2025

View reviewed changes

Merge branch 'main' into plot-expectation-value

6ec0268

natestemen merged commit b92d2fd into main Feb 13, 2025
1 check passed

natestemen deleted the plot-expectation-value branch February 13, 2025 18:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plot expectation value benchmarks #168

Plot expectation value benchmarks #168

natestemen commented Jan 14, 2025 •

edited

Loading

jordandsullivan left a comment •

edited

Loading

natestemen commented Jan 15, 2025

jordandsullivan commented Jan 15, 2025 •

edited

Loading

jordandsullivan left a comment

bachase commented Feb 10, 2025

natestemen commented Feb 11, 2025

bachase left a comment

bachase Feb 11, 2025

bachase Feb 11, 2025

natestemen Feb 12, 2025

Misty-W left a comment

bachase Feb 13, 2025 •

edited

Loading

bachase Feb 13, 2025

bachase left a comment

Plot expectation value benchmarks #168

Plot expectation value benchmarks #168

Conversation

natestemen commented Jan 14, 2025 • edited Loading

Description

jordandsullivan left a comment • edited Loading

Choose a reason for hiding this comment

natestemen commented Jan 15, 2025

jordandsullivan commented Jan 15, 2025 • edited Loading

jordandsullivan left a comment

Choose a reason for hiding this comment

bachase commented Feb 10, 2025

natestemen commented Feb 11, 2025

bachase left a comment

Choose a reason for hiding this comment

bachase Feb 11, 2025

Choose a reason for hiding this comment

bachase Feb 11, 2025

Choose a reason for hiding this comment

natestemen Feb 12, 2025

Choose a reason for hiding this comment

Misty-W left a comment

Choose a reason for hiding this comment

bachase Feb 13, 2025 • edited Loading

Choose a reason for hiding this comment

bachase Feb 13, 2025

Choose a reason for hiding this comment

bachase left a comment

Choose a reason for hiding this comment

natestemen commented Jan 14, 2025 •

edited

Loading

jordandsullivan left a comment •

edited

Loading

jordandsullivan commented Jan 15, 2025 •

edited

Loading

bachase Feb 13, 2025 •

edited

Loading