This GitHub Repo contains version 1 of a new 1-week curriculum on LLM Evals being added to ARENA 3.0. The final exercises content is inside the chapter3_llm_evals
folder. The rest of the repo are code for replicating various materials needed to create the exercises (you can ignore). When running the notebooks, make sure you set the working directory as ARENA_evals/chapter3_llm_evals
.
If you are testing or contributing our materials, do the following:
- Clone the repo and make a new branch
- Make a PR with the corresponding changes to both:
exercises/part.../solutions.py
(for bugs in the solution) orexercises/part.../tests.py
(for bugs in the test functions)instructions/pages.sec.../sec_n_title.py
(for any text or code displayed on our streamlit)