Juniper is a formally-specified computer algebra library written in Rust and formalized in Lean 4. This is meant to be a toy project to demonstrate how these systems can work together, not a ready-made or feature-rich library.
A CAS, or Computer Algebra System, is a library or program which can symbolically manipulate algebraic statements. For example, simplifying the statement (+ (^ (sin x) (+ (sin (/ π 2)) (cos (* 2 π)))) (^ (cos (inv (inv x))) 2))
to 1
is a problem CASs are built to solve. Juniper is a CAS library, written in Rust, which uses formal definitions from Lean 4 to understand which mathematical rules are applicable to a given statement.
A high-level overview:
Proven theorems in the Lean 4 project are exported as JSON and imported into Rust. Then, those theorems are converted into rewriting rules by extracting Forall
s as variables. Finally, egg is used to find the simplest form for a given statement using the rewriting rules obtained from Lean.
For a more detailed explanation, see the Lean README and the Rust README.
While the basic datatype of numbers in this project is ℚ, the operating domain of its mathematical rules is not. All numbers and expressions should be expected to be in ℝ, but only the subset ℚ of ℝ is represented by constants. Approximation is also done in ℝ (but represented by floats).
The point of this project is not to create a perfectly sound CAS using Lean proofs1, more to demonstrate how results in Lean can be automatically leveraged and utilized for computer algebra (or other similar rule-rewriting systems). A few things result from this distinction:
- The conceptions of certain mathematical concepts are not the same in Lean as they are in Juniper. For a myriad of reasons, many mathematical operators in Lean are defined in ways which are unusual for the uninitiated. The constant folding in Juniper doesn't line up exactly with Lean's understanding of those operators.
- Constant folding in Juniper is also not formally specified. Accomplishing that would essentially require an entire secondary conversion process, but with more complex conversion for computable definitions of functions (which is quite far out of the scope of this project).
- Equality saturation is also not formalized (mostly because it comes from egg).
- Parsing, printing, and transpilation is not verified beyond unit tests.
- create
LeanExpr
->egg::Pattern
infrastructure - create JuniperLean attribute to automatically turn marked theorems into JSON, and export them to a given file
- import JSON in Rust, turn it into a set of rewrites, and use that for the
egg::Runner
- create build.rs to track Lean files and rerun some Lean command to capture JSON
- add scientific number parsing to JuniperBigRational
- split juniper_bin into juniper_repl and juniper_lib
- write readmes and documentation for what this is and how it operates
- add license(s)?
- more expansive
lean_to_rewrite
architecture - add support for automatic conditionals (e.g. encoding
x
as a rewrite condition forx → a / 2 = a * (1 / 2)
) tolean_to_rewrite
- add environment support to the repl (being able to assign variables, e.g.
(= x (* 5 y))
andx / 5
→y
) - test juniper_math_expression
- test juniper_lean_to_rewrite
- test juniper_lib
- write the actual set of theorems for conversion into the CAS
- fix ne soundness issue? not really sure how to approach this one
- create system to turn
egg::Explanation
into Lean proofs (either textually (lol) or with a proof certificate)
Future work in this area might work to:
- resolve unsoundness issues
- equivalent operator definitions
- speicified constant folding
- formal verification
- resolve usability issues
- a more complete repl
- more interface features
- a GUI
- create an integration into Lean
- using the CAS as a command to generate a proof suggestion
- or to create a more complete rule transition system
- conditionals embedded into the language itself for evalution (good for avioding soundness issues related to conditional rewrites)
- extracting mvars
- computable function transfer
Also, important to mention another project which is working in an adjacent direction: lean-egg (doing equality saturation from within Lean about Lean statements).
Footnotes
-
Although, with a little effort, a very similar project could accomplish that! ↩