Assessing the Alignment of FOL Closeness Metrics with Human Judgement

This repository provides a comprehensive study of various metrics and their alignment with human judgment for evaluating First-Order Logic (FOL) closeness.

Install

Clone this Repository Clone the repository to your local machine:

git clone https://github.com/RamyaKeerthy/AlignmentFOL

Set Up the Environment Install the required dependencies:

pip install -r requirements.txt

Data Generation

To generate data for the evaluation, use the provided Jupyter notebooks within the notebook directory. These notebooks contain scripts to create the files essential for replicating the evaluation results presented in the paper.

Evaluation

Perturbation evaluations Perturbations can be generated using notebook/get_perturbations. Based on the generated perturbations, run the evaluation script to obtain scores for the seven metrics.

python run_eval_pert.py

Sample evaluations Sample data can be generated using notebook/get_samples. Use the following command to run the sample evaluation script:

python run_eval_samples.py

Note: Sample generation requires an API key to access the GPT model.

Credit

The evaluation code is adapted from LogicLlama

Licence

This code is licensed under the MIT License and is available for research purposes.

Citation

If you use this code or reference this work, please cite: Assessing the Alignment of FOL Closeness Metrics with Human Judgement

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
notebooks		notebooks
LICENSE		LICENSE
README.md		README.md
align_rmse.py		align_rmse.py
fol_parser.py		fol_parser.py
graph_parser.py		graph_parser.py
metrics.py		metrics.py
requirements.txt		requirements.txt
run_eval_pert.py		run_eval_pert.py
run_eval_sample.py		run_eval_sample.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Assessing the Alignment of FOL Closeness Metrics with Human Judgement

Install

Data Generation

Evaluation

Credit

Licence

Citation

About

Releases

Packages

Languages

License

RamyaKeerthy/AlignmentFOL

Folders and files

Latest commit

History

Repository files navigation

Assessing the Alignment of FOL Closeness Metrics with Human Judgement

Install

Data Generation

Evaluation

Credit

Licence

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages