git clone https://github.com/ozmig77/factcheck-ko-2020
pip install -r requirements.txt
-
docs_noun.json link is given here
save it unuder
data/
-
pretrained checkpoints link is given here.
save it under
ss/checkpoints/
andrte/checkpoints/
.
data/claims_korquad.json
: Supporting document for the Factcheck (Based on ko wiki)data/{train, val, test}.json
: Annotated dataset for the Factcheck
For DR, run python document/extract_noun.py
.
this will create data/docs_noun.json
.
For training SS, run python ss/train_ss.py
.
this will create ss/checkpoints/best_ckpt.pth
.
For training RTE, run python rte/train_rte.py
.
this will create rte/checkpoints/best_ckpt.pth
.
For evaluation, run python eval.py --k 5
.
It will take about 100 minute to process 4459 test claims, you can reduce time by setting k as 1.
Ours result R@5 94.93%, R@1 95.38% on SS, 64.21% on RTE.
Check demo.ipynb
for demo.