Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for SQuAD 2.0 #157

Closed
1 task
fmikaelian opened this issue Jun 20, 2019 · 6 comments
Closed
1 task

Support for SQuAD 2.0 #157

fmikaelian opened this issue Jun 20, 2019 · 6 comments
Assignees

Comments

@fmikaelian
Copy link
Collaborator

fmikaelian commented Jun 20, 2019

  • Train new pre-trained models on SQuAD 2.0
@fmikaelian fmikaelian changed the title Add SQuAD 2.0 model Support for SQuAD 2.0 Jun 25, 2019
@thomasvn
Copy link
Contributor

thomasvn commented Jul 8, 2019

Hi!

Just curious but what are the specs for the machine used to fine-tune the model on SQuADv1.1 and SQuADv2.0?

How long did 1.1 take you? And how long do you think 2.0 will take you?

@andrelmfarias
Copy link
Collaborator

Hi @thomasvn,

The fine-tuning of the Reader (BERT for QA) was run in an AWS EC2 p3.2xlarge machine (GPU Tesla V100 16GB). It took about 2 hours to complete (2 epochs on SQuAD 1.1 train was enough to achieve SOTA results on SQuAD 1.1 dev).

I think that 2 epochs on SQuAD 2.0 would take a little bit more than 3h

@thomasvn
Copy link
Contributor

thomasvn commented Jul 8, 2019

Awesome. Thanks for putting it into perspective!

@andrelmfarias andrelmfarias self-assigned this Jul 12, 2019
@alex-movila
Copy link

there is already pretrained model here
Could you adapt it fo cdQA?

@andrelmfarias
Copy link
Collaborator

Thanks @alex-movila, we will test this pre-trained model with cdQA and if it runs good we will probably integrate it to the package

@fmikaelian
Copy link
Collaborator Author

Closing this issue since it is part of the XLNet roadmap (PR #205, Issue #196)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants