Reduce model size when using Bert Embedder #2897

faizan30 · 2019-05-28T06:27:02Z

System (please complete the following information):

OS: Linux
Python version: 3.7.1
AllenNLP version: 0.8.3

Question

I'm working on a text classification task and I'm using PretrainedBertEmbedder. The size of classification model is over 400Mb. How can I reduce the model size while still using Bert?

Token embedder:

"token_embedders": {
"bert": {
"type": "bert-pretrained",
"pretrained_model": "bert-base-uncased",
"top_layer_only": true
}
}

Token Indexer:
"token_indexers": {
"bert": {
"type": "bert-pretrained",
"pretrained_model": ".pretrained/bert/bert-base-uncased.tar.gz",
"do_lowercase": true,
"max_pieces": 100
}
}

kernelmachine · 2019-05-29T17:26:51Z

I don't think you can reduce the model size beyond your classifier's parameters, unless you retrain a smaller BERT. If you need more efficient training, some tricks like mixed precision training (#2149) and gradient accumulation are in the works (#2721)

faizan30 · 2019-05-30T05:18:09Z

Are BERT embeddings being saved as part of weights file? If yes is there a way to seperate my model weights and bert weights?

faizan30 · 2019-05-30T05:18:34Z

Thanks for the response @kernelmachine .

faizan30 changed the title ~~Reduce model size using Bert Embedder~~ Reduce model size when using Bert Embedder May 29, 2019

kernelmachine closed this as completed May 31, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce model size when using Bert Embedder #2897

Reduce model size when using Bert Embedder #2897

faizan30 commented May 28, 2019

kernelmachine commented May 29, 2019

faizan30 commented May 30, 2019

faizan30 commented May 30, 2019

Reduce model size when using Bert Embedder #2897

Reduce model size when using Bert Embedder #2897

Comments

faizan30 commented May 28, 2019

kernelmachine commented May 29, 2019

faizan30 commented May 30, 2019

faizan30 commented May 30, 2019