-
Notifications
You must be signed in to change notification settings - Fork 2.3k
PretrainedTransformerMismatchedIndexer fails silently when given empty strings as input. #4612
Comments
PretrainedTransformerMismatchedIndexer
fails silently when given empty strings as input.
Thanks for the thorough bug report! Just FYI, I fixed the pip freeze output to be what was intended; you can see that it's now in a drop down, so it's not so annoying. Can you also include what I agree that throwing an error somewhere is the right fix. I'm not certain where that error should be, but right now I'm thinking probably in the tokenizer. Seeing the output of |
Ah, thanks for fixing pip freeze. It turns out that, in addition to empty strings, weird unicode characters can also mess things up; presumably they're not recognized by the token indexer. I added an example in the code snippet above to show what happens. I've included a
|
Ok, figured it out. We need a allennlp/allennlp/modules/token_embedders/pretrained_transformer_mismatched_embedder.py Line 108 in e840a58
That resolves the issue. Can you open a PR for this that includes a simple test based on the minimal example you gave above? That would be awesome. |
I changed that line to I've got a PR for this, but I didn't want to submit since the test suite wouldn't pass if the PR were accepted as is. Let me know if I should submit anyhow. |
Sorry I wasn't specific enough; it's the division by zero that's the problem, so you need to clamp the lengths to a min of 1. So, |
PR submitted: #4615 |
Checklist
master
branch of AllenNLP.pip freeze
.Description
When the
PretrainedTransformerMismatchedEmbedder
is used to embed an empty string''
, runningbackward
on the output of the embedder producesnan
gradients in the transformer parameters, but it's tough to track down where they came from. It's easier to see with an example; see "Steps to Reproduce".Suggested fix: The
PretrainedTransformerMismatchedEmbedder
should throw an error or give a warning when given an empty string as input. I'm happy to implement this if I can get some guidance on the appropriate files to change.The output of
pip freeze
is long and (I think) uninformative, so I've moved it to the bottom of this bug report.Related issues or possible duplicates
Steps to reproduce
Runnable example below. The first example will not produce
nan
gradients, but the second one will.Environment
OS: Linux
Python version: 3.7.6
Pip freeze output:
``` -e git+https://github.com/allenai/allennlp.git@73220d7#egg=allennlp -e git+https://github.com/allenai/allennlp-models.git@a730fed9424bcbe21186fc7866b195ea9ac7ecc5#egg=allennlp_models attrs==19.3.0 autopep8 @ file:///tmp/build/80754af9/autopep8_1592412889138/work backcall @ file:///home/conda/feedstock_root/build_artifacts/backcall_1592338393461/work beautifulsoup4==4.8.1 blis==0.4.1 boto3==1.14.20 botocore==1.17.20 catalogue==1.0.0 certifi==2020.6.20 chardet==3.0.4 click==7.1.2 conllu==3.0 cymem==2.0.3 decorator==4.4.2 docutils==0.15.2 filelock==3.0.12 flake8==3.8.3 flaky==3.7.0 future==0.18.2 h5py==2.10.0 idna==2.10 importlib-metadata @ file:///tmp/build/80754af9/importlib-metadata_1593446408836/work ipdb==0.11 ipython==7.9.0 ipython-genutils==0.2.0 javapackages==4.3.2 jedi @ file:///home/conda/feedstock_root/build_artifacts/jedi_1592619900914/work jmespath==0.10.0 joblib==0.16.0 jsonnet @ file:///home/conda/feedstock_root/build_artifacts/jsonnet_1590349750875/work jsonpickle==1.4.1 lxml==4.5.2 mccabe==0.6.1 more-itertools==8.4.0 murmurhash==1.0.2 mypy @ file:///tmp/build/80754af9/mypy_1593442617121/work mypy-extensions==0.4.3 nltk==3.5 numpy==1.19.0 overrides==3.1.0 packaging==20.4 pandas==0.25.2 parso==0.7.0 pexpect==4.8.0 pickleshare==0.7.5 plac==1.1.3 pluggy==0.13.1 preshed==3.0.2 prompt-toolkit==2.0.10 protobuf==3.12.2 psutil==5.7.0 ptyprocess==0.6.0 py==1.9.0 py-rouge==1.1 pycodestyle==2.6.0 pyflakes==2.2.0 Pygments==2.6.1 pyparsing==2.4.7 pytest==5.4.3 python-dateutil==2.8.1 python-Levenshtein==0.12.0 pytz==2020.1 PyXB==1.2.4 regex==2020.6.8 requests==2.24.0 responses==0.10.15 rope==0.17.0 s3transfer==0.3.3 sacremoses==0.0.43 scikit-learn==0.23.1 scipy==1.5.1 semantic-version==2.8.5 sentencepiece==0.1.91 six @ file:///home/conda/feedstock_root/build_artifacts/six_1590081179328/work soupsieve==2.0.1 spacy==2.2.4 srsly==1.0.2 tensorboardX==2.1 thinc==7.4.0 threadpoolctl==2.1.0 tokenizers==0.8.1rc1 toml @ file:///tmp/build/80754af9/toml_1592853716807/work torch==1.6.0 tqdm==4.47.0 traitlets==4.3.3 transformers==3.0.2 typed-ast==1.4.1 typing-extensions @ file:///tmp/build/80754af9/typing_extensions_1592847887441/work urllib3==1.25.9 wasabi==0.7.0 wcwidth @ file:///home/conda/feedstock_root/build_artifacts/wcwidth_1592931742287/work word2number==1.1 zipp==3.1.0 ```The text was updated successfully, but these errors were encountered: