XLM question-answering pipeline is flacky #28000

fxmarty · 2023-12-13T11:00:33Z

System Info

transformers main, but tested on commits in the last three weeks, same issue

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

for i in range(50):
    from transformers import AutoTokenizer, AutoModelForQuestionAnswering, pipeline
    import torch
    
    model = AutoModelForQuestionAnswering.from_pretrained("hf-internal-testing/tiny-random-XLMModel")
    tokenizer = AutoTokenizer.from_pretrained("hf-internal-testing/tiny-random-XLMModel")
    pipe = pipeline("question-answering", model=model, tokenizer=tokenizer)
    question = "Whats my name?"
    context = "My Name is Philipp and I live in Nuremberg."
    outputs = pipe(question, context)

sometimes fail with

Traceback (most recent call last):
  File "<tmp 4>", line 23, in <module>
    outputs = pipe(question, context)
  File "/home/fxmarty/hf_internship/transformers/src/transformers/pipelines/question_answering.py", line 393, in __call__
    return super().__call__(examples[0], **kwargs)
  File "/home/fxmarty/hf_internship/transformers/src/transformers/pipelines/base.py", line 1132, in __call__
    return next(
  File "/home/fxmarty/hf_internship/transformers/src/transformers/pipelines/pt_utils.py", line 125, in __next__
    processed = self.infer(item, **self.params)
  File "/home/fxmarty/hf_internship/transformers/src/transformers/pipelines/question_answering.py", line 563, in postprocess
    "start": np.where(char_to_word == token_to_orig_map[s])[0][0].item(),
KeyError: 5

Expected behavior

no error. I can have a look if I have time

The text was updated successfully, but these errors were encountered:

amyeroberts · 2023-12-13T12:10:59Z

Hi @fxmarty - thanks for raising this!

To help with debugging - has this been observed with other checkpoints or only the tiny random ones for testing?

fxmarty mentioned this issue Dec 13, 2023

Compatibility with Transformers 4.36 huggingface/optimum#1590

Merged

echarlaix mentioned this issue Jan 8, 2024

Modify model id for test huggingface/optimum#1628

Merged

huggingface deleted a comment from github-actions bot Jan 15, 2024

fxmarty closed this as completed Jan 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XLM question-answering pipeline is flacky #28000

XLM question-answering pipeline is flacky #28000

fxmarty commented Dec 13, 2023

amyeroberts commented Dec 13, 2023

XLM question-answering pipeline is flacky #28000

XLM question-answering pipeline is flacky #28000

Comments

fxmarty commented Dec 13, 2023

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

amyeroberts commented Dec 13, 2023