Support longformer in ORTModel #479

fxmarty · 2022-11-17T10:56:16Z

Longformer takes global_attention_mask as input in the current transformers onnx export. Hence, it is currently not supported with ORTModel.

Can see how we can support, but I'm worried adding custom cases like this to ORTModel adds overhead.

The text was updated successfully, but these errors were encountered:

AdriandLiu · 2024-01-18T20:15:15Z

As an alternative, is that possible/reasonable to use other model's config to optimize longformer? such as put

 {
  "model_type": "bert"
}

in config.json for optimize longformer? Thanks @fxmarty

fxmarty added the onnxruntime Related to ONNX Runtime label Nov 17, 2022

fxmarty mentioned this issue Nov 29, 2022

ONNX Conversion for LongFormer predictions different #505

Open

4 tasks

Provide feedback