[Ascend NPU] Improve torchbenchmark results #42

shink · 2025-01-26T02:30:16Z

Some benchmarks failed on NPU, there are still something we can do to improve them.

Torchbenchmark issues tracker

CV models

vision_maskrcnn

Installtation problems

fastNLP_Bert
opacus_cifar10 (Network problem)
torch_multimodal_clip (Network problem)

OOM

llava

NPU out of memory. Tried to allocate 174.00 MiB (NPU 0; 29.50 GiB total capacity; 28.26 GiB already allocated; 28.26 GiB current active; 34.72 MiB free; 29.10 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation.

Runtime Error

sam_fast: No module named 'triton'
timm_efficientdet: not_implemented
simple_gpt: not_implemented
simple_gpt_tp_manual: not_implemented

The text was updated successfully, but these errors were encountered:

shink self-assigned this Jan 26, 2025

shink added the Ascend NPU label Jan 26, 2025

shink mentioned this issue Feb 13, 2025

[Feat] Use torchbenchmark configuration #47

Open

hipudding mentioned this issue Mar 10, 2025

PyTorch Integration test #49

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Ascend NPU] Improve torchbenchmark results #42

[Ascend NPU] Improve torchbenchmark results #42

shink commented Jan 26, 2025 •

edited

Loading

[Ascend NPU] Improve torchbenchmark results #42

[Ascend NPU] Improve torchbenchmark results #42

Comments

shink commented Jan 26, 2025 • edited Loading

Torchbenchmark issues tracker

CV models

Installtation problems

OOM

Runtime Error

shink commented Jan 26, 2025 •

edited

Loading