[TensorRT EP] Use TRT/CUDA/ORT version from runtime instead of build time to generate hash value #22921

chilo-ms · 2024-11-21T19:58:15Z

Use TensorRT and CUDA version fetched at runtime to get the hash value which determines the cache name.

The old way to get the version is at compile/build time that might have some issues in some cases,
ex:
TRT EP uses the TRT version which we or users built against at compile time.
However, users can change different TRT version at run time, that can cause issue because TRT EP always checks the "fixed" TRT version, not the TRT version it uses now. This can cause TRT EP to use incompatible TRT engine cache.

see the github issue here:
#22382 (comment)

yf711

LGTM
I've tested on FRCNN model and now hash values are different when promoting trt lib version.

…time to generate hash value (microsoft#22921) Use TensorRT and CUDA version fetched at **runtime** to get the hash value which determines the cache name. The old way to get the version is at compile/build time that might have some issues in some cases, ex: TRT EP uses the TRT version which we or users built against at compile time. However, users can change different TRT version at run time, that can cause issue because TRT EP always checks the "fixed" TRT version, not the TRT version it uses now. This can cause TRT EP to use incompatible TRT engine cache. see the github issue here: microsoft#22382 (comment)

update

77bebe1

chilo-ms requested review from jywu-msft, yf711 and jingyanwangms November 21, 2024 19:59

chilo-ms added 3 commits December 2, 2024 23:20

add InitProviderOrtApi

2e1d022

fix typo

18e4c0d

still use ORT_VERSION at build time

2b89724

yf711 approved these changes Dec 4, 2024

View reviewed changes

jywu-msft approved these changes Dec 4, 2024

View reviewed changes

chilo-ms merged commit 9b9f881 into main Dec 4, 2024
95 checks passed

chilo-ms deleted the chi/trt_ep_refactor branch December 4, 2024 05:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TensorRT EP] Use TRT/CUDA/ORT version from runtime instead of build time to generate hash value #22921

[TensorRT EP] Use TRT/CUDA/ORT version from runtime instead of build time to generate hash value #22921

chilo-ms commented Nov 21, 2024 •

edited

Loading

yf711 left a comment

[TensorRT EP] Use TRT/CUDA/ORT version from runtime instead of build time to generate hash value #22921

[TensorRT EP] Use TRT/CUDA/ORT version from runtime instead of build time to generate hash value #22921

Conversation

chilo-ms commented Nov 21, 2024 • edited Loading

yf711 left a comment

Choose a reason for hiding this comment

chilo-ms commented Nov 21, 2024 •

edited

Loading