✨[Feature] Support auto data type transformation between int64 <-> int32 #1346

bowang007 · 2022-09-12T06:57:50Z

Is your feature request related to a problem? Please describe.
In this graph:

INFO: [Torch-TensorRT - Debug Build] - Partitioned Graph: [Segment Block @0:
    Target: TensorRT

    Graph: graph(%index.1 : Tensor,
      %data.1 : Tensor):
  %2 : int = prim::Constant[value=4]() # test_int64.py:28:0
  %3 : bool = prim::Constant[value=0]() # test_int64.py:28:0
  %4 : NoneType = prim::Constant()
  %index : Tensor = aten::to(%index.1, %2, %3, %3, %4) # test_int64.py:28:0
  %data.3 : Tensor = aten::mul(%data.1, %data.1) # test_int64.py:29:0
  return (%index, %data.3)

Segment Block @1:
    Target: Torch

    Graph: graph(%data.3 : Tensor,
      %index : Tensor):
  %2 : int = prim::Constant[value=1]() # test_int64.py:30:0
  %0 : Tensor = aten::scatter(%data.3, %2, %index, %2) # test_int64.py:30:0
  return (%0)

%index is converted to int32, but in block 1, scatter function needs data type int64 but got int32.
This is because TensorRT doesn't support int64, so Torch-TensorRT will cast all int64=>int32 to run them in TensorRT. However, when partitioning is enabled, some functions in Torch they still need type int64 to run.

Describe the solution you'd like
This could be supported if every aten::to operation is recorded and then cast the types between torch and tensorrt.

The text was updated successfully, but these errors were encountered:

ncomly-nvidia · 2022-09-12T15:35:08Z

@inocsin for viz.

inocsin · 2022-09-13T02:47:19Z

We should also record which value has been truncated in converson process

Christina-Young-NVIDIA · 2022-12-20T01:43:54Z

Duplicate of TensorRT #1546. Is this already support in the current codebase? Bo needs to confirm that we can close this issue.

peri044 · 2023-01-04T01:51:55Z

@bowang007 Is this support already in the master ? https://github.com/pytorch/TensorRT/blob/master/core/partitioning/shape_analysis.cpp#L231-L259

Christina-Young-NVIDIA · 2023-01-10T02:12:21Z

This one is indeed already supported in the master. @bowang007 to confirm and close.

bowang007 · 2023-01-12T06:38:44Z

supported in #1407 , closing

bowang007 added the feature request New feature or request label Sep 12, 2022

bowang007 self-assigned this Sep 12, 2022

ncomly-nvidia added the priority: high label Sep 12, 2022

ncomly-nvidia added the release: v1.3 Tagged to be included in v1.3 label Nov 7, 2022

ncomly-nvidia mentioned this issue Nov 7, 2022

feat: support int64 <=> int32 auto conversion #1407

Merged

7 tasks

narendasan assigned gs-olive Dec 15, 2022

gs-olive mentioned this issue Dec 15, 2022

✨[Feature] Support INT64 inputs at the graph input level #1546

Closed

bowang007 closed this as completed Jan 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨[Feature] Support auto data type transformation between int64 <-> int32 #1346

✨[Feature] Support auto data type transformation between int64 <-> int32 #1346

bowang007 commented Sep 12, 2022 •

edited

Loading

ncomly-nvidia commented Sep 12, 2022

inocsin commented Sep 13, 2022

Christina-Young-NVIDIA commented Dec 20, 2022

peri044 commented Jan 4, 2023

Christina-Young-NVIDIA commented Jan 10, 2023

bowang007 commented Jan 12, 2023

✨[Feature] Support auto data type transformation between int64 <-> int32 #1346

✨[Feature] Support auto data type transformation between int64 <-> int32 #1346

Comments

bowang007 commented Sep 12, 2022 • edited Loading

ncomly-nvidia commented Sep 12, 2022

inocsin commented Sep 13, 2022

Christina-Young-NVIDIA commented Dec 20, 2022

peri044 commented Jan 4, 2023

Christina-Young-NVIDIA commented Jan 10, 2023

bowang007 commented Jan 12, 2023

bowang007 commented Sep 12, 2022 •

edited

Loading