Update kernel matching logic: decouple from op schemas and remove kernel def hashes #12791

edgchen1 · 2022-08-30T19:15:17Z

Motivation

Currently, ORT minimal builds use kernel def hashes to map from nodes to kernels to execute when loading the model. As the kernel def hashes must be known ahead of time, this works for statically registered kernels. This works well for the CPU EP.
For this approach to work, the kernel def hashes must also be known at ORT format model conversion time, which means the EP with statically registered kernels must also be enabled then. This is not an issue for the always-available CPU EP. However, we do not want to require that any EP which statically registers kernels is always available too.
Consequently, we explore another approach to match nodes to kernels that does not rely on kernel def hashes. An added benefit of this is the possibility of moving away from kernel def hashes completely, which would eliminate the maintenance burden of keeping the hashes stable.

Approach

In a full build, ORT uses some information from the ONNX op schema to match a node to a kernel. We want to avoid including the ONNX op schema in a minimal build to reduce binary size. Essentially, we will take the necessary information from the ONNX op schema and make it available in a minimal build.
We will decouple the ONNX op schema from the kernel matching logic. The kernel matching logic will instead rely on per-op information which can either be obtained from the ONNX op schema or another source.
This per-op information must be available in a minimal build when there are no ONNX op schemas. We can put it in the ORT format model.
Existing uses of kernel def hashes to look up kernels can be replaced with the updated kernel matching logic. We no longer need to store kernel def hashes in the ORT format model’s session state and runtime optimization representations. We no longer need to keep the logic to generate and ensure stability of kernel def hashes.

…ching_experiment

…lasses and fix usages.

…ity.

…ching_experiment

… error message.

…l_update_fix

onnxruntime/core/flatbuffers/flatbuffers_utils.cc

lgtm-com · 2022-09-15T23:42:20Z

This pull request introduces 1 alert and fixes 1 when merging da5c9f4 into 739b567 - view on LGTM.com

new alerts:

1 for Unused static function

fixed alerts:

1 for Explicit returns mixed with implicit (fall through) returns

lgtm-com · 2022-09-16T05:30:25Z

This pull request introduces 6 alerts and fixes 1 when merging 716b756 into b935524 - view on LGTM.com

new alerts:

5 for Uncontrolled data used in path expression
1 for Unused static function

fixed alerts:

1 for Explicit returns mixed with implicit (fall through) returns

…l_update

onnxruntime/core/optimizer/selectors_actions/selector_action_transformer.cc

onnxruntime/test/framework/ort_model_only_test.cc

edgchen1 · 2022-09-16T23:59:22Z

winml/test/scenario/cppwinrt/CustomOps.cpp

@@ -593,7 +593,7 @@ static void CustomKernelWithCustomSchema() {
    floatTensorEdgeDesc.edgeType = MLOperatorEdgeType::Tensor;
    floatTensorEdgeDesc.tensorDataType = MLOperatorTensorDataType::Float;



@fdwr @jeffbloo please take a look at these changes too

🤔 Looks like a custom op WinML test. @smk2007? Seems okay 🤷‍♂️ (T to T1 makes sense, given the definition above), but I'm content if the test passes.

edgchen1 · 2022-09-17T00:41:18Z

onnxruntime/core/providers/cpu/cpu_execution_provider.cc

@@ -790,8 +790,6 @@ class ONNX_OPERATOR_KERNEL_CLASS_NAME(kCpuExecutionProvider, kOnnxDomain, 17, ST
 // !!PLEASE READ BELOW!! Following that, add new entries above this comment

 /*  *** IMPORTANT! ***
- If kernel registrations are incorrectly updated, ORT format models get broken as the kernel hashes may be invalidated.


@skottmckay should this be replaced another comment? I think it's still important to correctly register kernels

How would they register it correctly if the kernel def builder no longer generates a hash?

snnn

The changes to yaml files LGTM. Sorry I didn't look into the other files.

lgtm-com · 2022-09-17T02:01:31Z

This pull request introduces 1 alert and fixes 1 when merging 396a957 into b48f71f - view on LGTM.com

new alerts:

1 for Unused static function

fixed alerts:

1 for Explicit returns mixed with implicit (fall through) returns

edgchen1 · 2022-09-20T01:11:03Z

/azp run orttraining-linux-gpu-ci-pipeline

azure-pipelines · 2022-09-20T01:11:13Z

Azure Pipelines successfully started running 1 pipeline(s).

edgchen1 · 2022-09-20T18:48:27Z

/azp run Linux Eager Mode CI Pipeline

azure-pipelines · 2022-09-20T18:48:37Z

Azure Pipelines successfully started running 1 pipeline(s).

…nel def hashes (#12791) # Motivation Currently, ORT minimal builds use kernel def hashes to map from nodes to kernels to execute when loading the model. As the kernel def hashes must be known ahead of time, this works for statically registered kernels. This works well for the CPU EP. For this approach to work, the kernel def hashes must also be known at ORT format model conversion time, which means the EP with statically registered kernels must also be enabled then. This is not an issue for the always-available CPU EP. However, we do not want to require that any EP which statically registers kernels is always available too. Consequently, we explore another approach to match nodes to kernels that does not rely on kernel def hashes. An added benefit of this is the possibility of moving away from kernel def hashes completely, which would eliminate the maintenance burden of keeping the hashes stable. # Approach In a full build, ORT uses some information from the ONNX op schema to match a node to a kernel. We want to avoid including the ONNX op schema in a minimal build to reduce binary size. Essentially, we take the necessary information from the ONNX op schema and make it available in a minimal build. We decouple the ONNX op schema from the kernel matching logic. The kernel matching logic instead relies on per-op information which can either be obtained from the ONNX op schema or another source. This per-op information must be available in a minimal build when there are no ONNX op schemas. We put it in the ORT format model. Existing uses of kernel def hashes to look up kernels are replaced with the updated kernel matching logic. We no longer store kernel def hashes in the ORT format model’s session state and runtime optimization representations. We no longer keep the logic to generate and ensure stability of kernel def hashes.

…nel def hashes (microsoft#12791) # Motivation Currently, ORT minimal builds use kernel def hashes to map from nodes to kernels to execute when loading the model. As the kernel def hashes must be known ahead of time, this works for statically registered kernels. This works well for the CPU EP. For this approach to work, the kernel def hashes must also be known at ORT format model conversion time, which means the EP with statically registered kernels must also be enabled then. This is not an issue for the always-available CPU EP. However, we do not want to require that any EP which statically registers kernels is always available too. Consequently, we explore another approach to match nodes to kernels that does not rely on kernel def hashes. An added benefit of this is the possibility of moving away from kernel def hashes completely, which would eliminate the maintenance burden of keeping the hashes stable. # Approach In a full build, ORT uses some information from the ONNX op schema to match a node to a kernel. We want to avoid including the ONNX op schema in a minimal build to reduce binary size. Essentially, we take the necessary information from the ONNX op schema and make it available in a minimal build. We decouple the ONNX op schema from the kernel matching logic. The kernel matching logic instead relies on per-op information which can either be obtained from the ONNX op schema or another source. This per-op information must be available in a minimal build when there are no ONNX op schemas. We put it in the ORT format model. Existing uses of kernel def hashes to look up kernels are replaced with the updated kernel matching logic. We no longer store kernel def hashes in the ORT format model’s session state and runtime optimization representations. We no longer keep the logic to generate and ensure stability of kernel def hashes.

edgchen1 added 30 commits May 3, 2022 07:02

Save work.

0db5888

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

c346b73

…ching_experiment

Save work

21545be

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

0746d1b

…ching_experiment

save work

983294d

Remove unused code.

4d6ba51

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

43f6340

…ching_experiment

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

8859fbb

…ching_experiment

save work

106eba9

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

33a940c

…ching_experiment

Fix to pass tests.

ed0d91d

save work

bed0a2e

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

568ebbf

…ching_experiment

Update flatbuffers schema.

1cc880f

Save work

8a8e4b2

Save work.

a3c78b2

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

f96986c

…ching_experiment

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

9463b11

…ching_experiment

Update compile_schema.py to first delete generated Python files.

0753293

save changes

1b21a0c

build fix

3f9e936

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

532467c

…ching_experiment

small fix

31fd820

Update KernelRegistry, KernelRegistryManager, KernelTypeStrResolver c…

92436bf

…lasses and fix usages.

Add KernelTypeStrResolver parameter to IExecutionProvider::GetCapabil…

b35c048

…ity.

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

937d694

…ching_experiment

save/load kernel_type_str_resolver

9a4acad

remove kernel hashes from graph partitioning, other updates

ee010ba

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

8a2863b

…ching_experiment

Merge remote-tracking branch 'origin/master' into edgchen1/kernel_mat…

8e6af8b

…ching_experiment

edgchen1 added 3 commits September 15, 2022 13:25

address PR comments

00f057f

Add reference about ORT format model breaking change to version check…

5222c10

… error message.

Merge remote-tracking branch 'origin/main' into edgchen1/static_kerne…

da5c9f4

…l_update_fix

edgchen1 requested a review from snnn September 15, 2022 22:01

edgchen1 commented Sep 15, 2022

View reviewed changes

onnxruntime/core/flatbuffers/flatbuffers_utils.cc Outdated Show resolved Hide resolved

small fixes

716b756

fdwr requested a review from jeffbloo September 16, 2022 04:51

Merge remote-tracking branch 'origin/main' into edgchen1/static_kerne…

adf351a

…l_update

edgchen1 commented Sep 16, 2022

View reviewed changes

onnxruntime/core/optimizer/selectors_actions/selector_action_transformer.cc Show resolved Hide resolved

edgchen1 commented Sep 16, 2022

View reviewed changes

onnxruntime/test/framework/ort_model_only_test.cc Outdated Show resolved Hide resolved

edgchen1 commented Sep 16, 2022

View reviewed changes

edgchen1 added 2 commits September 16, 2022 17:04

more fixes

e15fbe8

update comments referring to kernel def hashes

396a957

edgchen1 commented Sep 17, 2022

View reviewed changes

snnn approved these changes Sep 17, 2022

View reviewed changes

jeffbloo approved these changes Sep 20, 2022

View reviewed changes

skottmckay approved these changes Sep 20, 2022

View reviewed changes

edgchen1 merged commit 454f77c into main Sep 20, 2022

edgchen1 deleted the edgchen1/static_kernel_update branch September 20, 2022 21:25

edgchen1 mentioned this pull request Sep 21, 2022

Consolidate enabled/default kernel def type constraints #13034

Merged

jywu-msft mentioned this pull request Sep 21, 2022

Add CANN EP #12416

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update kernel matching logic: decouple from op schemas and remove kernel def hashes #12791

Update kernel matching logic: decouple from op schemas and remove kernel def hashes #12791

edgchen1 commented Aug 30, 2022 •

edited

Loading

lgtm-com bot commented Sep 15, 2022

lgtm-com bot commented Sep 16, 2022

edgchen1 Sep 16, 2022

fdwr Sep 17, 2022

edgchen1 Sep 17, 2022

skottmckay Sep 19, 2022

snnn left a comment

lgtm-com bot commented Sep 17, 2022

edgchen1 commented Sep 20, 2022

azure-pipelines bot commented Sep 20, 2022

edgchen1 commented Sep 20, 2022

azure-pipelines bot commented Sep 20, 2022

		@@ -593,7 +593,7 @@ static void CustomKernelWithCustomSchema() {
		floatTensorEdgeDesc.edgeType = MLOperatorEdgeType::Tensor;
		floatTensorEdgeDesc.tensorDataType = MLOperatorTensorDataType::Float;

Update kernel matching logic: decouple from op schemas and remove kernel def hashes #12791

Update kernel matching logic: decouple from op schemas and remove kernel def hashes #12791

Conversation

edgchen1 commented Aug 30, 2022 • edited Loading

Motivation

Approach

lgtm-com bot commented Sep 15, 2022

lgtm-com bot commented Sep 16, 2022

edgchen1 Sep 16, 2022

Choose a reason for hiding this comment

fdwr Sep 17, 2022

Choose a reason for hiding this comment

edgchen1 Sep 17, 2022

Choose a reason for hiding this comment

skottmckay Sep 19, 2022

Choose a reason for hiding this comment

snnn left a comment

Choose a reason for hiding this comment

lgtm-com bot commented Sep 17, 2022

edgchen1 commented Sep 20, 2022

azure-pipelines bot commented Sep 20, 2022

edgchen1 commented Sep 20, 2022

azure-pipelines bot commented Sep 20, 2022

edgchen1 commented Aug 30, 2022 •

edited

Loading