OVEP 1.21.0 Development Updates #23080

ankitm3k · 2024-12-11T13:26:10Z

Description

OVEP development changes for ORT 1.21 Release

Motivation and Context

Has Critical Bug Fixes
Improved Performance optimizations for both memory & inference latency (Remove ov read model postproc intel/onnxruntime#513)
Enabled Model Compilation using NPUW (feat: Enable NPUW Support intel/onnxruntime#508)
Fixed support for EPContext embed mode 0 for lower memory utilization
Updated NuGet package name as Intel.ML.OnnxRuntime.OpenVino
Fixed QDQ Stripping logic on NPU

ankitm3k · 2024-12-11T13:29:18Z

@jywu-msft @adrianlizarraga Kindly Review & Merge

ankitm3k · 2024-12-11T13:35:26Z

@microsoft-github-policy-service agree company="Intel"

jywu-msft · 2024-12-11T15:31:12Z

/azp run Linux OpenVINO CI Pipeline

azure-pipelines · 2024-12-11T15:31:26Z

Azure Pipelines successfully started running 1 pipeline(s).

HectorSVC · 2024-12-11T19:05:59Z

/azp run Big Models,Linux Android Emulator QNN CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,ONNX Runtime Web CI Pipeline

azure-pipelines · 2024-12-11T19:06:43Z

Azure Pipelines successfully started running 9 pipeline(s).

HectorSVC · 2024-12-11T19:06:51Z

/azp run Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,Windows x64 QNN CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline

azure-pipelines · 2024-12-11T19:07:14Z

Azure Pipelines successfully started running 5 pipeline(s).

HectorSVC · 2024-12-11T22:24:19Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2024-12-11T22:24:37Z

Azure Pipelines successfully started running 3 pipeline(s).

### Description OVEP development changes for ORT 1.21 Release ### Motivation and Context - Has Critical Bug Fixes - Improved Performance optimizations for both memory & inference latency (intel#513) - Enabled Model Compilation using NPUW (intel#508) - Fixed support for EPContext embed mode 0 for lower memory utilization - Updated NuGet package name as `Intel.ML.OnnxRuntime.OpenVino` - Fixed QDQ Stripping logic on NPU

OVEP 1.21.0 Dev Updates

99f6528

ankitm3k requested a review from a team as a code owner December 11, 2024 13:26

HectorSVC approved these changes Dec 12, 2024

View reviewed changes

jywu-msft approved these changes Dec 12, 2024

View reviewed changes

jywu-msft merged commit 1f88284 into microsoft:main Dec 12, 2024
77 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OVEP 1.21.0 Development Updates #23080

OVEP 1.21.0 Development Updates #23080

ankitm3k commented Dec 11, 2024 •

edited

Loading

ankitm3k commented Dec 11, 2024

ankitm3k commented Dec 11, 2024

jywu-msft commented Dec 11, 2024

azure-pipelines bot commented Dec 11, 2024

HectorSVC commented Dec 11, 2024

azure-pipelines bot commented Dec 11, 2024

HectorSVC commented Dec 11, 2024

azure-pipelines bot commented Dec 11, 2024

HectorSVC commented Dec 11, 2024

azure-pipelines bot commented Dec 11, 2024

OVEP 1.21.0 Development Updates #23080

OVEP 1.21.0 Development Updates #23080

Conversation

ankitm3k commented Dec 11, 2024 • edited Loading

Description

Motivation and Context

ankitm3k commented Dec 11, 2024

ankitm3k commented Dec 11, 2024

jywu-msft commented Dec 11, 2024

azure-pipelines bot commented Dec 11, 2024

HectorSVC commented Dec 11, 2024

azure-pipelines bot commented Dec 11, 2024

HectorSVC commented Dec 11, 2024

azure-pipelines bot commented Dec 11, 2024

HectorSVC commented Dec 11, 2024

azure-pipelines bot commented Dec 11, 2024

ankitm3k commented Dec 11, 2024 •

edited

Loading