[QNN EP] Enable QnnGpu backend in QNN EP. #24435

johnpaultaken · 2025-04-15T21:29:00Z

Description

Enable the GPU backend also for the onnxruntime QNN EP.

Motivation and Context

Why is this change required? What problem does it solve?
It allows QNN EP to run on the GPU backend also.
With this change many models can now run fully on QNN EP GPU backend, like resnet_50, google_vit_base_fp32, squeezenet1.0-7 etc. Also the onnxruntime node tests and versioned operator tests pass numbers for the GPU is comparable to the HTP now.
Note: Currently QNN_LOG_LEVEL_DEBUG need to be enabled to run correctly.

HectorSVC · 2025-04-15T21:57:38Z

/azp run Linux QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows ARM64 QNN CI Pipeline,Linux Android Emulator QNN CI Pipeline

azure-pipelines · 2025-04-15T21:57:55Z

Azure Pipelines successfully started running 4 pipeline(s).

onnxruntime/core/providers/qnn/builder/qnn_backend_manager.cc

Description Enable the GPU backend also for the onnxruntime QNN EP. Motivation and Context Why is this change required? What problem does it solve? It allows QNN EP to run on the GPU backend also. With this change many models can now run fully on QNN EP GPU backend, like resnet_50, google_vit_base_fp32, squeezenet1.0-7 etc. Also the onnxruntime node tests and versioned operator tests pass numbers for the GPU is comparable to the HTP now. Note: Currently QNN_LOG_LEVEL_DEBUG need to be enabled to run correctly.

HectorSVC · 2025-04-16T00:00:35Z

/azp run Big Models,Linux Android Emulator QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows GPU TensorRT CI Pipeline,Windows x64 QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI

HectorSVC · 2025-04-16T00:00:45Z

/azp run Linux CPU CI Pipeline, Linux CPU Minimal Build E2E CI Pipeline, Linux GPU CI Pipeline, Linux GPU TensorRT CI Pipeline, MacOS CI Pipeline, ONNX Runtime Web CI Pipeline, onnxruntime-binary-size-checks-ci-pipeline, Linux QNN CI Pipeline,Linux OpenVINO CI Pipeline

HectorSVC

azure-pipelines · 2025-04-16T00:02:28Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2025-04-16T00:02:35Z

Azure Pipelines successfully started running 6 pipeline(s).

johnpaultaken · 2025-04-16T01:26:02Z

/azp run Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-04-16T01:26:08Z

Commenter does not have sufficient privileges for PR 24435 in repo microsoft/onnxruntime

johnpaultaken · 2025-04-16T01:30:15Z

@HectorSVC Windows GPU doc gen failure looks like a network issue. If it is a required pass, can you rerun please.
Raw logs show this:
ERR:write EPROTO 5C030000:error:0A000438:SSL routines:ssl3_read_bytes:tlsv1 alert internal error:c:\ws\deps\openssl\openssl\ssl\record\rec_layer_s3.c:1590:SSL alert number 80

In fact, all the pipeline failures looks unrelated to this change.
Please let me know if you think otherwise.

### Description Excludes QnnGpu.dll from Windows x64 NuGet package because it is not available for that architecture. ### Motivation and Context Fix failure in QNN packaging pipeline: ```shell CreateNativePackage: Generating nuspec for the native Microsoft.ML.OnnxRuntime.QNN nuget package... python ..\tools\nuget\generate_nuspec_for_native_nuget.py --package_version 1.22.0-dev-20250421-0439-2abab8d --package_name Microsoft.ML.OnnxRuntime.QNN --target_architecture x64 --build_config RelWithDebInfo --native_build_path D:\a\_work\1\b\RelWithDebInfo\RelWithDebInfo --packages_path D:\a\_work\1\b\packages --ort_build_path D:\a\_work\1\b --sources_path D:\a\_work\1\s --commit_id 2abab8d --is_release_build False --execution_provider None --nuspec_name NativeNuget.nuspec 1 file(s) copied. 1 file(s) copied. nuspec_name: NativeNuget.nuspec Bundling native shared library artifacts into Microsoft.ML.OnnxRuntime nuget package... nuget pack NativeNuget.nuspec Attempting to build package from 'NativeNuget.nuspec'. ##[error]EXEC(0,0): Error NU5019: File not found: 'D:\a\_work\1\b\RelWithDebInfo\RelWithDebInfo\QnnGpu.dll'. EXEC : error NU5019: File not found: 'D:\a\_work\1\b\RelWithDebInfo\RelWithDebInfo\QnnGpu.dll'. [D:\a\_work\1\s\csharp\OnnxRuntime.CSharp.proj] ##[error]csharp\OnnxRuntime.CSharp.proj(109,5): Error MSB3073: The command "nuget pack NativeNuget.nuspec" exited with code 1. ``` Introduced by this PR: #24435

### Description Enable the GPU backend also for the onnxruntime QNN EP. ### Motivation and Context Why is this change required? What problem does it solve? It allows QNN EP to run on the GPU backend also. With this change many models can now run fully on QNN EP GPU backend, like resnet_50, google_vit_base_fp32, squeezenet1.0-7 etc. Also the onnxruntime node tests and versioned operator tests pass numbers for the GPU is comparable to the HTP now. Note: Currently QNN_LOG_LEVEL_DEBUG need to be enabled to run correctly.

### Description Excludes QnnGpu.dll from Windows x64 NuGet package because it is not available for that architecture. ### Motivation and Context Fix failure in QNN packaging pipeline: ```shell CreateNativePackage: Generating nuspec for the native Microsoft.ML.OnnxRuntime.QNN nuget package... python ..\tools\nuget\generate_nuspec_for_native_nuget.py --package_version 1.22.0-dev-20250421-0439-2abab8d --package_name Microsoft.ML.OnnxRuntime.QNN --target_architecture x64 --build_config RelWithDebInfo --native_build_path D:\a\_work\1\b\RelWithDebInfo\RelWithDebInfo --packages_path D:\a\_work\1\b\packages --ort_build_path D:\a\_work\1\b --sources_path D:\a\_work\1\s --commit_id 2abab8d --is_release_build False --execution_provider None --nuspec_name NativeNuget.nuspec 1 file(s) copied. 1 file(s) copied. nuspec_name: NativeNuget.nuspec Bundling native shared library artifacts into Microsoft.ML.OnnxRuntime nuget package... nuget pack NativeNuget.nuspec Attempting to build package from 'NativeNuget.nuspec'. ##[error]EXEC(0,0): Error NU5019: File not found: 'D:\a\_work\1\b\RelWithDebInfo\RelWithDebInfo\QnnGpu.dll'. EXEC : error NU5019: File not found: 'D:\a\_work\1\b\RelWithDebInfo\RelWithDebInfo\QnnGpu.dll'. [D:\a\_work\1\s\csharp\OnnxRuntime.CSharp.proj] ##[error]csharp\OnnxRuntime.CSharp.proj(109,5): Error MSB3073: The command "nuget pack NativeNuget.nuspec" exited with code 1. ``` Introduced by this PR: microsoft#24435 Signed-off-by: bfilipek <bartlomiej.filipek@intel.com>

### Description Excludes QnnGpu.dll from Windows x64 NuGet package because it is not available for that architecture. ### Motivation and Context Fix failure in QNN packaging pipeline: ```shell CreateNativePackage: Generating nuspec for the native Microsoft.ML.OnnxRuntime.QNN nuget package... python ..\tools\nuget\generate_nuspec_for_native_nuget.py --package_version 1.22.0-dev-20250421-0439-2abab8d --package_name Microsoft.ML.OnnxRuntime.QNN --target_architecture x64 --build_config RelWithDebInfo --native_build_path D:\a\_work\1\b\RelWithDebInfo\RelWithDebInfo --packages_path D:\a\_work\1\b\packages --ort_build_path D:\a\_work\1\b --sources_path D:\a\_work\1\s --commit_id 2abab8d --is_release_build False --execution_provider None --nuspec_name NativeNuget.nuspec 1 file(s) copied. 1 file(s) copied. nuspec_name: NativeNuget.nuspec Bundling native shared library artifacts into Microsoft.ML.OnnxRuntime nuget package... nuget pack NativeNuget.nuspec Attempting to build package from 'NativeNuget.nuspec'. ##[error]EXEC(0,0): Error NU5019: File not found: 'D:\a\_work\1\b\RelWithDebInfo\RelWithDebInfo\QnnGpu.dll'. EXEC : error NU5019: File not found: 'D:\a\_work\1\b\RelWithDebInfo\RelWithDebInfo\QnnGpu.dll'. [D:\a\_work\1\s\csharp\OnnxRuntime.CSharp.proj] ##[error]csharp\OnnxRuntime.CSharp.proj(109,5): Error MSB3073: The command "nuget pack NativeNuget.nuspec" exited with code 1. ``` Introduced by this PR: #24435

HectorSVC added the ep:QNN label Apr 15, 2025

HectorSVC reviewed Apr 15, 2025

View reviewed changes

onnxruntime/core/providers/qnn/builder/qnn_backend_manager.cc Outdated Show resolved Hide resolved

johnpaultaken force-pushed the dev/qualcomm/johnpaul/enable_qnn_gpu branch from ce75ea8 to 5cdbae7 Compare April 15, 2025 23:48

HectorSVC approved these changes Apr 16, 2025

View reviewed changes

HectorSVC merged commit b4b5a79 into microsoft:main Apr 16, 2025
68 of 76 checks passed

adrianlizarraga mentioned this pull request Apr 21, 2025

[QNN EP] Exclude QnnGpu.dll from Windows x64 NuGet #24487

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QNN EP] Enable QnnGpu backend in QNN EP. #24435

[QNN EP] Enable QnnGpu backend in QNN EP. #24435

johnpaultaken commented Apr 15, 2025

HectorSVC commented Apr 15, 2025

azure-pipelines bot commented Apr 15, 2025

HectorSVC commented Apr 16, 2025

HectorSVC commented Apr 16, 2025

HectorSVC left a comment

azure-pipelines bot commented Apr 16, 2025

azure-pipelines bot commented Apr 16, 2025

johnpaultaken commented Apr 16, 2025

azure-pipelines bot commented Apr 16, 2025

johnpaultaken commented Apr 16, 2025 •

edited

Loading

[QNN EP] Enable QnnGpu backend in QNN EP. #24435

[QNN EP] Enable QnnGpu backend in QNN EP. #24435

Conversation

johnpaultaken commented Apr 15, 2025

Description

Motivation and Context

HectorSVC commented Apr 15, 2025

azure-pipelines bot commented Apr 15, 2025

HectorSVC commented Apr 16, 2025

HectorSVC commented Apr 16, 2025

HectorSVC left a comment

Choose a reason for hiding this comment

azure-pipelines bot commented Apr 16, 2025

azure-pipelines bot commented Apr 16, 2025

johnpaultaken commented Apr 16, 2025

azure-pipelines bot commented Apr 16, 2025

johnpaultaken commented Apr 16, 2025 • edited Loading

johnpaultaken commented Apr 16, 2025 •

edited

Loading