[WebNN] Fallback unsupported integer input and output of a WebNN graph to int32 #24425

Honry · 2025-04-15T01:13:11Z

Some WebNN backends support limited data types for the input and output of a WebNN graph. However, they can support more data types for intermediate nodes. To address this limitation, we implement a data type fallback mechanism. (Note: Currently, we only support fallback to int32 for certain integer data types.)

If a data type is not supported for a graph's input or output but is supported for intermediate nodes, we will:

Save the input MLTensor as 'int32' data type,
Convert the input data from ORT to int32,
Insert a cast operation to WebNN graph to convert the input back to its original data type,
Insert a cast operation to WebNN graph to convert the output back to 'int32',
Convert the output data from int32 to its original data type.

Honry · 2025-04-15T01:14:31Z

@fdwr, @guschmue, PTAL, thanks!

fs-eire · 2025-04-15T01:22:54Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,ONNX Runtime Web CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline

fs-eire · 2025-04-15T01:22:56Z

/azp run Linux QNN CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Linux Android Emulator QNN CI Pipeline,Android CI Pipeline,iOS CI Pipeline,ONNX Runtime React Native CI Pipeline,Linux DNNL CI Pipeline,Linux MIGraphX CI Pipeline,Linux ROCm CI Pipeline

azure-pipelines · 2025-04-15T01:23:23Z

Azure Pipelines successfully started running 7 pipeline(s).

azure-pipelines · 2025-04-15T01:35:44Z

Azure Pipelines will not run the associated pipelines, because the pull request was updated after the run command was issued. Review the pull request again and issue a new run command.

fdwr

Thank you for adding this. I hope it gets pushed down a level into the Chromium CoreML backend so callers didn't need to repeat this manual tensor conversion and cast-node insertion, but this demonstrates that it's possible.

js/web/lib/wasm/jsep/webnn/tensor-manager.ts

onnxruntime/core/providers/webnn/builders/model_builder.cc

Honry · 2025-04-16T09:09:26Z

Thank you for adding this. I hope it gets pushed down a level into the Chromium CoreML backend so callers didn't need to repeat this manual tensor conversion and cast-node insertion, but this demonstrates that it's possible.

Indeed, according to Reilly's comment, it can't be done in Chromium. :(

fdwr · 2025-04-17T00:02:47Z

Merge conflicts 💥.

…h to int32 Some WebNN backends support limited data types for the input and output of a WebNN graph. However, they can support more data types for intermediate nodes. To address this limitation, we implement a data type fallback mechanism. (Note: Currently, we only support fallback to int32 for certain integer data types.) If a data type is not supported for a graph's input or output but is supported for intermediate nodes, we will: 1. Save the input MLTensor as 'int32' data type, 2. Convert the input data from ORT to int32, 3. Insert a cast operation to WebNN graph to convert the input back to its original data type, 4. Insert a cast operation to WebNN graph to convert the output back to 'int32', 5. Convert the output data from int32 to its original data type.

Honry · 2025-04-17T00:28:41Z

Merge conflicts 💥.

@fdwr, fixed. Pls. help restart the CI.

fdwr

👍

fdwr · 2025-04-17T06:59:16Z

'webnnDataTypeToSize' was used before it was defined  @typescript-eslint/no-use-before-define

https://github.com/microsoft/onnxruntime/actions/runs/14505301882/job/40694637717?pr=24425#step:3:333

Honry · 2025-04-17T07:10:49Z

'webnnDataTypeToSize' was used before it was defined  @typescript-eslint/no-use-before-define
https://github.com/microsoft/onnxruntime/actions/runs/14505301882/job/40694637717?pr=24425#step:3:333

Oops, fixed.

fdwr · 2025-04-17T07:36:51Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline,Windows GPU WebGPU CI Pipeline,Windows OpenVINO CI Pipeline

fdwr · 2025-04-17T07:36:53Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

fdwr · 2025-04-17T07:36:55Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI

fdwr · 2025-04-17T07:36:57Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2025-04-17T07:37:03Z

Azure Pipelines successfully started running 1 pipeline(s).

azure-pipelines · 2025-04-17T07:37:08Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2025-04-17T07:37:10Z

Azure Pipelines successfully started running 3 pipeline(s).

azure-pipelines · 2025-04-17T07:37:15Z

Azure Pipelines successfully started running 3 pipeline(s).

fdwr

👍

Honry · 2025-04-18T05:43:42Z

@fdwr, looks like this failure task is irrelevant to my change, do I need to rebase it to main?

fdwr · 2025-04-19T02:39:26Z

@fdwr, looks like this failure task is irrelevant to my change, do I need to rebase it to main?

Restarting failed job. If it fails again, try remerging (or rebasing). Alas, required test.

…h to int32 (#24425) Some WebNN backends support limited data types for the input and output of a WebNN graph. However, they can support more data types for intermediate nodes. To address this limitation, we implement a data type fallback mechanism. (Note: Currently, we only support fallback to int32 for certain integer data types.) If a data type is not supported for a graph's input or output but is supported for intermediate nodes, we will: 1. Save the input MLTensor as 'int32' data type, 2. Convert the input data from ORT to int32, 3. Insert a cast operation to WebNN graph to convert the input back to its original data type, 4. Insert a cast operation to WebNN graph to convert the output back to 'int32', 5. Convert the output data from int32 to its original data type.

…h to int32 (microsoft#24425) Some WebNN backends support limited data types for the input and output of a WebNN graph. However, they can support more data types for intermediate nodes. To address this limitation, we implement a data type fallback mechanism. (Note: Currently, we only support fallback to int32 for certain integer data types.) If a data type is not supported for a graph's input or output but is supported for intermediate nodes, we will: 1. Save the input MLTensor as 'int32' data type, 2. Convert the input data from ORT to int32, 3. Insert a cast operation to WebNN graph to convert the input back to its original data type, 4. Insert a cast operation to WebNN graph to convert the output back to 'int32', 5. Convert the output data from int32 to its original data type. Signed-off-by: bfilipek <bartlomiej.filipek@intel.com>

fdwr added the ep:WebNN label Apr 15, 2025

fdwr reviewed Apr 16, 2025

View reviewed changes

Honry force-pushed the input-data-type-fallback branch from e5858e4 to 66ba571 Compare April 16, 2025 09:04

Honry added 2 commits April 17, 2025 08:12

Address comments

Loading
Loading status checks…

5962428

Honry force-pushed the input-data-type-fallback branch from 66ba571 to 5962428 Compare April 17, 2025 00:26

fdwr previously approved these changes Apr 17, 2025

View reviewed changes

Fix lint error

Loading
Loading status checks…

0e6b55b

Honry dismissed fdwr’s stale review via 0e6b55b April 17, 2025 07:10

fdwr approved these changes Apr 17, 2025

View reviewed changes

fs-eire approved these changes Apr 20, 2025

View reviewed changes

fs-eire merged commit 67c87a1 into microsoft:main Apr 20, 2025
70 of 76 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebNN] Fallback unsupported integer input and output of a WebNN graph to int32 #24425

[WebNN] Fallback unsupported integer input and output of a WebNN graph to int32 #24425

Honry commented Apr 15, 2025

Honry commented Apr 15, 2025

fs-eire commented Apr 15, 2025

fs-eire commented Apr 15, 2025

azure-pipelines bot commented Apr 15, 2025

azure-pipelines bot commented Apr 15, 2025

fdwr left a comment

Honry commented Apr 16, 2025

fdwr commented Apr 17, 2025 •

edited

Loading

Honry commented Apr 17, 2025

fdwr left a comment

fdwr commented Apr 17, 2025

Honry commented Apr 17, 2025

fdwr commented Apr 17, 2025

fdwr commented Apr 17, 2025

fdwr commented Apr 17, 2025

fdwr commented Apr 17, 2025

azure-pipelines bot commented Apr 17, 2025

azure-pipelines bot commented Apr 17, 2025

azure-pipelines bot commented Apr 17, 2025

azure-pipelines bot commented Apr 17, 2025

fdwr left a comment

Honry commented Apr 18, 2025 •

edited

Loading

fdwr commented Apr 19, 2025

[WebNN] Fallback unsupported integer input and output of a WebNN graph to int32 #24425

[WebNN] Fallback unsupported integer input and output of a WebNN graph to int32 #24425

Conversation

Honry commented Apr 15, 2025

Honry commented Apr 15, 2025

fs-eire commented Apr 15, 2025

fs-eire commented Apr 15, 2025

azure-pipelines bot commented Apr 15, 2025

azure-pipelines bot commented Apr 15, 2025

fdwr left a comment

Choose a reason for hiding this comment

Honry commented Apr 16, 2025

fdwr commented Apr 17, 2025 • edited Loading

Honry commented Apr 17, 2025

fdwr left a comment

Choose a reason for hiding this comment

fdwr commented Apr 17, 2025

Honry commented Apr 17, 2025

fdwr commented Apr 17, 2025

fdwr commented Apr 17, 2025

fdwr commented Apr 17, 2025

fdwr commented Apr 17, 2025

azure-pipelines bot commented Apr 17, 2025

azure-pipelines bot commented Apr 17, 2025

azure-pipelines bot commented Apr 17, 2025

azure-pipelines bot commented Apr 17, 2025

fdwr left a comment

Choose a reason for hiding this comment

Honry commented Apr 18, 2025 • edited Loading

fdwr commented Apr 19, 2025

fdwr commented Apr 17, 2025 •

edited

Loading

Honry commented Apr 18, 2025 •

edited

Loading