-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Insights: microsoft/onnxruntime
Overview
Could not load contribution data
Please try again later
201 Pull requests merged by 65 people
-
[WebNN] Support AveragePool with count_include_pad == 1
#24465 merged
Apr 20, 2025 -
Add infrastructure for auto EP selection
#24430 merged
Apr 20, 2025 -
[WebNN] Fallback unsupported integer input and output of a WebNN graph to int32
#24425 merged
Apr 20, 2025 -
Disambiguate the winml OrtModel with the model editing API OrtModel.
#24474 merged
Apr 19, 2025 -
Rename matmul_4bits_quantizer.py to matmul_nbits_quantizer.py
#24472 merged
Apr 19, 2025 -
[Docs] EPcontext error handling
#24471 merged
Apr 19, 2025 -
Fix Windows_CI_GPU_DML_Dev_x86 and Windows_CI_GPU_DML_Dev_arm64 pipeline steps
#24365 merged
Apr 18, 2025 -
Allow EpContext models with input/output models completely in buffers
#24463 merged
Apr 18, 2025 -
Add session config to return an error if model needs to be compiled
#24416 merged
Apr 18, 2025 -
Upgrade transformers to 4.48.0 for llama2
#24302 merged
Apr 18, 2025 -
[VitisAI EP] Implement new overload of CreateProvider() called with session options
#24445 merged
Apr 18, 2025 -
[OVEP] update: Introduce enable_causallm provider option in OVEP (preview)
#24457 merged
Apr 18, 2025 -
[QNN EP] Update the generated Qnn context binary file name to align with the design doc
#24461 merged
Apr 17, 2025 -
Validate CreateSessionFromArray with ep.context_enable enabled
#24176 merged
Apr 17, 2025 -
[webgpu] Supports batch and zero points in MatMulNBits WideTileProgram
#24390 merged
Apr 17, 2025 -
include corresponding Nuget version info to Node.js binding
#24450 merged
Apr 17, 2025 -
[DML EP] Support in-memory external data TensorProto
#24391 merged
Apr 17, 2025 -
[OpenVINO EP] Implement new overload of CreateProvider() for OpenVINO EP
#24406 merged
Apr 17, 2025 -
coreml: fix wrong C++ code in documentation
#24403 merged
Apr 17, 2025 -
[Native WebGPU] Handle corner cases in naive kernel.
#24438 merged
Apr 17, 2025 -
[nodejs] update Node.js binding document for 1.22 release
#24452 merged
Apr 17, 2025 -
[QNN EP] Reverting a recent logging change for QNN GPU only,
#24444 merged
Apr 17, 2025 -
Fix cuda memory access violation in GQA FlashAttention
#24447 merged
Apr 17, 2025 -
Fix MatmulTransposeFusion when input A and B are the same
#24373 merged
Apr 17, 2025 -
[nodejs] add missing header files for linux build
#24448 merged
Apr 16, 2025 -
[WebNN EP] Automatically use ml-tensor for outputs
#24282 merged
Apr 16, 2025 -
Fix compile issue in Azure EP unit test
#24446 merged
Apr 16, 2025 -
[nodejs] upgrade N-API version to 6
#24443 merged
Apr 16, 2025 -
Add GQA fusion for CUDA EP
#24335 merged
Apr 16, 2025 -
ONNXRuntime OpenVINO - Release 1.22
#24394 merged
Apr 16, 2025 -
Update QNN version to 2.33.2
#24440 merged
Apr 16, 2025 -
[QNN EP] Enable QnnGpu backend in QNN EP.
#24435 merged
Apr 16, 2025 -
[nodejs] support Node.js binding in multi env
#24366 merged
Apr 16, 2025 -
Add static quantization runner
#24114 merged
Apr 16, 2025 -
Support canonical EP names in SessionOptionsAppendExecutionProvider
#24433 merged
Apr 16, 2025 -
update the QNN download link
#24439 merged
Apr 16, 2025 -
Clean up Compile API
#24436 merged
Apr 16, 2025 -
[nodejs] allow installing DLLs from Nuget feed
#24418 merged
Apr 15, 2025 -
Enable Inference Results Saving in onnx-test-runner
#24210 merged
Apr 15, 2025 -
[webgpu] Fix batch-norm for ort-web-tests
#24404 merged
Apr 15, 2025 -
[node.js] fix handling null value for externalData
#24428 merged
Apr 15, 2025 -
Fix the Python API docs update pipeline
#24434 merged
Apr 15, 2025 -
[web] fix 'npm run pull:wasm' for main branch
#24429 merged
Apr 15, 2025 -
[Doc] EPContext with weight sharing
#24141 merged
Apr 15, 2025 -
[Native WebGPU] Support shared memory version of ReduceOps
#24399 merged
Apr 15, 2025 -
[MacOS] Add MLProgram Gather op for CoreML EP
#24387 merged
Apr 15, 2025 -
Fix doc gen issue
#24424 merged
Apr 15, 2025 -
[Native WebGPU EP] Increase error tolarance limit f16
#24420 merged
Apr 15, 2025 -
Fix typo in option text s/buildings/bindings
#24412 merged
Apr 15, 2025 -
workaround linux CI pipeline: pin triton to v3.2.0
#24423 merged
Apr 15, 2025 -
[webgpu] Enable DP4A MatMul generation path for Qualcomm
#24408 merged
Apr 15, 2025 -
Add Resize cubic mode without antialias (scales = [1, ≥1, ≥1, 1])
#24385 merged
Apr 15, 2025 -
Support mixed precision in quantization for RTN
#24401 merged
Apr 14, 2025 -
[WebGPU EP] Fixes bugs in slice operator implementation
#24415 merged
Apr 14, 2025 -
Replace gsl::narrow with narrow in xnnpack code
#24392 merged
Apr 14, 2025 -
[webgpu] move comments out from WGSL in FlashAttention impl
#24400 merged
Apr 14, 2025 -
[CPU] Add 8bit support to matmulnbits quantizer
#24384 merged
Apr 14, 2025 -
Add API to compile a model
#24207 merged
Apr 12, 2025 -
ORT-OVEP Doc update
#24395 merged
Apr 12, 2025 -
Update protobuf-java to 3.25.5
#24333 merged
Apr 12, 2025 -
Bump vite from 6.2.5 to 6.2.6 in /js/web/test/e2e/exports/testcases/vite-default
#24396 merged
Apr 11, 2025 -
[Native WebGPU EP] Add InstranceNormalization
#24369 merged
Apr 11, 2025 -
Migrate OpenVino Pipeline to Github Actions
#24297 merged
Apr 11, 2025 -
[webgpu] fix 2 bugs in Conv/ConvTranspose
#24388 merged
Apr 11, 2025 -
[QNN EP] Add support for int64 shape input of Expand Op
#24389 merged
Apr 11, 2025 -
[webgpu] Use workgroup memory to reduce register pressure
#24286 merged
Apr 11, 2025 -
MlasTranspose multi-threads support.
#24261 merged
Apr 11, 2025 -
[web] allow NPM tests to run nodejs binding for webgpu
#24370 merged
Apr 11, 2025 -
Make test
CApiTest.RequestLoadCancellation
deterministic#24348 merged
Apr 10, 2025 -
Remove build-nuget from dml-vs-2022.yml
#24372 merged
Apr 10, 2025 -
[WebNN EP] Support GroupQueryAttention(GQA)
#23416 merged
Apr 10, 2025 -
Pin wheel version to 0.45.1 (#24349)
#24367 merged
Apr 10, 2025 -
fix compile on latest 24.02
#24364 merged
Apr 9, 2025 -
Update unknown provider error message with current providers
#24352 merged
Apr 9, 2025 -
[EP Perf] Extension to post benchmark perf from local devices
#24236 merged
Apr 9, 2025 -
[WebNN] Support MatMulNBits op
#24142 merged
Apr 9, 2025 -
[web] fix TypeScript typing and add a test case
#24354 merged
Apr 9, 2025 -
Support WebGPU build for android and ios
#24308 merged
Apr 9, 2025 -
[QNN EP] Add support for Int64 tensors
#24351 merged
Apr 9, 2025 -
[QNN-EP] LoRAv2 Document update
#24205 merged
Apr 9, 2025 -
Pin wheel version to 0.45.1
#24349 merged
Apr 9, 2025 -
Merge 'main' into 'win-ort-main' @ 39e585ff2b
#24353 merged
Apr 9, 2025 -
[webgpu][dawn API optimization] workgroup dispatch
#24329 merged
Apr 8, 2025 -
Group build args
#24337 merged
Apr 8, 2025 -
[WebGPU EP] Exclude zero-dim input test case for WebGPU EP.
#24350 merged
Apr 8, 2025 -
[web] revise flag
ort.env.wasm.simd
#24314 merged
Apr 8, 2025 -
ROCm: Remove -Wno-interference-size compiler flag
#24326 merged
Apr 8, 2025 -
[webgpu] optimize SkipLayerNormalization operator
#24164 merged
Apr 8, 2025 -
[webgpu] fix bias-add
#24336 merged
Apr 8, 2025 -
[webgpu] Fix bias_split_gelu
#24342 merged
Apr 8, 2025 -
Remove explicit batch network flag for TRT 10+
#24298 merged
Apr 8, 2025 -
[webgpu] fix the reflect mode issue of Pad
#24202 merged
Apr 8, 2025 -
webgpu support for DequantizeLinear
#24268 merged
Apr 8, 2025 -
Use WASM f32x4 relaxed min/max for relaxed simd build
#24324 merged
Apr 8, 2025 -
[webgpu] Flash attention for generation
#23808 merged
Apr 8, 2025 -
Bump version to 1.21.1
#24328 merged
Apr 8, 2025 -
[VitisAI EP] export InferShapes to VitisAIEP
#23881 merged
Apr 8, 2025 -
[Native WebGPU] Exclude WebGPU EP from Conv3D tests.
#24327 merged
Apr 7, 2025 -
[webgpu] Fix ROUND_PREFER_CEIL issue of Resize operator
#24229 merged
Apr 7, 2025 -
Update Vitis-AI-ExecutionProvider.md
#24254 merged
Apr 7, 2025 -
Implement load cancellation ability
#24257 merged
Apr 7, 2025 -
[webgpu][dawn API optimization] reduce number of calls to buffer APIs
#24315 merged
Apr 7, 2025 -
[webgpu] Use 1D dispatch groups for attention
#24228 merged
Apr 7, 2025 -
Add ConvTranspose cache key
#24317 merged
Apr 5, 2025 -
Fix 'minimal_power' to 'minimum_power' for DirectML performance selection (perf test)
#24303 merged
Apr 5, 2025 -
[webgpu][dawn API optimization] reduce number of calls to wgpuDeviceGetQueue
#24313 merged
Apr 4, 2025 -
[Native WebGPU] Add Conv, ConTranspose and FusedConv
#24186 merged
Apr 4, 2025 -
Add support for uint8_t as data type for GatherBlockQuantized
#24239 merged
Apr 4, 2025 -
Update packaging pipeline for Nodejs binding
#24301 merged
Apr 4, 2025 -
Support Gemma3 with Clip fused attention
#24280 merged
Apr 4, 2025 -
[WebGPU] fix cache key of AttentionProbs/VxAttentionScore
#24309 merged
Apr 4, 2025 -
Bump vite from 6.2.4 to 6.2.5 in /js/web/test/e2e/exports/testcases/vite-default
#24312 merged
Apr 4, 2025 -
[WebGPU] fix Pad cache key
#24305 merged
Apr 4, 2025 -
[QNN-EP] Fix ONNX context model helper.
#24271 merged
Apr 4, 2025 -
Cherry picks for 1.21.1
#24188 merged
Apr 4, 2025 -
upgrade action shellcheck to v1.30.0
#24304 merged
Apr 4, 2025 -
[webgpu] Optimize MatMulNBits for f16 Block32 prefill performance
#23908 merged
Apr 4, 2025 -
Update publish-nuget.yml to correct feed.
#24299 merged
Apr 4, 2025 -
[QNN-EP] Enhance QNN-EP support for Softmax with opset < 13.
#24180 merged
Apr 3, 2025 -
Bump image-size from 1.1.1 to 1.2.1 in /js/react_native/e2e
#24278 merged
Apr 3, 2025 -
Bump next from 15.2.3 to 15.2.4 in /js/web/test/e2e/exports/testcases/nextjs-default
#24283 merged
Apr 3, 2025 -
[webgpu][dawn API optimization] reduce number of calls to wgpuDeviceHasFeature
#24281 merged
Apr 3, 2025 -
[webgpu] test_layer_normalization_3d_axis0_epsilon
#24276 merged
Apr 3, 2025 -
Expose TRT preview features as EP option
#24212 merged
Apr 3, 2025 -
Support load TensorRT V3 plugin
#24211 merged
Apr 3, 2025 -
Pin vcpkg version
#24284 merged
Apr 3, 2025 -
[WebGPU EP] Implements ceil mode for Average Pool
#24270 merged
Apr 2, 2025 -
Adding build-system to pyproject.toml
#24216 merged
Apr 2, 2025 -
Ensure to use correct GPU device in RunSince when it's invoked by new thread
#24192 merged
Apr 2, 2025 -
Migrate pull:wasm to github action
#24269 merged
Apr 2, 2025 -
[VitisAI] Fixed include error.
#24199 merged
Apr 2, 2025 -
Exclude onnxruntime-inference-examples directory from Component Gover…
#24258 merged
Apr 1, 2025 -
Update xcode and iphoneSimulatorVersion after MacOS-14
#24260 merged
Apr 1, 2025 -
[WebGPU EP] fixes bugs in split implementation
#24259 merged
Apr 1, 2025 -
Bump vite from 6.2.3 to 6.2.4 in /js/web/test/e2e/exports/testcases/vite-default
#24255 merged
Apr 1, 2025 -
Bump dsaltares/fetch-gh-release-asset from 1.1.0 to 1.1.2
#24248 merged
Mar 31, 2025 -
Add docs for QNN EP
backend_type
provider option#24253 merged
Mar 31, 2025 -
upgrade dawn version to 4cb1f9be152a4fa6bb695c08cd707ab078a1e2fb
#24247 merged
Mar 31, 2025 -
Add shader key validation step in WebGPU CI pipeline
#24243 merged
Mar 31, 2025 -
[js/web] Add Wasm Relaxed SIMD support to wasm backend
#22794 merged
Mar 31, 2025 -
Extend pyright exclude list in pyproject.toml
#24246 merged
Mar 31, 2025 -
[webgpu] Fix opset-12 softmax nhwc issue
#24227 merged
Mar 31, 2025 -
[QNN EP] Add platform-agnostic EP option to specify QNN backend,
backend_type
#24235 merged
Mar 31, 2025 -
Bump actions/cache from 3 to 4
#24250 merged
Mar 31, 2025 -
Bump actions/setup-python from 4 to 5
#24251 merged
Mar 31, 2025 -
[webgpu] fix LayerNorm with empty input
#24244 merged
Mar 29, 2025 -
[webgpu] Fix test_layer_normalization_2d_axis0
#24223 merged
Mar 29, 2025 -
Update linux-dnnl.yml: rename the pipeline
#24240 merged
Mar 29, 2025 -
[WebGPU EP] If Implementation for WebGPU EP
#24242 merged
Mar 29, 2025 -
update the readme doc for the tool ep_weight_sharing_ctx_gen
#24233 merged
Mar 29, 2025 -
Migrate Web CI into github actions
#24219 merged
Mar 28, 2025 -
Migrate Linux GPU pipelines to Github Actions
#24232 merged
Mar 28, 2025 -
RoPE fp16 avx
#23772 merged
Mar 28, 2025 -
Add React Native namespace back in for iOS
#24218 merged
Mar 28, 2025 -
Improve Shape Inference for GQA
#24143 merged
Mar 28, 2025 -
Fix the pipeline that failed because of vcpkg
#24226 merged
Mar 28, 2025 -
Generate unique names for SliceSplit fusion.
#24217 merged
Mar 27, 2025 -
Further reduce work load for Mac CI pipeline
#24197 merged
Mar 27, 2025 -
[webgpu-ep] Fix test_batchnorm_example
#24184 merged
Mar 27, 2025 -
Move Linux ARM64 CI pipeline and Linux DNNL CI pipeline to Github Actions
#24190 merged
Mar 27, 2025 -
Remove all CG template from pipelines
#24193 merged
Mar 27, 2025 -
Rolling back the python/cuda
#24170 merged
Mar 27, 2025 -
Disable KleidiAI in Python Packaging pipeline MacOS build
#24194 merged
Mar 27, 2025 -
Make the custom nuget packaging pipeline 1ES commpliant.
#24191 merged
Mar 27, 2025 -
[JSEP] adjust edge case logic for scatternd
#24172 merged
Mar 26, 2025 -
upgrade QNN to version 2.32.0.250228
#23977 merged
Mar 26, 2025 -
fix triggering for "Validate Gradle Wrapper" pipeline
#24181 merged
Mar 26, 2025 -
revise mac os pipeline to reduce the amount of jobs
#24177 merged
Mar 26, 2025 -
[wasm] remove --vcpkg in wasm build
#24179 merged
Mar 26, 2025 -
[WebGPU EP] Add GEMM implementation
#24023 merged
Mar 26, 2025 -
[QNN EP] ARM64EC python package remove --vcpkg in build
#24174 merged
Mar 26, 2025 -
Update the min GCC version
#24148 merged
Mar 25, 2025 -
Migrate Zip-Nuget Package Pipeline to 1ES
#23609 merged
Mar 25, 2025 -
Fix layout transformer for FusedConv
#24169 merged
Mar 25, 2025 -
[onnxruntime_perf_test] Fix custom_allocator_ destruction order.
#24136 merged
Mar 25, 2025 -
Bump vite from 6.2.1 to 6.2.3 in /js/web/test/e2e/exports/testcases/vite-default
#24167 merged
Mar 25, 2025 -
Move Linux CPU CI pipeline to Github Actions
#24154 merged
Mar 25, 2025 -
Limit the Pipeline ability to build cuda 11
#24073 merged
Mar 25, 2025 -
Change type len from int to size_t
#24157 merged
Mar 25, 2025 -
Proper Error Message when fp16 model is used for Beam Search in CPU
#24151 merged
Mar 25, 2025 -
Upgrade Big Model pipeline CUDA from 11.8 to 12.x
#24156 merged
Mar 25, 2025 -
[Native WebGPU] Add Matmul
#24046 merged
Mar 25, 2025 -
add cache "onnxnodetests" for node tests
#24150 merged
Mar 25, 2025 -
[js] Add API for accessing metadata of a model's input/output
#23937 merged
Mar 24, 2025 -
[js/web] allow bundler import condition for not bundling wasm
#24014 merged
Mar 24, 2025 -
[webgpu] add option to perserve device and enable in unittest
#24115 merged
Mar 24, 2025 -
Address Windows CUDA build issue
#24149 merged
Mar 24, 2025 -
refactor mac CI pipelines
#24138 merged
Mar 24, 2025 -
[CPU] Add fp16 support to sparse attention
#24015 merged
Mar 24, 2025 -
[mobile] Add Android NuGet BrowserStack test to NuGet packaging pipeline
#23580 merged
Mar 24, 2025 -
[Shape Inference] Add shape inference for QLinearAdd and QLinearMul ops
#24090 merged
Mar 24, 2025 -
Bump next from 15.1.2 to 15.2.3 in /js/web/test/e2e/exports/testcases/nextjs-default
#24132 merged
Mar 24, 2025 -
Fix attention QK linkage error
#24134 merged
Mar 24, 2025 -
Update package.json to make the dist avaliable again
#23991 merged
Mar 23, 2025 -
Update T5 Onnx Export and Optimization
#23949 merged
Mar 23, 2025 -
Deleted the constant SKIP_CUDA_TEST_WITH_DML
#24113 merged
Mar 22, 2025 -
Refactor Webnn IsSupported*() to use constant initializers.
#24118 merged
Mar 21, 2025 -
add test cases for webgpu ep in web
#24117 merged
Mar 21, 2025 -
Integrate KleidiAI for MatMulNBits via MlasQNBitGemm
#23627 merged
Mar 21, 2025 -
skip MOE python test when MPI is not installed
#24116 merged
Mar 21, 2025
45 Pull requests opened by 36 people
-
[webgpu] Use 64 as the workgroup size of DP4AMatMulQuantize
#24129 opened
Mar 21, 2025 -
Fix #24130 by adding an empty dependency group to the code that generates Microsoft.ML.OnnxRuntime.nuspec
#24131 opened
Mar 21, 2025 -
[WIP] hack to investigate parallelization settings
#24147 opened
Mar 24, 2025 -
Fuse Initializers Graph Transform
#24175 opened
Mar 25, 2025 -
[CUDA] upgrade cudnn front end to 1.11
#24189 opened
Mar 26, 2025 -
Set shared memory type based on options during the compilation phase
#24196 opened
Mar 26, 2025 -
fix: GatherND output shape infer error
#24206 opened
Mar 27, 2025 -
Creating a doc for Github Action
#24215 opened
Mar 27, 2025 -
Add CMake option to enable saturation checker for ConvSymKernelAvx2
#24220 opened
Mar 27, 2025 -
Change Github Action's triggers
#24222 opened
Mar 28, 2025 -
Add python bindings to the global thread pool functionality
#24238 opened
Mar 28, 2025 -
Only enable CUDA language if needed
#24256 opened
Mar 31, 2025 -
[QNN-EP] Support for Upsample operator
#24265 opened
Apr 1, 2025 -
Enable tests that pass on the scoreboard
#24294 opened
Apr 3, 2025 -
Remove tests from current_failing_tests list
#24295 opened
Apr 3, 2025 -
Fix ReverseSequence handling when sequence_length == 0
#24320 opened
Apr 6, 2025 -
Remove Unnecessary List Conversions in Input Validation
#24331 opened
Apr 7, 2025 -
Enable SME for sgemm and sbgemm through KleidiAI
#24346 opened
Apr 8, 2025 -
[WebGPU EP] Add EINSUM implementation
#24358 opened
Apr 9, 2025 -
[VitisAI] enable weights sharing
#24359 opened
Apr 9, 2025 -
[ort-build] Pass ORT_EXTRA_INTERFACE_FLAGS to onnxruntime_session
#24368 opened
Apr 9, 2025 -
Implement experimental intermediate cross CPU EP allocation
#24371 opened
Apr 9, 2025 -
Update whisper transformer module to 4.48.0
#24382 opened
Apr 10, 2025 -
No float from chars on gcc9
#24393 opened
Apr 11, 2025 -
Add LSX support for S8S8 and S8U8 GEMM kernels
#24397 opened
Apr 11, 2025 -
Add ability to disable Model Editor API
#24402 opened
Apr 12, 2025 -
Fix reproducible
#24409 opened
Apr 14, 2025 -
Print out result of find_package
#24410 opened
Apr 14, 2025 -
Do not add LD_LIBRARY_PATH to search path when not defined
#24411 opened
Apr 14, 2025 -
Remove executable permission bit from source files
#24413 opened
Apr 14, 2025 -
(WIP) Utilizing llama.cpp to expand support of more quantization types with…
#24414 opened
Apr 14, 2025 -
Added ort 1.22 roadmap.
#24422 opened
Apr 14, 2025 -
[VitisAI] refactor VitisAI EP for open source
#24426 opened
Apr 15, 2025 -
[WebNN] Always execute decomposed *SimplifiedLayerNormalization in FP32
#24437 opened
Apr 16, 2025 -
Integration with ONNX 1.18.0
#24449 opened
Apr 16, 2025 -
NV TensorRT RTX EP - initial commit
#24456 opened
Apr 17, 2025 -
add GatherBlockQuantized to webgpu ep
#24460 opened
Apr 17, 2025 -
[CoreML] Add support for int64
#24462 opened
Apr 17, 2025 -
[QNN-EP] Translate FP-to-Bool Cast by NotEqual.
#24466 opened
Apr 18, 2025 -
update test case for test_convtranspose_autopad_same
#24477 opened
Apr 19, 2025 -
[WebGPU] Optimize GEMM with vec4
#24478 opened
Apr 21, 2025 -
Bump ruff from 0.9.5 to 0.11.6
#24480 opened
Apr 21, 2025 -
Refine device discovery a bit more.
#24481 opened
Apr 21, 2025 -
Add script to run GH workflows for a branch.
#24482 opened
Apr 21, 2025 -
[WIP][webgpu] Support any block size matmulnbits efficiently
#24483 opened
Apr 21, 2025
43 Issues closed by 31 people
-
Exception during initialization using Intel NPU (Intel AI boost)
#23305 closed
Apr 16, 2025 -
[Performance] CUDAExecutionProvider without RoiAlign (opset 16 version)
#21990 closed
Apr 15, 2025 -
[Mobile] I After onnx is compiled using NDK, the compiled result is then linked to the C++ program
#24024 closed
Apr 15, 2025 -
[Build] Build for android with xnnpack enabled and Exceptions disabled has error with gsl::narrow
#24383 closed
Apr 15, 2025 -
[Build] build onnxruntime for vsinpu error
#23316 closed
Apr 14, 2025 -
Inputs are reordered by TensorRT provider
#22729 closed
Apr 11, 2025 -
[Build] Cuda Execution Provider library is needed despite we only use TensoRT Execution provider
#22960 closed
Apr 11, 2025 -
[Build] pip Installation Failure on aarch64 (NVIDIA Jetson Orin)
#24380 closed
Apr 10, 2025 -
[Build] cublasLt64_11.dll is required even when building cuda 12
#24360 closed
Apr 10, 2025 -
[Web] TypeScript typings are not available with moduleResolution: bundler
#24343 closed
Apr 9, 2025 -
[Web] Upgrading from 1.20.1 to 1.21.* breaks Segment Anything models on WebGPU
#23183 closed
Apr 9, 2025 -
[Web] Allow to disable the SIMD detection check
#24292 closed
Apr 8, 2025 -
[Build] Build python interface for Onnxruntime-qnn on aarch64 Open Embedded Linux
#24102 closed
Apr 8, 2025 -
Access violation in nvcuda64.dll on Session::Run
#24339 closed
Apr 8, 2025 -
Python 3.11.11 can't install onnxruntime==1.20.1
#24338 closed
Apr 8, 2025 -
ONNX cannot save the XGBoost binary classifier properly when trained on an imbalanced dataset.
#24334 closed
Apr 7, 2025 -
[Mobile] GPU for Android: Multiple publish output files with the same relative path were found
#24272 closed
Apr 4, 2025 -
[Mobile] NNAPI cannot Split without num_outputs
#24274 closed
Apr 4, 2025 -
[Documentation] Please document that NuGet package needs vc_redist
#24310 closed
Apr 4, 2025 -
[Build] CUDA version linkage
#23841 closed
Apr 4, 2025 -
[Build] ORT, DML, OpenVINO Python wheel build - "OpenVINOExecutionProvider doesn't support memcpy"
#23824 closed
Apr 3, 2025 -
Kernel error for T5-style beam search with FP-16 subgraphs
#23728 closed
Apr 3, 2025 -
[Build] Onnxruntime v1.21.0 fails to build with GCC-13.3.0 when given x86_64 `-mf16c` flag
#24289 closed
Apr 3, 2025 -
[Build] absl build error
#24245 closed
Apr 3, 2025 -
c++ bert onnx output nan
#24263 closed
Apr 2, 2025 -
onnx export failing - diffusion model support
#24234 closed
Mar 31, 2025 -
[to delete]
#24252 closed
Mar 31, 2025 -
c# onnxruntime bug
#24213 closed
Mar 28, 2025 -
Error: two nodes with same node name (/GatherSliceToSplitFusion/)
#24203 closed
Mar 27, 2025 -
[Mobile] Memory crash after repeated inference with dynamic shape input
#22520 closed
Mar 26, 2025 -
[onnxruntime-node] Unable to create Inference Session after upgrade to v1.21.0
#24173 closed
Mar 25, 2025 -
[Shape Inference] Failure in shape inference for QLinearAdd using symbolic_shape_infer.py
#24028 closed
Mar 25, 2025 -
[Build] 'SKIP_CUDA_TEST_WITH_DML': undeclared identifier
#24054 closed
Mar 22, 2025 -
CUDA inference on Azure's partial GPU
#24039 closed
Mar 21, 2025
64 Issues opened by 55 people
-
[Feature Request] Please bring more detailed information on error message.
#24479 opened
Apr 21, 2025 -
Need help - C++ ONNX Failing
#24476 opened
Apr 19, 2025 -
[Web] WebGPU performance discrepancy between ONNX and ORT formats in browser: WASM ops dominate in ORT model
#24475 opened
Apr 19, 2025 -
ORT aborts when loading the attached model
#24473 opened
Apr 19, 2025 -
Integrate with ONNX 1.18.0 release branch
#24468 opened
Apr 18, 2025 -
[Build] building ONNX fail with error "undefined"
#24467 opened
Apr 18, 2025 -
[Feature Request] Add Fusion Transformer for WebNN EP Decomposed GQA Node
#24454 opened
Apr 17, 2025 -
OnnxRuntime C# bindings and multi gpu memory allocation.
#24453 opened
Apr 17, 2025 -
[Web] WebGPU Incorrect predictions in ONNX model when using Electron on Intel devices
#24442 opened
Apr 16, 2025 -
onnxruntime InferenceSession error
#24441 opened
Apr 16, 2025 -
[Build] Build incorrectly assumes that AVX-VNNI is a core part of AVX2
#24432 opened
Apr 15, 2025 -
Internal computational problems using quantified model inference
#24427 opened
Apr 15, 2025 -
[Performance] [QNN EP] Performance gap between onnxruntime QNN EP and Genie from QNN SDK.
#24417 opened
Apr 14, 2025 -
Incompatible dimensions for matrix multiplication due to ORT_ENABLE_ALL
#24407 opened
Apr 14, 2025 -
[Performance][Regression] Numerical mismatch between ORT 1.21.0 and PyTorch
#24398 opened
Apr 11, 2025 -
GPU Memory Leak on CUDAExecutionProvider with Specific Batch Size Sequence
#24376 opened
Apr 10, 2025 -
How to confirm if the current hardware supports operators such as fp16/int8/int4?
#24375 opened
Apr 10, 2025 -
quantize onnx models to INT8
#24374 opened
Apr 10, 2025 -
[Build] CUDA Minimal build still needs CUDNN_HOME to be specified
#24361 opened
Apr 9, 2025 -
onnxruntime errors out due to ORT_ENABLE_EXTENDED optimization: Error merging shape info for output
#24340 opened
Apr 8, 2025 -
how to get memory allocate detail in model infer?
#24323 opened
Apr 7, 2025 -
[Feature Request] Please publish 3.13t wheels for Windows
#24318 opened
Apr 5, 2025 -
GLU Operator gives different Results on Dml EP compared to CPU EP
#24311 opened
Apr 4, 2025 -
[Performance] Require Advance Profiling when running with DmlExecutionProvuder
#24306 opened
Apr 4, 2025 -
GetDimensionsCount return wrong value for dynamic shape model
#24300 opened
Apr 3, 2025 -
[Performance] Memory usage difference on Windows and Linux
#24296 opened
Apr 3, 2025 -
[Build] Onnxruntime v1.21.0 fails to build with GCC-13
#24290 opened
Apr 3, 2025 -
SIGSEGV when calling OrtSession.run()
#24288 opened
Apr 3, 2025 -
OnnxRuntime 1.21.0 Java package failed to load on Windows
#24287 opened
Apr 3, 2025 -
[Feature Request] Consider support int4/uint4 for reshape op of default CPU EP
#24285 opened
Apr 3, 2025 -
[Build] Build script for MacOS fails for targets older than 13.4 because tests can not be built
#24277 opened
Apr 2, 2025 -
[Build] Building v1.21.0: unsupported instruction 'vpdpbusds'
#24275 opened
Apr 2, 2025 -
[Build] OpenVINO ep for macOS
#24273 opened
Apr 2, 2025 -
ONNX preloaded dlls are incompatible with CUDNN torch version
#24266 opened
Apr 1, 2025 -
[Build] v1.21.0 - QNNExecutionProvider dissapeared for snapdragon elite X devices
#24262 opened
Apr 1, 2025 -
[Build] [Bug] The compiler doesn't support BFLOAT16!!! on Jetson Nano
#24230 opened
Mar 28, 2025 -
Vector Assertion Failure in InferenceSession Init with Hotplugged-Off Cores on ARM (v1.21.0)
#24221 opened
Mar 27, 2025 -
Questions about using AMD VitisAI EP, how can i run my model on AMD NPU?
#24214 opened
Mar 27, 2025 -
OpenVINO EP not able to use CPU device
#24208 opened
Mar 27, 2025 -
Bug: inconsistent output with transformer models between CUDA and CPU execution providers
#24204 opened
Mar 27, 2025 -
Python Session.run_async Causes Program Exit
#24200 opened
Mar 27, 2025 -
error converting to onnx model
#24198 opened
Mar 27, 2025 -
[Feature Request] Verbose Logging support for onnxruntime-qnn Python package
#24185 opened
Mar 26, 2025 -
[Perf Test] DirectML "performance_preference" option non-functional due to catch22 bug
#24182 opened
Mar 26, 2025 -
[Web] Can't install behind NTLM proxy
#24178 opened
Mar 25, 2025 -
[Build] Must build on Ubuntu 20.04 with gcc 9
#24168 opened
Mar 25, 2025 -
QNN as ONNXruntime backend hangs while executing graph
#24166 opened
Mar 25, 2025 -
[Mobile][WebGPU][FeatureRequest] No true support for WebGPU
#24165 opened
Mar 25, 2025 -
onnxruntime gpu package for Aarch64?
#24159 opened
Mar 25, 2025 -
onnxruntime errors out due to ORT_ENABLE_BASIC optimization: Unexpected data type for Clip 'min' input of 11
#24158 opened
Mar 25, 2025 -
[Build]Linker error when building for macCatalyst: Object file built for macOS
#24153 opened
Mar 24, 2025 -
[Build] MacOS universal binary build failure: "error: unknown target CPU 'armv8-a'"
#24152 opened
Mar 24, 2025 -
[Build] error C2653: 'system_clock': is not a class or namesp ace name
#24145 opened
Mar 24, 2025 -
segmentation fault while using onnxruntime==1.21.0
#24144 opened
Mar 24, 2025 -
onnxruntime-mobile implementation on custom execution provider
#24135 opened
Mar 22, 2025 -
Microsoft.ML.OnnxRuntime Nuget package should have a 'native' dependency group
#24130 opened
Mar 21, 2025
117 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[WebNN EP] Support MultiHeadAttention(MHA)
#24079 commented on
Apr 21, 2025 • 16 new comments -
Fix an issue in wasm nortti build and add minimal build support for vcpkg
#24012 commented on
Apr 10, 2025 • 16 new comments -
Enabling c++20 on linux
#17816 commented on
Apr 2, 2025 • 3 new comments -
Add trace event control for ORT Web performance profiling
#23393 commented on
Apr 18, 2025 • 2 new comments -
[WIP][Native WebGPU] Remove explicit split operator in GQA
#23909 commented on
Apr 15, 2025 • 2 new comments -
[QNN-EP] Add BQ and LPBQ support in QNN EP
#24097 commented on
Mar 24, 2025 • 1 new comment -
[Build] Missmatch between CMake config and folder structure of onnxruntime-linux-x64-1.21.0.tgz
#24003 commented on
Apr 11, 2025 • 0 new comments -
Bad Allocation Error in ONNX Runtime on Windows x86 CPU When Processing Multiple Images Sequentially
#23938 commented on
Apr 11, 2025 • 0 new comments -
in cmake/CMakeList.txt all avx related option all set off, do we need do anything to use avx features?
#11833 commented on
Apr 11, 2025 • 0 new comments -
[Web] No way to prevent the default wasm from being bundled
#24009 commented on
Apr 12, 2025 • 0 new comments -
Turning on coreml and turning off coreml are two results
#24033 commented on
Apr 13, 2025 • 0 new comments -
TensorRT Support for Multiple Profiles
#23965 commented on
Apr 14, 2025 • 0 new comments -
[Build] The onnxruntime_tools-1.7.0 tarball on the PYPI site doesn't have requirements.txt and other files
#24048 commented on
Apr 14, 2025 • 0 new comments -
[Build] PyInstaller build with onnxruntime issues with DT_RUNPATH
#24044 commented on
Apr 14, 2025 • 0 new comments -
[Bug] Coqui VITS ONNX model can't be statically quantized.
#16738 commented on
Apr 15, 2025 • 0 new comments -
onnxruntime-web requires webpack, but many users use metro
#24052 commented on
Apr 15, 2025 • 0 new comments -
[Mobile] Maui with ONNX Runtime does not compile for IOS
#22661 commented on
Apr 16, 2025 • 0 new comments -
CoreML failed: Unable to get shape for output
#23262 commented on
Apr 16, 2025 • 0 new comments -
[Performance] does acl support fused conv?
#24063 commented on
Apr 16, 2025 • 0 new comments -
Enable tests that pass locally; remove duplicates
#24123 commented on
Mar 21, 2025 • 0 new comments -
[Build] error: array index 7 is past the end of the array (that has type '__m256[4]')
#23180 commented on
Apr 10, 2025 • 0 new comments -
[Build] memory leaked
#23915 commented on
Apr 10, 2025 • 0 new comments -
[Build] .pc file asks for -lonnxruntime but onnxruntime.a isn't installed
#23959 commented on
Apr 10, 2025 • 0 new comments -
[preprocess] Pad is not folded in Conv when opset_import is > 20
#23973 commented on
Apr 10, 2025 • 0 new comments -
[Performance] does onnxruntime 1.19.0 support sve?
#23983 commented on
Apr 10, 2025 • 0 new comments -
[Build] Unable to Compile ONNX Runtime 1.20.1 with ARMNN Provider on ARM Cortex A78
#23014 commented on
Apr 10, 2025 • 0 new comments -
[Feature Request] use Onnxruntime TensorRT execution provider with lean tensorRT runtime
#23082 commented on
Apr 10, 2025 • 0 new comments -
[Web] Different model output with WebGPU vs WASM (or Python with the CUDA EP)
#24070 commented on
Apr 9, 2025 • 0 new comments -
[DO NOT UNPIN] ORT 1.21.0 Release Candidates available for testing
#23885 commented on
Apr 8, 2025 • 0 new comments -
Using directML to inference accelerate onnxruntime, a crash occurred.
#22514 commented on
Apr 8, 2025 • 0 new comments -
[DO NOT UNPIN] ORT Nightly Package Name Change
#22541 commented on
Apr 8, 2025 • 0 new comments -
Error when I use cuda_runtime.h and OpenVINO EP at the same time
#23941 commented on
Apr 7, 2025 • 0 new comments -
Abs node runs into error with bf16 tensor
#23875 commented on
Apr 6, 2025 • 0 new comments -
[VSINPU]Fix gather OP with scalar indice issue
#24061 commented on
Apr 11, 2025 • 0 new comments -
Bump clang-format from 19.1.7 to 20.1.0
#24058 commented on
Apr 16, 2025 • 0 new comments -
Bump @babel/runtime from 7.25.6 to 7.26.10 in /js/react_native/e2e
#23994 commented on
Apr 11, 2025 • 0 new comments -
Bump @babel/helpers from 7.25.6 to 7.26.10 in /js/react_native/e2e
#23993 commented on
Apr 11, 2025 • 0 new comments -
[DRAFT do not review] Convert graph initializers into OrtValue
#23979 commented on
Mar 24, 2025 • 0 new comments -
[TEST] depthtospace
#23929 commented on
Apr 17, 2025 • 0 new comments -
Add OpenCL EP
#23830 commented on
Apr 17, 2025 • 0 new comments -
NHWC DepthToSpace U8 and its transformation
#23784 commented on
Apr 15, 2025 • 0 new comments -
(WIP) bitnet and t-mac
#23540 commented on
Mar 22, 2025 • 0 new comments -
enable global thread pool in python
#23495 commented on
Mar 25, 2025 • 0 new comments -
Matmul nbits to optimize memory layout for avx instructions
#22203 commented on
Apr 3, 2025 • 0 new comments -
CMake exports for static onnxruntime
#22173 commented on
Apr 17, 2025 • 0 new comments -
[Delivery] Win ARM64 wheels + QNN
#19162 commented on
Apr 20, 2025 • 0 new comments -
Creating ORT inference session from onnx model gives segmentation fault
#24087 commented on
Apr 19, 2025 • 0 new comments -
[Build] Compile error with onnxruntime_providers_cuda.vcxproj
#24099 commented on
Apr 19, 2025 • 0 new comments -
[Performance] Performance Bottleneck due to intra_op_num_threads being set globally
#24101 commented on
Apr 19, 2025 • 0 new comments -
[Build] build error for windows
#23166 commented on
Apr 19, 2025 • 0 new comments -
[Web] `Tensor.fromImage` crops, doesn't resize
#24050 commented on
Apr 18, 2025 • 0 new comments -
Add option "any" for DirectML EP device_filter to onnxruntime perftest binary
#24068 commented on
Apr 18, 2025 • 0 new comments -
Wrong indexing in CPUIDInfo::ArmLinuxInit
#24092 commented on
Apr 18, 2025 • 0 new comments -
ImportError: Unable to import dependency onnxruntime
#24120 commented on
Apr 17, 2025 • 0 new comments -
Crashes when executing model quantification on Deeplabv3
#23985 commented on
Apr 17, 2025 • 0 new comments -
OnnxRuntime gives different outputs on A100 v/s H100 GPU
#24027 commented on
Apr 17, 2025 • 0 new comments -
[Build] compilation error: invalid instruction mnemonic 'vcvtneeph2ps'
#22519 commented on
Apr 17, 2025 • 0 new comments -
[Web] How to use JSEP and WebGPU in static library (missing jsepAlloc or jsepInit)
#23072 commented on
Apr 16, 2025 • 0 new comments -
[Mobile] run speech using sherpa-onnx in the speech module, but if you want to use onnx inference in the translation module, you cannot initialize the ORT task.
#24062 commented on
Apr 16, 2025 • 0 new comments -
C++ Runtime does not recognize supposedly correct input.
#16430 commented on
Mar 28, 2025 • 0 new comments -
[Mobile] Dynamic Shape Challenge: Enabling LLM on QNN-HTP
#23832 commented on
Mar 28, 2025 • 0 new comments -
[Feature Request] Global Threadpool in Python API
#23523 commented on
Mar 27, 2025 • 0 new comments -
the memory leak using valgrind
#23762 commented on
Mar 27, 2025 • 0 new comments -
[Web] Getting Started link on onnxruntime.ai website broken
#23764 commented on
Mar 27, 2025 • 0 new comments -
Blank output issue with CUDAExecutionProvider - Onnx Model Converted to fp16
#23797 commented on
Mar 27, 2025 • 0 new comments -
[Build] Unsupported AVX512-FP16 Instructions in MLAS (vcvtneeph2ps, vcvtneoph2ps)
#24025 commented on
Mar 27, 2025 • 0 new comments -
[Performance] nearest neighbor Resize operator is significantly slower than pytorch for 3D tensors
#14596 commented on
Mar 26, 2025 • 0 new comments -
Assistance with adjusting default Arena Allocator C/C++ API
#23768 commented on
Mar 26, 2025 • 0 new comments -
Why the output of the ONNX MatMul node never be the same as what PyTorch gives?
#23792 commented on
Mar 26, 2025 • 0 new comments -
Application is getting crashed while creating session for the onnxruntime-qnn with QnnCpu backend option.
#24082 commented on
Mar 26, 2025 • 0 new comments -
[TensorRT ExecutionProvider] Cannot infer the model on a GPU device with an ID other than 0
#21276 commented on
Mar 25, 2025 • 0 new comments -
Question about the ONNX Runtime 1.20.2 binary release
#23721 commented on
Mar 25, 2025 • 0 new comments -
[Mobile] Unable to load models in Xamarin iOS
#16463 commented on
Mar 25, 2025 • 0 new comments -
[Web] WASM sigmoid producing numbers below 0 or above 1
#23943 commented on
Mar 24, 2025 • 0 new comments -
[Build] Compilation error when building Onnxrt 1.20.1 with flag onnxruntime_CUDA_MINIMAL=ON with TRT 10.7.23 and Cudnn 9.6.0.74,
#23504 commented on
Mar 24, 2025 • 0 new comments -
[Feature Request] Where op for bool input
#24127 commented on
Mar 24, 2025 • 0 new comments -
Adding Execution Provider into ONNX RT
#23732 commented on
Mar 24, 2025 • 0 new comments -
the memory usage not release
#23774 commented on
Mar 24, 2025 • 0 new comments -
[Web] [Feature Request] Ability to abort
#23703 commented on
Mar 23, 2025 • 0 new comments -
[Build] no match for ‘operator=’ (operand types are ‘OrtMemoryInfo’ and ‘const OrtDevice') in memory_info.cc line 44 when onnxruntime_ENAABLE_MEMORY_PROFILE is enabled
#23750 commented on
Mar 23, 2025 • 0 new comments -
Can load Fluxonnx Modal Components using InferenceSession
#23770 commented on
Mar 23, 2025 • 0 new comments -
[Build] Android x86_64 Cross Compiling on Mac OS
#23648 commented on
Mar 22, 2025 • 0 new comments -
OpenVino Runtime Exception. Unexpected: CPU plug-in doesn't support If operation with dynamic rank. Operation name: input.15
#23757 commented on
Mar 22, 2025 • 0 new comments -
[Mobile] [urgent] iOS application crash at CreateEnv (pointer being freed was not allocated)
#23759 commented on
Mar 22, 2025 • 0 new comments -
Segmentation fault while loading CUDA Provider
#16146 commented on
Mar 21, 2025 • 0 new comments -
OnnxRuntime for Windows on Arm as Arm64EC variant?
#15403 commented on
Mar 21, 2025 • 0 new comments -
[Regression] Floating-point overflow with v1.21
#24119 commented on
Mar 21, 2025 • 0 new comments -
Tensor Backing Buffer Mismatch Detected in Buffer Reuse
#23739 commented on
Mar 21, 2025 • 0 new comments -
[C++, Linux] Segmentation fault when run OrtApi::Run
#23897 commented on
Apr 6, 2025 • 0 new comments -
[Web] Facing this error in WebGPU: Model warmup failed: Error: input 'detection' is missing in 'feeds'.
#23921 commented on
Apr 6, 2025 • 0 new comments -
Xnnpack execution provider Resize::IsOnnxNodeSupported causes crash for models where Resize layer scales tensor is an empty tensor
#23886 commented on
Apr 5, 2025 • 0 new comments -
[OpenVINO GPU] OpenVINO EP shouldn't override the "ACCURACY" precision to "FP32"
#23895 commented on
Apr 5, 2025 • 0 new comments -
[Documentation] Memory Leak in TensorRTProvider example
#23901 commented on
Apr 5, 2025 • 0 new comments -
[Build] WASM static lib build fails: no member named 'Negate' in 'onnxruntime::MLFloat16'
#23769 commented on
Apr 5, 2025 • 0 new comments -
[Training] IR version incompatibility in artifact generation for on-device training
#20726 commented on
Apr 5, 2025 • 0 new comments -
[Build] Can NPU enablement be optional?
#22985 commented on
Apr 4, 2025 • 0 new comments -
[Performance]Do onednn executors depend on Intel platform
#23795 commented on
Apr 4, 2025 • 0 new comments -
[Build] Cross-compile for Android on Windows error
#23796 commented on
Apr 4, 2025 • 0 new comments -
[CPU EP] GatherND crashes with division by zero when batch dimensions mismatch between input and indices
#23828 commented on
Apr 4, 2025 • 0 new comments -
preprocess issues around MeanReduce/Reshape nodes and negative axes
#23868 commented on
Apr 4, 2025 • 0 new comments -
[OpenVINO] SessionOptionsAppendExecutionProvider_OpenVINO API loads NULL config file
#23871 commented on
Apr 4, 2025 • 0 new comments -
The Pad operator has a calculation error in the "reflect" mode.
#23878 commented on
Apr 4, 2025 • 0 new comments -
Please document how to build with new execution provider [Documentation Request]
#20654 commented on
Apr 3, 2025 • 0 new comments -
perf_view shows nothing after json load
#15927 commented on
Apr 3, 2025 • 0 new comments -
[Tests] 1 test fails: OptimizerInitializerTest.LoadExternalData: it throws a different type.
#23816 commented on
Apr 3, 2025 • 0 new comments -
[Build] Where is official build for Unity?
#19964 commented on
Apr 3, 2025 • 0 new comments -
[Performance] model inference in onnxruntime is toooooo slow
#23282 commented on
Apr 3, 2025 • 0 new comments -
C++ inference with GPU (CUDA)
#13934 commented on
Apr 2, 2025 • 0 new comments -
[web] `ort.InferenceSession.create` silently hangs/fails on iOS/iPad browsers if COEP/COOP headers are set
#11679 commented on
Apr 2, 2025 • 0 new comments -
[Performance] Why does inference occupy so much memory?
#23867 commented on
Apr 2, 2025 • 0 new comments -
Importing onnxruntime on AWS Lambdas with ARM64 processor causes crash
#10038 commented on
Apr 1, 2025 • 0 new comments -
Non-zero status code returned while running Resize node
#13975 commented on
Apr 1, 2025 • 0 new comments -
Always getting "Failed to create CUDAExecutionProvider"
#11092 commented on
Mar 31, 2025 • 0 new comments -
[nodejs-binding] Crash during InferenceSession initialization: "Check failed: node->IsInUse()"
#23794 commented on
Mar 30, 2025 • 0 new comments -
onnxruntime slower than xgboost & lightgbm in batch predictions
#886 commented on
Mar 28, 2025 • 0 new comments -
Microsoft.ML.OnnxRuntime.QNN 1.20.1 includes unnecessary filew in win-arm64.
#23781 commented on
Mar 28, 2025 • 0 new comments