Only enable CUDA language if needed #24256

gedoensmax · 2025-03-31T20:30:47Z

Description

In my testing it is not needed to even enable the CUDA language in case we are building with CUDA_MINIMAL. If not doing so CUDA toolkit does not have to be installed and fully registered with VS Studio on windows for example.

cc @chilo-ms

gedoensmax · 2025-03-31T21:00:58Z

onnxruntime/test/testdata/custom_op_library/custom_op_library.cc

+#ifdef USE_CUDA_MINIMAL
    Cuda::RegisterOps(domain);
    Cuda::RegisterOps(domain_v2);
-
+#endif


If a memcpy would be used as custom op that should work without needing cvcc during compile. Let me know if that would be an accepted change.

Not very clear to me why we don't test Cuda::RegisterOps() in non USE_CUDA_MINIMAL, i.e. normal cuda build now?

snnn · 2025-03-31T21:16:14Z

/azp run Big Models, Linux CPU Minimal Build E2E CI Pipeline, Linux QNN CI Pipeline, ONNX Runtime Web CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline, Windows x64 QNN CI Pipeline

azure-pipelines · 2025-03-31T21:16:45Z

Azure Pipelines successfully started running 8 pipeline(s).

chilo-ms · 2025-04-01T18:49:01Z

The minimal CUDA Windows CI has following error:

cuda_provider_factory.obj(0,0): Error LNK2019: unresolved external symbol "void __cdecl onnxruntime::cuda::Explicit_Impl_Cast(struct CUstream_st *,float const *,double *,unsigned __int64)" (?Explicit_Impl_Cast@cuda@onnxruntime@@YAXPEAUCUstream_st@@PEBMPEAN_K@Z) referenced in function "void __cdecl onnxruntime::cuda::Impl_Cast<float,double>(struct CUstream_st *,float const *,double *,unsigned __int64)" (??$Impl_Cast@MN@cuda@onnxruntime@@YAXPEAUCUstream_st@@PEBMPEAN_K@Z)

gedoensmax · 2025-04-01T21:57:14Z

Yeah found the remaining kernel - cast ops for float to double that TRT has registered as fallback.

chilo-ms · 2025-04-03T17:44:40Z

/azp run Big Models, Linux CPU Minimal Build E2E CI Pipeline, Linux QNN CI Pipeline, ONNX Runtime Web CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline, Windows x64 QNN CI Pipeline

azure-pipelines · 2025-04-03T17:45:10Z

Azure Pipelines successfully started running 7 pipeline(s).

chilo-ms · 2025-04-03T18:52:12Z

Yeah found the remaining kernel - cast ops for float to double that TRT has registered as fallback.

But the cuda::Impl_Cast() being called in TRT EP still needs the corresponding Explicit_Impl_Cast() implementation in unary_elementwise_ops_impl.cu that you excluded from the last commit.

So, it will still end up with linker error in cuda minimal build CI:
Error LNK2019: unresolved external symbol "void __cdecl onnxruntime::cuda::Explicit_Impl_Cast...

gedoensmax added 3 commits March 31, 2025 15:28

only enable CUDA language if needed

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

999ba13

Use CUDA Toolkit include dirs

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

f9ec136

Remove custom ops since we cannot compile a custom op using cuda

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

8b495e4

gedoensmax commented Mar 31, 2025

View reviewed changes

do not support float to double cast when build without CUDA

Loading
Loading status checks…

93d76f9

gedoensmax marked this pull request as draft April 16, 2025 11:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only enable CUDA language if needed #24256

Only enable CUDA language if needed #24256

gedoensmax commented Mar 31, 2025

gedoensmax Mar 31, 2025

chilo-ms Apr 3, 2025 •

edited

Loading

snnn commented Mar 31, 2025

azure-pipelines bot commented Mar 31, 2025

chilo-ms commented Apr 1, 2025

gedoensmax commented Apr 1, 2025

chilo-ms commented Apr 3, 2025

azure-pipelines bot commented Apr 3, 2025

chilo-ms commented Apr 3, 2025

Only enable CUDA language if needed #24256

Are you sure you want to change the base?

Only enable CUDA language if needed #24256

Conversation

gedoensmax commented Mar 31, 2025

Description

gedoensmax Mar 31, 2025

Choose a reason for hiding this comment

chilo-ms Apr 3, 2025 • edited Loading

Choose a reason for hiding this comment

snnn commented Mar 31, 2025

azure-pipelines bot commented Mar 31, 2025

chilo-ms commented Apr 1, 2025

gedoensmax commented Apr 1, 2025

chilo-ms commented Apr 3, 2025

azure-pipelines bot commented Apr 3, 2025

chilo-ms commented Apr 3, 2025

chilo-ms Apr 3, 2025 •

edited

Loading