Skip to content

Commit d6df4f2

Browse files
authoredApr 5, 2025··
Add ConvTranspose cache key (#24317)
### Description <!-- Describe your changes. --> ### Motivation and Context <!-- - Why is this change required? What problem does it solve? - If it fixes an open issue, please link to the issue here. -->
1 parent 55c1a3b commit d6df4f2

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed
 

Diff for: ‎onnxruntime/core/providers/webgpu/nn/conv_backprop_webgpu.cc

+2
Original file line numberDiff line numberDiff line change
@@ -181,6 +181,8 @@ ConvTranspose2DProgram CreateConvTranspose2DProgram(const std::vector<const Tens
181181
program.AddInput({bias, ProgramTensorMetadataDependency::TypeAndRank, reduced_bias_shape, components});
182182
}
183183
program.AddOutput({output, ProgramTensorMetadataDependency::Rank, reduced_output_shape, components})
184+
.CacheHint(std::to_string(input_channels_remainder) + "-" + std::to_string(pack_input_as4) + std::to_string(components) +
185+
"-" + std::to_string(b_components) + "-" + std::to_string(a_components) + "-" + std::to_string(is_channels_last ? 1 : 0))
184186
.AddUniformVariables({{static_cast<uint32_t>(output_size)}, {strides}, {kernel_dims}, {dilations}, {effective_kernel_dims}, {local_pads}, {static_cast<uint32_t>(input_channels_per_group_int)}, {static_cast<uint32_t>(input_channels_per_group)}, {static_cast<uint32_t>(output_channels_per_group)}})
185187
.SetDispatchGroupSize((output_size + WORKGROUP_SIZE - 1) / WORKGROUP_SIZE);
186188

0 commit comments

Comments
 (0)
Please sign in to comment.