Add python bindings to the global thread pool functionality #24238

khoover · 2025-03-28T19:13:43Z

Description

Allows users to configure and enable the global thread pool via Python, and have inference sessions use it instead of session-local thread pools.

Motivation and Context

Forked off of #23495 to take over implementation, see issue #23523.

Our particular use case involves a single service instance serving thousands of individual models, each relatively small (e.g. small decision trees). Creating individual services for each model is too much overhead, and attempting to start several thousand thread-pools is a non-starter. We could possibly have each session be single-threaded, but we would like to be able to separate the request handler thread count from the compute thread count (e.g. 2 handler threads but 4 intra-op ones).

snnn · 2025-03-28T19:17:51Z

We don't have a good way to shutdown the thread pool. It should be done before the python process started to exit and unload DLLs. So this doc: https://learn.microsoft.com/en-us/windows/win32/dlls/dynamic-link-library-best-practices#deadlocks-caused-by-lock-order-inversion
So, generally speaking, if you have a thread pool and you declared it as a global var, then you must manually shutdown it, otherwise it will cause a deadlock. The issue was highly reproducible. However, recently probably the restriction is gone? I am not sure. But the Windows doc is not updated. So I am not very sure if this will work.

snnn · 2025-03-28T19:23:15Z

Like this: apache/mxnet#11163

khoover · 2025-04-17T22:48:22Z

@microsoft-github-policy-service agree company="Instacart"

khoover · 2025-04-18T16:40:13Z

We don't have a good way to shutdown the thread pool. It should be done before the python process started to exit and unload DLLs. So this doc: https://learn.microsoft.com/en-us/windows/win32/dlls/dynamic-link-library-best-practices#deadlocks-caused-by-lock-order-inversion So, generally speaking, if you have a thread pool and you declared it as a global var, then you must manually shutdown it, otherwise it will cause a deadlock. The issue was highly reproducible. However, recently probably the restriction is gone? I am not sure. But the Windows doc is not updated. So I am not very sure if this will work.

Would adding onnxruntime::Environment::ShutdownGlobalThreadPools and then hooking it into atexit in Python work? I imagine the existing C++ API has examples of needing to do this as well, no?

EDIT: I see, the native implementation would just let Environment destruct at the end of the program, and that handles shutting down. But the Python bindings have a static shared_ptr<Environment>, and it's really just the thread pools that need to go.

…tting the global threading options

snnn · 2025-04-21T19:04:00Z

On Windows when a thread shuts down, the OS also needs to notify all loaded DLLs in case if they need to do any cleanup work. It cannot be too late. So, in general all user created threads should be shutdown before unloading DLLs. Theoretically speaking, there is another way of doing this: the cleanup functions should test if the currently the process is shutting down, if yes, giving up doing any clean up. We do not have such logics in ORT yet. We may consider doing it later.

khoover force-pushed the add-python-global-thread-pool branch from 6a2b524 to 44d5cf6 Compare April 17, 2025 23:27

alex-halpin and others added 3 commits April 18, 2025 16:47

enable global thread pool in python

9c19ed7

Lint fixes

42b4287

Include utility for std::move

Loading
Loading status checks…

510733f

khoover force-pushed the add-python-global-thread-pool branch from 44d5cf6 to 510733f Compare April 18, 2025 16:47

khoover added 2 commits April 18, 2025 22:14

Create threadpool shutdown method for Environment

c8453d1

Register global thread pool shutdown via atexit in Python

Loading
Loading status checks…

5baf1ee

khoover marked this pull request as ready for review April 18, 2025 22:36

Register the shutdown when the EnvInitializer is created, not when se…

Loading
Loading status checks…

89a602d

…tting the global threading options

Fix typo

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.

GPG key ID: B5690EEEBB952194

Verified
Learn about vigilant mode

Loading
Loading status checks…

6f0ce1e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add python bindings to the global thread pool functionality #24238

Add python bindings to the global thread pool functionality #24238

khoover commented Mar 28, 2025 •

edited

Loading

snnn commented Mar 28, 2025

snnn commented Mar 28, 2025

khoover commented Apr 17, 2025

khoover commented Apr 18, 2025 •

edited

Loading

snnn commented Apr 21, 2025

Add python bindings to the global thread pool functionality #24238

Are you sure you want to change the base?

Add python bindings to the global thread pool functionality #24238

Conversation

khoover commented Mar 28, 2025 • edited Loading

Description

Motivation and Context

snnn commented Mar 28, 2025

snnn commented Mar 28, 2025

khoover commented Apr 17, 2025

khoover commented Apr 18, 2025 • edited Loading

snnn commented Apr 21, 2025

khoover commented Mar 28, 2025 •

edited

Loading

khoover commented Apr 18, 2025 •

edited

Loading