to C API GIL optional, diabled by default: https://github.com/colesbury/nogil Allows parallelizing well up to the number of available cores Replaces CPython's allocator with https://github.com/microsoft/mimalloc helps GPU-heavy workloads 16
challenge was to maintain a stable, non-trivial concurrent communication link with hardware units while utilizing a 1 Gbit/s UDP/IP connection to the maximum. Naturally, we started with the multiprocessing package, but at some point, it became clear that most CPU time was consumed by the data transfers between the data processing stages, not by data processing itself. The CPython multithreading implementation based on GIL was a dead end too. When we found out about the “nogil” fork of Python it took a single person less than half a working day to adjust the codebase to use this fork and the results were astonishing. 17