Jess Shapiro - Everything at Once: Python's Many Concurrency Models

EVERYTHING AT ONCE Python’s Many Concurrency Models Jess Shapiro (she/her)

Concurrency ▪ Doing multiple things “at once” ▪ Concurrency isn’t
just “on” or “off” ▪ Many available options in Python ▪ Asyncio coroutines ▪ Python threads ▪ GIL-released threads ▪ Multiprocessing ▪ Distributed tasks

Parallelism ▪ Do things actually happen simultaneously? ▪ How does
performance scale when you add more CPUs?

Minimum Schedulable Unit ▪ Code is made up of semantic
chunks ▪ How big are the chunks that can be run independently?

Data Sharing and Isolation ▪ How isolated is data between
tasks? ▪ How long does data stay the same for? ▪ What tools can be used to share data?

Asyncio Coroutines ▪ One coroutine runs at a time ▪
MSU: “Awaitable Block” ▪ Global state is shared and consistent within each block ▪ Event loop

Awaitable Block

Asyncio Coroutine Sample Code

Python Threads ▪ One thread runs (GIL) ▪ MSU: “Bytecode”
▪ Global state is shared, but consistent only for single-bytecode ops* ▪ Combined scheduling

Which of these is a single-bytecode operation? ▪ x +=
1 ▪ func(**kw) ▪ dict.items() ▪ ‘{y}’.format(y=x.val) Lesson: Bytecode atomicity is essential, but it’s not there for you. Don’t count on it.

Python Threading Sample Code

GIL-released Threads ▪ Multiple threads run simultaneously ▪ MSU: Host
processor instruction (x86, etc) ▪ Global state is shared but unreliable ▪ OS-scheduled

GIL-released Thread Sample Code

Multiprocessing ▪ Multiple processes run simultaneously ▪ MSU: Host processor
instruction (x86, etc) ▪ Global state starts the same as parent, but evolves independently ▪ OS-scheduled

Multiprocessing Sample Code

Distributed Tasks ▪ Multiple tasks run simultaneously ▪ MSU: varies;
often the entire application for some subset of data ▪ Global state totally independent; often “process-like” ▪ Central orchestrator

Distributed Task Sample Code

When to use each? ▪ Asyncio – Performance is I/O-bound
rather than CPU-bound – Starting new codebase without synchronous legacy code ▪ Threads – Need preemptive multitasking – Integrate synchronous code – Need fine-grained concurrency – Python “glue” for GIL-unlocked C ▪ Processes – Don’t need substantial inter-task communication – Full parallelism required for Python code ▪ Distributed tasks – Highly-segmentable and distributable workload – Need for shared state minimal – Large enough load to overcome perf overhead of orchestrator

Acknowledgements ▪ Allison Kaptur & Chris Fenner for slide review
▪ Friends & mentors for their belief and support ▪ https://carbon.now.sh for snippet images ▪ Mazarine on Market in SF for avocado toast ▪ YOU!

Contact Me ▪ Email: [email protected] ▪ Twitter: @transgingerjess ▪ Github:
@haikuginger

Jess Shapiro - Everything at Once: Python's Man...

Jess Shapiro - Everything at Once: Python's Many Concurrency Models

PyCon 2019

More Decks by PyCon 2019

Other Decks in Programming

Featured

Transcript

EVERYTHING AT ONCE Python’s Many Concurrency Models Jess Shapiro (she/her)

Concurrency ▪ Doing multiple things “at once” ▪ Concurrency isn’t

Parallelism ▪ Do things actually happen simultaneously? ▪ How does

Minimum Schedulable Unit ▪ Code is made up of semantic

Data Sharing and Isolation ▪ How isolated is data between

Asyncio Coroutines ▪ One coroutine runs at a time ▪

Awaitable Block

Asyncio Coroutine Sample Code

Python Threads ▪ One thread runs (GIL) ▪ MSU: “Bytecode”

Which of these is a single-bytecode operation? ▪ x +=

Python Threading Sample Code

GIL-released Threads ▪ Multiple threads run simultaneously ▪ MSU: Host

GIL-released Thread Sample Code

Multiprocessing ▪ Multiple processes run simultaneously ▪ MSU: Host processor

Multiprocessing Sample Code

Distributed Tasks ▪ Multiple tasks run simultaneously ▪ MSU: varies;

Distributed Task Sample Code

When to use each? ▪ Asyncio – Performance is I/O-bound

Acknowledgements ▪ Allison Kaptur & Chris Fenner for slide review

Contact Me ▪ Email: [email protected] ▪ Twitter: @transgingerjess ▪ Github: