Kaigi On Rails

@nateberkopec for Kaigi on Rails 2022 JA text on slides
by @cafedomancer Queues in Rails Apps Sidekiq, Puma, and the GVL Rails ΞϓϦ έʔγϣϯʹ ͓͚ΔΩϡʔ

Hi Nate, Our application is really slow, but our dashboards
look OK. We don't really understand what's going on. Can you help us? Nate ༷ ͍ͭ΋͓ੈ࿩ʹͳ͓ͬͯΓ·͢ɻ ฐࣾͰӡ༻͍ͯ͠ΔΞϓϦέʔ γϣϯʹύϑΥʔϚϯε্ͷ໰୊ ͕͋Γɺ͓٬༷͔Βෆຬͷ੠͕د ͤΒΕ͍ͯ·͢ɻμογϡϘʔυ ΛݟΔݶΓͰ͸ɺಛʹ໰୊͸ݟ౰ ͨΓ·ͤΜͰͨ͠ɻฐࣾͷํͰ ͸ɺͲͷΑ͏ͳ໰୊͕ى͖͍ͯΔ ͷ͔Λ೺ѲͰ͖͓ͯΓ·ͤΜɻ͓ ๩͍͠ͱ͜ΖେมڪॖͳͷͰ͢

Queues Ωϡʔ

Nate Berkopec

Agenda Three kinds of queues • Sidekiq • Puma/Unicorn •
GVL 3 ͭͷछྨͷΩϡʔ ΞδΣϯμ

Queue Ωϡʔ “Servers” αʔόʔ Work ࡞ۀ

Queues increase availability Ωϡʔ͸Մ༻ੑΛ޲্ͤ͞Δ

Christoph Roser/allaboutlean.com

Queues can be organized in a “queueing system”. Ωϡʔ͸ʮΩϡʔΠϯάγεςϜʯʹ Αͬͯߏ੒͞ΕΔ

Total Time = Queue Time + Service Time ߹ܭ࣌ؒ =
Ωϡʔ࣌ؒ + αʔϏε࣌ؒ

Sidekiq

Queues (Redis) Process types, with threads ϓϩηεʮλΠϓʯͱεϨου Ωϡʔ

Important Point / ॏཁͳϙΠϯτ Add queue time instrumentation for Sidekiq
Ωϡʔ࣌ؒͷܭଌπʔϧΛ Sidekiq ʹಋೖ͠Α͏

Queues (Redis) Process types, with threads ϓϩηεʮλΠϓʯͱεϨου Ωϡʔ

Domain Driven Queues •invoices •mailers •payments υϝΠϯۦಈΩϡʔ •੥ٻॻ •ϝʔϧ •ࢧ෷͍

Every Job Class has an “SLA” ͢΂ͯͷδϣϒΫϥεʹ͸ SLA ͕͋Δ

within_30_seconds within_5_minutes within_1_hour within_1_week

“Shards” w/ Consistent Hashing ίϯγεςϯτ ϋογϡ๏ʹΑΔ γϟʔυ

Predicted Latency = Current rate of job processing x #
of Jobs in Queue ༧૝͞ΕΔϨΠςϯγʔ = ݱࡏͷδϣϒॲཧ཰ x Ωϡʔͷδϣϒ਺

https://tinyurl.com/kellysidekiq

TODO 1. Arrange queues based on SLAs 2. Create alerts
based on queue SLAs 1. SLA ʹج͍ͮ ͯΩϡʔΛ   ഑ஔ͠Α͏ 2. SLA ʹج͍ͮ ͯΞϥʔτΛ   ࡞੒͠Α͏

Header: X-Request-Start

More queue time? 1 worker 4 workers Ωϡʔ࣌ؒͳ͕௕ ͍ͷ͸ͲΕ͔ʁ

4 pods 1 worker 1 pod 4 workers More queue
time? Ωϡʔ࣌ؒͳ͕௕ ͍ͷ͸ͲΕ͔ʁ

Queue time = 1/s

1 master process per container ίϯςφ͋ͨΓ 1 Ϛελʔϓϩηε

Always use at least 4 child processes/workers ࠷௿Ͱ΋ 4 ͭͷϓϩηε/ϫʔΧʔΛ࢖༻͠Α͏

TODO 1. Measure web request queueing 2. Set up autoscaling
based on this 3. Use at least 4 child processes 1. Web ϦΫΤετ ͷΩϡʔΠϯά Λܭଌ͠Α͏ 2. ্هʹج͍ͮͯ ΦʔτεέʔϦ ϯάΛ   ઃఆ͠Α͏ 3. গͳ͘ͱ΋ 4 ͭ ͷϓϩηεΛ   ࢖༻͠Α͏

Global VM Lock (GVL)

Great Valuable Lock Ractor VM Lock

1.Exit 2.I/O 3.Interrupt (100ms)

100 Threads?

# of Threads Service Time

tinyurl.com/gvlruby

wait_for_less_busy_worker tinyurl.com/pumasleep

Worker 1: 0/5 Threads Busy Worker 2: 1/5 Threads Busy
Worker 3: 4/5 Threads Busy

TODO 1. Set thread count to 5 (web) or 10
(Sidekiq) 2. Monitor service time under load 3. Upgrade to Puma 6 1. εϨου਺Λ 5 (web) ·ͨ͸ 10 (Sidekiq) ʹ   ઃఆ͠Α͏ 2. ෛՙ࣌ͷαʔϏε࣌ ؒΛ   ϞχλϦϯά͠Α͏ 3. Puma 6 ʹόʔδϣϯ   Ξοϓ͠Α͏

TODO 1. Arrange queues based on SLAs 2. Create alerts
based on queue SLAs 3. Measure web request queueing 4. Set up autoscaling based on this 5. Use at least 4 child processes 6. Set thread count to 5 (web) or 10 (Sidekiq) 7. Monitor service time under load 8. Upgrade to Puma 6 1. SLA ʹج͍ͮͯΩϡʔΛ   ഑ஔ͠Α͏ 2. SLA ʹج͍ͮͯΞϥʔτΛ   ࡞੒͠Α͏ 3. Web ϦΫΤετͷΩϡʔΠϯ άΛܭଌ͠Α͏ 4. ্هʹج͍ͮͯΦʔτεέʔ ϦϯάΛઃఆ͠Α͏ 5. গͳ͘ͱ΋ 4 ͭͷϓϩηεΛ ࢖༻͠Α͏ 6. εϨου਺Λ 5 (web) ·ͨ͸ 10 (Sidekiq) ʹઃఆ͠Α͏ 7. ෛՙ࣌ͷαʔϏε࣌ؒΛϞχ λϦϯά͠Α͏ 8. Puma 6 ʹόʔδϣϯΞοϓ͠ Α͏

@nateberkopec for Kaigi on Rails 2022 JA text on slides
by @cafedomancer Queues in Rails Apps Sidekiq, Puma, and the GVL Rails ΞϓϦ έʔγϣϯʹ ͓͚ΔΩϡʔ

Kaigi On Rails

Kaigi On Rails

More Decks by Nate Berkopec

Featured

Transcript