Slide 22
Slide 22 text
Okay, mystery is solved now:
● 1. Preferred leader election upon broker-restart causes many clients to open connection in a
short period, then SYN queue filled up
● => 2. Meanwhile, some producer connections fall-back to be established through SYN cookies
● => 3. Due to kernel bug, window scaling-factor became inconsistent between broker/producer.
Broker: `advertised_window = window >> 7`, Producer: `window = advertised_window << 1`
● => 4. Producer waits ack from broker too frequently due to small window size.
At the same time, broker delayed acks because sufficient data isn’t received.
● => 5. Producer starts to take insanely long time to complete sending out request.