Upgrade to Pro — share decks privately, control downloads, hide ads and more …

#5 “Experiences Evaluating DCTCP”

#5 “Experiences Evaluating DCTCP”

cafenero_777

June 10, 2023
Tweet

More Decks by cafenero_777

Other Decks in Technology

Transcript

  1. $ which • Experiences Evaluating DCTCP • Lawrence Brakmo, Boris

    Burkov, Greg Leclercq, Murat Mugan • Facebook • Linux Plumbers Conference 2018
  2. Agenda • ֓ཁͱಡ΋͏ͱͨ͠ཧ༝ • Abstract • Introduction • Overview •

    Intra-Rack Test Results • Inter-Rack Test Results • Conclusions • References
  3. ֓ཁͱಡ΋͏ͱͨ͠ཧ༝ • ֓ཁ • େن໛DC Closӡ༻ͰͷDCTCPύϑΥʔϚϯεൺֱ݁Ռ • Linux kernel΍NIC’s fi

    rmware, CPU utilizationͷൺֱ • ಡ΋͏ͱͨ͠ཧ༝ • ࣮ࡍͷେن໛Closߏ੒Ͱͷੑೳൺֱ΍ϋϚΔϙΠϯτ͕஌Γ͍ͨͨΊ
  4. Introduction • TCP30೥ͷྺ࢙ • fully, fairly, utilize the available bandwidth

    • ᫔᫓੍ޚ,Van Jacobson 1988 • ଟ͘͸loss-based algorithm • ύέϩε͢Δ·ͰΩϡʔʹஷΊΔ->(tail)ϨΠςϯγ૿Ճʂ • ύέϩεΛݕ஌͢ΔͨΊʹύέϩεΛ”଴ͭ” • ຊ຤స౗ʁ
  5. Introduction (Cont.) • Congestion avoidance algorithm • ϩε͢Δ”લ”ʹ᫔᫓ݕ஌ • Ωϡʔͷ੒௕Λ᫔᫓લஹͱͯ͠ݕ஌

    • TCP-Vegas, BBR: RTTΛར༻ • DCTCP: ECNΛར༻ • Reno: • ᫔᫓ݕ஌: cwnd 50%ݮʂ • RTTΑΓ΋୹͍᫔᫓΋ݕ஌ͯ͠͠·͏ • DCTCP: • cwndͷݮΒ͠ํΛ޻෉ • RTTຖʹbyteׂ߹ΛτϥοΩϯά • 100%᫔᫓->cwnd 50%ݮ • 50%᫔᫓->cwnd25%ݮ • ࣮ࡍ͸ҠಈฏۉΛ࠾༻ • 100%᫔᫓͔ͭҎલʹ᫔᫓͕ແ͚Ε͹ • -> cwnd 1/32ݮ
  6. Overview • Intra-Rack Test • 3 Sender -> 1 Receiver

    • 1MB & 10KB RPC • DCTCPಈ࡞֬ೝ • ύέϩεݮ, tail latencyݮ, fairness • Inter-Rack Tests • 3 Worker Racks -> 3 Storage Racks • workers read storage • netestoΛ༻͍ͯղੳ • https://github.com/facebook/fbkutils/tree/master/netesto
  7. Intra-Rack Tests Results • 1. ᫔᫓੍ޚʹدΒͳ͍ෆެฏੑͷ໰୊ • Server 1: 25%,

    Server 2: 25%, Server 3: 50% • εΠονόοϑΝઃܭʹґଘ • Work-around: Bu ff er AͷΈ࢖͏͜ͱͰެฏੑ୲อ • 2. ECNΛ࢖ͬͨ৔߹ͷϑϩʔͷެฏੑͷภΓ໰୊ • 25GbpsΛֻ͚Δɻ΋͏Ұͭͷϑϩʔ͸0.5Gbps͔͠ग़ͳ͍ • 60usͱ1.2msͷόΠϞʔμϧ෼෍ʹͳͬͯ͠·ͬͨ • NICϑΝʔϜ΢ΣΞͷ৽ػೳ͕ݪҼ -> ແޮԽ fl ow# RTT։࢝࣌ؒ (ܦա࣌ؒ) ૹ৴όΠτ਺ Ϩʔτ
  8. Intra-Rack Tests Results (Cont.) • 3. DCTCPͷʹΑΔߴtail-latency໰୊ • CubicΑΓDCTCPͷํ͕஗͍ɻɻɻ •

    2015೥ͷpatchʹىҼ • όάमਖ਼ޙ • 10KBͷੑೳ͕େ෯޲্ • 1MBͷํ͸૬ରతʹੑೳΛऔΒΕͨͨΊɺ
 Cubicͱൺ΂ͯେ͖͍ • RPCαΠζؒͷෆެฏੑ΋ղܾ
  9. Inter-Rack Test Results • Worker͕storageΛಡΉϕϯνϚʔΫ • 3rack, 70% of FSW

    links • DCTCP͕ͱͯ΋ྑ͍݁Ռ • +1% CPUෛՙఔ౓ • 2rack, 99% of FSW links • DCTCPͷdiscardߴ͍ɺCPUෛՙ • ECNϚʔΫ͕63.7%
  10. Inter-Rack Test Results (Cont.) • 1rack, 99.9% of FSW links

    • discards, ࠶ૹଟ͍ɻɻ • ߴෛՙ࣌ͷDCTCPͷޮՌ͸ݶఆతʁ • ਓ޻తͳϕϯνϚʔΫա͗ͨʁ • prodͰಉ͜͡ͱ͕ى͖Δ͔ෆ໌ • CPUෛՙݮ • ͓ͦΒ͘᫔᫓ͷͨΊcwnd͕ݮ -> ECNϚʔΩϯάݮͷͨΊɻ
  11. EoP