Upgrade to Pro — share decks privately, control downloads, hide ads and more …

How Patroni solved Database Reliability at Gojek

Avatar for Kumar Abhijeet Kumar Abhijeet
March 28, 2024
18

How Patroni solved Database Reliability at Gojek

Avatar for Kumar Abhijeet

Kumar Abhijeet

March 28, 2024
Tweet

Transcript

  1. Agenda Gojek - Scale and Microservices Databases & Reliability Patroni

    & 5 9s of Availability Deep dive into Patroni Managing Patroni on production - Lessons & Experiences
  2. Cloud Provider’s Compute Uptime >= 99.9% < 8h 41m of

    downtime/year Across multiple zones >= 99.99% < 52m of downtime/year
  3. App Server Workloads PostgreSQL VMs API Traffic LB New Master

    Old Master Replica shared_buffers=16MB shared_buffers=2GB
  4. Patroni Open Source and actively maintained by Zalando Converts PostgreSQL

    systems into Highly Available Fault Tolerant Disaster Ready
  5. Patroni Almost instantaneous failovers (~1-2s) Way cheaper than running managed

    DB solutions Cluster Management made easy Multi Region HA Deployments
  6. Patroni at Gojek 200+ clusters running on Production ~60 TB

    of data flows in/out every day Guarantees less than 10MBs of data loss Consul as DCS and service discovery IAC everywhere!
  7. Patroni at Gojek TF Modules for Provisioning/Chef for configuration Sync/Async

    replication choices All round observability! Secure and granular role-based access PR based workflow for infra provisioning