Upgrade to Pro — share decks privately, control downloads, hide ads and more …

How Patroni solved Database Reliability at Gojek

Kumar Abhijeet
March 28, 2024
14

How Patroni solved Database Reliability at Gojek

Kumar Abhijeet

March 28, 2024
Tweet

Transcript

  1. Agenda Gojek - Scale and Microservices Databases & Reliability Patroni

    & 5 9s of Availability Deep dive into Patroni Managing Patroni on production - Lessons & Experiences
  2. Cloud Provider’s Compute Uptime >= 99.9% < 8h 41m of

    downtime/year Across multiple zones >= 99.99% < 52m of downtime/year
  3. App Server Workloads PostgreSQL VMs API Traffic LB New Master

    Old Master Replica shared_buffers=16MB shared_buffers=2GB
  4. Patroni Open Source and actively maintained by Zalando Converts PostgreSQL

    systems into Highly Available Fault Tolerant Disaster Ready
  5. Patroni Almost instantaneous failovers (~1-2s) Way cheaper than running managed

    DB solutions Cluster Management made easy Multi Region HA Deployments
  6. Patroni at Gojek 200+ clusters running on Production ~60 TB

    of data flows in/out every day Guarantees less than 10MBs of data loss Consul as DCS and service discovery IAC everywhere!
  7. Patroni at Gojek TF Modules for Provisioning/Chef for configuration Sync/Async

    replication choices All round observability! Secure and granular role-based access PR based workflow for infra provisioning