Slide 1

Slide 1 text

Distributed PostgreSQL is a game changer Parikshit Savjani (@talktosavjani) Postgres Team at Microsoft

Slide 2

Slide 2 text

I work for I specialize in My current focus Parikshit Savjani @talktosavjani

Slide 3

Slide 3 text

Image by Simon Steinberger @talktosavjani PostgreSQL is more popular than ever

Slide 4

Slide 4 text

One of most loved and wanted databases in Stack Overflow’s 2019 Developer Survey https://insights.stackoverflow.com/survey/2019?utm_source=so-owned&utm_medium=blog&utm_campaign=dev-survey-2019&utm_content=launch-blog https://db-engines.com/en/blog_post/76 Ranked DBMS of the Year for 2 consecutive years by DB-Engines PostgreSQL is more popular than ever @talktosavjani

Slide 5

Slide 5 text

https://db-engines.com/en/ranking_trend/system/PostgreSQL PostgreSQL is more popular than ever 2013 2014 2015 2016 2017 2018 2019 2020 200 100 300 400 500 © February 2020, DB-Engines.com @talktosavjani

Slide 6

Slide 6 text

Why Postgres? Why Now? @talktosavjani

Slide 7

Slide 7 text

Why Postgres? Why Now? • Truly Open Source (No single owner) @talktosavjani

Slide 8

Slide 8 text

No content

Slide 9

Slide 9 text

Why Postgres? Why Now? • Truly Open Source (No single owner) • Feature rich & Highly Extensible @talktosavjani

Slide 10

Slide 10 text

Why Postgres? Why Now? • Truly Open Source (No single owner) • Feature rich & Highly Extensible • Robust, Reliable & Rich Ecosystem @talktosavjani

Slide 11

Slide 11 text

Why Postgres? Why Now? • Truly Open Source (No single owner) • Feature rich & Highly Extensible • Robust, Reliable & Rich Ecosystem • Speed of innovation and releases @talktosavjani

Slide 12

Slide 12 text

Why Postgres? Why Now? • Truly Open Source (No single owner) • Feature rich & Highly Extensible • Robust, Reliable & Rich Ecosystem • Speed of innovation and releases • Available as a service in all major clouds @talktosavjani

Slide 13

Slide 13 text

“Postgres is the new Linux of databases”- Unknown @talktosavjani

Slide 14

Slide 14 text

Common cloud native app patterns Multi-tenant SaaS applications @talktosavjani

Slide 15

Slide 15 text

Common cloud native app patterns Real time Operational Analytics @talktosavjani

Slide 16

Slide 16 text

Common cloud native app patterns Hybrid Transactional Analytical System @talktosavjani

Slide 17

Slide 17 text

Common cloud native app patterns Multi-tenant SaaS applications Real-time operational analytics Transactional/OLTP applications Massive data volume and processing needs @talktosavjani

Slide 18

Slide 18 text

Postgres Parallel Processing capabilities Postgres 9.6 Parallel Sequential scans, Joins and Aggregates Postgres 10 Parallel B-tree, bitmap heap scans, merge join, non-correlated sub queries. Postgres 11 Parallel Hash joins, DDL, Index Builds, Parallel partitions scans Postgres 12 Parallel Queries in Serializable Isolation Mode Postgres 13 Parallel Vacuum Credits: Amit Kapila (@kapila_amit) @talktosavjani

Slide 19

Slide 19 text

“Distributed PostgreSQL is the need of the hour” @talktosavjani

Slide 20

Slide 20 text

CITUS IS An Extension to Postgres @talktosavjani

Slide 21

Slide 21 text

Citus is Open Source @talktosavjani

Slide 22

Slide 22 text

Not a fork @talktosavjani

Slide 23

Slide 23 text

Citus shards Postgres database horizontally @talktosavjani

Slide 24

Slide 24 text

Distributed PostgreSQL scales better APPLICATION SELECT FROM GROUP BY company_id, avg(spend) AS avg_campaign_spend compaigns company_id; METADATA COORDINATOR NODE WORKER NODES W1 W2 W3 … Wn SELECT company_id sum(spend), count(spend) … FROM campaigns_2001 … SELECT company_id sum(spend), count(spend) … FROM campaigns_2009 … SELECT company_id sum(spend), count(spend) … FROM campaigns_2017 … @talktosavjani

Slide 25

Slide 25 text

A PERFORMANCE COMPARISON Distributing Postgres with Hyperscale (Citus) @talktosavjani

Slide 26

Slide 26 text

Microsoft Windows team relies on Citus and Postgres (on Azure) for mission-critical shiproom decisions Read more in our aka.ms/azure-postgres-blog: https://techcommunity.microsoft.com/t5/azure-database- for-postgresql/architecting-petabyte-scale-analytics-by- scaling-out-postgres-on/ba-p/969685 @talktosavjani

Slide 27

Slide 27 text

How do you manage distributed Citus cluster Your new best friend to manage distributed systems @talktosavjani

Slide 28

Slide 28 text

DEMO Deploying Citus cluster on Kubernetes @talktosavjani

Slide 29

Slide 29 text

Citus Azure Data Studio postgRest postgresql-hll pg_cron postgresql-topn pgloader @talktosavjani

Slide 30

Slide 30 text

CITUSDATA.COM/NEWSLETTER Do you get the Citus Newsletter? @talktosavjani

Slide 31

Slide 31 text

@tapoueh PGCONFINDIA ~ a 40% off discount on any edition Source: theartofpostgresql.com @talktosavjani

Slide 32

Slide 32 text

More popular than ever Distributed PostgreSQL = need of hour Citus shards Postgres & enables distributed processing Kubernetes = cluster management & portability Microsoft Postgres @talktosavjani

Slide 33

Slide 33 text

Distributed Postgres truly is changing the game. @talktosavjani

Slide 34

Slide 34 text

A good newsletter is like a good GIN index. Sign up for the Citus Newsletter @talktosavjani

Slide 35

Slide 35 text

@talktosavjani • @citusdata • @AzureDBPostgres ध"यवाद | Thank U AskAzureDBforPostgreSQL@service.microsoft.com