Postgres – Past, Present Future

Postgres Past | Present | Future

Hi! I’m Craig I work at Heroku. I’m a product
manager, this means I do whatever engineers don’t want to. I blog some and curate postgres weekly @craigkerstiens

It might help to explain that the pronunciation is "post-gres"
or "post-gres-cue-ell", not "post-gray-something". ! I heard people making this same mistake in presentations at this past weekend's Postgres Anniversary Conference :-( Arguably, the 1996 decision to call it PostgreSQL instead of reverting to plain Postgres was the single worst mistake this project ever made. It seems far too late to change now, though. ! regards, tom lane

Good ! Postgres Postgres-QL

Good ! Postgres Postgres-QL Bad ! Postgres-SQL Postgray-SQL

Why Postgres - TLDR Datatypes Conditional Indexes Transactional DDL Foreign
Data Wrappers Concurrent Index Creation Extensions Common Table Expressions Fast Column Addition Listen/Notify Table Inheritance Per Transaction sync replication Window functions NoSQL inside SQL Momentum

TLDR in quote “Postgres - it’s the emacs of databases”

Postgres The Past Well over 5 years ago

Past | Present | Future In Time 1986 Development Began
1989 First major release 1995 SQL Support

MVCC Past | Present | Future Each query sees transactions
committed before it

MySQL Won Past | Present | Future

MySQL Won Past | Present | Future Was easier to
install ! Simpler replication setup

Postgres early focus Past | Present | Future We care
about safety and integrity of data

Postgres The Present 5 years until now

Past | Present | Future Datatypes Numeric Integer Precision Floating
Point Serial Monetary Character Character Var Char String Binary Date/Time Time DateTime Intervals Timezones Boolean Enums Geometric Points Lines Boxes Polygons Circles Network Addresses Inet Cidr Mac Address UUID XML Arrays

Past | Present | Future Postgres 8.3

Past | Present | Future UUID hstore Full text search
Postgres 8.3

Past | Present | Future Postgres 8.3 - UUID Much
better than serial keys Just works

Past | Present | Future Postgres 8.3 - UUID Much
better than serial keys Just works create extension uuid; create table foo (id uuid);

Past | Present | Future Postgres 8.3 - hstore

Past | Present | Future Postgres 8.3 - hstore hstore
- horrible name

- horrible name key-value store in a column

- horrible name key-value store in a column Even better

- horrible name key-value store in a column Even better Can index on keys/values

- horrible name key-value store in a column Even better Can index on keys/values Can filter on keys values

hstore in action

create extension hstore; ! create table foo (example hstore); !
INSERT INTO foo VALUES ( ‘key => "examplevalue", state => “California”', ); ! SELECT * FROM foo WHERE example->key = ‘examplevalue’; ! hstore in action

Past | Present | Future Postgres 8.3 - full text
search Stemming Ranking / Boost Support Multiple languages Fuzzy search for mispelling Accent support

Past | Present | Future Postgres 8.4 - AKA SQLs
your friend

Past | Present | Future Postgres 8.4 - AKA SQLs
your friend Window function ! Common Table Expressions

Past | Present | Future Postgres 8.4 - Window functions
Aggregate over set, compute some value at a row ! “A window function performs a calculation across a set of table rows that are somehow related to the current row. This is comparable to the type of calculation that can be done with an aggregate function.”

SELECT email, users.data->'state', sum(total(items)), rank() OVER (PARTITION BY users.data->'state' ORDER
BY sum(total(items)) desc) FROM users, purchases WHERE purchases.user_id = users.id GROUP BY 1, 2; window functions in action

window functions in action SELECT email, users.data->'state', sum(total(items)), rank() OVER
(PARTITION BY users.data->'state' ORDER BY sum(total(items)) desc) FROM users, purchases WHERE purchases.user_id = users.id GROUP BY 1, 2;

Past | Present | Future Postgres 8.4 - CTEs

Past | Present | Future Postgres 8.4 - CTEs If
you like writing SQL you’re weird

you like writing SQL you’re weird If you like reading SQL see a counselor

you like writing SQL you’re weird If you like reading SQL see a counselor Window functions make SQL bearable

you like writing SQL you’re weird If you like reading SQL see a counselor Window functions make SQL bearable It’s like a view within your query that you can reference

WITH top_5_products AS ( SELECT products.*, count(*) FROM products, line_items
WHERE products.id = line_items.product_id GROUP BY products.id ORDER BY count(*) DESC LIMIT 5 ) ! SELECT users.email, count(*) FROM users, line_items, top_5_products WHERE line_items.user_id = users.id AND line_items.product_id = top_5_products.id GROUP BY 1 ORDER BY 1; CTEs in action

Past | Present | Future Postgres 9.0 - Finally replication

Replication

Replication Listen/Notify

Past | Present | Future Postgres 9.1 - a JVM
like foundation

Past | Present | Future Postgres 9.1 - a JVM
like foundation Foreign Tables Extensions Sync Rep Unlogged tables KNN

Past | Present | Future Postgres 9.1 - Foreign Tables

Past | Present | Future Postgres 9.1 - Foreign Tables
Query from in Postgres to something else Between Postgres databases Other sources Other DBs Redis LDAP S3 Twitter

Past | Present | Future Postgres 9.1 - Extensions

Past | Present | Future Postgres 9.1 - Extensions Easier
method for installing contrib Extensions grow More to come

Past | Present | Future Postgres 9.1 - Sync rep

Past | Present | Future Postgres 9.1 - Sync rep
9.0 was a foundation Sync rep = granular control Per transaction basis

Past | Present | Future Postgres 9.1 - unlogged tables

Past | Present | Future Postgres 9.1 - KNN

Past | Present | Future Indexes

Past | Present | Future Indexes B-Tree ! GIN !
GiST ! SP-GiST ! KNN

Past | Present | Future Indexes

Past | Present | Future Indexes - B-Tree Default Usually
what you want

Past | Present | Future Indexes - GIN Multiple values
in single column Array/hstore

Past | Present | Future Indexes - GiST Values that
can overlap Full text search Shapes GIS

Past | Present | Future Indexes - Others B-Tree !
GIN ! GiST ! SP-GiST ! KNN ! VODKA

Past | Present | Future Postgres 9.2 - User friendly
finally

finally Index Only Scans

finally Index Only Scans Pg_stat_statements

finally Index Only Scans Pg_stat_statements JSON Datatype

Pg Stat Statements $ select * from pg_stat_statements where query
~ 'from users where email'; ! ! userid │ 16384 dbid │ 16388 query │ select * from users where email = ?; calls │ 2 total_time │ 0.000268 rows │ 2 shared_blks_hit │ 16 shared_blks_read │ 0 shared_blks_dirtied │ 0 shared_blks_written │ 0 local_blks_hit │ 0 local_blks_read │ 0 local_blks_dirtied │ 0 local_blks_written │ 0 temp_blks_read │ 0 temp_blks_written │ 0 time_read │ 0 time_write │ 0

Pg Stat Statements SELECT (total_time / 1000 / 60) as
total, (total_time/calls) as avg, query FROM pg_stat_statements ORDER BY 1 DESC LIMIT 100;

Pg Stat Statements total | avg | query --------+--------+------------------------- 295.76
| 10.13 | SELECT id FROM users... 219.13 | 80.24 | SELECT * FROM ... (2 rows) !

Performance Guidelines

Past | Present | Future Performance

Past | Present | Future Performance Aim for high cache
hit rate

hit rate Frequent queries execute in under 10 ms

hit rate Frequent queries execute in under 10 ms Rarer queries in under 100 ms

hit rate Frequent queries execute in under 10 ms Rarer queries in under 100 ms Reports, whatever’s right for your app

Explain # EXPLAIN SELECT last_name FROM employees WHERE salary >=
50000; QUERY PLAN -------------------------------------------------- Seq Scan on employees width=6) Filter: (salary >= 50000) (3 rows) (cost=0.00..35811.00 rows=1

Explain # EXPLAIN SELECT last_name FROM employees WHERE salary >=
50000; QUERY PLAN -------------------------------------------------- Seq Scan on employees width=6) Filter: (salary >= 50000) (3 rows) startup time max time rows return (cost=0.00..35811.00 rows=1

Explain Analyze # EXPLAIN ANALYZE SELECT last_name FROM employees WHERE
salary >= 50000; QUERY PLAN -------------------------------------------------- Seq Scan on employees (cost=0.00..35811.00 rows=1 width=6) (actual time=2.401..295.247 rows=1428 loops=1) Filter: (salary >= 50000) Total runtime: 295.379 (3 rows) ! Filter: (salary >= 50000) (3 rows) startup time max time rows return actual time 2.401..295.247 rows=1428 295.379

Pg Stat Statements total | avg | query --------+--------+------------------------- 295.76
| 10.13 | SELECT id FROM users... 219.13 | 80.24 | SELECT * FROM ... (2 rows) !

finally Index Only Scans ! Pg_stat_statements ! JSON Datatype

JSON CREATE TABLE bar (foo json); ! SELECT '{"id":1,"email": "[email protected]",}'::json;

Past | Present | Future Postgres 9.3 - App Dev
Friendly

Friendly Materialized Views

Friendly Materialized Views Writeable FDWs

Friendly Materialized Views Writeable FDWs Better JSON

Postgres The Future some fact some speculation

Past | Present | Future Postgres 9.4 - Prewarm !
Refresh materialized view ! Ordered set aggregates ! JSONB ! Logical decoding ! !

Past | Present | Future Postgres 9.4 - Prewarm

Past | Present | Future Postgres 9.4 - Prewarm Replicas
are often cold Run a utility, warm em up

Past | Present | Future Postgres 9.4 - Refresh materialized
view

view Materialized views can’t be read while updating

view Materialized views can’t be read while updating Now they can

view Materialized views can’t be read while updating Now they can Yes, they were of minimal use before

Past | Present | Future Postgres 9.4 - Ordered set
aggregates

Past | Present | Future Postgres 9.4 - Ordered set
aggregates I kinda understand these Median Percentile Hypothetical values

Past | Present | Future Postgres 9.4 - Logical Decoding

WAL is binary

WAL is binary Decoding = SQL

WAL is binary Decoding = SQL Means cross version upgrades, multimaster, etc

Past | Present | Future Postgres 9.4 - JSONB

Past | Present | Future Postgres 9.4 - JSONB Binary
JSON Easier to get performance (GIN)

Postgres The Future some fact some speculation

Past | Present | Future Speculation

Past | Present | Future Speculation Upsert

Past | Present | Future Speculation Upsert Multimaster

Past | Present | Future Speculation Upsert Multimaster Extensions -
apt-get

apt-get Foreign data wrappers

apt-get Foreign data wrappers Document storage

apt-get Foreign data wrappers Document storage Pluggable storage engines

Recap PG was and still is safe for your data

Recap PG became user friendly for power users

Recap PG b e c a m e u s
e r f r i e n d l y f o r a p p l i c a t i o n developers

Recap PG i s b e c o m i
n g a platform for your data, more than relational database

Why Postgres - TLDR Datatypes Conditional Indexes Transactional DDL Foreign
Data Wrappers Concurrent Index Creation Extensions Common Table Expressions Fast Column Addition Listen/Notify Table Inheritance Per Transaction sync replication Window functions NoSQL inside SQL Momentum

Fin. @craigkerstiens http://speakerdeck/u/craigkerstiens

Postgres – Past, Present Future

Postgres – Past, Present Future

More Decks by Craig Kerstiens

Featured

Transcript