Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Speed Up Your Database

Speed Up Your Database

Are your queries slow? Learn how to speed them up through better SQL crafting and use of meaningful indexes. You will understand what works well and what doesn't, and will walk away with a checklist for faster databases. Through examples and benchmarks, I will demonstrate how to go from almost a minute of SQL execution to less than a millisecond. I expect that you will all be itching to analyze queries to see how much you can shave off.

Anna Filina

October 08, 2014
Tweet

More Decks by Anna Filina

Other Decks in Programming

Transcript

  1. Multitude of queries • Loops: • Reduce number of queries.

    • Check for lazy loading in your ORM. 5 foreach ($pictures as $id) { query_pic_details($id); }
  2. Lazy loading • Data loaded on demand. • Use LEFT

    JOIN to get all related data in one go. 6 echo $picture->album->user->name;
  3. Count queries • Manual: add custom code before queries. •

    Auto: ORMs (like Doctrine) have listeners. 8
  4. Slow query log • Open my.cnf • Queries taking 0.5s

    or longer will be logged. • Open the log file and analyze the queries. 9 slow_query_log=ON long_query_time=0.5 slow_query_log_file=/path/to/file
  5. Simple query • Get all pictures from user #1 •

    38 seconds. 11 SELECT picture.id, picture.title FROM picture LEFT JOIN album ON picture.album.id = album.id WHERE album.user_id = 1;
  6. Explain 13 • Prefix with EXPLAIN: • Each row =

    table scanned: EXPLAIN SELECT picture.id, picture.title FROM picture LEFT JOIN album ON picture.album.id = album.id WHERE album.user_id = 1; +----+-------------+---------+------+---------------+------+---------+------+---------+--------------------------------+ | id | select_type | table | type | possible_keys | key | key_len | ref | rows | Extra | +----+-------------+---------+------+---------------+------+---------+------+---------+--------------------------------+ | 1 | SIMPLE | album | ALL | NULL | NULL | NULL | NULL | 20000 | Using where | | 1 | SIMPLE | picture | ALL | NULL | NULL | NULL | NULL | 2000000 | Using where; Using join buffer | +----+-------------+---------+------+---------------+------+---------+------+---------+--------------------------------+
  7. Explain 14 • Details: • Because of the join, we’re

    scanning all albums for each picture. • Total 40 billion rows scanned. table : album key : NULL rows : 20,000 table : picture key : NULL rows : 2,000,000
  8. Add an index 17 • album.id • Same query down

    to 2.38 sec. • We just saved 36 sec on every request! ALTER TABLE album ADD PRIMARY KEY(id);
  9. Under the hood 18 • Details: • With the index,

    we find our album right away. • Total 2,000,000 rows scanned. table : album key : PRIMARY rows : 1 table : picture key : NULL rows : 2,000,000
  10. Add a better index 19 • picture.album_id • Now down

    to 0.12 sec. • 317 times faster than original. ALTER TABLE picture ADD INDEX(album_id);
  11. Under the hood 20 • Details: • Total 200,000 rows

    scanned. table : album key : NULL rows : 20,000 table : picture key : album_id rows : 100
  12. Add a better index 21 • album.user_id • Now down

    to 0.10 sec. • Gained another 17% in speed. ALTER TABLE album ADD INDEX(user_id);
  13. Under the hood 22 • Details: • With the second

    index, we’re scanning 100 pics. • Total 10,000 rows scanned. table : album key : user_id rows : 100 table : picture key : album_id rows : 100
  14. Indexes used 23 • Primary and foreign keys make for

    great indexes. • Also whatever you use in JOINs and WHEREs.
  15. Index criteria 24 • Change frequency • Selectivity: how many

    different values? • Good examples: • Bad examples: user.gender picture.views user.id picture.date
  16. Airport customs 26 • 5 customs officers, 1 min to

    process a traveler. • 5 travelers / min = fluid. • 20 travelers / min = queue slowly builds. • 1000 travellers / min = 3h wait if you arrive on 2nd minute.
  17. Smaller indexes 27 • BIGINT as index is slower than

    TINYINT. • Use UNSIGNED to double the maximum value.
  18. Functions 29 • Avoid functions on an indexed column: •

    The index will be ignored here. SELECT id, title FROM picture WHERE YEAR(create_date) >= 2011;
  19. Limit 30 • Now down to 0.0009 sec. • 100

    times faster than the last query. SELECT picture.id, picture.title FROM picture LEFT JOIN album ON picture.album.id = album.id WHERE album.user_id = 1 LIMIT 25;
  20. Other constraints 31 • Range: • Only makes sense if

    that column is indexed: SELECT picture.id, picture.title FROM picture LEFT JOIN album ON picture.album.id = album.id WHERE album.user_id = 1 AND picture.id BETWEEN 26 AND 50; # 0.023 sec ALTER TABLE picture ADD INDEX(id);
  21. Other constraints 32 • IN clause: SELECT picture.id, picture.title FROM

    picture LEFT JOIN album ON picture.album.id = album.id WHERE album.user_id = 1 AND picture.id IN (15,26,29,32,45); # 0.048 sec
  22. Split tables 33 • MySQL can partition your tables transparently.

    ◦ Creates multiple files on disk. ◦ Shows data as a single table.
  23. Archives 34 • MySQL ARCHIVE storage engine: ◦ Doesn’t support

    UPDATE or DELETE ◦ Very fast for SELECT • You can archive old transactions, news, etc.
  24. Split databases 35 • Horizontal partitionning, sharding. • Users 1-1000

    and their data in database #1 • Users 1001-2000 and their data in database #2
  25. Checklist 36 ✓Count queries. ✓Log the slow ones. ✓Use EXPLAIN

    and check number of rows scanned. ✓Try different indexes, remove unused ones. ✓Use small indexes, preferably numeric. ✓Don’t use functions on indexed columns. ✓Limit number of results. ✓Split tables and databases.
  26. Even more fun stuff 38 • Setup read slaves •

    Denormalize where appropriate • SSD hard drive
  27. Services • Development: PHP, JS, etc. • Fix problems: bugs,

    performance, etc. • Coaching & workshops. • Advisor: testing strategy, architecture, etc. 39