Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Care and Feeding of Spark SQL Catalyst Optimizer

Care and Feeding of Spark SQL Catalyst Optimizer

A practical approach to a highly technical topic. First, understand the basics of how Spark transforms declarative SQL into code that can execute across a cluster. Then look at two case studies of query planning gone wrong in production. Finally, stop into the pharmacy for prescriptive approaches to the most common issues with Spark queries and how to fix them.

Avatar for Rose Toomey

Rose Toomey

June 25, 2020
Tweet

More Decks by Rose Toomey

Other Decks in Technology

Transcript