Speaker Deck

Practical Performance Analysis and Tuning for Cloudera Impala

by Greg Rahn

Published October 30, 2013 in Technology


Impala brings SQL to Hadoop, but it also brings SQL performance tuning to those using the platform. This technical session will cover several topics in Impala performance analysis including understanding query execution plans, the use of query hints, interpreting Impala’s built-in query instrumentation as well as examination of Impala’s hardware resource utilization for different queries and workloads. We’ll also discuss design choices like table partitioning and file formats, including the Parquet columnar storage format for Hadoop.