Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Ahana 101: An introduction to Ahana Cloud for Presto on AWS, SaaS for Presto on AWS

Ahana
September 09, 2021

Ahana 101: An introduction to Ahana Cloud for Presto on AWS, SaaS for Presto on AWS

Presto is the fastest growing query engine used by companies like Facebook, Uber, Twitter and many more. While powerful, Presto can be complicated to run on your own especially if you’re a smaller team that may not have the skillset.

That’s where Ahana comes in. Ahana Cloud is SaaS for Presto, giving teams of all sizes the power to deploy and manage Presto on AWS. Ahana takes care of hundreds of deployment and management configurations of Presto including attaching/detaching external data sources, configuration parameters, tuning, and much more.

In this webinar Ram will discuss why companies are using Ahana Cloud for their Presto deployments and give an overview of Ahana including:

The Ahana SaaS console
How easy it is to add data sources like AWS S3 and integrate catalogs like Hive
Features like Data Lake Caching for 5x performance and autoscaling

Ahana

September 09, 2021
Tweet

More Decks by Ahana

Other Decks in Technology

Transcript

  1. What is Presto? • Open source, distributed MPP SQL query

    engine • Originally developed at Facebook as a replacement for Hive • Query in Place -- no need to move data(ETL) from source • Federated Querying -- join data from different source format • ANSI SQL Compliant • Designed from ground up for fast analytic queries against data of any size • Proven on petabytes of data • SQL-On-Anything • Federated querying and pluggable architecture to support many connectors • Opensource, hosted on github • https://github.com/prestodb 4
  2. Presto Use Cases Data Lakehouse analytics Reporting & dashboarding Interactive

    ad hoc querying Transformation using SQL (ETL) Federated querying across data sources 5
  3. At A Glance • Ahana - The Company • Ahana

    Cloud is SaaS to Query Data Lakes • Simplifies SQL analytics on cloud data lakes like S3 Team Ahana Cloud, Database & Presto Experts Steven Mih Cofounder CEO Dipti Borkar Cofounder Chief Products Officer Dave Simmen Cofounder Chief Technical Officer 2021 DBTA Best Data 100 2021 Stevie Best Startup 2021 Coolest Analytics 2021 Top 10 Hot Big Data 2020 Datanami Best Big Data Startup Awards 2020 Open Source 100
  4. Challenges with SQL on Open Data Lakes Cloud DW /

    AWS Serverless options get very expensive for growing data volumes ▪ Cloud data warehouse costs grow much faster than compute engine costs ▪ Serverless options like AWS Athena charge /query and get expensive “Do it yourself” approach is complicated ▪ Big data skills in platform teams are limited ▪ Presto is complicated and operationally very time consuming Presto on AWS like AWS Athena has limited capabilities and doesn’t scale ▪ Limited concurrency of 20 per account ▪ No visibility into cluster logs, query logs, no flexibility / control on scale
  5. Ahana Console (Control Plane) CLUSTER ORCHESTRATION CONSOLIDATED LOGGING SECURITY &

    ACCESS BILLING & SUPPORT In-VPC Presto Clusters (Compute Plane) AD HOC CLUSTER 1 TEST CLUSTER 2 PROD CLUSTER N Glue S3 RDS Elasticsearch Ahana Cloud Account Ahana console oversees and manages every Presto cluster Customer Cloud Account In-VPC orchestration of Presto clusters, where metadata, monitoring, and data sources reside Ahana Cloud for Presto 12
  6. Ahana Cloud – Reference Architecture • Distributed SQL engine with

    proven scalability • Interactive ANSI SQL queries • Query data where it lives with Federated Connectors (no ETL) • High concurrency • Separation of compute and storage 13
  7. How Carbon uses PrestoDB in the Cloud with Ahana to

    Power its Real-time Customer Dashboards Jordan Hoggart, Data Engineer at Carbon
  8. Ahana Cloud for Presto - Summary ▪ Brings SQL on

    AWS S3 with an open data lake + USER ▪ Presto compute brought to your data in your VPC in your account ▪ Fully managed Presto cluster life cycle including idle-time management ▪ Query AWS DBs - RDS/MySQL , RDS/Postgres, Elasticsearch, Redshift, Elasticsearch ▪ Cloud-native and highly available running on Kubernetes ▪ Bring your own ▪ BI tool / Data Science Notebook ▪ Metadata Catalog ▪ Transaction Manager Easy to use 3x Price Performance Open & Flexible
  9. Give it a spin • Ahana Cloud is available on

    the AWS Marketplace • Sign-up for a 14-day free trial here: https://ahana.io/sign-up 19