Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Ahana 101: An introduction to Ahana Cloud for Presto on AWS, SaaS for Presto on AWS

Ahana
September 09, 2021

Ahana 101: An introduction to Ahana Cloud for Presto on AWS, SaaS for Presto on AWS

Presto is the fastest growing query engine used by companies like Facebook, Uber, Twitter and many more. While powerful, Presto can be complicated to run on your own especially if you’re a smaller team that may not have the skillset.

That’s where Ahana comes in. Ahana Cloud is SaaS for Presto, giving teams of all sizes the power to deploy and manage Presto on AWS. Ahana takes care of hundreds of deployment and management configurations of Presto including attaching/detaching external data sources, configuration parameters, tuning, and much more.

In this webinar Ram will discuss why companies are using Ahana Cloud for their Presto deployments and give an overview of Ahana including:

The Ahana SaaS console
How easy it is to add data sources like AWS S3 and integrate catalogs like Hive
Features like Data Lake Caching for 5x performance and autoscaling

Ahana

September 09, 2021
Tweet

More Decks by Ahana

Other Decks in Technology

Transcript

  1. An Introduction
    to Ahana Cloud
    Ram @ ahana.io
    Sept 09 2021
    1

    View Slide

  2. • What is Presto?
    • What is Ahana?
    • Ahana Demo
    Agenda
    2

    View Slide

  3. What is Presto?

    View Slide

  4. What is Presto?
    • Open source, distributed MPP SQL query engine
    • Originally developed at Facebook as a replacement for Hive
    • Query in Place -- no need to move data(ETL) from source
    • Federated Querying -- join data from different source format
    • ANSI SQL Compliant
    • Designed from ground up for fast analytic queries against data of any
    size
    • Proven on petabytes of data
    • SQL-On-Anything
    • Federated querying and pluggable architecture to support many connectors
    • Opensource, hosted on github
    • https://github.com/prestodb
    4

    View Slide

  5. Presto Use Cases
    Data
    Lakehouse
    analytics
    Reporting &
    dashboarding
    Interactive
    ad hoc
    querying
    Transformation
    using SQL (ETL)
    Federated
    querying
    across data
    sources
    5

    View Slide

  6. Presto
    Users

    View Slide

  7. Introducing Ahana Cloud

    View Slide

  8. At A Glance
    • Ahana - The Company
    • Ahana Cloud is SaaS to Query Data Lakes
    • Simplifies SQL analytics on cloud data
    lakes like S3
    Team Ahana
    Cloud, Database & Presto Experts
    Steven Mih
    Cofounder
    CEO
    Dipti Borkar
    Cofounder
    Chief Products Officer
    Dave Simmen
    Cofounder
    Chief Technical Officer
    2021 DBTA
    Best Data 100
    2021 Stevie
    Best Startup
    2021 Coolest
    Analytics
    2021 Top 10
    Hot Big Data
    2020 Datanami
    Best Big Data Startup
    Awards
    2020 Open
    Source 100

    View Slide

  9. 9
    Ahana Cloud is a fully-managed, cloud-native, Presto service

    View Slide

  10. Challenges with SQL on Open Data Lakes
    Cloud DW / AWS Serverless
    options get very expensive for
    growing data volumes
    ▪ Cloud data warehouse
    costs grow much faster
    than compute engine costs
    ▪ Serverless options like
    AWS Athena charge /query
    and get expensive
    “Do it yourself” approach
    is complicated
    ▪ Big data skills in platform
    teams are limited
    ▪ Presto is complicated and
    operationally very time
    consuming
    Presto on AWS like AWS
    Athena has limited capabilities
    and doesn’t scale
    ▪ Limited concurrency of 20
    per account
    ▪ No visibility into cluster
    logs, query logs, no
    flexibility / control on scale

    View Slide

  11. Ahana Cloud Architecture

    View Slide

  12. Ahana Console (Control Plane)
    CLUSTER
    ORCHESTRATION
    CONSOLIDATED
    LOGGING
    SECURITY &
    ACCESS
    BILLING &
    SUPPORT
    In-VPC Presto Clusters (Compute Plane)
    AD HOC CLUSTER 1
    TEST CLUSTER 2
    PROD CLUSTER N
    Glue
    S3
    RDS
    Elasticsearch
    Ahana
    Cloud Account
    Ahana console
    oversees and
    manages every
    Presto cluster
    Customer
    Cloud Account
    In-VPC orchestration
    of Presto clusters,
    where metadata,
    monitoring, and data
    sources reside
    Ahana Cloud for Presto
    12

    View Slide

  13. Ahana Cloud – Reference Architecture
    • Distributed SQL engine with
    proven scalability
    • Interactive ANSI SQL queries
    • Query data where it lives with
    Federated Connectors (no
    ETL)
    • High concurrency
    • Separation of compute and
    storage
    13

    View Slide

  14. Ahana Demo
    Zero to Presto In 30 mins

    View Slide

  15. 3x Better Price/Performance VS
    3x faster SQL data
    transformation jobs
    at same price
    Value of Ahana Cloud

    View Slide

  16. How Carbon uses PrestoDB in the Cloud
    with Ahana to Power its Real-time
    Customer Dashboards
    Jordan Hoggart, Data Engineer at Carbon

    View Slide

  17. Ahana Cloud for Presto - Summary
    ▪ Brings SQL on AWS S3 with an open
    data lake
    +
    USER
    ▪ Presto compute brought to your data in your
    VPC in your account
    ▪ Fully managed Presto cluster life cycle
    including idle-time management
    ▪ Query AWS DBs - RDS/MySQL , RDS/Postgres,
    Elasticsearch, Redshift, Elasticsearch
    ▪ Cloud-native and highly available running on
    Kubernetes
    ▪ Bring your own
    ▪ BI tool / Data Science Notebook
    ▪ Metadata Catalog
    ▪ Transaction Manager
    Easy to use
    3x Price Performance
    Open & Flexible

    View Slide

  18. Wrapping Up

    View Slide

  19. Give it a spin
    • Ahana Cloud is available on the AWS Marketplace
    • Sign-up for a 14-day free trial here: https://ahana.io/sign-up
    19

    View Slide

  20. Thank you!
    Stay Up-to-Date with Ahana
    Website: https://ahana.io/
    Blogs: https://ahana.io/blog/
    Twitter: @ahanaio
    20

    View Slide

  21. Questions?
    21

    View Slide