rights reserved. Amazon Confidential and Trademark. Distributed engine layer OTFs’ layer Data layer Read consistent/old data Write data with rollback support change/delete single or multiple records Read/Write through open table format https://pages.awscloud.com/rs/112-TZM-766/images/AWS-Black-Belt_2023_Datalake-Format- On-AWS_0516_v1.pdf 19 Open Table Format (OTF) レイヤーを挟んで 高度なニーズに対応する
rights reserved. Amazon Confidential and Trademark. Source Data AWS Glue Data Catalog Iceberg base Data Lake Amazon Athena Data Ingestion Data Analytics Iceberg/Spark Amazon EMR AWS Glue ETL Amazon Managed Service for Apache Flink Iceberg/Flink Streaming Batch Amazon Managed Streaming for Apache Kafka 全体アーキテクチャ例 25
rights reserved. Amazon Confidential and Trademark. 32 • ACID transactions with Athena & Iceberg Workshop • Accelerate data science feature engineering on transactional data lakes using Amazon Athena with Apache Iceberg • Amazon Athena、Amazon EMR、および AWS Glue を使用した Apache Iceberg データレイクの構築 • Perform upserts in a data lake using Amazon Athena and Apache Iceberg • Build a transactional data lake using Apache Iceberg, AWS Glue, and cross-account data shares using AWS Lake Formation and Amazon Athena • Use Apache Iceberg in a data lake to support incremental data processing • Build a real-time GDPR-aligned Apache Iceberg data lake • Automate replication of relational sources into a transactional data lake with Apache Iceberg and AWS Glue • Interact with Apache Iceberg tables using Amazon Athena and cross account fine-grained permissions using AWS Lake Formation • Build a serverless transactional data lake with Apache Iceberg, Amazon EMR Serverless, and Amazon Athena 参考リソース