Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Amazon S3 NYJavaSIG 2024-12-12
Search
sullis
December 12, 2024
Programming
0
140
Amazon S3 NYJavaSIG 2024-12-12
Amazon S3
NYJavaSIG
New York, NY
sullis
December 12, 2024
Tweet
Share
More Decks by sullis
See All by sullis
Amazon S3 - Portland Java User Group 2024-09-17
sullis
0
68
Netty - Montreal Java User Group 2024-05-21
sullis
0
150
Netty Chicago Java User Group 2024-04-17
sullis
0
980
Java 21 - Portland Java User Group 2023-10-24
sullis
0
300
Microbenchmarking with JMH - Portland 2023-03-14
sullis
0
130
Code generation on the Java VM 2022-04-19
sullis
0
120
Mockito 2022-01-25
sullis
0
170
GitHub Actions 2021-12-16
sullis
0
40
Apache Struts and the Equifax data breach 2021-06-03
sullis
0
55
Other Decks in Programming
See All in Programming
盆栽転じて家具となる / Bonsai and Furnitures
aereal
0
3.6k
Djangoアプリケーション 運用のリアル 〜問題発生から可視化、最適化への道〜 #pyconshizu
kashewnuts
1
200
Alba: Why, How and What's So Interesting
okuramasafumi
0
240
Grafana Loki によるサーバログのコスト削減
mot_techtalk
1
110
Rubyでつくるパケットキャプチャツール
ydah
1
730
バックエンドのためのアプリ内課金入門 (サブスク編)
qnighy
8
1.7k
振り返れば奴(Cline)がいる
keiyagi
0
180
『改訂新版 良いコード/悪いコードで学ぶ設計入門』活用方法−爆速でスキルアップする!効果的な学習アプローチ / effective-learning-of-good-code
minodriven
29
5.1k
Linux && Docker 研修/Linux && Docker training
forrep
23
4.4k
Rails アプリ地図考 Flush Cut
makicamel
1
110
AIの力でお手軽Chrome拡張機能作り
taiseiue
0
160
ASP. NET CoreにおけるWebAPIの最新情報
tomokusaba
0
320
Featured
See All Featured
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
44
7k
Into the Great Unknown - MozCon
thekraken
34
1.6k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
49
2.3k
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
160
15k
The Illustrated Children's Guide to Kubernetes
chrisshort
48
49k
GraphQLの誤解/rethinking-graphql
sonatard
68
10k
Large-scale JavaScript Application Architecture
addyosmani
510
110k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
175
51k
StorybookのUI Testing Handbookを読んだ
zakiyama
28
5.5k
A designer walks into a library…
pauljervisheath
205
24k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
20
2.4k
Why You Should Never Use an ORM
jnunemaker
PRO
55
9.2k
Transcript
Amazon S3 NYJavaSIG 2024-12-12 Sean Sullivan
AWS Developers
Agenda AWS re:Invent 2024 Amazon S3 AWS SDK Bonus topics
AWS re:Invent 2024 S3 Tables S3 Metadata
AWS re:Invent 2024 fully managed Iceberg tables
AWS re:Invent 2024 “Table buckets”
AWS re:Invent 2024 Iceberg tables as fi rst-class AWS resources
AWS re:Invent 2024
AWS re:Invent 2024 pom.xml
AWS re:Invent 2024 automatic generation of metadata that is captured
when S3 objects are added or modi fi ed stored in fully managed Apache Iceberg tables
Amazon S3 “Simple Storage Service” Launched on March 14, 2006
Amazon S3 S3 is an object storage service with an
HTTP REST API https://www.allthingsdistributed.com/2023/07/building-and-operating-a-pretty-big-storage-system.html
Amazon S3 “There is a frontend fl eet with a
REST API, a namespace service, a storage fl eet that’s full of hard disks, and a fl eet that does background operations.” https://www.allthingsdistributed.com/2023/07/building-and-operating-a-pretty-big-storage-system.html
S3 core concepts Buckets Objects
S3 core concepts An Amazon S3 object represents a fi
le or collection of data Every object must reside within a bucket
S3 bucket types General purpose buckets Directory buckets Table buckets
S3 bucket names an Amazon S3 bucket name is globally
unique the namespace is shared by all AWS accounts
S3 pricing https://aws.amazon.com/s3/pricing/ “You pay for storing objects in your
S3 buckets. The rate you’re charged depends on your objects' size, how long you stored the objects during the month, and the storage class”
S3 storage classes https://aws.amazon.com/s3/storage-classes/ “Amazon S3 o ff ers a
range of storage classes that you can choose from based on the performance, data access, resiliency, and cost requirements of your workloads.”
S3 storage classes https://aws.amazon.com/s3/storage-classes/ Standard Intelligent Tiering Express One Zone
… and many others
Storage class choice matters https://www.youtube.com/watch?v=RxgYNrXPOLw
S3 REST API 3314 pages
S3 operations Upload object List objects Download object Copy Move
Delete
using S3 in a Java application
AWS SDK for Java v1 AWS SDK for Java v2
AWS SDK for Kotlin
AWS SDK for Java v1 https://aws.amazon.com/blogs/developer/announcing-end-of-support-for-aws-sdk-for-java-v1-x-on-december-31-2025/
Open source SDK’s
AWS SDK for Java v2 pom.xml
AWS SDK for Java v2 pom.xml
AWS SDK for Java v2 : HTTP clients Apache Client
Netty Client CRT Client
AWS SDK for Java v2: CRT client pom.xml
CRT @ AWS re:Invent 2024 https://youtu.be/2DSVjJTRsz8?t=833
Di ff erent fl avors of S3 clients Async Sync
how to create an S3 bucket?
Creating an S3 bucket AWS Console UI AWS CLI AWS
SDK CloudFormation AWS CDK Terraform Pulumi Infrastructure as Code Other
CloudFormation
Pulumi
s3-playground https://github.com/sullis/s3-playground
s3-playground https://github.com/sullis/s3-playground
testing S3 locally Localstack MinIO Adobe S3Mock Testcontainers Testcontainers Testcontainers
S3 with MinIO S3MinioTest.java
how to upload an object? PutObjectRequest
how to retrieve an object? GetObjectRequest
how to upload large objects? CreateMultipartUploadRequest
parallel uploads? S3TransferManager
Big Data analytics?
Iceberg @ Net fl ix 2018 June 2018 https://www.youtube.com/watch?v=nWwQMlrjhy0 S3
Apache Iceberg 2024 A table format is a method of
structuring a dataset’s files to present them as a unified “table.”
Apache Iceberg 2024 In a data lake, all your data
is stored as files in some storage solution (e.g. Amazon S3)
AWS re:Invent 2023 Ryan Blue
AWS re:Invent 2023 S3
AWS re:Invent 2023
AWS re:Invent 2023 “Too many small fi les are a
problem”
Apache Iceberg project https://github.com/apache/iceberg
Iceberg pull request https://github.com/apache/iceberg/pull/11349
S3 Conditional Writes https://aws.amazon.com/about-aws/whats-new/2024/08/amazon-s3-conditional-writes/ Conditional writes can ensure there is
no existing object with the same key name in you bucket during PUT operations
S3 bucket permissions https://docs.aws.amazon.com/AmazonS3/latest/API/API_PutBucketLifecycleCon fi guration.html By default, all Amazon
S3 resources are private, including buckets, objects, and related subresources
S3 performance considerations?
S3 performance https://docs.aws.amazon.com/AmazonS3/latest/userguide/optimizing-performance.html "your application can achieve at least 3,500
PUT/COPY/POST/DELETE or 5,500 GET/HEAD requests per second per partitioned Amazon S3 pre fi x”
S3 performance https://docs.aws.amazon.com/AmazonS3/latest/userguide/optimizing-performance.html "There are no limits to the number
of pre fi xes in a bucket. You can increase your read or write performance by using parallelization”
S3 performance https://docs.aws.amazon.com/AmazonS3/latest/userguide/optimizing-performance.html “While Amazon S3 is scaling to your
new higher request rate, you may see some 503 (Slow Down) errors. These errors will dissipate when the scaling is complete.”
KubeCon November 2024
re:Invent December 2024
The End
Bonus content
AWS re:Invent 2024
AWS re:Invent 2024
AWS re:Invent 2024
AWS re:Invent 2024
AWS re:Invent 2024
AWS re:Invent 2024