Building large-scale batch of media with AWS

Building large-scale batch of media with AWS 2024/06/19 Japanglish Tech
Talk

Building a large-scale batch is diﬃcult. - It’s like tackling
big waves while many different batches spark and come along intensely. - We should prepare well.

Self-Introduction - My name is Kaito Murata - I work
at istyle, creating @cosme we are now at 25 years anniversary! - I am surfer lover - I am 5th year software developer keywords: Next.js, Node.js, AWS

Today I will talk about… How we have been building
the large scale batch of - existing batch system is composed of 30+ different jobs, and each batch inﬂuence several tables which inﬂuence 10+ subsystems - how we realized complexing development style with 7+ members. - how we replaced the existing on-premise batch with AWS system.

And you will learn - How to control large-scale batch
systems. like surfers control waves.

Batch System Requirements: - Composed of 30+ different batches, we
need a powerful tool to monitorize each batch’s behavior and performance. - As each batch runs and sometimes fails, we need the quickest way to re-run if any batch fails. Also, we should notice errors quickly. Infrastructure batch1 batch2 batchN ・・・

Infrastructure to meet the batch requirements: - Event Bridge -
Step Functions - ECS on Fargete ※ Be careful so as not to reoccur the same batch(Next Page).

Infrastructure to meet the batch requirements: - EventBridge ensures 1+
run in each job, and this sometimes causes more than 1 time run - To prevent this, use the unique ID of which each workﬂow issues in each time. Start End Run Error ID Veriﬁcation

How 7+ SWEs work simultaneously without conﬂicts? - We adapted
DI(Dependency Injection)patterns with Tsrynge. - While each repository(DB-connection)parts are separated, each repository function is used in each developer’s batch use case. - We started writing unit-tests from the ﬁrst.

Created Development Standard(Criteria) The criteria includes the points such as:
- Logging information is well written enough with parameters. - Unit Tests are written enough (which was made easy with DI pattern). - Summarization of every table and pages on which batch has inﬂuence. - Each member knows how to re-run the batch with documentation.

Logging and Error Monitorization - All errors are captured in
New Relic. - All errors and warnings are notiﬁed in a slack channel. - Any developers can ﬁx errors with manual.

Thank you for coming! Please follow me at x :
r_devops zenn: r_devops

Building large-scale batch of media with AWS

Building large-scale batch of media with AWS

muratak

More Decks by muratak

Featured

Transcript

Building large-scale batch of media with AWS 2024/06/19 Japanglish Tech

Building a large-scale batch is diﬃcult. - It’s like tackling

Self-Introduction - My name is Kaito Murata - I work

Today I will talk about… How we have been building

And you will learn - How to control large-scale batch

Batch System Requirements: - Composed of 30+ different batches, we

Infrastructure to meet the batch requirements: - Event Bridge -

Infrastructure to meet the batch requirements: - EventBridge ensures 1+

How 7+ SWEs work simultaneously without conﬂicts? - We adapted

Created Development Standard(Criteria) The criteria includes the points such as:

Logging and Error Monitorization - All errors are captured in

Thank you for coming! Please follow me at x :