Slide 35
Slide 35 text
● Implement CI/CD as early as possible
● Implement Infrastructure as Code (i.e: using Cloudformation) to
provision resources
○ EC2 instances
○ EMR cluster
○ Glue Jobs, Crawler
● Deploy for multi-environment (prod, stag, dev)
● Ensure all aspect in the pipeline (DAG, tables, resources) is defined
as code and versioned in Git
● Practice code review for every changes
5) Tips on Ops
35