Slide 49
Slide 49 text
Data Warehousing in the Modern Data Stack
@ongchinhwee
“Since storage and compute are dirt cheap, engineering time is
expensive, why not snapshot all your data (and append new
partitions for each ETL schedule)?”
● Does not apply for very large dimensions
● Does not preclude the importance of dimensional data modelling
Related reading: Functional Data Enginering - a modern paradigm for batch data processing (and related
talks) by Maxime Beauchemin, creator of Airflow and Superset