Delta lake setup
libraryDependencies += “io.delta” %% “delta-core” % “1.1.0”
val spark = SparkSession
.builder()
.con
fi
g(“spark.sql.extensions”,
“io.delta.sql.DeltaSparkSessionExtension”)
.con
fi
g(“spark.sql.catalog.spark_catalog”,
“spark.sql.delta.catalog.DeltaCatalog”)
.getOrCreate()
Slide 38
Slide 38 text
Timestamp Entity_name Entity_type Belongs to
2014-06-07 factory-worker-0 Pod StatefulSet
2014-06-07 factory-worker-1 Pod StatefulSet
2014-06-07 factory-worker-2 Pod StatefulSet
…
spark.read.format(“delta”).load(“/magic-db”)
Slide 39
Slide 39 text
“It’s possible to look directly
into the history of the data!”
Slide 40
Slide 40 text
Version Timestamp User Operation Comment
1 2014-06-07 Architect CREATE
Default name space
city
2 2014-06-08 Architect CREATE Factory
…
val deltaTable = DeltaTable.forPath(spark, “/magic-db”)
deltaTable.history()
Slide 41
Slide 41 text
0. World creation
Slide 42
Slide 42 text
1. Default namespace city creation
Slide 43
Slide 43 text
2. Factory creation
Slide 44
Slide 44 text
3. Forest discovery
Slide 45
Slide 45 text
4. The
fi
rst human’s visit
Slide 46
Slide 46 text
5. Pod lost
NAME READY STATUS AGE
pod/magic-db-cluster-0 1/1 Error 15m
pod/magic-db-cluster-1 1/1 Running 15m
pod/magic-db-cluster-2 1/1 Running 15m
Alice was sitting at the
cafe. Even her favourite
cheesecake could not
console her. She knew
s h e h a d t o d o
something to save the
pods and higher-order
functions.
Timestamp Entity_name Entity_type Belongs to
2021-11-01 magic-db-0 Pod CRD
2021-11-01 magic-db-1 Pod CRD
2021-11-01 magic-db-2 Pod CRD
…
Slide 61
Slide 61 text
val deltaTable = DeltaTable.forPath(“/magic-db”)
deltaTable.vacuum(3)
Slide 62
Slide 62 text
val deltaTable = DeltaTable.forPath(“/magic-db”)
deltaTable.vacuum(3)
Slide 63
Slide 63 text
Version Timestamp User Operation Comment
9 2021-11-01 Alice CREATE New pods created
10 2021-11-01 System DELETE All pods lost
val deltaTable = DeltaTable.forPath(spark, “/magic-db”)
deltaTable.history()