War stories of lighting a Spark in the Kubernetes sea

•Big data developer at Captif y •Diversity&Inclusion ambassador for Captify
Kyiv offic e •Women Who Code Kyiv Data Engineering Lead and Mento r •Speaker and traveller Roksolana Diachuk

01 02 03 How to run Spark on Kubernetes Research
stage Project development stage Agenda 04 Conclusions

What big data engineers do?

Spark jobs cluster managers

Kubernetes objects landscape Deployment Pod StatefulSet ReplicaSet Volume DaemonSet Service
Namespace Custom object

Kubernetes operator etcd Postgres Operator Postgres Deploymen t /StatefulSet Postgres
Deploymen t /StatefulSet Custom automation for work fl ow actions State kubectl apply

Spark Kubernetes operator Controllers Submission runner Spark Pod Monitor Mutating
Admission Webhook

Spark Kubernetes operator Controllers Submission runner Spark Pod Monitor Mutating
Admission Webhook API Server / Scheduler kubectl spark-app.yaml Spark Application Object Spark Application Pod Events Driver Executor Executor

Volume 1. Research

Documentation

Yaml file launch Bash script launch Story №1

NAME READY STATUS RESTARTS AG E spark-driver 0/1 Pending 0
0 s spark-driver 0/1 Init:0/1 0 0 s spark-driver 0/1 Init:Error 0 3s Story №1

Problem Solution Remove restart policy Story №1. API updates Custom
objects API update ¯\_(ツ)_/¯

apiVersion: sparkoperator.k8s.io/v1alpha1 kind: SparkApplicatio n metadata : name: spark-p i
namespace: defaul t spec : type: Scal a image: gcr.io/ynli-k8s/spark:v2.4. 0 mainClass: org.apache.spark.examples.SparkP i mainApplicationFile: local:///tmp/jars/spark_example.jar mode: cluste r deps: { } Spark-app.yaml

driver : coreLimit: 1000 m cores: 0. 1 labels :
version: 2.4. 0 memory: 1024 m serviceAccount: spar k executor : cores: 1 instances: 1 labels: version: 2.4. 0 memory: 1024 m imagePullPolicy: Neve r restartPolicy: Never Spark-app.yaml

Story №2 Docker images

NAME READY STATUS RESTARTS AG E sparkoperator 0/1 Pending 0
0 s sparkoperator 0/1 ContainerCreating 0 0 s sparkoperator 0/1 Error 0 5 s sparkoperator 0/1 CrashLoopBackOff 0 9s Story №2. Docker images

Problem Solution Docker image upgrade, images storage policies Story №2.
Docker images Outdated Docker image version

Story №3. Library choice Spark Application Executor Executor Driver Integration
tests

Story №3. Library choice

Problem Solution Limited library functionality More detailed research and discussion

Story №4. Subresources

The server could not find the requested resource message at
http://host:port/apis/ sparkoperator.k8s.io/v1alpha1/ namespaces/default/sparkapplications/ spark-example/status REST API call Story №4. Subresources

Problem Solution Spark operator bug Statuses added to spark-operator-crds.yaml Story
№4. Subresources

apiVersion: apiextensions.k8s.io/v1beta1 kind: CustomResourceDefinition metadata : name: sparkapplications.sparkoperator.k8s.io … subresources:
status: {} spark-operator-crds.yaml

Lessons learned • Thorough research and results discussion • Read
the documentation carefully • Community is everything

Volume 2. Project development

Project initiation

Story №1. Expertise

Problem Solution Lack of expertise with big data stack on
k8s Constant discussions and team education Consequence CI/CD creation took months Story №1. Expertise

Story №2.Logging and monitoring

spec: driver: javaOptions: -Dlog4j.configuration = /path/to/log4j.properties executor: javaOptions: -Dlog4j.configuration =
/path/to/log4j.properties Story №2.Logging and monitoring

Problem Solution log4j file configuration is not picked up Building
a config map and mounting it into Spark custom object Story №2.Logging and monitoring

Story №3.Infrastructure support

Story №3.Infrastructure support Problem Solution Shared development cluster Agreements about
the policies Consequence Data loss and constant infrastructure changes

Lessons learned • Keep in mind challenges while choosing the
tech stack • Make sure there’s enough expertise for the project development • Communicate a lot

• Expertise development in the departmen t • 1 production-level
project with missed deadline s • …lots of sleepless nights \_(ツ)_/ Results

Building big data infrastructures on top of Kubernetes is very
challenging but do not give up, it is fun! (may produce headaches and eye twitching)

Resources • Running Spark on Kubernetes documentation • Kubernetes documentatio
n • K. Hightower. Kubernetes: Up & Runnin g • G. Kim. Project Phoenix

https://github.com/GoogleCloudPlatform/ spark-on-k8s-operator Spark-k8s operator repo

github.com/ kubernetes-client/java github.com/fabric8io/ kubernetes-client Java-k8s client Fabric8io

dead_flowers22 roksolana-d My contact info roksolanadiachuk roksolanad

Thank you for attention!

War stories of lighting a Spark in the Kubernet...

War stories of lighting a Spark in the Kubernetes sea

More Decks by Roksolana

Featured

Transcript