Slide 1

Slide 1 text

Navigating the Azure Cloud at Digitec Galaxus Our Journey and Lessons Learned Digitec Galaxus AG Olivier Girard | Gerald Schermann 16.04.2024

Slide 2

Slide 2 text

Olivier Girard Domain Architect Platform / Online Shop

Slide 3

Slide 3 text

Gerald Schermann Software Architect Platform & SysOps Arlberg Servus Grüezi 2014

Slide 4

Slide 4 text

Our Cloud Journey 1 - The Past 2 - The Present 3 – The Learnings 0 - About Us

Slide 5

Slide 5 text

What we're about and how we work

Slide 6

Slide 6 text

Innovative Ambitious Responsible Cooperative Piratesque

Slide 7

Slide 7 text

We also run six additional shops in the EU galaxus.ch digitec.ch galaxus.fr galaxus.de galaxus.at galaxus.it galaxus.nl galaxus.be

Slide 8

Slide 8 text

Our Applications

Slide 9

Slide 9 text

9 Our Cloud Journey – The Past

Slide 10

Slide 10 text

On-prem Hosting Manual Deployment 2015 … digitec.ch/galaxus.ch Revenue 645M Product Catalog 300K Orders 2M 350 300 250 200 150 100 50 2007 2015 2018 2024 Employees Product Development Monolith Layered Architecture ERP OnlineShop DWH Monolith DB TL PO SA Dev 5 Scrum Teams 2015

Slide 11

Slide 11 text

Cloud Hosting (Azure/GCP) Continuous Delivery ... 2018 Monolith Layered Architecture ERP OnlineShop DWH Monolith DB CH + galaxus.de Revenue 970M Product Catalog 2M Orders >4M 350 300 250 200 150 100 50 2007 2015 2018 2024 Employees Product Development TL PO SA Dev > 10 Scrum Teams OnlineShop vNext GraphQL Community Search Microservices 2018

Slide 12

Slide 12 text

Hosting pre-2017 CDN Windows LB VMs Shop/ERP VMs Shop/ERP VMs Shop/ERP VMs Our own hardware … 2017

Slide 13

Slide 13 text

Hosting pre-2017: lift & shift CDN Windows LB VMs Shop/ERP VMs Shop/ERP VMs Shop/ERP VMs Microservices in AKS IIS reverse proxy … 2017

Slide 14

Slide 14 text

Hosting in 2017 CDN Windows LB VMs Shop/ERP VMs Shop/ERP VMs Shop/ERP VMs Microservices in AKS IIS reverse proxy 2017 BLACK FRIDAY

Slide 15

Slide 15 text

Black Friday 2017 2017

Slide 16

Slide 16 text

Hosting in 2017 CDN Windows LB VMs Shop/ERP VMs Shop/ERP VMs Shop/ERP VMs Microservices in AKS IIS reverse proxy BLACK FRIDAY 2017 Redis

Slide 17

Slide 17 text

Hosting in 2018 CDN Windows LB VMs Shop/ERP VMs Shop/ERP VMs Shop/ERP VMs Microservices in AKS Routing Layer AKS in-place upgrades? IIS reverse proxy Microservices in AKS Redis 2018

Slide 18

Slide 18 text

Hosting in 2018 CDN Windows LB VMs Shop/ERP VMs Shop/ERP VMs Shop/ERP VMs Routing Layer IIS reverse proxy Microservices in AKS Redis 2018 Single point of failure? Maybe? Wait until 2022 … the day the routers died

Slide 19

Slide 19 text

Hosting in 2019 / 2020 CDN Shop/ERP VMs Shop/ERP VMs Shop/ERP VMs Routing Layer Microservices in AKS Redis ERP VMs Containerization facts: From .NET Full Framework to .NET Core Replacing all Windows-only dependencies 9 months 2020

Slide 20

Slide 20 text

DevOps – Early Days VCS & Continuous Delivery SRE Team No deployments on Friday! Logging / Monitoring / Alerting Application Insights Prometheus Logs, Alerts, Metrics K8s (Workload) Metrics, Alerts Metric correlation & sampling Scaling issues Grafana 2020

Slide 21

Slide 21 text

More and more Data, and distributed DWH Monolith DB MS SQL (> 2 TB) Search Index Elastic ERP OnlineShop Business Intelligence Community Community Mongo DB Data propagation over Azure Service Bus nightly full-copy 2020

Slide 22

Slide 22 text

Our Cloud Journey – The Present

Slide 23

Slide 23 text

Containers everywhere … 2024 ERP Data Platform Galaxus in 7 countries Revenue 2370M Product Catalog 6M Orders 15M 350 300 250 200 150 100 50 2007 2015 2018 2024 Employees Product Development 35M Sessions/month 200 Page Views/sec >700 Services/DBs in prod >6M LoC TL PO SA Dev > 45 Scrum Teams Online Shop ShopProduct Modular Monolith Microservices (or Microliths) Unstructured Monolith Layered Architecture … Monolith DB 2024

Slide 24

Slide 24 text

Modular Architecture E-Commerce by DG Customer Service Retail Editorial HR … Sales Order Fullfillment Finance Marketplace Sales Category Management Warehousing Purchasing Provider Management Community Marketing Community

Slide 25

Slide 25 text

Data Platform Modular Architecture ERP Online Shop Backend Data Analysis Implementation 2024

Slide 26

Slide 26 text

Modular Architecture Online Shop Frontend Hosting/Ops 2024

Slide 27

Slide 27 text

Hosting CDN Routing Layer Microservices in AKS Redis CPU-bound Memory-bound «No interruptions» «Quarantine» Node Pools + Spot Instances Scale to Zero + Previews Infrastructure as Code Multi-Cluster Management 2024

Slide 28

Slide 28 text

Preview Deployments with Knative 2024 Preview Pod per PR “Regular” Pods based on main Other Services Service calls

Slide 29

Slide 29 text

Preview Deployments with Knative 2024

Slide 30

Slide 30 text

The day the routers died reboot recreation 2022

Slide 31

Slide 31 text

The day the routers died reboot recreation 2022

Slide 32

Slide 32 text

Culture: Post Mortems Document Incident Understand Root Cause Preventive Actions 2024

Slide 33

Slide 33 text

Data (is gold) Data Platform Big Query event-carried state transfer 2024 single writer principle data on the inside data on the outside data mesh

Slide 34

Slide 34 text

Autonomous Teams TL Scrum Team PO SA Dev mission OKR x-functional autonomous customers business 6-10 members 2024

Slide 35

Slide 35 text

We do Dev*Ops

Slide 36

Slide 36 text

Internal Developer Platform TL Feature Team PO SA Dev Platform Database Hosting Dev Tooling Observability Messaging / Kafka Data Platform Delivery (CI/CD) Networking Security TL Platform Team PO Platform Architect DevOps Engineer Data Engineer Data Team PO Data Scientist TL support / enabling / governance customer needs / feedback 2024

Slide 37

Slide 37 text

Dev*Ops and Platform Observability Datadog VCS & Continuous Deployment Security Kyverno Network Policies Managed Identities Security Team 2024

Slide 38

Slide 38 text

Developer Tools Dev Tools & Local Development Developer Portal DB Self-Service & (Cost) Transparency Missing Pieces Feature Team 2024

Slide 39

Slide 39 text

2024 Developer Tools Dev Tools & Local Development Developer Portal DB Self-Service & (Cost) Transparency Missing Pieces Feature Team

Slide 40

Slide 40 text

2024 Developer Tools Dev Tools & Local Development Developer Portal DB Self-Service & (Cost) Transparency Missing Pieces Feature Team

Slide 41

Slide 41 text

Developer Tools Dev Tools & Local Development Developer Portal DB Self-Service & (Cost) Transparency Missing Pieces Feature Team 2024

Slide 42

Slide 42 text

Our Cloud Journey – The Learnings

Slide 43

Slide 43 text

Top 3 Lessons Learned not-invented-here vs. managed / industry standard 1 distributed vs. modular 2 one-size fits all vs. specific solution 3 …

Slide 44

Slide 44 text

Next Up in Our Journey de-couple to scale (sagas, streaming) event-driven architecture Simplify and more self-service to increase productivity system overview / observability, explicit know how and governance holistic system view service mesh, federated governance and security multi/hybrid cloud …

Slide 45

Slide 45 text

No content

Slide 46

Slide 46 text

galaxus.ch/jobs Olivier Girard Domain Architect – Online Shop / Platform https://www.linkedin.com/in/oligirard/ Gerald Schermann Software Architect – Platform https://www.linkedin.com/in/gerald-schermann/ Tech Blog