Lock in $30 Savings on PRO—Offer Ends Soon! ⏳

The REWE Data Platform - A step towards a data ...

The REWE Data Platform - A step towards a data mesh

A data mesh in the wild – in this talk we will talk openly about our journey of setting up a data mesh at REWE International. We will cover the good, the bad and the ugly, starting from the reason why we decided on a data mesh, the initial attempts and challenges and
the current state and architecture. We will end with an overview of our future plans for the REWE data platform.

Avatar for Posedio

Posedio PRO

May 22, 2024
Tweet

More Decks by Posedio

Other Decks in Programming

Transcript

  1. THE REWE DATA PLATFORM 1. Vision 2. The billion-euro cake

    3. Building blocks of a data mesh 4.Challenges 5. What we got right – and the impact it had 6. A platform is never finished
  2. 4 REWE International AG is part of the REWE Group

    - one of Europe's leading trading and travel and tourism groups. REWE Interna+onal AG 9,21 bn Euro Total External Sales 2021 92 Thousand Employees 2021 2 Thousand Apprentices 2021 2,65 Thousand Markets 2021 The REWE International AG in figures
  3. 5 The IT department with around 480 employees, creates customized

    software and hardware solutions for all domestic and international divisions of the group. Focus IT: Pleased to meet you!
  4. 6 Jobs at REWE IT – Data Platform Interested to

    join us? https://rewe-group.jobs/jobs/it-handelinternational Contact details: Mail: [email protected]
  5. 11 • Obvious data error • Requires collaboration between different

    departments • Sensitive data (prices) • Difficult to propagate the fix through all the systems WHY A CAKE
  6. 13 MEANING • Data is scattered across teams • Meaining

    is unclear or different outside the context of the team
  7. 14 QUALITY • Quality gates do no exists, data is

    not checked • Data graveyards • By-product of the operational processes
  8. 15 SECURITY • Some data is sensitive • Usage of

    some data must be tracked and reported • The people that produce the data are not always aware of all the rules
  9. 16 AVAILABILITY • Data is difficult to find • It

    takes weeks to get access to data • It takes weeks to give access to data
  10. 17

  11. 18 DOMAIN OWNERSHIP • Article masterdata domain is responsible for

    all article data • The team makes the data available to all
  12. 19 DATA PRODUCTS • The team is responsible for the

    data quality • Internal customer satisfaction is tracked • Tests catch errors
  13. 20 FEDERATED GOVERNANCE • A governence team decides on rules

    and access control • Rules are enforced by the system
  14. 21 SELF-SERVICE DATA PLATFORM • Data is easy to find

    • Data is easy to publish and consume • The platform enforces the rules
  15. 27 ARCHITECTURE – DEEP DIVE Sources Storage and Processing Data

    Integration article curated Semantics Layer (Processing) product Observability/monitoring GCS ✔schema ✔quality data contract Data platform run orders raw Level 1 Level n Level 1 Level n central /shared HaFi specific Cloud composer central orchestration Meta data quality check to be published in the data catalog • Table/Data description • 70% column description Data catalog
  16. 29 • Multiplatform • Easy onboarding • Data contracts •

    MLOps and AI • Streaming data THE WAY FORWARD