Upgrade to Pro — share decks privately, control downloads, hide ads and more …

The REWE Data Platform

The REWE Data Platform

A step towards a data mesh

Posedio

May 22, 2024
Tweet

More Decks by Posedio

Other Decks in Programming

Transcript

  1. THE REWE DATA PLATFORM 1. Vision 2. The billion-euro cake

    3. Building blocks of a data mesh 4.Challenges 5. What we got right – and the impact it had 6. A platform is never finished
  2. 4 REWE International AG is part of the REWE Group

    - one of Europe's leading trading and travel and tourism groups. REWE Interna+onal AG 9,21 bn Euro Total External Sales 2021 92 Thousand Employees 2021 2 Thousand Apprentices 2021 2,65 Thousand Markets 2021 The REWE International AG in figures
  3. 5 The IT department with around 480 employees, creates customized

    software and hardware solutions for all domestic and international divisions of the group. Focus IT: Pleased to meet you!
  4. 6 Jobs at REWE IT – Data Platform Interested to

    join us? https://rewe-group.jobs/jobs/it-handelinternational Contact details: Mail: [email protected]
  5. 11 • Obvious data error • Requires collaboration between different

    departments • Sensitive data (prices) • Difficult to propagate the fix through all the systems WHY A CAKE
  6. 13 MEANING • Data is scattered across teams • Meaining

    is unclear or different outside the context of the team
  7. 14 QUALITY • Quality gates do no exists, data is

    not checked • Data graveyards • By-product of the operational processes
  8. 15 SECURITY • Some data is sensitive • Usage of

    some data must be tracked and reported • The people that produce the data are not always aware of all the rules
  9. 16 AVAILABILITY • Data is difficult to find • It

    takes weeks to get access to data • It takes weeks to give access to data
  10. 17

  11. 18 DOMAIN OWNERSHIP • Article masterdata domain is responsible for

    all article data • The team makes the data available to all
  12. 19 DATA PRODUCTS • The team is responsible for the

    data quality • Internal customer satisfaction is tracked • Tests catch errors
  13. 20 FEDERATED GOVERNANCE • A governence team decides on rules

    and access control • Rules are enforced by the system
  14. 21 SELF-SERVICE DATA PLATFORM • Data is easy to find

    • Data is easy to publish and consume • The platform enforces the rules
  15. 27 ARCHITECTURE – DEEP DIVE Sources Storage and Processing Data

    Integration article curated Semantics Layer (Processing) product Observability/monitoring GCS ✔schema ✔quality data contract Data platform run orders raw Level 1 Level n Level 1 Level n central /shared HaFi specific Cloud composer central orchestration Meta data quality check to be published in the data catalog • Table/Data description • 70% column description Data catalog
  16. 29 • Multiplatform • Easy onboarding • Data contracts •

    MLOps and AI • Streaming data THE WAY FORWARD