Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Data Catalogs - Rebuild the Broken Promise

Data Catalogs - Rebuild the Broken Promise

Is Data Catalog still relevant? Why does most data catalog fail? What can we do to prevent the failures?

Ananth Packkildurai

April 23, 2023
Tweet

More Decks by Ananth Packkildurai

Other Decks in Technology

Transcript

  1. Data Catalogs
    - Rebuild the Broken Promise
    Ananth Packkildurai

    View Slide

  2. Slack
    Data
    Engineer
    Zendesk
    Principal Data
    Engineer
    Creator
    Schemata -
    Data Contract
    Platform
    Author
    Data
    Engineering
    Weekly

    View Slide

  3. Data Catalog is an expensive
    data ingestion platform you
    never intend to build.

    View Slide

  4. How Happy are you with Data Catalog?

    View Slide

  5. What is the
    purpose of
    a Data
    Catalog?
    Data Lineage
    Data Governance
    Data Discovery
    Metadata Management
    Collaboration

    View Slide

  6. Data Engineering in a Nutshell

    View Slide

  7. Data Catalog - A Disjointed Space

    View Slide

  8. MAD Landscape - 2012

    View Slide

  9. Data Engineering Function 2012

    View Slide

  10. MAD Landscape 2023

    View Slide

  11. Don’t Strive for
    comprehensive &
    Complete
    Coverage

    View Slide

  12. Don’t Desire for
    Single Source of
    Metadata

    View Slide

  13. What should
    we start
    thinking about
    it?

    View Slide

  14. Streamline Data Creation
    Process

    View Slide

  15. Shift Left

    View Slide

  16. Data Contract as a Code

    View Slide

  17. Headless Data Catalog

    View Slide

  18. https://schemata.app
    https://www.linkedin.com/in/ananthdurai
    [email protected]

    View Slide