Upgrade to Pro — share decks privately, control downloads, hide ads and more …

How to scale Data Ops

How to scale Data Ops

Data Engineering roles have increased 10x in the past 3 years.
Their role is to provide all the teams in a company with fresh, reliable, accurate data.

Despite that, very few fundamental changes have emerged in the ways we scale Data Ops. Companies today are faced with either dealing with fragmented data, or hiring more data engineers. But it doesn't have to be that way.

This deck talks about why you need data engineering, what should be your strategy and focus, the newer ways to integrate your different data silos, where to start your data consolidation efforts, how to scale processes data engineering teams, and what it will bring to your business in the end.

Romain Dardour

May 29, 2018
Tweet

Other Decks in Technology

Transcript

  1. - Co-founder, CEO @ Hull.io - Product guy - Advisor,

    Techstars Mentor - Always tried to unify all the things
 github.com/unity
 First company name... Unity ¯\_(ツ)_/¯ I’m Romain Dardour
  2. “The data you need
 to work in one tool, Is

    usually in another tool” — Romain Dardour
  3. 1900-1970 Factory Era
 The one who produces the most wins

    1970-2000 Mass Marketing Era
 The one who shouts the loudest wins 2000-2015 Online Marketing Era
 The one with the biggest online presence wins 2015+ Personalized Marketing Era
 The one with the best usage of data wins
  4. → Attribution → Team Alignment → Reporting → Personalization →

    Lead Qualification → Customer success → Product Qualified Leads → Growth Experiments → GDPR (Compliance)
  5. Data engineer ≠ Data Scientist Data Scientist Cleans, massages, organizes

    (big) data, performs statistics and analysis to develop insights, build models, find patterns, tell stories to stakeholders Data Engineer Develops, constructs, Tests, maintains architectures and large-scale processing systems - Acquire Data - Marry systems together - Recommend ways to Improve Data reliability
  6. Growth Engineers, BizOps Teams, Demand Gen Teams - Transversal, Horizontal,

    Growth mindset - All purpose enablers - “Getting things done” focus Skills
 SQL & NoSQL, APIs, Python, Ruby, Javascript, Database Systems,
 Data Modeling & ETL Tools, MapReduce, Data warehousing.
  7. What matters is how customer data is used What experiences

    can you create from your tools, teams & data?
  8. Step 0. Get in a room together. No dial-ins. Everyone

    around a whiteboard. Morning or whole day
  9. How could I send an email like this from Intercom?

    Hi Romain, I noticed you hadn’t got your ticket for Inbound.org for this year. Are you thinking of coming? We’ve a focus this year on in-house speakers, particularly from SaaS. Have you met Hana Abaza from Shopify? You know the deal. 3-days. Resort. Justin’s karaoke. And the weather in London’s really not great in September, so.. Let me know! Darmesh
  10. Payments Industry Location Hi Romain, I noticed you hadn’t got

    your ticket for Inbound.org for this year. Are you thinking of coming? We’ve a focus this year on in-house speakers, particularly from SaaS. Have you met Hana Abaza from Shopify? You know the deal. 3-days. Resort. Justin’s karaoke. And the weather in London’s really not great in September, so.. Let me know! Darmesh Friends
  11. #demo_requests Hana Abaza Head of Marketing Toronto $ Fit: Very

    Good Signals: Data Warehouse, Salesforce, $SHOP Matching tech: 8 Blog Reader for 2 months 
 Interested in data warehousing Viewed pricing 3 times this week 1 How could I send a notification like this from Slack?
  12. #demo_requests Hana Abaza Head of Marketing Toronto $ Fit: Very

    Good Signals: Data Warehouse, Salesforce, $SHOP Matching tech: 8 Blog Reader for 2 months 
 Interested in data warehousing Viewed pricing 3 times this week 1 Tech stack Website Scoring
  13. Incomplete Sources Inaccurate Sources Limited APIs Duplicate Items Legacy Data

    “Impedence Mismatch” between objects
 (i.e Accounts, Companies, Organizations…)
  14. Map all key profiles & profile data Contact Name Email

    Lead status Job title Location ...
  15. For the original source fields, we'd like the highest ranking

    source & details on the oldest date that any of these events occurred. If two events of the same rank occur, we want the earliest in the day. For the latest source fields, we'd like the highest ranking source & details on the most recent date that any of these events occurred. If two events of the same rank occur, we want the last in the day. Everything has the same logic on the account level, but we of course want the oldest source across all users on that account for the original source and the latest across all for the latest.
  16. 1

  17. What’s important isn’t what your data stack
 looks like. It’s

    how connected it is,
 and how easy it is to iterate on it. Everything is a loop around the customer profile.