Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Big Data on GoogleCloud

Big Data on GoogleCloud

Avatar for Kenneth Kinyanjui

Kenneth Kinyanjui

March 13, 2018
Tweet

More Decks by Kenneth Kinyanjui

Other Decks in Programming

Transcript

  1. Problem Solver ❖ Jr Software Engineer ❖ Jr Linux System

    Admin ❖ Rookie Entrepreneur ❖ Sr Engineer - Backend & Infrastructure ❖ Design Sprint Master ❖ Product Manager ❖ Still can’t get out of writing code ...
  2. This talk will tell help you know... ❖ What Big

    Data is? ❖ Why we should care about Big Data? ❖ When are we ready ? ❖ How we can leverage on Big Data with ease?
  3. Well it can be summarized in the 5vs ❖ Volume

    - Data Size ❖ Variety - Different forms of data sources ❖ Veracity - Uncertainty of Data ❖ Value - Impact ❖ Velocity - Speed of Change
  4. Hold on you need ... ❖ Data Warehouse ❖ Data

    Processing ❖ Data Streaming ❖ Highly Scalable Database ❖ Visualization ❖ Analyze the Data
  5. Stages of Companies/Startups Domain Early Stage High Growth Late Stage

    Users A few hundred to thousand Thousands to millions Millions to Billions Team <20 <1000 >1000 X-Functional Team Few Many Many Data Size GigaByte GigaByte-TeraByte TeraByte -PetaByte Agility Yes Yes Not necessarily Flexibility Not necessarily Yes Yes
  6. Ask yourself ❖ Which stage of the company life cycle

    am I ? ❖ How is my data collection process? ❖ Are we making decisions based on Data ? ❖ Is Experimentation and Validation at the core of our business? ❖ Is my data accessible ? ❖ Is my data clean ? ❖ Can we confidently say we are a data driven company?
  7. Keep in mind ... Its 3 Simple things ❖ Data

    Collection ❖ Data Analysis ❖ Data Visualization
  8. Data Analysis ❖ Easily Access your data ❖ Clean it

    ❖ Some Crunching of Data ❖ Some ML/AI