Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Big Data on GoogleCloud

Big Data on GoogleCloud

Kenneth Kinyanjui

March 13, 2018
Tweet

More Decks by Kenneth Kinyanjui

Other Decks in Programming

Transcript

  1. Problem Solver ❖ Jr Software Engineer ❖ Jr Linux System

    Admin ❖ Rookie Entrepreneur ❖ Sr Engineer - Backend & Infrastructure ❖ Design Sprint Master ❖ Product Manager ❖ Still can’t get out of writing code ...
  2. This talk will tell help you know... ❖ What Big

    Data is? ❖ Why we should care about Big Data? ❖ When are we ready ? ❖ How we can leverage on Big Data with ease?
  3. Well it can be summarized in the 5vs ❖ Volume

    - Data Size ❖ Variety - Different forms of data sources ❖ Veracity - Uncertainty of Data ❖ Value - Impact ❖ Velocity - Speed of Change
  4. Hold on you need ... ❖ Data Warehouse ❖ Data

    Processing ❖ Data Streaming ❖ Highly Scalable Database ❖ Visualization ❖ Analyze the Data
  5. Stages of Companies/Startups Domain Early Stage High Growth Late Stage

    Users A few hundred to thousand Thousands to millions Millions to Billions Team <20 <1000 >1000 X-Functional Team Few Many Many Data Size GigaByte GigaByte-TeraByte TeraByte -PetaByte Agility Yes Yes Not necessarily Flexibility Not necessarily Yes Yes
  6. Ask yourself ❖ Which stage of the company life cycle

    am I ? ❖ How is my data collection process? ❖ Are we making decisions based on Data ? ❖ Is Experimentation and Validation at the core of our business? ❖ Is my data accessible ? ❖ Is my data clean ? ❖ Can we confidently say we are a data driven company?
  7. Keep in mind ... Its 3 Simple things ❖ Data

    Collection ❖ Data Analysis ❖ Data Visualization
  8. Data Analysis ❖ Easily Access your data ❖ Clean it

    ❖ Some Crunching of Data ❖ Some ML/AI