Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Seeing at the Speed of Thought: Empowering Others Through Data Exploration

Seeing at the Speed of Thought: Empowering Others Through Data Exploration

Talk I gave at Big Data Visualisation Sydney 2017

Greg Goltsov

March 08, 2017
Tweet

More Decks by Greg Goltsov

Other Decks in Programming

Transcript

  1. Seeing at the speed of thought Empowering others through data

    exploration Greg Goltsov Senior Data Engineer @gregoltsov www.gregory.goltsov.info (will have link to slides)
  2. Seeing at the speed of thought Empowering others through data

    exploration yourself your team your company
  3. Touch Surgery Built marketing/sales dashboards for Fortune 10 companies Built

    educational dashboards for 4 of the top 10 world-rated medical universities All from scratch
  4. Appear Here World’s biggest online marketplace for retail spaces Internal

    recommendation system Highly visual debug interface for non-tech people
  5. Southern Cross Austereo Modernising the data pipeline Spearheading data-driven culture

    throughout the company Datasets covering 80% Australians weekly
  6. Make feedback fast “Check the dash in 15 mins” “I

    put your request into the backlog”
  7. The goal is to turn data into information, and information

    into insight. – Carly Fiorina, former HP CEO
  8. Ad-hoc queries Data pipeline Fast to develop Every query gets

    thrown away after Upfront investment Every integration builds foundations
  9. Aberdeen Group research Don’t use unstructured data Use unstructured data

    Happy with the ability to share data 18% 60% Pleased with the accessibility 20% 50%
  10. Volume Variety Velocity Machine learning "3D Data Management: Controlling Data

    Volume, Velocity and Variety”, Gartner Inc. 2001
  11. Ingest quickly Real-time schema-on- read exploration Push vetted insights into

    DW/BI Example: Spark, AWS Athena, Microsoft’s PowerBI
  12. Scale computation and storage separately Go from non-trivial data to

    dashboard in minutes Spark is 20-100x faster than MapReduce Turnkey solution: www.databricks.com OSS: Apache Zeppelin on AWS EMR Spark
  13. THANK YOU Speaker Name: Greg Goltsov Email: [email protected] Organized by

    UNICOM Trainings & Seminars Pvt. Ltd.
 [email protected] http://www.unicomlearning.com/2017/Big_Data_Visualization_Summit_Sydney