Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Seeing at the Speed of Thought: Empowering Othe...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Greg Goltsov
March 08, 2017
Programming
260
0
Share
Seeing at the Speed of Thought: Empowering Others Through Data Exploration
Talk I gave at Big Data Visualisation Sydney 2017
Greg Goltsov
March 08, 2017
More Decks by Greg Goltsov
See All by Greg Goltsov
Beginning ClojureScript: How not to learn a new language
ggoltsov
1
120
Full-stack Data Science: How to be a One-Man Data Team
ggoltsov
2
620
Kuranku - final game presentation
ggoltsov
0
300
Scalable agent-based simulations
ggoltsov
1
250
Procedural City Generator - Honours Presentation
ggoltsov
2
1.6k
Extracting the Meaning: Painless processing and analysis of image data with Fiji and Ruby
ggoltsov
0
180
Ninja Code
ggoltsov
1
2.4k
Other Decks in Programming
See All in Programming
ハーネスエンジニアリングにどう向き合うか 〜ルールファイルを超えて開発プロセスを設計する〜 / How to approach harness engineering
rkaga
13
5k
How Swift's Type System Guides AI Agents
koher
0
230
今年もTECHSCOREブログを書き続けます!
hiraoku101
0
250
Codex CLIのSubagentsによる並列API実装 / Parallel API Implementation with Codex CLI Subagents
takatty
2
890
CDK Deployのための ”反響定位”
watany
4
710
Xdebug と IDE による デバッグ実行の仕組みを見る / Exploring-How-Debugging-Works-with-Xdebug-and-an-IDE
shin1x1
0
370
メッセージングを利用して時間的結合を分離しよう #phperkaigi
kajitack
3
590
TiDBのアーキテクチャから学ぶ分散システム入門 〜MySQL互換のNewSQLは何を解決するのか〜 / tidb-architecture-study
dznbk
1
160
煩雑なSkills管理をSoC(関心の分離)により解決する――関心を分離し、プロンプトを部品として育てるためのOSSを作った話 / Solving Complex Skills Management Through SoC (Separation of Concerns)
nrslib
4
870
ふりがな Deep Dive try! Swift Tokyo 2026
watura
0
200
Mastering Event Sourcing: Your Parents Holidayed in Yugoslavia
super_marek
0
150
一度始めたらやめられない開発効率向上術 / Findy あなたのdotfilesを教えて!
k0kubun
4
2.9k
Featured
See All Featured
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
21
1.4k
Paper Plane (Part 1)
katiecoart
PRO
0
6.6k
Primal Persuasion: How to Engage the Brain for Learning That Lasts
tmiket
0
320
Visualization
eitanlees
150
17k
SEO in 2025: How to Prepare for the Future of Search
ipullrank
3
3.4k
How to Grow Your eCommerce with AI & Automation
katarinadahlin
PRO
1
170
How STYLIGHT went responsive
nonsquared
100
6k
Amusing Abliteration
ianozsvald
1
150
Making Projects Easy
brettharned
120
6.6k
Facilitating Awesome Meetings
lara
57
6.8k
How People are Using Generative and Agentic AI to Supercharge Their Products, Projects, Services and Value Streams Today
helenjbeal
1
160
Building a A Zero-Code AI SEO Workflow
portentint
PRO
0
440
Transcript
Seeing at the speed of thought Empowering others through data
exploration Greg Goltsov Senior Data Engineer @gregoltsov www.gregory.goltsov.info (will have link to slides)
Seeing at the speed of thought Empowering others through data
exploration
Seeing at the speed of thought Empowering others through data
exploration
Seeing at the speed of thought Empowering others through data
exploration yourself
Seeing at the speed of thought Empowering others through data
exploration yourself your team
Seeing at the speed of thought Empowering others through data
exploration yourself your team your company
Touch Surgery Built marketing/sales dashboards for Fortune 10 companies Built
educational dashboards for 4 of the top 10 world-rated medical universities All from scratch
Appear Here World’s biggest online marketplace for retail spaces Internal
recommendation system Highly visual debug interface for non-tech people
Southern Cross Austereo Modernising the data pipeline Spearheading data-driven culture
throughout the company Datasets covering 80% Australians weekly
BI/DW tools
BI/DW tools
Remove barriers Make feedback fast Remove yourself
Remove barriers
Remove barriers Catalogued datasets with one-line import in Python Messy
dataset in PDFs
Remove barriers Dashboard with right filters, Excel export “Can you
run a query?”
Remove barriers. Foster curiosity.
Make feedback fast
Make feedback fast Found a new trend via tinkering “Tomorrow
I’ll see results of the batch job”
Make feedback fast “Check the dash in 15 mins” “I
put your request into the backlog”
Make feedback fast. Let people tinker.
Remove yourself
Remove yourself Data pipeline + products Ad-hoc
None
None
Remove yourself. Don’t stand in the way.
Remove barriers Make feedback fast Remove yourself
The goal is to turn data into information, and information
into insight. – Carly Fiorina, former HP CEO
Insight Information Data
Insight Information Data Value ↑ Abundance
Insight Information Data Fraud Access pattern Logs
Insight Information Data Key influencers MOM trends Tweets
Ad-hoc queries Data pipeline Fast to develop Every query gets
thrown away after Upfront investment Every integration builds foundations
Visualise your ETL. Augment your Data Warehouses with Data Lakes.
None
Extract Transform Load Sources Data Warehouse
Extract Transform Load Sources Data Warehouse Data Insight Time
Volume Variety Velocity "3D Data Management: Controlling Data Volume, Velocity
and Variety”, Gartner Inc. 2001
Volume Variety Velocity "3D Data Management: Controlling Data Volume, Velocity
and Variety”, Gartner Inc. 2001
Analysis of Unstructured Data: Applications of Text Analytics and Sentiment
Mining ~80% of all data is unstructured
~80% of your data is unstructured
http://www.ft.com/cms/s/0/de15414e-ebad-11e1-985a-00144feab49a.html#axzz2F3CM6G7g “Making sense of unstructured data isn’t about technology, it’s
a business challenge”
Aberdeen Group research Don’t use unstructured data Use unstructured data
Happy with the ability to share data 18% 60% Pleased with the accessibility 20% 50%
Volume Variety Velocity Machine learning "3D Data Management: Controlling Data
Volume, Velocity and Variety”, Gartner Inc. 2001
Ingest quickly Real-time schema-on- read exploration Push vetted insights into
DW/BI Example: Spark, AWS Athena, Microsoft’s PowerBI
Collect Store Process/ Analyse Sources Data Warehouse Data Insight Insight
Time
Collect Store Process/ Analyse
Collect Store Process/ Analyse
Collect Store Process/ Analyse
None
Look at data. A lot.
Look at data. A lot. http:/ /www.forbes.com/sites/gilpress/2016/03/23/data- preparation-most-time-consuming-least-enjoyable- data-science-task-survey-says
None
None
Scale computation and storage separately Go from non-trivial data to
dashboard in minutes Spark is 20-100x faster than MapReduce Turnkey solution: www.databricks.com OSS: Apache Zeppelin on AWS EMR Spark
We made it! Now what?
We made it! Now what? Human scale.
AirBnB Scaling Tribal Knowledge
AirBnB Scaling Tribal Knowledge
AirBnB Scaling Tribal Knowledge
AirBnB Scaling Tribal Knowledge
AirBnB Scaling Tribal Knowledge
None
THANK YOU Speaker Name: Greg Goltsov Email:
[email protected]
Organized by
UNICOM Trainings & Seminars Pvt. Ltd.
[email protected]
http://www.unicomlearning.com/2017/Big_Data_Visualization_Summit_Sydney