Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
2016 - Dillon Niederhut - What to do when your ...
Search
PyBay
September 25, 2016
0
73
2016 - Dillon Niederhut - What to do when your data is large, but not big
PyBay
September 25, 2016
Tweet
Share
More Decks by PyBay
See All by PyBay
2017 - The Packaging Gradient
pybay
2
930
2017 - Building Bridges: Stopping Python 2 without damages
pybay
0
650
2017 - Bringing Python 3 to LinkedIn
pybay
1
560
2017 - Python Debugging with PUDB
pybay
0
710
2017 - Opening up to Open Source
pybay
0
250
2017 - A Gentle Introduction to Text Classification with Deep Learning
pybay
2
190
2017 - Performant Asynchronous Programming at Quora
pybay
1
380
2017 - latus - a Personal Cloud Storage App written in Python
pybay
2
520
2017 - Everything You Ever Wanted to Know About Web Authentication in Python
pybay
3
630
Featured
See All Featured
Bash Introduction
62gerente
614
210k
Thoughts on Productivity
jonyablonski
69
4.7k
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
194
16k
Site-Speed That Sticks
csswizardry
10
650
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
161
15k
YesSQL, Process and Tooling at Scale
rocio
173
14k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
53k
The Language of Interfaces
destraynor
158
25k
A Tale of Four Properties
chriscoyier
160
23k
Speed Design
sergeychernyshev
31
1k
What’s in a name? Adding method to the madness
productmarketing
PRO
22
3.5k
Typedesign – Prime Four
hannesfritz
42
2.7k
Transcript
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
What to do when your data are large but not big Dillon Niederhut PyBay – the San Francisco Bay Area Python Conference 20 August 2016
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
about this talk • data at github.com/deniederhut/pybay 2016 • python libraries : celery, h5py, numpy, pandas, pymongo • other libraries : mongodb, rabbitmq, sqlite
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
about me • dlab.berkeley.edu • @DLabAtBerkeley
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
size concerns 1 1from xkcd
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
time concerns 2 2always relevant
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
code concerns 3 3thanks Randall!
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
sequential processing
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
parallel processing
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
contact • dillon.niederhut.us • @dillonniederhut