Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
2016 - Dillon Niederhut - What to do when your ...
Search
PyBay
September 25, 2016
0
76
2016 - Dillon Niederhut - What to do when your data is large, but not big
PyBay
September 25, 2016
Tweet
Share
More Decks by PyBay
See All by PyBay
2017 - The Packaging Gradient
pybay
2
950
2017 - Building Bridges: Stopping Python 2 without damages
pybay
0
670
2017 - Bringing Python 3 to LinkedIn
pybay
1
570
2017 - Python Debugging with PUDB
pybay
0
740
2017 - Opening up to Open Source
pybay
0
270
2017 - A Gentle Introduction to Text Classification with Deep Learning
pybay
2
200
2017 - Performant Asynchronous Programming at Quora
pybay
1
390
2017 - latus - a Personal Cloud Storage App written in Python
pybay
2
530
2017 - Everything You Ever Wanted to Know About Web Authentication in Python
pybay
3
650
Featured
See All Featured
Imperfection Machines: The Place of Print at Facebook
scottboms
269
13k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
116
20k
Done Done
chrislema
185
16k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
31
2.6k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.5k
Designing for Performance
lara
610
69k
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
10
620
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
35
3.2k
Automating Front-end Workflow
addyosmani
1371
200k
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
658
61k
The Power of CSS Pseudo Elements
geoffreycrofte
80
6k
Making the Leap to Tech Lead
cromwellryan
135
9.6k
Transcript
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
What to do when your data are large but not big Dillon Niederhut PyBay – the San Francisco Bay Area Python Conference 20 August 2016
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
about this talk • data at github.com/deniederhut/pybay 2016 • python libraries : celery, h5py, numpy, pandas, pymongo • other libraries : mongodb, rabbitmq, sqlite
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
about me • dlab.berkeley.edu • @DLabAtBerkeley
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
size concerns 1 1from xkcd
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
time concerns 2 2always relevant
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
code concerns 3 3thanks Randall!
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
sequential processing
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
parallel processing
Large data in python Dillon Niederhut Introduction Motivation Strategies Closing
contact • dillon.niederhut.us • @dillonniederhut