Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
PyCon India - Commodity Machine Learning; past,...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Andreas Mueller
September 25, 2016
2.7k
0
Share
PyCon India - Commodity Machine Learning; past, present and future
PyCon India 2016 keynote
Andreas Mueller
September 25, 2016
More Decks by Andreas Mueller
See All by Andreas Mueller
Automating Machine Learning
amueller
4
1.2k
Engineering Scikit-Learn V2
amueller
0
300
Advanced Machine Learning with Scikit-Learn for Pycon Amsterdam
amueller
0
300
Scikit-learn: New project features in 0.17
amueller
0
140
Bootstrapping machine learning
amueller
0
150
PyData Berlin 2014 Keynote: Commodity machine learnin
amueller
0
190
Advanced Machine Learning with Scikit-Learn
amueller
1
760
Machine Learning With Scikit-Learn ODSC SF 2015
amueller
4
1.8k
Machine Learning With Scikit-Learn - Pydata Strata NYC 2015
amueller
1
3k
Featured
See All Featured
The agentic SEO stack - context over prompts
schlessera
0
770
For a Future-Friendly Web
brad_frost
183
10k
Applied NLP in the Age of Generative AI
inesmontani
PRO
4
2.2k
Art, The Web, and Tiny UX
lynnandtonic
304
21k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
35
3.4k
Jess Joyce - The Pitfalls of Following Frameworks
techseoconnect
PRO
1
150
Large-scale JavaScript Application Architecture
addyosmani
515
110k
Become a Pro
speakerdeck
PRO
31
5.9k
StorybookのUI Testing Handbookを読んだ
zakiyama
31
6.7k
How Fast Is Fast Enough? [PerfNow 2025]
tammyeverts
3
560
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
4.3k
エンジニアに許された特別な時間の終わり
watany
106
240k
Transcript
Commodity Machine Learning Past, present and future Andreas Mueller
What is machine learning?
Automatic Decision Making Spam? Yes No
Spam? Yes No
Programming Machine Learning
Machine learning is EVERYWHERE
None
None
None
Science Engineering Medicine ...
Commodity machine learning
past
+
None
dawn of open source tools...
The age of shell
Documentation? Testing?
Scikit-learn: User centric machine learning
.fit(X, y) .predict(X) .transform(X)
present
Choose your ecosystem.
Open! Documented! Tested!
Usability is key!
ML Frameworks PyMC, Edward, Stan theano, tensorflow, keras
None
from sklearn.model_selection import GridSearchCV from sklearn.pipeline import Pipeline
github.com/scikitlearncontrib/scikitlearncontrib
(near) Future
pip install scikitlearn==0.18rc2 0.18 for the release candidate:
sklearn.cross_validation sklearn.grid_search sklearn.learning_curve sklearn.model_selection
results = pd.DataFrame(grid_search.results_)
labels → groups n_folds → n_splits
from sklearn.cross_validation import KFold cv = KFold(n_samples, n_folds) for train,
test in cv: ... from sklearn.model_selection import KFold cv = KFold(n_folds) for train, test in cv.split(X, y): ...
from sklearn.mixture import GaussianMixture from sklearn.mixture import BayesianGaussianMixture
PCA() RandomizedPCA() PCA()
Gaussian Process Rewrite
Isolation Forests
Play from sklearn.neural_network import MLPClassifier Work import keras
pipe = Pipeline([('preprocessing', StandardScaler()), ('classifier', SVC())]) param_grid = {'preprocessing': [StandardScaler(),
None]} grid = GridSearchCV(pipe, param_grid)
40
(further) Future
Feature / Column names
from __future__ import sklearn.plotting
from __future__ import AutoClassifier
More Transparency
amueller.github.io @amuellerml @amueller
[email protected]