Upgrade to PRO for Only $50/Year—Limited-Time Offer! 🔥
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Atelier Datalab - volet technique
Search
Providenz - Laurent Paoletti
September 29, 2014
Technology
0
76
Atelier Datalab - volet technique
Stockage, analyse, visualisation de données et machine learning
Providenz - Laurent Paoletti
September 29, 2014
Tweet
Share
More Decks by Providenz - Laurent Paoletti
See All by Providenz - Laurent Paoletti
Introduction au machine learning
providenz
0
200
Des builds front plus rapides
providenz
0
49
Back to front
providenz
0
150
Machine Learning for the rest of us
providenz
1
190
Brunch, le builder pour les developpeurs pressés
providenz
0
160
Postgresql la plateforme de vos données
providenz
0
260
Performance web (Brown bag lunch)
providenz
0
42
Montée en charge
providenz
0
39
Présentation de django
providenz
0
44
Other Decks in Technology
See All in Technology
コミューンのデータ分析AIエージェント「Community Sage」の紹介
fufufukakaka
0
440
エンジニアとPMのドメイン知識の溝をなくす、 AIネイティブな開発プロセス
applism118
4
970
[JAWS-UG 横浜支部 #91]DevOps Agent vs CloudWatch Investigations -比較と実践-
sh_fk2
1
240
第4回 「メタデータ通り」 リアル開催
datayokocho
0
120
直接メモリアクセス
koba789
0
280
グレートファイアウォールを自宅に建てよう
ctes091x
0
140
ML PM Talk #1 - ML PMの分類に関する考察
lycorptech_jp
PRO
1
730
Haskell を武器にして挑む競技プログラミング ─ 操作的思考から意味モデル思考へ
naoya
2
750
品質のための共通認識
kakehashi
PRO
3
220
AI時代におけるアジャイル開発について
polyscape_inc
0
130
re:Invent 2025 ふりかえり 生成AI版
takaakikakei
1
180
eBPFとwaruiBPF
sat
PRO
4
2.5k
Featured
See All Featured
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
Building Flexible Design Systems
yeseniaperezcruz
330
39k
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
659
61k
Bash Introduction
62gerente
615
210k
Thoughts on Productivity
jonyablonski
73
5k
Designing for humans not robots
tammielis
254
26k
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
162
15k
Reflections from 52 weeks, 52 projects
jeffersonlam
355
21k
Java REST API Framework Comparison - PWX 2021
mraible
34
9k
How Fast Is Fast Enough? [PerfNow 2025]
tammyeverts
3
390
Fireside Chat
paigeccino
41
3.7k
The MySQL Ecosystem @ GitHub 2015
samlambert
251
13k
Transcript
DATALAB l ’atelier Laurent Paoletti @providenz TVT - 29 septembre
2014
DATA BIG DATA DATASCIENCE définitions
VOLUME VÉLOCITÉ VARIÉTÉ COMPLEXITÉ critères
DONNÉES STRUCTURÉES SEMI-STRUCTURÉES NON STRUCTURÉES typologie
TEXTE HORODATEES GÉOGRAPHIQUES SCIENCE - FINANCE LOGS GRAPHE IMAGE/SON/VIDEO typologie
OPENDATA SERVICES - API ORGANIQUE CROWDSOURCING OBJETS CONNECTÉS ACHAT SCRAPING
- EXTRACTION sources
sources - api
HOME SERVEUR(S) CLOUD CUSTOM ! GPU FPGA plateformes -infrastructure
FICHIERS excel csv hdf5 plateformes -persistance
DB RELATIONELLES ! MYSQL POSTGRESQL SQLSERVER, ORACLE plateformes -persistance
SIG:POSTGIS plateformes -persistance
GRAPHES: NEO4J plateformes -persistance
RECHERCHE : ELASTICSEARCH plateformes -persistance
HADOOP SPARK HBASE plateformes -persistance
MAP-REDUCE plateformes -persistance
EXTRACTION NETTOYAGE ETL analyse - préparation
FILTRAGE TRANSFORMATION STATISTIQUES analyse
R SQL PYTHON OPENREFINE analyse - outils
« capacité qu’on donne à une machine d’ingérer des données
à apprendre et de s’enrichir grâce à son expérience » machine learning
machine learning ANTI-SPAM RECOMMANDATIONS SCORING OPTIMISATION DE PRIX IDENTIFICATION
TRAINING DATA machine learning 101
machine learning 101
machine learning 101 setosa
machine learning 101
machine learning 101 DATASET MODELE DATA PREDICTION apprentissage humain
« For a long time, we thought that Tamoxifen was
roughly 80% effective for breast cancer patients. But now we know much more: we know that it’s 100% effective in 70% to 80% of the patients, and ineffective in the rest. » ! machine learning 101
machine learning regression classification !
machine learning - outils R JAVA PYTHON SAAS ! !
visualisation http://flowingdata.com/page/2/
http://www.brightpointinc.com/interactive/political_influence/index.html?source=d3js WEB visualisation
http://www.brightpointinc.com/interactive/political_influence/index.html?source=d3js visualisation
EXCEL - GNUPLOT PYTHON - MATPLOTLIB WEB - D3.JS !
! visualisation - outils
Général: http://www.oreilly.com/data/ Pandas: http://pandas.pydata.org/ R: http://www.r-project.org/ Python: https://www.python.org/ Machine learning:
http://scikit-learn.org/ Openrefine: http://openrefine.org/ Postgis: http://postgis.net/ Elasticsearch: http://www.elasticsearch.org/ Hadoop: http://hadoop.apache.org/ Spark: https://spark.apache.org/ Hbase: http://hbase.apache.org/ D3: http://d3js.org/ Bigml: https://bigml.com/ Prediction API: https://cloud.google.com/prediction/?hl=fr ressources
merci Laurent Paoletti @providenz TVT - 29 septembre 2014