Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Atelier Datalab - volet technique
Search
Providenz - Laurent Paoletti
September 29, 2014
Technology
0
74
Atelier Datalab - volet technique
Stockage, analyse, visualisation de données et machine learning
Providenz - Laurent Paoletti
September 29, 2014
Tweet
Share
More Decks by Providenz - Laurent Paoletti
See All by Providenz - Laurent Paoletti
Introduction au machine learning
providenz
0
200
Des builds front plus rapides
providenz
0
45
Back to front
providenz
0
140
Machine Learning for the rest of us
providenz
1
180
Brunch, le builder pour les developpeurs pressés
providenz
0
160
Postgresql la plateforme de vos données
providenz
0
260
Performance web (Brown bag lunch)
providenz
0
38
Montée en charge
providenz
0
35
Présentation de django
providenz
0
42
Other Decks in Technology
See All in Technology
更新系と状態
uhyo
7
1.5k
サーバレス、コンテナ、データベース特化型機能をご紹介。CloudWatch をもっと使いこなそう!
o11yfes2023
0
170
SREからゼロイチプロダクト開発へ ー越境する打席の立ち方と期待への応え方ー / Product Engineering Night #8
itkq
2
800
Porting PicoRuby to Another Microcontroller: ESP32
yuuu
4
410
Road to Go Gem #rubykaigi
sue445
0
470
LiteXとオレオレCPUで作る自作SoC奮闘記
msyksphinz
0
630
AWS全冠芸人が見た世界 ~資格取得より大切なこと~
masakiokuda
5
6k
Linuxのパッケージ管理とアップデート基礎知識
go_nishimoto
0
240
Cross Data Platforms Meetup LT 20250422
tarotaro0129
1
590
SREの視点で考えるSIEM活用術 〜AWS環境でのセキュリティ強化〜
coconala_engineer
1
290
Would you THINK such a demonstration interesting ?
shumpei3
1
220
QA/SDETの現在と、これからの挑戦
imtnd
0
120
Featured
See All Featured
Producing Creativity
orderedlist
PRO
344
40k
What's in a price? How to price your products and services
michaelherold
245
12k
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
8
660
Code Review Best Practice
trishagee
67
18k
Embracing the Ebb and Flow
colly
85
4.6k
Build The Right Thing And Hit Your Dates
maggiecrowley
35
2.6k
Fantastic passwords and where to find them - at NoRuKo
philnash
51
3.1k
Six Lessons from altMBA
skipperchong
27
3.7k
Designing for Performance
lara
608
69k
How STYLIGHT went responsive
nonsquared
99
5.5k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
280
13k
jQuery: Nuts, Bolts and Bling
dougneiner
63
7.7k
Transcript
DATALAB l ’atelier Laurent Paoletti @providenz TVT - 29 septembre
2014
DATA BIG DATA DATASCIENCE définitions
VOLUME VÉLOCITÉ VARIÉTÉ COMPLEXITÉ critères
DONNÉES STRUCTURÉES SEMI-STRUCTURÉES NON STRUCTURÉES typologie
TEXTE HORODATEES GÉOGRAPHIQUES SCIENCE - FINANCE LOGS GRAPHE IMAGE/SON/VIDEO typologie
OPENDATA SERVICES - API ORGANIQUE CROWDSOURCING OBJETS CONNECTÉS ACHAT SCRAPING
- EXTRACTION sources
sources - api
HOME SERVEUR(S) CLOUD CUSTOM ! GPU FPGA plateformes -infrastructure
FICHIERS excel csv hdf5 plateformes -persistance
DB RELATIONELLES ! MYSQL POSTGRESQL SQLSERVER, ORACLE plateformes -persistance
SIG:POSTGIS plateformes -persistance
GRAPHES: NEO4J plateformes -persistance
RECHERCHE : ELASTICSEARCH plateformes -persistance
HADOOP SPARK HBASE plateformes -persistance
MAP-REDUCE plateformes -persistance
EXTRACTION NETTOYAGE ETL analyse - préparation
FILTRAGE TRANSFORMATION STATISTIQUES analyse
R SQL PYTHON OPENREFINE analyse - outils
« capacité qu’on donne à une machine d’ingérer des données
à apprendre et de s’enrichir grâce à son expérience » machine learning
machine learning ANTI-SPAM RECOMMANDATIONS SCORING OPTIMISATION DE PRIX IDENTIFICATION
TRAINING DATA machine learning 101
machine learning 101
machine learning 101 setosa
machine learning 101
machine learning 101 DATASET MODELE DATA PREDICTION apprentissage humain
« For a long time, we thought that Tamoxifen was
roughly 80% effective for breast cancer patients. But now we know much more: we know that it’s 100% effective in 70% to 80% of the patients, and ineffective in the rest. » ! machine learning 101
machine learning regression classification !
machine learning - outils R JAVA PYTHON SAAS ! !
visualisation http://flowingdata.com/page/2/
http://www.brightpointinc.com/interactive/political_influence/index.html?source=d3js WEB visualisation
http://www.brightpointinc.com/interactive/political_influence/index.html?source=d3js visualisation
EXCEL - GNUPLOT PYTHON - MATPLOTLIB WEB - D3.JS !
! visualisation - outils
Général: http://www.oreilly.com/data/ Pandas: http://pandas.pydata.org/ R: http://www.r-project.org/ Python: https://www.python.org/ Machine learning:
http://scikit-learn.org/ Openrefine: http://openrefine.org/ Postgis: http://postgis.net/ Elasticsearch: http://www.elasticsearch.org/ Hadoop: http://hadoop.apache.org/ Spark: https://spark.apache.org/ Hbase: http://hbase.apache.org/ D3: http://d3js.org/ Bigml: https://bigml.com/ Prediction API: https://cloud.google.com/prediction/?hl=fr ressources
merci Laurent Paoletti @providenz TVT - 29 septembre 2014