Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Atelier Datalab - volet technique
Search
Providenz - Laurent Paoletti
September 29, 2014
Technology
0
76
Atelier Datalab - volet technique
Stockage, analyse, visualisation de données et machine learning
Providenz - Laurent Paoletti
September 29, 2014
Tweet
Share
More Decks by Providenz - Laurent Paoletti
See All by Providenz - Laurent Paoletti
Introduction au machine learning
providenz
0
210
Des builds front plus rapides
providenz
0
50
Back to front
providenz
0
150
Machine Learning for the rest of us
providenz
1
190
Brunch, le builder pour les developpeurs pressés
providenz
0
160
Postgresql la plateforme de vos données
providenz
0
270
Performance web (Brown bag lunch)
providenz
0
44
Montée en charge
providenz
0
47
Présentation de django
providenz
0
45
Other Decks in Technology
See All in Technology
超初心者からでも大丈夫!オープンソース半導体の楽しみ方〜今こそ!オレオレチップをつくろう〜
keropiyo
0
110
We Built for Predictability; The Workloads Didn’t Care
stahnma
0
140
Introduction to Sansan for Engineers / エンジニア向け会社紹介
sansan33
PRO
6
68k
AIと新時代を切り拓く。これからのSREとメルカリIBISの挑戦
0gm
1
2.8k
レガシー共有バッチ基盤への挑戦 - SREドリブンなリアーキテクチャリングの取り組み
tatsukoni
0
220
All About Sansan – for New Global Engineers
sansan33
PRO
1
1.4k
プロダクト成長を支える開発基盤とスケールに伴う課題
yuu26
4
1.3k
CDKで始めるTypeScript開発のススメ
tsukuboshi
1
480
Context Engineeringが企業で不可欠になる理由
hirosatogamo
PRO
3
620
AIエージェントに必要なのはデータではなく文脈だった/ai-agent-context-graph-mybest
jonnojun
0
100
セキュリティについて学ぶ会 / 2026 01 25 Takamatsu WordPress Meetup
rocketmartue
1
310
Agent Skils
dip_tech
PRO
0
120
Featured
See All Featured
Why Our Code Smells
bkeepers
PRO
340
58k
Bash Introduction
62gerente
615
210k
How to Think Like a Performance Engineer
csswizardry
28
2.4k
Building Applications with DynamoDB
mza
96
6.9k
HDC tutorial
michielstock
1
390
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
66
37k
Why Mistakes Are the Best Teachers: Turning Failure into a Pathway for Growth
auna
0
54
Building Better People: How to give real-time feedback that sticks.
wjessup
370
20k
Groundhog Day: Seeking Process in Gaming for Health
codingconduct
0
94
The AI Search Optimization Roadmap by Aleyda Solis
aleyda
1
5.2k
How to Align SEO within the Product Triangle To Get Buy-In & Support - #RIMC
aleyda
1
1.4k
How To Stay Up To Date on Web Technology
chriscoyier
791
250k
Transcript
DATALAB l ’atelier Laurent Paoletti @providenz TVT - 29 septembre
2014
DATA BIG DATA DATASCIENCE définitions
VOLUME VÉLOCITÉ VARIÉTÉ COMPLEXITÉ critères
DONNÉES STRUCTURÉES SEMI-STRUCTURÉES NON STRUCTURÉES typologie
TEXTE HORODATEES GÉOGRAPHIQUES SCIENCE - FINANCE LOGS GRAPHE IMAGE/SON/VIDEO typologie
OPENDATA SERVICES - API ORGANIQUE CROWDSOURCING OBJETS CONNECTÉS ACHAT SCRAPING
- EXTRACTION sources
sources - api
HOME SERVEUR(S) CLOUD CUSTOM ! GPU FPGA plateformes -infrastructure
FICHIERS excel csv hdf5 plateformes -persistance
DB RELATIONELLES ! MYSQL POSTGRESQL SQLSERVER, ORACLE plateformes -persistance
SIG:POSTGIS plateformes -persistance
GRAPHES: NEO4J plateformes -persistance
RECHERCHE : ELASTICSEARCH plateformes -persistance
HADOOP SPARK HBASE plateformes -persistance
MAP-REDUCE plateformes -persistance
EXTRACTION NETTOYAGE ETL analyse - préparation
FILTRAGE TRANSFORMATION STATISTIQUES analyse
R SQL PYTHON OPENREFINE analyse - outils
« capacité qu’on donne à une machine d’ingérer des données
à apprendre et de s’enrichir grâce à son expérience » machine learning
machine learning ANTI-SPAM RECOMMANDATIONS SCORING OPTIMISATION DE PRIX IDENTIFICATION
TRAINING DATA machine learning 101
machine learning 101
machine learning 101 setosa
machine learning 101
machine learning 101 DATASET MODELE DATA PREDICTION apprentissage humain
« For a long time, we thought that Tamoxifen was
roughly 80% effective for breast cancer patients. But now we know much more: we know that it’s 100% effective in 70% to 80% of the patients, and ineffective in the rest. » ! machine learning 101
machine learning regression classification !
machine learning - outils R JAVA PYTHON SAAS ! !
visualisation http://flowingdata.com/page/2/
http://www.brightpointinc.com/interactive/political_influence/index.html?source=d3js WEB visualisation
http://www.brightpointinc.com/interactive/political_influence/index.html?source=d3js visualisation
EXCEL - GNUPLOT PYTHON - MATPLOTLIB WEB - D3.JS !
! visualisation - outils
Général: http://www.oreilly.com/data/ Pandas: http://pandas.pydata.org/ R: http://www.r-project.org/ Python: https://www.python.org/ Machine learning:
http://scikit-learn.org/ Openrefine: http://openrefine.org/ Postgis: http://postgis.net/ Elasticsearch: http://www.elasticsearch.org/ Hadoop: http://hadoop.apache.org/ Spark: https://spark.apache.org/ Hbase: http://hbase.apache.org/ D3: http://d3js.org/ Bigml: https://bigml.com/ Prediction API: https://cloud.google.com/prediction/?hl=fr ressources
merci Laurent Paoletti @providenz TVT - 29 septembre 2014