Slide 1

Slide 1 text

Monitoring, Data Center et Machine Learning https://labs.ovh.com/machine-learning-platform Clement Bataille Christophe Rannou

Slide 2

Slide 2 text

WE OPERATE IN A FAST GROWING MARKET where availability is essential PUBLIC CLOUD & VPS PRIVATE CLOUD DEDICATED SERVER STORAGE NETWORK & SECURITY MULTI-LOCAL EVERYWHERE RAPID DELIVERY WORD-CLASS STANDARDS

Slide 3

Slide 3 text

OUR STORY

Slide 4

Slide 4 text

WE INNOVATE WITH EVERYONE WE ARE INDEPENDANT WE ARE THE ALTERNATIVE WHICH FREES UP THE CLOUD THROUGH CREATIVITY AND AN ENTREPRENEURIAL SPIRIT EMPLOYEES CUSTOMERS ECOSYSTEM OVH PARTNERS INNOVATION FOR FREEDOM

Slide 5

Slide 5 text

WE ARE A GLOBAL CLOUD PROVIDER WE ARE AN INDUSTRIAL PLAYER Hillsboro x1 17 Tbps 34 Point of Presence IN 3 CONTINENTS BHS x6 Vinthill x1 +1 400 000 CUSTOMERS IN 132 COUNTRIES Singapore x1 2 200 EMPLOYEES IN 18 COUNTRIES Sydney x1 Europe x18 BUILDER OF ITS OWN SERVERS SINCE 2002 + 1 million SERVERS built since 1999 29 DATA CENTERS in 12 locations EXISTING SINCE 1999

Slide 6

Slide 6 text

SIMPLE AND QUICK TO SET UP MULTI-LOCAL, CLOSE TO EACH PERSON THROUGHOUT THE WORLD WHOSE COST IS ACCESSIBLE AND PREDICTABLE REVERSIBLE, OPEN AND INTEROPERABLE TRANSPARENT AND RESPONSIBLE WE BUILD A DIFFERENT CLOUD a truly Cloud SMART TO GROW AND SUCCEED TOGETHER

Slide 7

Slide 7 text

ML for Time Series with Prescience 7

Slide 8

Slide 8 text

Toy Use Case 8 We are going to have a look at the international airline passengers prediction problem. The initial data is in the csv format : steps passengers 1949-01 112 1949-02 118 1949-03 132 1949-04 129

Slide 9

Slide 9 text

Problem Definition Detection Time Series • Specific Target • Ground Truth quality • Specific Target • Ground Truth quality • Forecast • Anomaly detection • Prediction horizon • Forecast • Anomaly detection • Prediction horizon 9

Slide 10

Slide 10 text

Problem definition 10 What kind of problem do you want to solve ? • Simple Classification • Simple Regression • Time series classification • Time series regression/forecast

Slide 11

Slide 11 text

Problem definition 11 • What is your label ? • What is your ordering column ? Month Passengers 1949-01 112 1949-02 118 1949-03 132 1949-04 129 Ordering column Label

Slide 12

Slide 12 text

Problem definition 12 • How many steps forward do you want to predict ? One year => 12 steps • Do you want to apply a discount ? No => 1 • Scoring metric ? Default => MSE 1 Year

Slide 13

Slide 13 text

Experiment definition 13 How many fold do you want to create ? 2

Slide 14

Slide 14 text

Experiment definition 14 How much budget to allow on optimization ? 10

Slide 15

Slide 15 text

Demo 15

Slide 16

Slide 16 text

Scores are deceitful • Data Leakage – Unusable – Dangerous – Difficult to detect – Difficult to correct • Wrong problem definition 16

Slide 17

Slide 17 text

Stock Value Prediction: How to get rich in 5 minutes Graphs: Vegard Flovik, How (not) to use Machine Learning for time series forecasting: Avoiding the pitfalls 17

Slide 18

Slide 18 text

Stock Value Prediction: Why I am not rich 18

Slide 19

Slide 19 text

Stock Value Prediction: What I should have done 19

Slide 20

Slide 20 text

Use cases

Slide 21

Slide 21 text

Electricity Comsuption forecast Sonde DC Site EDF WARP10 ML with Prescience Retrain model every week Forecast everyday

Slide 22

Slide 22 text

CEPH Key indicator: Forecast every 6h

Slide 23

Slide 23 text

VOIP Monitoring Chiffres Clés : 125 destinations 3 lignes par pays 1 point à l’heure

Slide 24

Slide 24 text

Temperature Anomaly Room Warp10 Arima Model Retrain every week Forecast by point Compare with Isolation Forest Alerting with TAT

Slide 25

Slide 25 text

Temperature Anomaly Rack Warp10 Mobil Mean Isolation Forest Alerting with TAT

Slide 26

Slide 26 text

Network forecast Key indicator : • 20K equipment • 100 metric by equipment • Point every minute

Slide 27

Slide 27 text

Network forecast • Trend Stationary • Differency Stationary Method stationary transformation Graphs: Investopedia, stationary and non-stationary processes : https://www.investopedia.com/articles/trading/07/stationary.asp

Slide 28

Slide 28 text

Network forecast Warp10 Heuristique de selection Regression Arima by TS Export to WARP10 + Mailing

Slide 29

Slide 29 text

T H A N K YO U @ChrisRannou Clement Bataille https://gitter.im/ovh/ai