data —> solutions to help people anticipate and adapt to climate variability and climate change [email protected] @NFauchereau https://github.com/nicolasfauchereau/
fundamentals! 3. A brief typology of ML algorithms! 4. ML in industry! 5. ML in the environmental sciences! 6. Development of a ML-based seasonal forecasting scheme for the Pacific Islands ! 7. Lessons learned and conclusions Outline The tools: Python and the scikit-learn ML library https://speakerdeck.com/nicolasf
the ability to learn without being explicitly programmed.” Arthur Samuel, 1959. “A computer program is said to learn from experience E with respect to some task T and some performance measure P, if its performance on T, as measured by P, improves with experience E.” Tom Mitchell, 1997.
31 2011. Note that all data are in mm. A: INITIAL DATA MSLA seasonal anomalies for 8 stations data for 1981 - 2010 Target ‘raw’ MSLA anomalies (mm) regression discretized (quintiles) classification
cloud computing • Satellite remote sensing • In-situ sensors arrays • Model outputs Environmental sciences’s BIG DATA era ? • Mature and stable ML libraries BIG data compute power accessible
growing in the ‘data science’ community ! • huge collection of libraries from linear algebra to bayesian analysis, visualisation etc ! • rapid prototyping to production
open-source (and free) • consistent API (Application Programming Interface) • comprehensive documentation • efficient algorithms • harnesses the power of the ‘Python scientific stack’ • very active development • http://scikit-learn.org/stable/index.html