A fourth paradigm? The challenges of using machine learning in physics

A FOURTH PARADIGM? THE CHALLENGES OF USING MACHINE LEARNING IN
PHYSICS KEITH T. BUTLER IOP OCT 2019

IOP OCT 2019 THE MEANING OF A FOURTH PARADIGM(?) APL
Materials 4, 053208 (2016)

IOP OCT 2019 HOW TO BENEFIT FROM ML IN PHYSICS
PROBLEMS ▸ Choose the right problems ▸ Choose the right models ▸ Choose the right data ▸ Understand the results (if you care to…)

IOP OCT 2019 CHOOSING THE RIGHT PROBLEMS 4 “Computers are
useless, they can only give you answers.” Picasso.

IOP OCT 2019 CHOOSING THE RIGHT PROBLEMS 5 KEY QUESTIONS
BEFORE GOING ML ▸ What do I want to achieve? ▸ How much data do I have/can I get? ▸ What kind of data do I have? ▸ Do I care more about prediction or inference? ▸ What kind of hardware do I have? 5

IOP OCT 2019 CHOOSING THE RIGHT MODEL UNDERSTAND THE OPTIONS
▸ Example decision tree (classical); neural network (deep) 6 Robustness Scaling Interpretability Simplicity Speed Accuracy ANN DT Traditional ML Deep NN Performance Data

IOP OCT 2019 AN OVERVIEW OF CLASSICAL ML MODELS 7
https://scikit-learn.org/stable/tutorial/machine_learning_map/index.html

IOP OCT 2019 SOME IMPORTANT CLASSES OF DEEP NEURAL NETWORKS
8 Fuller map available: https://towardsdatascience.com/the-mostly-complete-chart-of-neural-networks- explained-3fb6f2367464 Mulitlayer perceptrons Function approximation machine Recurrent Neural Networks Time series/sequential data eg text conversion Deep Convolutional Networks Image processing

IOP OCT 2019 CHOOSING THE RIGHT MODEL REPRESENTATIONS OF THE
PROBLEM ▸ Hand crafted descriptors ▸ Learned representations ▸ E.g Convolutional neural networks 9 Smaller datasets, with very structured inputs, classical models Larger datasets, with very unstructured inputs, missing data, combined data sources, deep models Traditional ML Deep NN

IOP OCT 2019 REPRESENTATIONS OF THE PROBLEM EXAMPLES ▸ Clean
structured data can often be converted/compressed 10 ▸ Noisy/incomplete datasets are difﬁcult to convert analytically ?

IOP OCT 2019 OBTAINING (THE RIGHT) DATA 14,197,122 images!!

IOP OCT 2019 OBTAINING (THE RIGHT) DATA ▸ Image recognition
exploded after the availability of the Imagenet challenge ▸ Physics data is not as easy to label as many other ‘simple’ data types

IOP OCT 2019 DEALING WITH DATA SCARCITY TRANSFER LEARNING Can
we use what we learned from other examples to speed up learning?

IOP OCT 2019 DEALING WITH DATA SCARCITY TRANSFER LEARNING http://scs.ryerson.ca/~aharley/vis/conv/ﬂat.html
Example - convolutional networks learn very general features in early layers

IOP OCT 2019 DEALING WITH DATA SCARCITY TRANSFER LEARNING https://github.com/materialsvirtuallab/megnet
Example - crystal neural network trained on energies can be used for dielectric properties DFPT calculation several hours on 10s to 100s of CPUs. MegNet calculation seconds on a desktop

IOP OCT 2019 DEALING WITH DATA SCARCITY ACTIVE LEARNING TRAIN
ON LABELLED DATA TEST ON UNLABELLED DATA Committee of models Identify regions of uncertainty

IOP OCT 2019 BUILDING BETTER DATASETS Nature, 2018, 559, 547.
The importance of metadata is becoming more apparent Initiatives like FAIRsFAIR https://www.force11.org/ group/fairgroup/fairprinciples

IOP OCT 2019 MAKING MODELS INTERPRETABLE ▸ Classical models are
often easy to interpret ▸ Deep models, learned representations can be more opaque

IOP OCT 2019 MAKING MODELS INTERPRETABLE Model performance Interpretability use
Sub-human Debug and improve Human Increase conﬁdence Super-human Learn from successs

IOP OCT 2019 INTERPRETABLE MODELS FOR NEUTRON SCATTERING Finding the
right signal can be a needle in a haystack

IOP OCT 2019 INTERPRETABLE MODELS FOR NEUTRON SCATTERING Build a
model discrimination network - ask it WHY it makes the choice.

IOP OCT 2019 INTERPRETABLE MODELS FOR NEUTRON SCATTERING The network
identiﬁes the same regions of E/Q space as a trained physicist. Could, in future, guide experiments of the same type.

IOP OCT 2019 SUMMARY ▸ AI/ML/Logistic regression has potential to
be a transformative tool ▸ To get the most from AI we should understand the relationships between models, data and physics ▸ Physics data tends to be scarce or messy or both, cutting edge research looks at how to deal with these issues ▸ Combining AI with traditional physics simulation offers a powerful route

IOP OCT 2019 ACKNOWLEDGMENTS ▸ SciML ▸ Rebecca, Tony, Jeyan,
Sam, Patrick ▸ ICL ▸ Kazuki, Aron ▸ UCL ▸ Dan

IOP OCT 2019 THANK YOU 25 NATURE, 2018, 559, 547.
@keeeto2000 @ml_sci keeeto.github.io www.scd.stfc.ac.uk/ Pages/Scientiﬁc- Machine-Learning.aspx

A fourth paradigm? The challenges of using mach...

A fourth paradigm? The challenges of using machine learning in physics

keeeto

More Decks by keeeto

Other Decks in Education

Featured

Transcript

A FOURTH PARADIGM? THE CHALLENGES OF USING MACHINE LEARNING IN

IOP OCT 2019 THE MEANING OF A FOURTH PARADIGM(?) APL

IOP OCT 2019 HOW TO BENEFIT FROM ML IN PHYSICS

IOP OCT 2019 CHOOSING THE RIGHT PROBLEMS 4 “Computers are

IOP OCT 2019 CHOOSING THE RIGHT PROBLEMS 5 KEY QUESTIONS

IOP OCT 2019 CHOOSING THE RIGHT MODEL UNDERSTAND THE OPTIONS

IOP OCT 2019 AN OVERVIEW OF CLASSICAL ML MODELS 7

IOP OCT 2019 SOME IMPORTANT CLASSES OF DEEP NEURAL NETWORKS

IOP OCT 2019 CHOOSING THE RIGHT MODEL REPRESENTATIONS OF THE

IOP OCT 2019 REPRESENTATIONS OF THE PROBLEM EXAMPLES ▸ Clean

IOP OCT 2019 OBTAINING (THE RIGHT) DATA 14,197,122 images!!

IOP OCT 2019 OBTAINING (THE RIGHT) DATA ▸ Image recognition

IOP OCT 2019 DEALING WITH DATA SCARCITY TRANSFER LEARNING Can

IOP OCT 2019 DEALING WITH DATA SCARCITY TRANSFER LEARNING http://scs.ryerson.ca/~aharley/vis/conv/ﬂat.html

IOP OCT 2019 DEALING WITH DATA SCARCITY TRANSFER LEARNING https://github.com/materialsvirtuallab/megnet

IOP OCT 2019 DEALING WITH DATA SCARCITY ACTIVE LEARNING TRAIN

IOP OCT 2019 BUILDING BETTER DATASETS Nature, 2018, 559, 547.

IOP OCT 2019 MAKING MODELS INTERPRETABLE ▸ Classical models are

IOP OCT 2019 MAKING MODELS INTERPRETABLE Model performance Interpretability use

IOP OCT 2019 INTERPRETABLE MODELS FOR NEUTRON SCATTERING Finding the

IOP OCT 2019 INTERPRETABLE MODELS FOR NEUTRON SCATTERING Build a

IOP OCT 2019 INTERPRETABLE MODELS FOR NEUTRON SCATTERING The network

IOP OCT 2019 SUMMARY ▸ AI/ML/Logistic regression has potential to

IOP OCT 2019 ACKNOWLEDGMENTS ▸ SciML ▸ Rebecca, Tony, Jeyan,

IOP OCT 2019 THANK YOU 25 NATURE, 2018, 559, 547.