Slides, thesis dissertation defense, deep generative neural networks for novelty generation

/ 56 Deep generative neural networks for novelty generation: a
foundational framework, metrics and experiments 1 Mehdi Cherti LAL/CNRS, Université Paris Saclay Supervised by: - Balàzs Kégl (LAL/CNRS, Université Paris Saclay) - Akın Kazakçı(Mines Paristech)

/ 56 2 Prediction Quest for artiﬁcial intelligence

/ 56 3 Prediction Novelty generation Quest for artiﬁcial intelligence

/ 56 How to study novelty generation ? 4

/ 56 5 Studying novelty generation: •Design theory •Computational creativity
•Machine learning

/ 56 6 Design theory • Early work: (Simon, 1969,
1973), Design as ‘problem solving’ (i.e. moving from an initial state to a desired state) • C-K theory: (Hatchuel et al., 2003), Design as joint expansion of knowledge and concepts • Various formalisms of knowledge (Set Theory (Hatchuel et al, 2007), Graphs (Kazakci et al, 2010), Matroids (Le Masson et al, 2017))

/ 56 • Through C-K, it acknowledges that knowledge is
central • But lacks computer-based experimental tools 7 Design theory

/ 56 8 Generation as optimization with evolutionary algorithms Computational
creativity

/ 56 9 • Enables experimentation but the end- goal
is the object itself rather than studying the generative process • Fitness function barrier • No representation learning • Generation and evaluation are disconnected Computational creativity

/ 56 10 Machine learning proposes powerful generative models,

/ 56 11 but these powerful models are used to
regenerate objects that we can relate easily to known objects…

/ 56 • Although trained for generating what we know,
some models can generate unrecognizable objects • However, these models and samples are considered as spurious (Bengio et al. 2013), or as a failure(Salimans et al. 2016) 12

/ 56 Instead of ignoring or eliminating novelty, we should
study it. 13

/ 56 14 • Goal of the thesis: study generative
potential of deep generative networks (DGNs) for novelty generation • Research questions: • What novelty can be generated by DGN? • How to evaluate the generative potential of a DGN? • What are the general characteristics of DGN that can generate novelty? • Method: We use computer based simulations with deep generative models because • They offer a rich and powerful set of existing techniques • They can learn (i.e. representations of objects) • Their generative potential has not been studied systematically

/ 56 Outline 1. Introduction 2. The impact of representations
on novelty generation 3. Results 3.a. Studying the generative potential of a deep net 3.b. Evaluating the generative potential of deep nets 3.c. Characteristics of models that can generate novelty 4. Conclusion and perspectives 15

/ 56 16 2.The impact of representations on novelty generation

/ 56 2.The impact of representations on novelty generation 17
(Reich, 1995) In design literature, it has been acknowledged that objects can be represented in multiple ways What effect do representations have on novelty generation ?

/ 56 • Suppose we have a dataset of 16
letters 18 2.The impact of representations on novelty generation

/ 56 • Suppose we represent images in pixel space
• We generate pixels randomly uniformly 19 Everything is new, but no structure 2.The impact of representations on novelty generation

/ 56 • Suppose we re-represent each letter using strokes
20 • For instance, 2.The impact of representations on novelty generation

/ 56 22 Pixel space Stroke space Representations change what
you can generate 2.The impact of representations on novelty generation

/ 56 •How do we choose a “useful” representation for
novelty generation ? •Machine learning, and deep generative models in particular, provides ways to learn representations from data Q: Can we use those learned representations for generation of novelty even if these models are not designed to do so ? 23 2.The impact of representations on novelty generation

•Noise vs novelty •Likelihood •Compression of representations Summary:

/ 56 25 • What novelty can be generated by
deep generative nets (DGN)? • How to evaluate the generative potential of a DGN? • What are the general characteristics of DGN that can generate novelty? Research questions:

/ 56 3.Experiments 26

/ 56 27 • We observed that some models could
generate novelty although not designed to do that • Thus, deep generative models have an unused generative potential • Can we demonstrate this more systematically ? 3.a. Studying the generative potential of a deep net

/ 56 28 3.a. Studying the generative potential of a
deep net

/ 56 29 (Kazakci, Cherti, Kégl, 2016) Train data Generative
model Learn Generate ?? 3.a. Studying the generative potential of a deep net

We use a convolutional sparse auto-encoder as a model Sparsity
Training objective is to minimize the reconstruction error 30 3.a. Studying the generative potential of a deep net / 56

Reconstruction Input (dim 625) Bottleneck Encode Decode Deep autoencoder with
a bottleneck from Hinton, G. E., & Salakhutdinov, R. R. (2006). 3.a. Studying the generative potential of a deep net

/ 56 • We use an iterative method to generate
new images • Start with a random image • Force the network to construct (i.e. interpret) • , until convergence, f(x) = dec(enc(x)) 32 3.a. Studying the generative potential of a deep net

/ 56 33 3.a. Studying the generative potential of a
deep net

/ 56 3.a. Studying the generative potential of a deep
net 34

net 35

/ 56 36 Kazakçı, Cherti, Kégl, 2016 3.a. Studying the
generative potential of a deep net

/ 56 37 Known Training digits Representable “Combinations of strokes”
37 3.a. Studying the generative potential of a deep net Our interpretation of the results:

/ 56 38 Known Training digits Representable All digits that
the model can generate Valuable All recognizable digits 3.a. Studying the generative potential of a deep net

Valuable Human selection 3.a. Studying the generative potential of a deep net

/ 56 • We have one example of a deep
generative model that can indeed generate novelty • Can we go further by automatically ﬁnding models that can generate novelty ? 40 3.b. Evaluating the generative potential of deep nets

/ 56 41 We designed a new setup and set
of metrics to ﬁnd models that are capable of generating novelty 3.b. Evaluating the generative potential of deep nets

/ 56 •Training on known classes •Testing on classes known
to the experimenter but unknown to the model 42 Idea: simulate the unknown by 3.b. Evaluating the generative potential of deep nets Proposed setup: train on digits and test on letters, where letters are used as a proxy for evaluating the capacity of models to generate novelty

/ 56 43 Generative model Learn Generate Q: How many
of those are letters ? 3.b. Evaluating the generative potential of deep nets

/ 56 44 Discriminator Learn 3.b. Evaluating the generative potential
of deep nets To count letters, we learn a discriminator with 36 classes = 10 for digits + 26 for letters

/ 56 45 Discriminator Nb. of letters Predict 3.b. Evaluating
the generative potential of deep nets We then use the discriminator to score the models:

/ 56 46 3.b. Evaluating the generative potential of deep
nets “Nb of letters” score is a proxy for ﬁnding models that generate images that are : • non trivial • non recognizable as digits

/ 56 • We do a large scale experiment where
we train ~1000 models (autoencoders, GANs) by varying their hyperparameters. • From each model, we generate 1000 images, then we evaluate the model using our proposed metrics • Question we tried to answer: 47 Can we ﬁnd models that can generate novelty ? 3.b. Evaluating the generative potential of deep nets

/ 56 • Selecting models for letters count lead to
models that can generate novelty 48 • Selecting models for digits count lead to models that memorize training classes 3.b. Evaluating the generative potential of deep nets

Pangrams 49 3.b. Evaluating the generative potential of deep nets

Valuable Letters 3.b. Evaluating the generative potential of deep nets

/ 56 51 We have shown that we can automatically
ﬁnd models that can generate novelty, as well as other models that cannot 3.b. Evaluating the generative potential of deep nets

/ 56 52 • Can we characterize the difference between
models that can generate novelty and models that cannot ? • We study a particular model architecture through a series of experiments 3.b. Evaluating the generative potential of deep nets

/ 56 53 • We study the effect of different
ways of restricting the capacity of the representation on the same architecture • We ﬁnd that restricting the capacity of the representation hurts their ability to generate novelty 3.c.Characteristics of models that can generate novelty More capacity More novelty

/ 56 54 Conclusion Main Contributions: • Importance of representation
on novelty generation • Current models can generate novelty even though not designed for that • We propose a new setup and a set of metrics to assess the capacity of the models to generate novelty • We show that constraining the capacity of the representation can be harmful for novelty generation

/ 56 55 Perspectives: immediate next steps • Explain why
existing models can generate novelty • Propose an explicit training criterion to learn a representation suitable for novelty generation • Propose alternatives generative procedures to random sampling • Experiment on more complex datasets and domains

/ 56 56 • Agent evolving over time: dynamic knowledge
and value function • Multi-agent system so that agents get/give feedback and cooperate Perspectives: future

Thank you !

net 58

Slides, thesis dissertation defense, deep gener...

Slides, thesis dissertation defense, deep generative neural networks for novelty generation

More Decks by Mehdi

Other Decks in Science

Featured

Transcript