Out of class novelty generation, an experimental foundation

Mehdi Cherti, Balázs Kégl CNRS & Université Paris Saclay Center
for Data Science Akin Kazakci Mines Paristech Centre de gestion scientiﬁque 1 ICTAI 2017

Motivation • A notable characteristic of human intelligence is novelty
generation: the capacity to build/invent/think new objects from their available knowledge. • Examples : • Design of new products (e.g., in engineering industry) • Artistic works (e.g., paintings, music) • Intellectual works (e.g., scientiﬁc theories) 2

Motivation Can we build programs that can autonomously build new
product designs, new paintings, new music styles, new scientiﬁc theories ?

Questions we asked • What is meant by generation of
novelty ? • Can we generate novelty ? • How can a program generating novelty be evaluated ?

Where we looked at • Design theory • Computational creativity
• Machine learning

The importance of the representation • The designer chooses a
representation • The representation reﬂects/encodes the knowledge of the designer about the domain • The designer uses the representation to generate new objects • Different designers will choose different representations (b.c., different knowledge), leading to completely different objects (Reich, 1995)

The fitness function barrier • For most computational creativity systems,
the value (fitness) function is fixed and predetermined. • Also, representation (genotype) is fixed and predetermined. • The objects generated by the system reflect designer’s preference - not the machine’s.

Can we learn representations ? •Important subﬁeld of machine learning
: representation learning • In ML: we know how to learn good representations for prediction (supervised learning) and how to evaluate them •Q : What is a good representation for the generation of new objects ?

Can we learn representations ? •Can we use generative models
of ML to do that ? •Problem: Current generative models in ML are mostly trained based on maximum likelihood or some proxy of it => unlikely to generate “novelty” Train data Test data Generative model Generate Learn What ML wants

novelty ? • Can we generate novelty ? • How can a program generating novelty be evaluated ?

What is meant by generation of novelty ? Our deﬁnition
attempt: Generate Novelty = Generate new types/classes/categories

novelty ? • Can we generate novelty, that is, new types ? • How can a program generating novelty be evaluated ?

Can we generate new types ? (Kazakci, 2016) Clusters found
semi-manually Train data Generative model Learn Generate

Can we generate new types ? Kazakçı et al. 2016

Can we generate new types ? In Kazakçı et al.
2016: • We show that symbols of new types can be generated by carefully tuned autoencoders • We make a ﬁrst step of deﬁning a conceptual and experimental framework of novelty generation • However, we make no attempt to design evaluation metrics A set of types (clusters) discovered by the model

• What is meant by generation of novelty ? •
Can we generate novelty, that is, new types ? • This paper: How can a program generating novelty be evaluated ? Questions we asked

How can a program generating novelty be evaluated ? Idea
: simulate the unknown Train on known classes, Test on classes known to the experimenter but unknown to the model Examples: Train on all fashion styles up to 2000, test on fashion styles from 2000- Train on baroque and classical music, test on romantic music Train on drug-like molecules, test on Malaria drugs Our setup: Train on digits, test on letters

How can a program generating novelty be evaluated ? Generative
model Learn Generate Q : How many of those are letters ?

How can a program generating novelty be evaluated ? Discriminator
36 classes = 10 for digits + 26 for letters Learn

How can a program generating novelty be evaluated ? Discriminator
Number of letters Predict

How can a program generating novelty be evaluated ? Low
nb.letters High nb.letters Problem: “noise” can get misclassiﬁed as letter

How can a program generating novelty be evaluated ? Problem:
“noise” gets misclassiﬁed as letter Solution: we use Objectness = posterior entropy Low objectness High objectness

How can a program generating novelty be evaluated ? High
nb. letters Low nb. letters High objectless Low objectless

Experiments • We do a large scale experiment where we
train ~1000 models (autoencoders, GANs) by varying their hyper-parameters. • From each model, we generate 1000 images, then we evaluate the model using our proposed metrics (count + objectness) • Question we tried to answer: Can we ﬁnd models that can generate novelty ?

Results • Selecting models for letters count + objectness lead
to models that can generate novelty • Selecting models for digits count + objectness lead to models that memorize training classes

Results Restrict capacity

Results A visualization of the iterative procedure of generation from
autoencoders Iterations

Pangrams Hand-picked From top models, found automatically

Summary • We propose a workable deﬁnition of novelty •
We propose a set of scores to evaluate the capacity of models to generate novelty • We ﬁnd models that can generate novelty

Perspectives • The immediate next goal is to analyze the
models in a systematic way • Next step : how can we build programs that can build their own value function ?

Thank you ! [email protected] [email protected] [email protected]

backup : Generating new types of objects: generating new symbols
• We use an iterative method to build symbols the net has never seen (inspired by Bengio et al. (2013) but we don’t try to avoid spurious samples): • Start with a random image • force the network to construct (i.e. interpret) • , until convergence, f(x) = decode(encode(x))

Out of class novelty generation, an experimenta...

Out of class novelty generation, an experimental foundation

Mehdi

More Decks by Mehdi

Other Decks in Research

Featured

Transcript

Mehdi Cherti, Balázs Kégl CNRS & Université Paris Saclay Center

Motivation • A notable characteristic of human intelligence is novelty

Motivation Can we build programs that can autonomously build new

Questions we asked • What is meant by generation of

Where we looked at • Design theory • Computational creativity

The importance of the representation • The designer chooses a

The ﬁtness function barrier • For most computational creativity systems,

Can we learn representations ? •Important subﬁeld of machine learning

Can we learn representations ? •Can we use generative models

Questions we asked • What is meant by generation of

What is meant by generation of novelty ? Our deﬁnition

Questions we asked • What is meant by generation of

Can we generate new types ? (Kazakci, 2016) Clusters found

Can we generate new types ? Kazakçı et al. 2016

Can we generate new types ? In Kazakçı et al.

• What is meant by generation of novelty ? •

How can a program generating novelty be evaluated ? Idea

How can a program generating novelty be evaluated ? Generative

How can a program generating novelty be evaluated ? Discriminator

How can a program generating novelty be evaluated ? Discriminator

How can a program generating novelty be evaluated ? Low

How can a program generating novelty be evaluated ? Problem:

How can a program generating novelty be evaluated ? High

Experiments • We do a large scale experiment where we

Results • Selecting models for letters count + objectness lead

Results Restrict capacity

Results A visualization of the iterative procedure of generation from

Pangrams Hand-picked From top models, found automatically

Summary • We propose a workable deﬁnition of novelty •

Perspectives • The immediate next goal is to analyze the

Thank you ! [email protected] [email protected] [email protected]

backup : Generating new types of objects: generating new symbols