Image ATM - Image Classification for Everyone

Dat Tran - Head of AI @ Axel Springer AI
(@datitran) 9 august 2019 ~ india ~ Image-ATM Image Classification for Everyone

echo $(whoami)

https://ai.axelspringer.com/

echo $(whoami)

echo $(whoami) https://www.volkswagen-newsroom.com/de/pressemitteilungen/volkswagen-konzern-it-setzt-bei-digitalisierung-auf-neue-recruiting-und-arbeitsmethoden-542

echo $(whoami) Face2Face Hydroplaning Prediction Image Quality Assessment

echo $(whoami)

Agenda • Motivation • Image Classification Problem • Image-ATM •
Conclusion

Motivation

That’s us

Europe‘s leading digital publisher appr. 16.000 employees worldwide over 250
brands in over 40 countries appr. 70% of the revenue is digital 49,4% of the revenue is from international business The soul and spirit of the company Axel Springer is journalism The Axel Springer Family In a nutshell

A family of many strong brands

idealo.de 16

perfect shopping experience 330 million offers Business Model

330 million offers perfect shopping experience A lot of manual
work Business Model

Product Gallery

Perfect Product Gallery

idealo.de - Hotel Price Comparision ~ 2.306.658 accommodations ~ 308.519.299
images ~ ~ 133 images per accommodation hotel.idealo.de

Importance of Photography for Hotels “.. after price, photography is
the most important factor for travelers and prospects scanning OTA sites..” “.. Photography plays a role of 60% in the decision to book with a particular hotel ..” “.. study published today by TripAdvisor, it would seem like photos have the greatest impact driving engagement from travelers researching on hotel and B&B pages ..”

1 2 3 4 5 6 7 8 9 10
11 12 13

Current Image Placement - Bedroom Position: 19 Position: 1

Current Image Placement - Reception Position: 17 Position: 3

Footer 26 April 2019 Beautiful images should appear earlier in
the gallery

Image Quality Assessment • NVIDIA Developer Blog: https://devblogs.nvidia.com/deep- learning-hotel-aesthetics-photos/ •
GitHub: https://github.com/idealo/image-q uality-assessment

1 2 3 4 5 6 7 8 9 10
11 12 13

Ensure different areas get depicted

1 2 3 4 5 6 7 8 Bedroom Bathroom
Restaurant Facade Fitness Studio Kitchen

Ad-hoc: Winter/non-winter classification ~ Ad-hoc request to label 300k hotel
images for travel department ~ Final solution: train neural network and predict 300k images

Problem Statement Many tagging problems • 2000 product categories •
Many classes with a lot of hotel images; classes can change and ad-hoc requests such as winter/non-winter • Also a lot of image tagging problems within the Axel Springer group

Problem Statement Requirements • Needed a tool to do fast
experimentation • Easy to use also for non-machine learners/data scientists • Good documentation • Explainable AI

Image Classification

What is Image Classification? • Image classification is the task
of assigning an input image one label from a fixed set of categories: http://cs231n.github.io/classification/ • Supervised learning problem

Typical Example • MNIST • Fashion-MNIST • ImageNet • CIFAR-10
• Cats vs. dogs • etc...

How to solve it? • SVMs • Feed-forward neural networks
• CNNs • CapsuleNet • SVMs • Feed-forward neural networks • CNNs • CapsuleNet

Transfer Learning

Transfer Learning 1. Use pre-trained CNN that was trained on
millions of images (e.g. MobileNet or VGG16) 2. Replace top layers so that the output fits with classification task 3. Train existing and new layer weights

Libraries Any many more...

Example tf.keras

Example tf.keras Use ImageDataGenerator class to • Load images •
Some preprocessing, target size, train/validation split etc..

Using the Keras Functional API • Transfer Learning with MobileNet
• Input shape matches with target size; rgb • Add dropout and also dense layer with two classes Example tf.keras

Compile model and train it • Define loss and metrics
• Define number of epochs, validation data • And many more... Example tf.keras

Many things can go wrong with tf.keras You can solve
image classification in many ways • Sequential API vs. Functional API vs. Subclassing API • Train/test/validation split via ImageDataGenerator or scikit-learn’s model_selection • Choice of models, optimizer, loss function, metrics, number of epochs etc… • Cloud training (AWS, Google Cloud, Azure, your own GPU cluster)

Explainable AI Source: https://medium.com/@raeidsaqur/explainable-machine-learning-5-must-read-papers-95660d9f0c72

Ethics & Biases

Interpretability of ML models

Interpretability of ML models PASCAL VOC Challenge source: Wojciech Samek,
“Interpretable and Trustworthy Machine Learning” Why is this a train? Why is this a horse? Why is this a boot?

Interpretability of ML models PASCAL VOC Challenge source: Wojciech Samek,
“Interpretable and Trustworthy Machine Learning”

CNNs can be explainable • Attribution techniques • Visualization techniques
https://github.com/idealo/cnn-exposed

Attribution techniques Grad-Cam Further methods are e.g. saliency maps, LRP
etc...

Visualization techniques source: Zhou et. al., “Learning Deep Features for
Discriminative Localization”

Image-ATM

Labeling Input Processing Modeling Output Deployment Highly manual Problem revisited

Initial idea Input Processing Modeling Output

What is Image-ATM?

Installation

Usage • Train with CLI • Train without CLI ◦
Google Colab

Conclusion

Summary ~ Image-ATM reduced our training workflow to a couple
of mins from sometimes a few hours (max 1-2 hrs) ~ Abstraction generates standardized workflow and enabled non-machine learners to do image classification ~ Library is extensively used now at idealo.de ~ Glasses, washing machines, cameras, smart phones etc. ~ Many more use cases...

Sneaker Gallery

Roadmap ~ Upgrade to TensorFlow 2.0 ~ Add AutoML capabilities
(at the moment ENAS takes 8hrs on MNIST for a good model whereas transfer learning takes 20mins) ~ More interpretable AI techniques (SHAP etc..) ~ PDF report output ~ More image augmentation ~ Semi-supervised learning ~ Image Deduplication (coming soon)

Team Christopher Lennan Senior Data Scientist idealo.de Malgorzata Adamczyk Machine
Learning Engineer Axel Springer AI Gunar Maiwald Data Scientist idealo.de Dat Tran Head of AI Axel Springer AI

Contributors Check out our repo and contribute: https://github.com/idealo/imageatm

Url: www.dat-tran.com Twitter: @datitran Questions?

Image ATM - Image Classification for Everyone

Image ATM - Image Classification for Everyone

More Decks by Dat Tran

Other Decks in Technology

Featured

Transcript