Long Short-Term Memory (LSTM) models with TensorFlow | Sahil Dua

Long Short-Term Memory (LSTM) models with TensorFlow Sahil Dua (sahildua2305)

➔ Developer at Booking.com ➔ Open Source Community Leader, DuckDuckGo
➔ Contributor: ◆ Git ◆ Pandas ◆ Go-GitHub ◆ Linguist ◆ etc… whoami

Agenda ➔ Deep Learning for NLP ➔ Word Representations ➔
How to handle sequences? ◆ LSTMs ➔ TensorFlow Implementation of LSTM

Natural Language Processing (NLP) ?

NLP Tasks ➔ Question Answering ➔ Sentiment Analysis ➔ Image
to Text mapping ➔ Machine Translation ➔ Speech Recognition

Deep Learning for NLP?

Inputs to different types of models ➔ Convolutional Neural Networks
➔ Logistic Regression ➔ Reinforcement Learning

NLP Input Data Format

NLP Input Format Data (desired)

Word Embeddings

How to handle sequences?

Challenges in modelling sequences ➔ How to preserve order of
words? ◆ The food was good, not bad at all. ◆ The food was bad, not good at all.

Challenges in modelling sequences ➔ How to deal with different
word orders? ◆ On Monday, it was raining. ◆ It was raining on Monday.

Challenges in modelling sequences ➔ How to keep track of
long term dependencies? ◆ In Berlin, I had a great time and I learnt some of the ________ language.

Challenges in modelling sequences ➔ How to deal with variable
length of sequences? ◆ Awesome. ◆ Food was good. ◆ I didn’t like that place at all.

Feed-Forward Neural Network

Detecting activity in a movie scene

Ignoring the sequential nature ➔ Low-level patterns ➔ Complex patterns
➔ High-level patterns

Remembering information with RNN

Recurrent Neural Network (RNN) ➔ Use previous knowledge for prediction
➔ Store knowledge from current prediction

Problems with RNNs

Problems with RNNs ➔ Unable to keep long memories ➔
Chaotic memory update

Longer and more controlled Memories through LSTMs

How LSTM works? ➔ Forgetting mechanism ➔ Saving mechanism ➔
Using mechanism

How LSTM works?

Mathematically,

TensorFlow Implementation

Thank you! Sahil Dua (@sahildua2305)

Long Short-Term Memory (LSTM) models with Tenso...

Long Short-Term Memory (LSTM) models with TensorFlow | Sahil Dua

Sahil Dua

More Decks by Sahil Dua

Other Decks in Technology

Featured

Transcript