Answering questions using knowledge graphs and sequence translation

Answering questions using knowledge graphs and sequence translation December 6th
2018 - Metis, San Francisco David Mack [email protected] https://octavian.ai @octavian_ai

David Mack https://octavian.ai [email protected]

Vision: from document search...

Nope! Vision: … to knowledge graph reasoning

The system we’ll create:

Article: https://medium.com/octavian-ai/answering-english-questions-us Code: https://github.com/Octavian-ai/english2cypher Find all of this on https://octavian.ai

Knowledge graphs

The dataset we’ll use: CLEVR-Graph Questions and answers on randomly
generated transit networks.

Graph Question Cypher Answer How many stations are between Diz
Bridge and Wisk Lane? MATCH (var1) MATCH (var2) MATCH tmp1 = shortestPath((var1)-[*]-(var2)) WHERE var1.name="Diz Bridge" AND var2.name="Wisk Lane" WITH 1 AS foo, tmp1 AS var3 UNWIND nodes(var3) AS var4 RETURN length(collect(var4)) - 2 6 670,000 more examples... CLEVR-Graph data format A randomly generated synthetic dataset with 18 different types of questions.

Question answering system

Overview The complex bit!

How the translation works Whole model is 360 lines of
tensorflow

Text must be turned into vectors...

Text must be turned into vectors... (Mapping learnt during training)
(Fixed dictionary of 128 tokens precomputed)

Combined word-character tokenization scheme significantly helped translation accuracy Where is
Dalkey Hospital? [WHERE, IS, D, A, L, K, E, Y, HOSPITAL] Rare word

Encode a sequence then decode it Encoder Decoder English Cypher

Recurrent neural network - a way to process sequences in
a neural network Background input Neural network and hidden state output Learn more about RNNs: http://colah.github.io/posts/2015-08-Understanding-LSTMs/

Simple translation Read the TensorFlow tutorial: https://github.com/tensorflow/nmt Encode Encode (2
layer, 1024 unit LSTM cell in a bidirectional layout) (2 layer, 1024 unit LSTM cell in a bidirectional layout) 3

Attention: let the network look back at the input D
2 D 1 D 2 D 1 hello world welt welt Attention Current With attention

Attentional Seq2Seq Image from Google. You can get the above
technology as an a easy-to-use library by Google: https://google.github.io/seq2seq/

Training in progress - cypher output

Performance • Achieves >99.7% translation accuracy • Trained on 670,000
Question-Cypher pairs • Works on 18 different types of questions Trained for 300 steps, 2 hours on a Tesla K80

Limitations These limitations are primarily from the limited nature of
the training data Does not generalize well to other questions Does not generalize to English variations

Strategies for improving • Start with pre-trained sentence model (e.g.
BERT) and transfer learn on top of it • Train on a broader range of Cypher queries

What’s the theoretical limit of this system? • It can
perform any knowledge transformation that Neo4j is capable of • The limit is the sophistication of the recurrent cell • This system should be as powerful at reasoning as any other RNN approach • Language models are still relatively shallow and struggle with long text sequences (e.g. paragraph understanding)

and one more thing...

Please check it out and share! https://medium.com/octavian-ai/how-to-get-started-with-machine-learning-on-graphs-7f0795c83763

Q&A David Mack [email protected] https://octavian.ai @octavian_ai

Appendix https://octavian.ai @octavian_ai David Mack [email protected]

Read more about everything mentioned: • Online Safari book on
RNNs: https://www.safaribooksonline.com/library/view/neural-networks-and/9781492037354/ch04.html • Introduction to RNN/LSTM: http://colah.github.io/posts/2015-08-Understanding-LSTMs/ • Attention and Augmented Recurrent Neural Networks https://distill.pub/2016/augmented-rnns/ • TensorFlow Neural Machine Translation Tutorial https://github.com/tensorflow/nmt • Differentiable neural computers (DeepMind nature publication) https://deepmind.com/blog/differentiable-neural-computers/ • MACnets https://arxiv.org/abs/1803.03067 • IQA: Visual Question Answering in Interactive Environments (Reinforcement learning reasoning): https://arxiv.org/abs/1712.03316 • English to Cypher translation https://medium.com/octavian-ai/answering-english-questions-using-knowledge-graphs-and-sequence-translation-2ac baa35a21d • Applying MACnet to knowledge graph (work in progress): https://github.com/Octavian-ai/mac-graph • Octavian’s research https://www.octavian.ai/articles https://twitter.com/Octavian_ai

Answering questions using knowledge graphs and ...

Answering questions using knowledge graphs and sequence translation

David Mack

Other Decks in Programming

Featured

Transcript