Computational Intelligence: images <-> sentences

images sentences

Questions & Observations ! • Hierarchical ensemble methods • Shared
intermediate representation • Real-time performance? • Neurological realism?

Every Picture Tells a Story: Generating Sentences from Images Ali
Farhadi, Mohsen Hejrati , Mohammad Amin Sadeghi, Peter Young, Cyrus Rashtchian, Julia Hockenmaier, David Forsyth

images sentences

Felzenszwalb Detector A Discriminatively Trained, Multiscale, Deformable Part Model Pedro
F. Felzenszwalb, David McAllester and Deva Ramanan

Linear SVM ! ! Felzenszwalb detector Hoiem 3D scene model
GIST scene features (+Adaboost) Node features + scores

Edge Potentials • given a test image • k-nn training
examples, average node features • from the image side: node features for similar images • from the sentence side: sentence representation for similar images • Multi-label Markov Random Field

images sentences

Curran & Clark Tools • Maximum Entropy Tagger • POS
Tagger • Combinatory Categorial Grammar (CCG) • Chunker • Named Entity Recognizer

C&C Parser Dependency Parse Subject/Direct Object Head nouns from prepositional
phrases (“X in the background”) Scene information

Node Potentials: Lin Similarity • Wordnet! • Hypernyms (is-a) •
Hyponyms (instance-of) • Compare synsets

Edge Potentials • given a test image • k-nn training
examples, average node features • from the image side: node features for similar images • from the sentence side: sentence representation for similar images • Multi-label Markov Random Field

Structure Learning Finding weights on linear combinations on nodes and
edges so that the ground truth triplet scores highest

N. Siddharth, Andrei Barbu, Jeffrey Mark Siskind ! Seeing What
You’re Told: Sentence-Guided Activity Recognition In Video

Object Detection Track Event Recognizer Sentences

Object Detection Track Event Recognizer Sentences Felzenszwalb + false positives

Object Detection Track Event Recognizer Sentences Felzenszwalb + false positives
Felz. conﬁdence + Optical Flow

Object Detection Track Event Recognizer Sentences Dynamic Programming Maximize detection
conﬁdence and optical ﬂow continuity

Object Detection Track Event Recognizer Sentences Per-Object/Per-frame • Position •
Velocity • Acceleration • Aspect Ratio Agent+Instrument • Distance • Orientation A time series of feature vectors Train with Hidden Markov-Model (per-word in lexicon)

Object Detection Track Event Recognizer Sentences Recognize with HMM Maximize
linear combination of observations and state transitions

Object Detection Track Event Recognizer Sentences Sentence Tracker Determine whether
a set of tracks matches a sentence by maximizing the probability of the cross-product lattice

Natural Language Semantics

Questions & Observations ! • Hierarchical ensemble methods • Shared
intermediate representation • Real-time performance? • Neurological realism?

Computational Intelligence: images <-> sentences

Computational Intelligence: images <-> sentences

gregab

More Decks by gregab

Other Decks in Technology

Featured

Transcript

images sentences

Questions & Observations ! • Hierarchical ensemble methods • Shared

Every Picture Tells a Story: Generating Sentences from Images Ali

images sentences

Felzenszwalb Detector A Discriminatively Trained, Multiscale, Deformable Part Model Pedro

Linear SVM ! ! Felzenszwalb detector Hoiem 3D scene model

Edge Potentials • given a test image • k-nn training

images sentences

Curran & Clark Tools • Maximum Entropy Tagger • POS

C&C Parser Dependency Parse Subject/Direct Object Head nouns from prepositional

Node Potentials: Lin Similarity • Wordnet! • Hypernyms (is-a) •

Edge Potentials • given a test image • k-nn training

Structure Learning Finding weights on linear combinations on nodes and

N. Siddharth, Andrei Barbu, Jeffrey Mark Siskind ! Seeing What

Object Detection Track Event Recognizer Sentences

Object Detection Track Event Recognizer Sentences Felzenszwalb + false positives

Object Detection Track Event Recognizer Sentences Felzenszwalb + false positives

Object Detection Track Event Recognizer Sentences Dynamic Programming Maximize detection

Object Detection Track Event Recognizer Sentences Per-Object/Per-frame • Position •

Object Detection Track Event Recognizer Sentences Recognize with HMM Maximize

Object Detection Track Event Recognizer Sentences Sentence Tracker Determine whether

Natural Language Semantics

Natural Language Semantics

Questions & Observations ! • Hierarchical ensemble methods • Shared