Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Natural Language Processing

Natural Language Processing

A presentation to accompany the demo of my unofficially named Twitter sentiment analysis project, Morgan. It's like Klout for events! Discusses the use of the naïve bayes classifier! If you want to see more visit me on github.com/meganspeir.

Megan Speir

August 10, 2013
Tweet

More Decks by Megan Speir

Other Decks in Programming

Transcript

  1. > > I’M > > ___ ___ _____ _____ ___

    _ _ > | \/ || ___| __ \ / _ \ | \ | | > | . . || |__ | | \// /_\ \| \| | > | |\/| || __|| | __ | _ || . ` | > | | | || |___| |_\ \| | | || |\ | > \_| |_/\____/ \____/\_| |_/\_| \_/ > > _____ ______ _____ _____ ______ > / ___|| ___ \| ___|_ _|| ___ \ > \ `--.|| |_/ /| |__ | | || |_/ / > `--. \| __/ | __| | | || / > /\__/ /| | | |___ _| |_|| |\ \ > \____/ \_| \____/ \___/ \_| \_| > > > [email protected] > > > > >
  2. PROBLEM? What did people really think about my event? Discover

    the truth with Morgan’s Twitter Sentiment Analysis
  3. NAÏVE BAYES CLASSIFIER A Naïve Bayesian classifier assumes probabilities being

    combined are independent of each other. Not true considering python (the snake) and python (the language) mean different things given the context.
  4. To make up for its naïveté, calculate the probability of

    a tweet being in a specific category by multiplying together all the probabilities  of the individual words in a tweet. Love Hate Excited Bored 0 20 40 60 80 Negative Positive
  5. > > > > _____ _ _ ___ _ _

    _ __ > |_ _| | | | / _ \ | \ | || | / / > | | | |_| |/ /_\ \| \| || |/ / > | | | _ || _ || . ` || \ > | | | | | || | | || |\ || |\ \ > \_/ \_| |_/\_| |_/\_| \_/\_| \_/ > > __ _______ _ _ > \ \ / / _ | | | | > \ V /| | | | | | | > \ / | | | | | | | > | | \ \_/ / |_| | > \_/ \___/ \___/ > > > [email protected] > > > > Copyright © 2013 Megan Speir. All rights reserved. >