Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Understand your Brand

Avatar for ejtitus ejtitus
October 12, 2015

Understand your Brand

Insight IDX presentation

Avatar for ejtitus

ejtitus

October 12, 2015
Tweet

Other Decks in Programming

Transcript

  1. Topics? Automate ⚬ no sampling! ⚬ fast! ⚬ ~ 2

    million posts ⚬ in “Russian” ⚬ product: “Budweiser”
  2. Workflow NLTK custom filtering LDA Optimize topics ⚬ Keywords –

    What is this topic? ⚬ Probable topics in each post – What topic does a post belong to? extract usable text Identify, count, remove retweets
  3. ⚬ “Russia” ⚬ “Moscow” ⚬ Fish 1. National Identity ⚬

    Required ⚬ ❤ ⚬ Drinking 2. Brand Loyalty Translate(keywords)→ Topics
  4. ⚬ “Russia” ⚬ “Moscow” ⚬ Fish 1. National Identity ⚬

    Required ⚬ ❤ ⚬ Drinking 2. Brand Loyalty 3. Debate on national beverage Translate(keywords)→ Topics ⚬ “Tea” ⚬ Who knows ⚬ Who says
  5. ⚬ “Russia” ⚬ “Moscow” ⚬ Fish 1. National Identity ⚬

    Required ⚬ ❤ ⚬ Drinking 2. Brand Loyalty 3. Debate on national beverage ⚬ Topic labels are automatically applied ⚬ Topics extracted from a few keywords ⚬ Can easily be used with any language Translate(keywords)→ Topics ⚬ “Tea” ⚬ Who knows ⚬ Who says
  6. Eric Titus PhD in Chemistry University of Texas at Austin

    Postdoc at Temple University Super-resolution imaging of single molecules and nanoparticles
  7. Minimum shows the optimum number of topics to use in

    LDA to characterize this data set Unique posts (not retweets) are relatively evenly distributed across topics (top). When looking at all data, including retweets, we see that certain categories are retweeted more than others (bottom). topics are given as numbers to protect brand identity
  8. All twitter activity for the brand increases on Saturdays, for

    both unique content and retweeted content We see unique postings occur primarily in the evening