Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Extracting insights with the DataSift platform

Extracting insights with the DataSift platform

Handling lots of real-time streams of information, when Twitter alone is producing 330+ million tweets a day and 40 million links to news and media, can be a daunting task, and actually turning this into valuable insights might be even tougher.
This talk will cover how the DataSift platform makes this task easy,
and will show some concrete use cases of how the social media data
can be used for customer intelligence.

@ Lexalytics User Group, Boston, Monday, June 4th, 2012

Lorenzo Alberton

May 27, 2012
Tweet

Other Decks in Technology

Transcript

  1. 2 Terabyte messages processed in real time and stored every

    day ~1 Petabyte of storage available Monday, 11 June 2012
  2. Thousands of concurrent, custom output streams all crafted with tender

    love and surgical precision Monday, 11 June 2012
  3. Lexalytics Usage Stats 7 servers 16x 2.40GHz cores/server ~400% CPU

    per instance ~20ms per interaction 4 languages Sentiment, Topics, Entities Monday, 11 June 2012
  4. Messages Volume by Language 0 2K 3K 1K Portuguese Spanish

    English (long) English (short) French Monday, 11 June 2012
  5. Average CPU Load by Language 0 7 12 5 Portuguese

    + French + Spanish Portuguese + Spanish English (short + long) Monday, 11 June 2012
  6. RIM CEOs resignation RIM stock opens and falls sentiment switches

    to positive 5 mins later the stock price reaches support level Monday, 11 June 2012
  7. Crowd reaction for the news • Significance of the event

    proportional to spike of traffic generated • Switch from negative to positive sentiment when the market decided the price had been discounted enough Monday, 11 June 2012
  8. Boris Johnson vs Ken Livingstone London Mayoral Election 2012 very

    positive sentiment spike for Boris at the end of the campaign Monday, 11 June 2012