Upgrade to Pro — share decks privately, control downloads, hide ads and more …

OpenTalks.AI - Алексей Бурнаков, Тематическое моделирование новостей на основе детекции цитирований​

OpenTalks.AI
February 21, 2020

OpenTalks.AI - Алексей Бурнаков, Тематическое моделирование новостей на основе детекции цитирований​

OpenTalks.AI

February 21, 2020
Tweet

More Decks by OpenTalks.AI

Other Decks in Science

Transcript

  1. News Specific Citation Detection Personal data Headline Editorial Cite number

    Cite index Citing media Date News rating Editor rating Board rating Organization rating
  2. Citation Detection: results precision = 0.89 recall = 0.87 Logistic

    regression output F1 score = 0.88 MCC score = 0.88 AUC: 0.998 We did good at a train dataset
  3. NLP Pipeline Raw text Tokenization Who cites TASS Which news

    was cited Topic modelling Customer facing
  4. Topic Modelling III Flood in the Irkutsk Region :( `Losharik`

    Submarine deadly accident :( Motivation: Are there big topics today?
  5. Daily Competition Snapshot Who is the hero of the day?

    Personal data Personal data Personal data Personal data Personal data