OpenTalks.AI - Алексей Бурнаков, Тематическое моделирование новостей на основе детекции цитирований​

Ad8ae7af280edaecb09bd73a551b5e5f?s=47 OpenTalks.AI
February 21, 2020

OpenTalks.AI - Алексей Бурнаков, Тематическое моделирование новостей на основе детекции цитирований​

Ad8ae7af280edaecb09bd73a551b5e5f?s=128

OpenTalks.AI

February 21, 2020
Tweet

Transcript

  1. 5.

    News Specific Citation Detection Personal data Headline Editorial Cite number

    Cite index Citing media Date News rating Editor rating Board rating Organization rating
  2. 8.

    Citation Detection: results precision = 0.89 recall = 0.87 Logistic

    regression output F1 score = 0.88 MCC score = 0.88 AUC: 0.998 We did good at a train dataset
  3. 10.

    NLP Pipeline Raw text Tokenization Who cites TASS Which news

    was cited Topic modelling Customer facing
  4. 13.

    Topic Modelling III Flood in the Irkutsk Region :( `Losharik`

    Submarine deadly accident :( Motivation: Are there big topics today?
  5. 16.

    Daily Competition Snapshot Who is the hero of the day?

    Personal data Personal data Personal data Personal data Personal data