Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Measuring Quality Content
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Adam Hyland
August 04, 2012
Research
2
80
Measuring Quality Content
Presentation to Wikimania 2012 on Article Feedback Tool statistics.
Adam Hyland
August 04, 2012
Tweet
Share
More Decks by Adam Hyland
See All by Adam Hyland
Here Comes (a significant fraction of) Everybody
protonk
0
78
Boston Data Swap: Data Vis Under Uncertainty
protonk
0
57
Why Nate Silver is Famous
protonk
1
130
Data Visualization under Uncertainty
protonk
0
770
Phillips Academy Wikipedia Introduction
protonk
0
92
Other Decks in Research
See All in Research
POI: Proof of Identity
katsyoshi
0
130
それ、チームの改善になってますか?ー「チームとは?」から始めた組織の実験ー
hirakawa51
0
640
競合や要望に流されない─B2B SaaSでミニマム要件を決めるリアルな取り組み / Don't be swayed by competitors or requests - A real effort to determine minimum requirements for B2B SaaS
kaminashi
0
700
HU Berlin: Industrial-Strength Natural Language Processing with spaCy and Prodigy
inesmontani
PRO
0
210
Community Driveプロジェクト(CDPJ)の中間報告
smartfukushilab1
0
160
Grounding Text Complexity Control in Defined Linguistic Difficulty [Keynote@*SEM2025]
yukiar
0
100
生成的情報検索時代におけるAI利用と認知バイアス
trycycle
PRO
0
280
社内データ分析AIエージェントを できるだけ使いやすくする工夫
fufufukakaka
1
890
視覚から身体性を持つAIへ: 巧緻な動作の3次元理解
tkhkaeio
0
190
ドメイン知識がない領域での自然言語処理の始め方
hargon24
1
240
Proposal of an Information Delivery Method for Electronic Paper Signage Using Human Mobility as the Communication Medium / ICCE-Asia 2025
yumulab
0
160
Remote sensing × Multi-modal meta survey
satai
4
710
Featured
See All Featured
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
34
2.6k
Public Speaking Without Barfing On Your Shoes - THAT 2023
reverentgeek
1
300
Music & Morning Musume
bryan
47
7.1k
How Fast Is Fast Enough? [PerfNow 2025]
tammyeverts
3
450
Lightning Talk: Beautiful Slides for Beginners
inesmontani
PRO
1
440
Facilitating Awesome Meetings
lara
57
6.8k
Being A Developer After 40
akosma
91
590k
Paper Plane
katiecoart
PRO
0
46k
Optimising Largest Contentful Paint
csswizardry
37
3.6k
Are puppies a ranking factor?
jonoalderson
1
2.7k
Measuring & Analyzing Core Web Vitals
bluesmoon
9
750
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
35
3.4k
Transcript
Measuring Article Quality Peer Review and the Article Feedback Tool
Adam Hyland protonk @ en-wp
Look Familiar?
Maybe This Version?
None
Article Feedback Tool • Deployed in 2010 • Version 4
(the current version) ramped up in 2011 • Designed to offer an avenue for reader feedback • High volume of reader feedback
• 6 months of public data • 795,353 articles --
2,487,522 responses
Featured Articles (FA) • 3,599 articles (0.09% of all articles)
• 2,267 Featured Lists (FL) • Most rigorous peer review process on the English Wikipedia • Very sensitive to editor preferences • Some idiosyncrasies
Good Articles (GA) • 15,357 articles • Relatively rigorous peer
review (yes I know reasonable minds may disagree) • Less idiosyncratic than FA in some ways • Perhaps less dependent on editor preference
Data • Article name • Length (in bytes) • GA/FA
status (including former/not- promoted) • Some user data
None
Beyond Summaries • Reader ratings follow pageviews • Predominantly non-editors
• Popular articles: • Call of Duty • Justin Bieber • Jimmy Wales (avg. rating: 1.10585)
Power Laws Everywhere!
Classical(ish) Models • Logistic regression model supports a relationship between
rating and likelihood of FA/GA • Linear model does, but with a twist • Can’t escape Cambridge Endogeneity Police!
None
Data Mining • Predicting featured status from reader ratings and
minimal meta-data. • Bayesian classifier able to roughly predict featured status (with a high false positive rate)
But the system’s changing! • AFT v4 is a multi-category
quantitative measure • AFT v5 is, roughly, YES/NO • Is this a problem? • Frank Harrell and the perils of dichotomization.
Actual Reader Ratings
Another Look
For the skeptics
Information • We can imagine we might not lose information
in shifting to v5 • This is born out by the classifier, to some degree. • We don’t lose a lot of power when dichotomizing individual ratings
A Look Ahead • Really exciting! • Great compliment to
current research methods • Long exposures can help discover reader/editor divergence • Predictive analytics • Need more open data
Questions? • Of course you have questions! • All work
is or soon will be available on github under a free license • Full writeup on en-wp forthcoming