Slide 3
Slide 3 text
3
Overview
• I introduce my research project to identify and analyze
scholarly bibliographic references on Wikipedia
– based on the following two papers
www.nature.com/scientificdata
Dataset of first appearances of the
scholarly bibliographic references
on Wikipedia articles
Jiro Kikkawa
✉
, Masao Takaku & Fuyuki Yoshikane
Referencing scholarly documents as information sources on Wikipedia is important because it supports
or improves the quality of Wikipedia content. Several studies have been conducted regarding scholarly
references on Wikipedia; however, little is known of the editors and their edits contributing to add
the scholarly references on Wikipedia. In this study, we develop a methodology to detect the oldest
scholarly reference added to Wikipedia articles by which a certain paper is uniquely identifiable as the
“first appearance of the scholarly reference.” We identified the first appearances of 923,894 scholarly
references (611,119 unique DOIs) in 180,795 unique pages on English Wikipedia as of March 1, 2017
and stored them in the dataset. Moreover, we assessed the precision of the dataset, which was highly
precise regardless of the research field. Finally, we demonstrate the potential of our dataset. This
dataset is unique and attracts those who are interested in how the scholarly references on Wikipedia
grew and which editors added them.
Background & Summary
Along with the digitization of scholarly communication, numerous scholarly documents have been referenced
and used on the Web. One of the changes arising from the development and dissemination of scholarly infor-
mation infrastructures on the Web is the utilization of scholarly documents by various people and communities,
including readers other than traditional ones such as researchers and specialists. As such an example, there are
many references and accesses to scholarly documents via Wikipedia. In particular, according to Crossref, which
assigns Digital Object Identi ers (DOIs) to scholarly documents massively, Wikipedia is one of the largest refer-
rers of Crossref DOIs as of 20151.
Wikipedia is a free online encyclopedia that anyone can edit, and it has been one of the most visited websites
in the world. However, owing to its collaborative nature, much criticism and discussion have emerged since its
start with regard to the accuracy and reliability of its contents. ree core content policies exist in Wikipedia:
“veri ability,” “neutral point of view,” and “no original research.” Referencing scholarly documents as informa-
tion sources on Wikipedia complements these policies, as these cited sources support or improve the quality of
Wikipedia content.
Several studies have been conducted regarding scholarly bibliographic references on Wikipedia; however,
most of them have focused on the scholarly document itself2–6. e methodologies in previous studies used
DaTa DEScRIpTOR
OpEN
Kikkawa, Jiro; Takaku, Masao; Yoshikane, Fuyuki: "Dataset of first
appearances of the scholarly bibliographic references on Wikipedia
articles", Scientific Data, Vol. 9, Article No. 85, pp. 1-11, 2022.
https://doi.org/10.1038/s41597-022-01190-z
Time Lag Analysis of Adding Scholarly
References to English Wikipedia
How Rapidly Are They Added to and How Fresh
Are They?
Jiro Kikkawa(
B) , Masao Takaku , and Fuyuki Yoshikane
University of Tsukuba, Tsukuba, Ibaraki, Japan
{jiro,masao,fuyuki}@slis.tsukuba.ac.jp
Abstract. Referencing scholarly documents as information sources on
Wikipedia is important because they complement and improve the qual-
ity of Wikipedia content. However, little is known about them, such as
how rapidly they are added and how fresh they are. To answer these ques-
tions, we conduct a time-series analysis of adding scholarly references to
the English Wikipedia as of October 2021. Consequently, we detect no
tendencies in Wikipedia articles created recently to refer to more fresh
references because the time lag between publishing the scholarly articles
and adding references of the corresponding paper to Wikipedia articles
has remained generally constant over the years. In contrast, tendencies
to decrease over time in the time lag between creating Wikipedia articles
and adding the first scholarly references are observed. The percentage of
cases where scholarly references were added simultaneously as Wikipedia
articles are created is found to have increased over the years, particu-
larly since 2007–2008. This trend can be seen as a response to the policy
changes of the Wikipedia community at that time that was adopted by
various editors, rather than depending on massive activities by a small
number of editors.
Kikkawa, Jiro; Takaku, Masao; Yoshikane, Fuyuki: "Time Lag
Analysis of Adding Scholarly References to English Wikipedia:
How Rapidly Are They Added to and How Fresh Are They?",
Proceedings of the 18th International Conference, iConference
2023, Lecture Notes in Computer Science (LNCS), Vol. 13972,
pp. 425-438, 2023. https://doi.org/10.1007/978-3-031-28032-0_33