Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Invenio Interest Group - INSPIRE HEP
Search
Javier Martin Montull
June 13, 2014
Research
0
93
Invenio Interest Group - INSPIRE HEP
Invenio Interest Group presentation about INSPIRE HEP contributions to the Invenio software
Javier Martin Montull
June 13, 2014
Tweet
Share
More Decks by Javier Martin Montull
See All by Javier Martin Montull
Creating a complex Angular 2 application - 3rd Developer's conference, CERN
jmartinm
0
39
Editing content in Invenio
jmartinm
0
56
BlogForever workshop - Invenio metadata curation
jmartinm
2
63
Other Decks in Research
See All in Research
【NICOGRAPH2025】Photographic Conviviality: ボディペイント・ワークショップによる 同時的かつ共生的な写真体験
toremolo72
0
110
LLM-Assisted Semantic Guidance for Sparsely Annotated Remote Sensing Object Detection
satai
3
300
HU Berlin: Industrial-Strength Natural Language Processing with spaCy and Prodigy
inesmontani
PRO
0
120
"主観で終わらせない"定性データ活用 ― プロダクトディスカバリーを加速させるインサイトマネジメント / Utilizing qualitative data that "doesn't end with subjectivity" - Insight management that accelerates product discovery
kaminashi
15
18k
学習型データ構造:機械学習を内包する新しいデータ構造の設計と解析
matsui_528
5
2.5k
「リアル×スキマ時間」を活用したUXリサーチ 〜新規事業を前に進めるためのUXリサーチプロセスの設計〜
techtekt
PRO
0
230
ACL読み会2025: Can Language Models Reason about Individualistic Human Values and Preferences?
yukizenimoto
0
110
一般道の交通量減少と速度低下についての全国分析と熊本市におけるケーススタディ(20251122 土木計画学研究発表会)
trafficbrain
0
110
Pythonでジオを使い倒そう! 〜それとFOSS4G Hiroshima 2026のご紹介を少し〜
wata909
0
1.2k
EarthDial: Turning Multi-sensory Earth Observations to Interactive Dialogues
satai
3
560
ロボット学習における大規模検索技術の展開と応用
denkiwakame
1
180
ドメイン知識がない領域での自然言語処理の始め方
hargon24
1
230
Featured
See All Featured
Why You Should Never Use an ORM
jnunemaker
PRO
61
9.7k
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
231
22k
Heart Work Chapter 1 - Part 1
lfama
PRO
4
35k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
47
7.9k
Taking LLMs out of the black box: A practical guide to human-in-the-loop distillation
inesmontani
PRO
3
2k
The Anti-SEO Checklist Checklist. Pubcon Cyber Week
ryanjones
0
37
YesSQL, Process and Tooling at Scale
rocio
174
15k
Into the Great Unknown - MozCon
thekraken
40
2.2k
Beyond borders and beyond the search box: How to win the global "messy middle" with AI-driven SEO
davidcarrasco
0
34
The Illustrated Children's Guide to Kubernetes
chrisshort
51
51k
The SEO identity crisis: Don't let AI make you average
varn
0
47
Redefining SEO in the New Era of Traffic Generation
szymonslowik
1
180
Transcript
INSPIRE: contributions to Invenio from the leading High Energy Physics
platform Open Repositories 2014, Helsinki Javier Martin Montull on behalf of the INSPIRE collaboration
HEP (High Energy Physics) ?
Who are our users?
~50.000 users (HEP scientists) ~2 searches/sec
+ 2500 papers/year from experimentalists
+ 20000 papers/year from theorists
A bit of history about INSPIRE
INSPIRE’s timeline 1969 SPIRES is born 1991 SPIRES becomes first
website in the US 2007 2010 2011 CERN, SLAC, Fermilab and DESY form the INSPIRE collaboration Launch of INSPIRE beta site Launch of INSPIRE production site
Why did we move from SPIRES to Invenio ?
None
What is INSPIRE’s focus ?
Ingest content Curate/enrich metadata (manually and automatically) Present the information
to the user
Ingest content Curate/enrich metadata (manually and automatically) Present the information
to the user
Ingestion from arXiv OAI-PMH + attached files (images) Oaiharvest module
Ingestion from publishers Ad-hoc scripts that can be found in:
https://github.com/inspirehep/harvesting-kit
Ingest content Curate/enrich metadata (manually and automatically) Present the information
to the user
Curation • Curated metadata for over 40 years • Curation
job is distributed in 4 labs • Using Invenio curation web tools
Some examples Record Editor https://inspirehep.net/record/edit/?ln=en#state=edit&recid=1300121 Multiple Record Editor https://inspirehep.net/record/multiedit
Ingest content Curate/enrich metadata (manually and automatically) Present the information
to the user
With a focus on author information
http://inspireheptest.cern.ch/author/profile/S.Mele.1
bibauthorid: algorithm able to disambiguate author names webauthorprofile: presents the
information to the users
Reaching the limits of the current framework and don’t want
to go back to...
None
So it is time to move to Invenio 2.0 https://github.com/inveniosoftware/invenio/tree/next
In order to have Better user input Better curation/back office
tools Better UX/UI
Better user input
Better user input We have started using the new deposit
module https://inspirelabstest.cern.ch/deposit/literature/ 10.1016/j.physletb.2012.11.073 And plan on using it for future corrections...
Better curation/back office tools
Better curation/back office tools New workflows module • Allows interaction
with workflow objects for curators (i.e. accept/reject records) • Flexible and customisable UI
Better UX/UI
New tools Bower Grunt
Will allow us to move away from the 90s
INSPIRE + Invenio 2.0 = INSPIRE Labs (coming up in
the next months) http://inspirelabstest.cern.ch
None