Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DRETa: Extracting RDF from Wikitables [POSTER]
Search
Emir Muñoz
October 23, 2013
Research
65
0
Share
DRETa: Extracting RDF from Wikitables [POSTER]
DRETa: Extracting RDF from Wikitables
Posters & Demos @ ISWC 2013
Emir Muñoz
October 23, 2013
More Decks by Emir Muñoz
See All by Emir Muñoz
Machine Learning Pipelines in Production - ML Galway Meetup
emunoz
0
79
Academic Writing: Hints and Tools
emunoz
0
160
Mining Cardinalities from Knowledge Bases
emunoz
0
260
Using Drug Similarities for Discovery of Possible Adverse Reactions
emunoz
0
160
A Hybrid Method for Rating Prediction Using Linked Data Features and Text Reviews
emunoz
0
240
On Learnability of Cardinality Constraints from RDF Data
emunoz
0
220
Minute Madness ESWC 2016
emunoz
0
130
Tensor Networks---a brief description
emunoz
0
130
A Linked Data-Based Decision Tree Classifier to Review Movies
emunoz
1
260
Other Decks in Research
See All in Research
LINEヤフー データサイエンス Meetup「三井物産コモディティ予測チャレンジ」の舞台裏-AlpacaTechパート
gamella
1
550
東京大学工学部計数工学科、計数工学特別講義の説明資料
kikuzo
0
460
Unified Audio Source Separation (Defense Slides)
kohei_1979
1
610
台湾モデルに学ぶ詐欺広告対策:市民参加の必要性
dd2030
0
340
定数整数除算・剰余算最適化再考
herumi
1
120
計算情報学研究室(数理情報学第7研究室)2026
tomohirokoana
0
500
YOLO26_ Key Architectural Enhancements and Performance Benchmarking for Real-Time Object Detection
satai
3
770
CyberAgent AI Lab研修 / Social Implementation Anti-Patterns in AI Lab
chck
7
4.6k
言語モデルから言語について語る際に押さえておきたいこと
eumesy
PRO
5
2.3k
非試合日の野球場を楽しむためのARホームランボールキャッチ体験システムの開発 / EC79-miyazaki
yumulab
0
190
2026.01ウェビナー資料
elith
0
380
COFFEE-Japan PROJECT Impact Report(Uminomukou Coffee)
ontheslope
0
160
Featured
See All Featured
Dominate Local Search Results - an insider guide to GBP, reviews, and Local SEO
greggifford
PRO
0
190
Speed Design
sergeychernyshev
33
1.8k
The Limits of Empathy - UXLibs8
cassininazir
1
350
Leading Effective Engineering Teams in the AI Era
addyosmani
9
2k
How to train your dragon (web standard)
notwaldorf
97
6.7k
コードの90%をAIが書く世界で何が待っているのか / What awaits us in a world where 90% of the code is written by AI
rkaga
61
44k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
35
2.5k
Automating Front-end Workflow
addyosmani
1370
210k
Docker and Python
trallard
47
3.9k
Product Roadmaps are Hard
iamctodd
PRO
55
12k
Stewardship and Sustainability of Urban and Community Forests
pwiseman
0
220
DevOps and Value Stream Thinking: Enabling flow, efficiency and business value
helenjbeal
1
210
Transcript
Enabling Networked Knowledge ACKNOWLEDGEMENTS: This work was funded in part
by Science Foundation Ireland under Grant No. SFI/08/CE/I1380 (Lion-2). DRETA: EXTRACTING RDF FROM WIKITABLES Emir Muñoz, Aidan Hogan, Alessandra Mileo National University of Ireland, Galway MOTIVATION WIKITABLE SURVEY player http://dbpedia.org/resource/David_de_Gea http://dbpedia.org/resource/Rafael_Pereira_da_Silva_(footballer_born_1990) http://dbpedia.org/resource/Patrice_Evra …. http://dbpedia.org/resource/Fabio_Pereira_da_Silva http://dbpedia.org/resource/Tom_Cleverley http://dbpedia.org/resource/Darren_Fletcher PROPOSAL http://dbpedia.org/resource/Manchester_United_F.C. http://dbpedia.org/resource/England http://dbpedia.org/resource/Forward_(association_football) http://dbpedia.org/resource/Wayne_Rooney dbo:birthPlace dbp:currentclub dbp:position http://dbpedia.org/resource/Spain http://dbpedia.org/resource/Goalkeeper_(association_football) http://dbpedia.org/resource/David_de_Gea dbp:position http://dbpedia.org/resource/Brazil http://dbpedia.org/resource/Defender_(association_football) http://dbpedia.org/resource/Fabio_Pereira_da_Silva dbp:position … … (1) dbr:David_de_Gea dbo:birthPlace dbr:Spain . (2) dbr:Fabio_Pereira_de_Silva dbo:birthPlace dbr:Brazil . (3) dbr:Fabio_Pereira_de_Silva dbp:currentclub dbr:Manchester_United_F.C . SUGGESTED TRIPLES: SELECT ?player WHERE { ?player dbp:currentclub dbr:Manchester_United_F.C . } TABLE TAXONOMY: DISTRIBUTIONS: QUERY: RESULTS DEMO … http://emunoz.org/wikitables (1) EXTRACTED 34.9 MILLION UNIQUE & NOVEL TRIPLES FROM 1.14 MILLION WIKITABLES (8 MACHINES: 4GB RAM, 2.2 GHZ SINGLE CORE; 12 DAYS) (2) INITIAL EVALUATION: (MANUAL ANNOTATION; THREE JUDGES; 750 TRIPLES EACH) (3) MACHINE LEARNING CLASSIFIERS: (CONSENSUS GOLD STANDARD; VARIETY OF FEATURES) FROM 1.14 MILLION WIKITABLES: BAGGING DECISION TREES: SUPPORT VECTOR MACHINES: 1.14 MILLION WIKITABLES: 7.9 MILLION TRIPLES @81.5% PREC. 15.3 MILLION TRIPLES @72.4% PREC. … INCOMPLETE RESULTS!