Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
DRETa: Extracting RDF from Wikitables [POSTER]
Search
Emir Muñoz
October 23, 2013
Research
0
53
DRETa: Extracting RDF from Wikitables [POSTER]
DRETa: Extracting RDF from Wikitables
Posters & Demos @ ISWC 2013
Emir Muñoz
October 23, 2013
Tweet
Share
More Decks by Emir Muñoz
See All by Emir Muñoz
Machine Learning Pipelines in Production - ML Galway Meetup
emunoz
0
48
Academic Writing: Hints and Tools
emunoz
0
140
Mining Cardinalities from Knowledge Bases
emunoz
0
140
Using Drug Similarities for Discovery of Possible Adverse Reactions
emunoz
0
86
A Hybrid Method for Rating Prediction Using Linked Data Features and Text Reviews
emunoz
0
150
On Learnability of Cardinality Constraints from RDF Data
emunoz
0
98
Minute Madness ESWC 2016
emunoz
0
89
Tensor Networks---a brief description
emunoz
0
60
A Linked Data-Based Decision Tree Classifier to Review Movies
emunoz
1
140
Other Decks in Research
See All in Research
WikipediaやYouTubeにおける論文参照 / joss2024
corgies
1
210
仮説検定とP値
shuntaros
6
7.3k
機械学習を用いたポケモン対戦選出予測
fufufukakaka
1
570
Introduction of NII S. Koyama's Lab (AY2024)
skoyamalab
0
330
中高生にSFを読んでもらうには
ichiiida
1
830
独立成分分析を用いた埋め込み表現の視覚的な理解
momoseoyama
3
770
ランサーズエージェント_フリーランスエンジニアの年収・キャリアの実態調査2024
lancers_pr
0
310
SSII2024 [OS3] 企業における基盤モデル開発の実際
ssii
PRO
0
490
RCEへの近道
kawakatz
1
620
SSII2024 [OS2] 大規模言語モデルとVision & Languageのこれから
ssii
PRO
5
1.3k
単語埋め込みを用いた日本語オノマトペにおける有声・無声子音の対立による音象徴の分析
shunnosukemotomura
0
390
Conducting AI Research on High-Performance Computing (HPC) Systems
yoshipon
2
460
Featured
See All Featured
The Straight Up "How To Draw Better" Workshop
denniskardys
229
130k
Six Lessons from altMBA
skipperchong
24
3.2k
Designing Experiences People Love
moore
136
23k
Embracing the Ebb and Flow
colly
81
4.3k
Mobile First: as difficult as doing things right
swwweet
219
8.8k
5 minutes of I Can Smell Your CMS
philhawksworth
200
19k
A Tale of Four Properties
chriscoyier
155
22k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
26
2.1k
How GitHub (no longer) Works
holman
305
140k
Gamification - CAS2011
davidbonilla
78
4.9k
4 Signs Your Business is Dying
shpigford
178
21k
A Modern Web Designer's Workflow
chriscoyier
689
190k
Transcript
Enabling Networked Knowledge ACKNOWLEDGEMENTS: This work was funded in part
by Science Foundation Ireland under Grant No. SFI/08/CE/I1380 (Lion-2). DRETA: EXTRACTING RDF FROM WIKITABLES Emir Muñoz, Aidan Hogan, Alessandra Mileo National University of Ireland, Galway MOTIVATION WIKITABLE SURVEY player http://dbpedia.org/resource/David_de_Gea http://dbpedia.org/resource/Rafael_Pereira_da_Silva_(footballer_born_1990) http://dbpedia.org/resource/Patrice_Evra …. http://dbpedia.org/resource/Fabio_Pereira_da_Silva http://dbpedia.org/resource/Tom_Cleverley http://dbpedia.org/resource/Darren_Fletcher PROPOSAL http://dbpedia.org/resource/Manchester_United_F.C. http://dbpedia.org/resource/England http://dbpedia.org/resource/Forward_(association_football) http://dbpedia.org/resource/Wayne_Rooney dbo:birthPlace dbp:currentclub dbp:position http://dbpedia.org/resource/Spain http://dbpedia.org/resource/Goalkeeper_(association_football) http://dbpedia.org/resource/David_de_Gea dbp:position http://dbpedia.org/resource/Brazil http://dbpedia.org/resource/Defender_(association_football) http://dbpedia.org/resource/Fabio_Pereira_da_Silva dbp:position … … (1) dbr:David_de_Gea dbo:birthPlace dbr:Spain . (2) dbr:Fabio_Pereira_de_Silva dbo:birthPlace dbr:Brazil . (3) dbr:Fabio_Pereira_de_Silva dbp:currentclub dbr:Manchester_United_F.C . SUGGESTED TRIPLES: SELECT ?player WHERE { ?player dbp:currentclub dbr:Manchester_United_F.C . } TABLE TAXONOMY: DISTRIBUTIONS: QUERY: RESULTS DEMO … http://emunoz.org/wikitables (1) EXTRACTED 34.9 MILLION UNIQUE & NOVEL TRIPLES FROM 1.14 MILLION WIKITABLES (8 MACHINES: 4GB RAM, 2.2 GHZ SINGLE CORE; 12 DAYS) (2) INITIAL EVALUATION: (MANUAL ANNOTATION; THREE JUDGES; 750 TRIPLES EACH) (3) MACHINE LEARNING CLASSIFIERS: (CONSENSUS GOLD STANDARD; VARIETY OF FEATURES) FROM 1.14 MILLION WIKITABLES: BAGGING DECISION TREES: SUPPORT VECTOR MACHINES: 1.14 MILLION WIKITABLES: 7.9 MILLION TRIPLES @81.5% PREC. 15.3 MILLION TRIPLES @72.4% PREC. … INCOMPLETE RESULTS!