Slide 21
Slide 21 text
• Features at different levels used to train
Machine Learning models
• Article features (e.g., # of tables)
• Table features (e.g., #rows, #columns, ratios)
• Cell features (e.g., # of entities, string length, has
format)
• Column features (e.g., # of entities, # of unique
entities)
• Predicate/Column features (e.g., string similarity, # of
rows where relation holds)
• Predicate features (e.g., triple count, count unique)
• Triple features (e.g., is the table from article or body)
Emir M. - WSDM, New York City, USA, 27th February, 2014 21
MINING RDF FROM WIKITABLES (4/6)