Keywordfinder: automatic keyword extraction from text
As an Insight Data Science Fellow, I completed a 3-week project that involved building a keyword extraction algorithm. Given a block of text as input, my algorithm selects keywords that describe what the text is about.
length Capitalized? Position in page Spread in page Named entity? Noun phrase? Ngram? Logistic regression model In-sample: 65%, out-of-sample: 65%, chance: 50%