Slide 44
Slide 44 text
โ PyData NYC 2023 workshop:
extracting dishes, ingredients
and equipment from r/cooking
Reddit posts
โ used LLM during annotation
โ beat few-shot LLM baseline of
0.74 with task-specific model
SPACY.FYI/PYDATA-NYC
CASE STUDY
๐ 8 hours
DATA DEV TIME
๐ฆ 400mb
MODEL SIZE
๐ฅ 2000+
WORDS / SECOND