Workshop: Half hour of labeling power: Can we beat GPT?

Ines Montani & Ryan Wesslen Explosion Half hour of labeling
power Can we beat GPT?

spacy.io

spacy.io Open-source library for industrial-strength natural language processing 170m+ downloads

spacy.io prodigy.ai Open-source library for industrial-strength natural language processing 170m+
downloads

downloads Modern scriptable annotation tool for machine learning developers 9k+ users 800+ companies

downloads Modern scriptable annotation tool for machine learning developers 9k+ users 800+ companies prodigy.ai/teams

downloads Modern scriptable annotation tool for machine learning developers 9k+ users 800+ companies prodigy.ai/teams Collaborative data development platform GPT-4 API Alex Smith Developer

Generative ! single/multi-doc summarization " reasoning ✅ problem solving ✍
paraphrasing % style transfer ❓question answering Predictive ' text classification ( relation extraction ) coreference * grammar & morphology + entity recognition , semantic parsing - discourse structure

SST2 AG News Banking77 GPT-3 65 70 75 80 85
90 95 100 1% 5% 10% 20% 50% 100% Text Classification

SST2 AG News Banking77 GPT-3 65 70 75 80 85
90 95 100 1% 5% 10% 20% 50% 100% Text Classification 10 20 30 40 50 60 70 80 90 100 0 100 200 300 400 500 FabNER Claude 2 Entity Recognition

Correct LLM few-shot results

Annotation Guidelines DISH known food dishes, e.g. lobster ravioli, garlic
bread INGREDIENT EQUIPMENT individual parts of a food dish, including herbs and spices any kind of cooking equipment, e.g. oven, cooking pot, grill

annotate evaluate update

annotate evaluate update 1

annotate evaluate update 1 resolve disagreements retrospective meetings assess if
more data is needed 2

annotate evaluate update 1 resolve disagreements retrospective meetings assess if
more data is needed 2 update annotation guidelines add more examples expand label definitions 3

spacy-llm config prompt template spacy.io/usage/large-language-models

Evaluation Results 0 20 40 60 80 100 Zero-shot Chain-of-thought
Few-shot Task-specific ? F DISH (F) INGREDIENT (F) EQUIPMENT (F)

Annotation Guidelines DISH known food dishes, e.g. lobster ravioli, garlic
bread INGREDIENT EQUIPMENT individual parts of a food dish, including herbs and spices any kind of cooking equipment, e.g. oven, cooking pot, grill

Evaluation Results 0 20 40 60 80 100 Zero-shot Chain-of-thought
Few-shot Task-specific F DISH (F) INGREDIENT (F) EQUIPMENT (F) 2000 words/second

prodigy.ai/features/task-routing Use task routing to distribute workloads and determine inter-annotator
agreement. pro tip:

koaning.io/posts/large-disagreement-models Focus on examples where models disagree, similar to active
learning. pro tip:

ChatGPT Use generative models to create spaCy rule sets! pro
tip: spacy.io/usage/rule-based-matching

Takeaways Generative complements predictive, it doesn't replace it.

Takeaways Generative complements predictive, it doesn't replace it. Use generative
models to create better, more accurate, faster, smaller and private task-specific models.

Takeaways Generative complements predictive, it doesn't replace it. Use generative
models to create better, more accurate, faster, smaller and private task-specific models. With good tooling, you can make human input more e icient.

thank you! Explosion spaCy Prodigy explosion.ai spacy.io prodigy.ai Twitter Mastodon
Bluesky @explosion_ai @[email protected] @explosion-ai.bsky.social LinkedIn

Workshop: Half hour of labeling power: Can we b...

Workshop: Half hour of labeling power: Can we beat GPT?

Ines Montani PRO

Video

More Decks by Ines Montani

Other Decks in Programming

Featured

Transcript

Ines Montani & Ryan Wesslen Explosion Half hour of labeling

spacy.io

spacy.io Open-source library for industrial-strength natural language processing 170m+ downloads

spacy.io prodigy.ai Open-source library for industrial-strength natural language processing 170m+

spacy.io prodigy.ai Open-source library for industrial-strength natural language processing 170m+

spacy.io prodigy.ai Open-source library for industrial-strength natural language processing 170m+

spacy.io prodigy.ai Open-source library for industrial-strength natural language processing 170m+

Generative ! single/multi-doc summarization " reasoning ✅ problem solving ✍

SST2 AG News Banking77 GPT-3 65 70 75 80 85

SST2 AG News Banking77 GPT-3 65 70 75 80 85

SST2 AG News Banking77 GPT-3 65 70 75 80 85

1

1 2

Correct LLM few-shot results

Correct LLM few-shot results

Annotation Guidelines DISH known food dishes, e.g. lobster ravioli, garlic

annotate evaluate update

annotate evaluate update 1

annotate evaluate update 1 resolve disagreements retrospective meetings assess if

annotate evaluate update 1 resolve disagreements retrospective meetings assess if

spacy-llm config prompt template spacy.io/usage/large-language-models

Evaluation Results 0 20 40 60 80 100 Zero-shot Chain-of-thought

Evaluation Results 0 20 40 60 80 100 Zero-shot Chain-of-thought

Annotation Guidelines DISH known food dishes, e.g. lobster ravioli, garlic

Evaluation Results 0 20 40 60 80 100 Zero-shot Chain-of-thought

prodigy.ai/features/task-routing Use task routing to distribute workloads and determine inter-annotator

koaning.io/posts/large-disagreement-models Focus on examples where models disagree, similar to active

ChatGPT Use generative models to create spaCy rule sets! pro

Takeaways Generative complements predictive, it doesn't replace it.

Takeaways Generative complements predictive, it doesn't replace it. Use generative

Takeaways Generative complements predictive, it doesn't replace it. Use generative

thank you! Explosion spaCy Prodigy explosion.ai spacy.io prodigy.ai Twitter Mastodon