Towards Structured Data: LLMs from Prototype to Production

Ines Montani Explosion TOWARDS STRUCTURED LARGE LANGUAGE MODELS ✨ CHATGPT
🤖 ARTIFICIAL INTELLIGENCE 🧠 MACHINE LEARNING ✨ PROTOTYPE TO PRODUCTION LLAMA 🦙 NATURAL LANGUAGE PROCESSING 💬 ✨ OPEN SOURCE 🌎 PYTHON 🐍 PROMPT ENGINEERING ⚙ ZERO-SHOT LEARNING 🎯 GPT-4 EVALUATION 📈 COPILOT 🚀 GENERATIVE AI 👾 DATA LLMS FROM Ines Montani 💥 Explosion

SPACY SPACY.IO 🌎 GITHUB.COM/EXPLOSION/SPACY Open-source library for industrial-strength Natural Language
Processing 225m+ downloads

SPACY SPACY.IO 🌎 GITHUB.COM/EXPLOSION/SPACY Open-source library for industrial-strength Natural Language
Processing 225m+ downloads ChatGPT can write spaCy code!

PRODIGY Modern scriptable annotation tool for machine learning developers PRODIGY.AI
10k+ users 900+ companies

SOFTWARE IN INDUSTRY

SOFTWARE IN INDUSTRY modular 🧩

SOFTWARE IN INDUSTRY modular 🧩 transparent 🔎

SOFTWARE IN INDUSTRY modular 🧩 transparent 🔎 explainable 🔮

SOFTWARE IN INDUSTRY modular 🧩 transparent 🔎 explainable 🔮 🔒
data-private

data-private ✅ reliable

data-private ✅ reliable 💸 a ff ordable

SOFTWARE IN INDUSTRY black-box models modular 🧩 transparent 🔎 explainable
🔮 🔒 data-private ✅ reliable 💸 a ff ordable

SOFTWARE IN INDUSTRY black-box models modular 🧩 transparent 🔎 explainable
🔮 third-party APIs 🔒 data-private ✅ reliable 💸 a ff ordable

📖 single/multi-doc summarization ✅ problem solving ✍ paraphrasing 🧮 reasoning
🖼 style transfer Generative ❓question answering 📚 text classification 🏷 entity recognition 🔗 relation extraction 🧬 grammar & morphology 🎯 semantic parsing 👫 coreference resolution 💬 discourse structure Predictive UNDERSTANDING NLP TASKS

📖 single/multi-doc summarization ✅ problem solving ✍ paraphrasing 🧮 reasoning
🖼 style transfer Generative ❓question answering 📚 text classification 🏷 entity recognition 🔗 relation extraction 🧬 grammar & morphology 🎯 semantic parsing 👫 coreference resolution 💬 discourse structure Predictive UNDERSTANDING NLP TASKS human-readable machine-readable

🔮 large generative model

🔮 large generative model 📦 distilled task-specific model

🔮 large generative model 📦 distilled task-specific model in-context learning
Falcon MIXTRAL GPT-4

Falcon MIXTRAL GPT-4 transfer learning ELECTRA T5

Falcon MIXTRAL GPT-4 transfer learning ELECTRA T5 BERT-base still very competitive!

GITHUB.COM/EXPLOSION/SPACY-LLM TOWARDS STRUCTURED DATA Prompt Template 🔮 LLM London is
bigger than Berlin LOCATION: London, Berlin LOCATION

GITHUB.COM/EXPLOSION/SPACY-LLM 💬 unstructured text input 📊 structured Doc object

GITHUB.COM/EXPLOSION/SPACY-LLM Named Entity Recognition Text Classification Relation Extraction Lemma- tization
💬 unstructured text input 📊 structured Doc object

GITHUB.COM/EXPLOSION/SPACY-LLM Named Entity Recognition Text Classification Relation Extraction Lemma- tization
💬 unstructured text input 📊 structured Doc object 🔮 LLM ⚙ Supervised Model ✍ Rules mix, match and replace techniques

CLOSE THE GAP BETWEEN PROTOTYPE AND PRODUCTION

CLOSE THE GAP BETWEEN PROTOTYPE AND PRODUCTION 🔗 standardize inputs
and outputs

and outputs 📈 start with evaluation

and outputs 📈 start with evaluation EXPLOSION.AI/BLOG/APPLIED-NLP-THINKING 🎯 assess utility, not just accuracy

and outputs 📈 start with evaluation EXPLOSION.AI/BLOG/APPLIED-NLP-THINKING 🎯 assess utility, not just accuracy 🔁 work on data iteratively

and outputs 📈 start with evaluation EXPLOSION.AI/BLOG/APPLIED-NLP-THINKING 🎯 assess utility, not just accuracy 🔁 work on data iteratively 💬 consider structure and ambiguity of natural language

processing pipeline prototype 🔮 📦 GITHUB.COM/EXPLOSION/SPACY-LLM processing pipeline in production
📦 📦 📦 📦 📊 structured Doc object 📊 structured Doc object PROTOTYPE TO PRODUCTION

processing pipeline prototype 🔮 📦 prompt model & transform output
to structured data GITHUB.COM/EXPLOSION/SPACY-LLM processing pipeline in production 📦 📦 📦 📦 📊 structured Doc object 📊 structured Doc object PROTOTYPE TO PRODUCTION

🔮 HUMAN IN THE LOOP

continuous evaluation baseline 🔮 HUMAN IN THE LOOP

continuous evaluation baseline prompting 🔮 HUMAN IN THE LOOP

continuous evaluation baseline prompting PRODIGY.AI 🔮 HUMAN IN THE LOOP

continuous evaluation baseline prompting 📦 transfer learning PRODIGY.AI 🔮 HUMAN
IN THE LOOP

continuous evaluation baseline prompting 📦 transfer learning PRODIGY.AI distilled model
🔮 HUMAN IN THE LOOP

task-specific distillation workflow continuous evaluation baseline prompting 📦 transfer learning
PRODIGY.AI distilled model 🔮 HUMAN IN THE LOOP

▪ PyData NYC 2023 workshop: extracting dishes, ingredients and equipment
from r/cooking Reddit posts SPACY.FYI/PYDATA-NYC CASE STUDY 🕓 8 hours DATA DEV TIME 📦 400mb MODEL SIZE 🔥 2000+ WORDS / SECOND

from r/cooking Reddit posts ▪ used LLM during annotation SPACY.FYI/PYDATA-NYC CASE STUDY 🕓 8 hours DATA DEV TIME 📦 400mb MODEL SIZE 🔥 2000+ WORDS / SECOND

from r/cooking Reddit posts ▪ used LLM during annotation ▪ beat few-shot LLM baseline of 0.74 with task-specific model SPACY.FYI/PYDATA-NYC CASE STUDY 🕓 8 hours DATA DEV TIME 📦 400mb MODEL SIZE 🔥 2000+ WORDS / SECOND

from r/cooking Reddit posts ▪ used LLM during annotation ▪ beat few-shot LLM baseline of 0.74 with task-specific model ▪ 20× inference time speedup SPACY.FYI/PYDATA-NYC CASE STUDY 🕓 8 hours DATA DEV TIME 📦 400mb MODEL SIZE 🔥 2000+ WORDS / SECOND

CONCLUSION

▪ LLMs can be one part of a product or
process, and swapped for di ff erent approaches. CONCLUSION

process, and swapped for di ff erent approaches. ▪ Iteration and the right tooling can get you past the prototype plateau. CONCLUSION

process, and swapped for di ff erent approaches. ▪ Iteration and the right tooling can get you past the prototype plateau. ▪ There’s no need to compromise on development best practices or privacy. CONCLUSION

THANK YOU! 💥 Explosion 💫 spaCy ✨ Prodigy 🐦 Twitter
🐘 Mastodon 🦋 Bluesky 💼 LinkedIn explosion.ai spacy.io prodigy.ai @_inesmontani @[email protected] @inesmontani.bsky.social

Towards Structured Data: LLMs from Prototype to...

Towards Structured Data: LLMs from Prototype to Production

More Decks by Ines Montani

Other Decks in Programming

Featured

Transcript