Slide 28
Slide 28 text
encoder models
ELECTRA T5
task-specific models
small, often fast, cheap
to run, don’t always
generalize well, need
data to fine-tune
relatively small and fast,
affordable to run,
generalize & adapt well,
need data to fine-tune
OPEN-SOURCE MODELS
large generative models
Falcon
MIXTRAL
very large, often slower,
expensive to run,
generalize & adapt well,
need little to no data