Failing to reason with LLMs (ARC AGI kaggle update with Llama3) - Speaker Deck

Tweet

Tweet

Slide 1

Slide 1 text

Abstractly reasoning – failing with an LLM (next steps for ARC AGI) PyDataLondon 2024-08 lightning talk @IanOzsvald – ianozsvald.com

Slide 2

Slide 2 text

Can LLMs reason? ARC AGI Abstract JSON “initial → target” Tried “don’t code, just reason” Llama3 70B pretty smart Llama3 8B writes code pretty well, sometimes Abstraction & Reasoning Challenge By [ian]@ianozsvald[.com] Ian Ozsvald

Slide 3

Slide 3 text

30% solutions pretty good! By [ian]@ianozsvald[.com] Ian Ozsvald It counts! Comments! Reasonable numpy! Correct substitution!

Slide 4

Slide 4 text

Convincing weirdness By [ian]@ianozsvald[.com] Ian Ozsvald

Slide 5

Slide 5 text

Big issue – it gets stuck on the same ideas Get LLM to read lots of failed model outputs, summarise, then maybe I could ask it to make new strategies? Notes → NotANumber.email newsletter Next steps By [ian]@ianozsvald[.com] Ian Ozsvald