tasks (Abstract Reasoning Corpus) using Python. Constraint: Minimize source code length (bytes). Metric: • Per Task: Score = max(1, 2500 - Length) for correct solutions. • Final Ranking: Determined by the Cumulative Score across 400 tasks. Key Difficulty: • LLMs typically prioritize readability and explanation, not extreme brevity (Code Golf). • It's okay to solve it manually. Can LLM/AI agents beat professional golfers?