Agentic Coding

Agentic Coding: Beyond Prototyping Sebastian Korfmann Tech Strategy Consultant -
Founder of Hekto.dev

"Artisanal coding is like hand- tilling a commercial farm— beautiful,
but not how we feed the world"

Terminal vs IDE "there's a good chance that by the
end of the year, people aren't using IDEs anymore... we're trying to avoid over investing in UI... given how models are progressing, that might not be useful work" (in a Q&A session at the "Code with Claude" conference, May 2025) Anthropic’s Boris Cherny (Claude Code)

What is Claude Code? An AI coding agent that works
in your terminal (or anywhere, really) • Builds up context on demand • Leverages Prompt Caching • Have seen it work for more than 20 minutes • Limiting factor: Primarily available context and tools npm install -g @anthropic-ai/claude-code

Example: Bluesky Frontend App • First attempt of using Claude
Code • Entirely driven through Claude Code • A couple of hours experimenting

Example: Soccer Analysis Experimented during DFB Pokalﬁnale • Wondered if
I could build real-time commentary • Set it up in like 2 hours • Watched the game with AI analysis Cost me about $1.25 in Gemini API calls for the Video Analysis

Time to Reﬂect starting to fully buy into the idea
of agentic coding

Claude Code • For both experiments (plus a few more
shorter ones): ~ 85 EUR in API Cost • Decided to go for the MAX Plan of claude.ai • That's 200 USD / month

Less constraints Stopped being concerened about cost

Example: A Miro Board via MCP • Used Claude Code
with MCP (Model Context Protocol) integration • Direct API control of Miro • About 10 minutes from text to board Shows how tools can connect in unexpected ways

Example: Mobile Development Tried coding from my phone • Used
WebSSH and Tailscale • Built a React app on the sofa • Deployed to Vercel Return of the thin client?

Example: Claude Code Open Telemetry • Rails App • Postgres
• TimescaleDB

Example: Hekto.dev • Elixir Phoenix • MCP Gateway • RFC
driven • Midsized Codebase • Runs in Production

Other Tools in This Space Different approaches, similar goals Claude
Code - Terminal-based approach Goose (Block) - Desktop app with project generation Sketch - Browser-based with containers Cursor - IDE integration Each has its own philosophy and tradeoffs

An Interesting Observation We're only scratching the surface Example: Anthropic's
designers now ship production code "Large state management changes you typically wouldn't see a designer making"

What I've Learned Some practical insights

Planning Makes a Difference Write specs together When you write
RFCs or design docs with AI: - Makes a lot of difference vs single shot prompting - Helps LLMs as it helps humans - Catches edge cases early - Suggests alternatives you hadn't considered Example: Spend 10 minutes on an RFC, save way more time down the road

Embrace the Feedback Loop • Find some form of automated
feedback loop early on • Tests, Compiler, Browser (MCP), ...

You're still responsible vibe coded, rfc driven, thoroughly tested? Whatever,
but you're still responsible for what you're shipping :)

About Costs Some real numbers I tracked my usage with
OpenTelemetry: - One heavy coding day: ~$175 worth of API calls - Claude Max subscription: $200/month The subscription model makes more sense for regular use.

The Economics Quick comparison Fully loaded Senior Engineer (Germany): ~€10,000/month
Working days: ~20 Daily cost: €500 Claude Max: €200/month If it saves you half a day per month, it pays for itself. People report much more than that.

How Teams Might Change Some observations • smaller teams, larger
scope • everyone ships to production • cost of prototyping approaching zero • way more software can and likely will be created • SaaS models? Still early days, but the impact will be massive

Looking Forward The trajectory is clear 2019: GPT-2 struggled with
basic tasks 2022: ChatGPT / Copilot wrote code snippets 2024: Cursor builds full ﬂedged features 2025: Agentic Coding 2026: ? The improvements have been consistent and exponential.

AI Task Completion is Doubling Every 7 Months The length
of tasks that AI can successfully complete has been growing exponentially, doubling approximately every 7 months over the past 6 years.

Common Concerns Let's address the elephants "AI can't handle complex
codebases" Actually, it does quite well with large projects "What about code quality?" In my experience, it writes thorough tests and catches edge cases "Should we wait for better models?" Current tools are already quite capable

Would you drive a car with agentic code? • what
makes you so sure that human code is better, they make errors all the time • it's all about having testing systems in place, human code or not

Read also https://ﬂy.io/blog/youre-all- nuts/

Takeaways • Don't wait, it's good enough already • Code
is not an asset, it's a liability • Apply to uncommon use-cases • Actual limit: your imagination and access to tools • You get what you pay for

Getting Started If you're curious # Install Claude Code npm
install -g @anthropic-ai/claude-code # Or try another tool like Goose, Sketch, or Cursor # Pick a small project to experiment with Start small, see what works for you.

Final Thoughts This technology is moving fast. What's not working
today, might in 3 months. The key is to start experimenting and learning. Thanks for listening!

Questions & Discussion Sebastian Korfmann • @skorfmann.com (Bluesky) • [email protected]
• hekto.dev Happy to chat about experiences, concerns, or ideas.

Links • https://bsky.app/proﬁle/skorfmann.com/post/ 3lpt4zqqygs2d • https://bsky.app/proﬁle/skorfmann.com/post/3lqpi2an7qs2i • https://bluesky-client-2oskmsae8-hekto.vercel.app • https://www-cdn.anthropic.com/
58284b19e702b49db9302d5b6f135ad8871e7658.pdf • https://metr.org/blog/2025-03-19-measuring-ai-ability-to- complete-long-tasks/

Agentic Coding

Agentic Coding

Sebastian Korfmann

More Decks by Sebastian Korfmann

Other Decks in Technology

Featured

Transcript

Agentic Coding: Beyond Prototyping Sebastian Korfmann Tech Strategy Consultant -

"Artisanal coding is like hand- tilling a commercial farm— beautiful,

Terminal vs IDE "there's a good chance that by the

What is Claude Code? An AI coding agent that works

Example: Bluesky Frontend App • First attempt of using Claude

Example: Soccer Analysis Experimented during DFB Pokalﬁnale • Wondered if

Time to Reﬂect starting to fully buy into the idea

Claude Code • For both experiments (plus a few more

Less constraints Stopped being concerened about cost

Example: A Miro Board via MCP • Used Claude Code

Example: Mobile Development Tried coding from my phone • Used

Example: Claude Code Open Telemetry • Rails App • Postgres

Example: Hekto.dev • Elixir Phoenix • MCP Gateway • RFC

Other Tools in This Space Different approaches, similar goals Claude

An Interesting Observation We're only scratching the surface Example: Anthropic's

What I've Learned Some practical insights

Planning Makes a Difference Write specs together When you write

Embrace the Feedback Loop • Find some form of automated

You're still responsible vibe coded, rfc driven, thoroughly tested? Whatever,

About Costs Some real numbers I tracked my usage with

The Economics Quick comparison Fully loaded Senior Engineer (Germany): ~€10,000/month

How Teams Might Change Some observations • smaller teams, larger

Looking Forward The trajectory is clear 2019: GPT-2 struggled with

AI Task Completion is Doubling Every 7 Months The length

Common Concerns Let's address the elephants "AI can't handle complex

Would you drive a car with agentic code? • what

Read also https://ﬂy.io/blog/youre-all- nuts/

Takeaways • Don't wait, it's good enough already • Code

Getting Started If you're curious # Install Claude Code npm

Final Thoughts This technology is moving fast. What's not working

Questions & Discussion Sebastian Korfmann • @skorfmann.com (Bluesky) • [email protected]

Links • https://bsky.app/proﬁle/skorfmann.com/post/ 3lpt4zqqygs2d • https://bsky.app/proﬁle/skorfmann.com/post/3lqpi2an7qs2i • https://bluesky-client-2oskmsae8-hekto.vercel.app • https://www-cdn.anthropic.com/