Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Agentic Coding

Agentic Coding

Avatar for Sebastian Korfmann

Sebastian Korfmann

June 01, 2025
Tweet

More Decks by Sebastian Korfmann

Other Decks in Technology

Transcript

  1. Terminal vs IDE "there's a good chance that by the

    end of the year, people aren't using IDEs anymore... we're trying to avoid over investing in UI... given how models are progressing, that might not be useful work" (in a Q&A session at the "Code with Claude" conference, May 2025) Anthropic’s Boris Cherny (Claude Code)
  2. What is Claude Code? An AI coding agent that works

    in your terminal (or anywhere, really) • Builds up context on demand • Leverages Prompt Caching • Have seen it work for more than 20 minutes • Limiting factor: Primarily available context and tools npm install -g @anthropic-ai/claude-code
  3. Example: Bluesky Frontend App • First attempt of using Claude

    Code • Entirely driven through Claude Code • A couple of hours experimenting
  4. Example: Soccer Analysis Experimented during DFB Pokalfinale • Wondered if

    I could build real-time commentary • Set it up in like 2 hours • Watched the game with AI analysis Cost me about $1.25 in Gemini API calls for the Video Analysis
  5. Claude Code • For both experiments (plus a few more

    shorter ones): ~ 85 EUR in API Cost • Decided to go for the MAX Plan of claude.ai • That's 200 USD / month
  6. Example: A Miro Board via MCP • Used Claude Code

    with MCP (Model Context Protocol) integration • Direct API control of Miro • About 10 minutes from text to board Shows how tools can connect in unexpected ways
  7. Example: Mobile Development Tried coding from my phone • Used

    WebSSH and Tailscale • Built a React app on the sofa • Deployed to Vercel Return of the thin client?
  8. Example: Hekto.dev • Elixir Phoenix • MCP Gateway • RFC

    driven • Midsized Codebase • Runs in Production
  9. Other Tools in This Space Different approaches, similar goals Claude

    Code - Terminal-based approach Goose (Block) - Desktop app with project generation Sketch - Browser-based with containers Cursor - IDE integration Each has its own philosophy and tradeoffs
  10. An Interesting Observation We're only scratching the surface Example: Anthropic's

    designers now ship production code "Large state management changes you typically wouldn't see a designer making"
  11. Planning Makes a Difference Write specs together When you write

    RFCs or design docs with AI: - Makes a lot of difference vs single shot prompting - Helps LLMs as it helps humans - Catches edge cases early - Suggests alternatives you hadn't considered Example: Spend 10 minutes on an RFC, save way more time down the road
  12. Embrace the Feedback Loop • Find some form of automated

    feedback loop early on • Tests, Compiler, Browser (MCP), ...
  13. You're still responsible vibe coded, rfc driven, thoroughly tested? Whatever,

    but you're still responsible for what you're shipping :)
  14. About Costs Some real numbers I tracked my usage with

    OpenTelemetry: - One heavy coding day: ~$175 worth of API calls - Claude Max subscription: $200/month The subscription model makes more sense for regular use.
  15. The Economics Quick comparison Fully loaded Senior Engineer (Germany): ~€10,000/month

    Working days: ~20 Daily cost: €500 Claude Max: €200/month If it saves you half a day per month, it pays for itself. People report much more than that.
  16. How Teams Might Change Some observations • smaller teams, larger

    scope • everyone ships to production • cost of prototyping approaching zero • way more software can and likely will be created • SaaS models? Still early days, but the impact will be massive
  17. Looking Forward The trajectory is clear 2019: GPT-2 struggled with

    basic tasks 2022: ChatGPT / Copilot wrote code snippets 2024: Cursor builds full fledged features 2025: Agentic Coding 2026: ? The improvements have been consistent and exponential.
  18. AI Task Completion is Doubling Every 7 Months The length

    of tasks that AI can successfully complete has been growing exponentially, doubling approximately every 7 months over the past 6 years.
  19. Common Concerns Let's address the elephants "AI can't handle complex

    codebases" Actually, it does quite well with large projects "What about code quality?" In my experience, it writes thorough tests and catches edge cases "Should we wait for better models?" Current tools are already quite capable
  20. Would you drive a car with agentic code? • what

    makes you so sure that human code is better, they make errors all the time • it's all about having testing systems in place, human code or not
  21. Takeaways • Don't wait, it's good enough already • Code

    is not an asset, it's a liability • Apply to uncommon use-cases • Actual limit: your imagination and access to tools • You get what you pay for
  22. Getting Started If you're curious # Install Claude Code npm

    install -g @anthropic-ai/claude-code # Or try another tool like Goose, Sketch, or Cursor # Pick a small project to experiment with Start small, see what works for you.
  23. Final Thoughts This technology is moving fast. What's not working

    today, might in 3 months. The key is to start experimenting and learning. Thanks for listening!
  24. Questions & Discussion Sebastian Korfmann • @skorfmann.com (Bluesky) • [email protected]

    • hekto.dev Happy to chat about experiences, concerns, or ideas.