[AgentConHelsinki2025]From Assistant to Agent: How Developer Agents Reshape Mobile Workflows
This is the slide, "From Assistant to Agent: How Developer Agents Reshape Mobile Workflows" at AgentCon Helsinki 2025.
#AgentConHelsinki hashtag#AgentCon hashtag#GenAI hashtag#aicommunity hashtag#techevent
Books) •[Of fi cial] Evangelist at NTT TechnoCross e.g. AI-powered call analysis, livestock monitoring, and privileged access management •[Private] iplatform.org About me Kenichi Kambara (X:@korodroid)
Evolved from waiting for prompts to acting on context •From isolated commands to continuous collaboration •From single- fi le focus to project-wide understanding •From reactive suggestions to proactive improvements
Check — context-aware, tool-using, outcome-driven Key Characteristics Context Awareness Multi ‑ step Reasoning Direct Execution Persistent State Understands codebase, dependencies, and developer intent. Plans tasks and executes across fi les/functions. Edits code, runs tests/builds, generates draft PRs. Keeps memory and artifacts across sessions.
agent in a Flutter repo. Goal: A screen scrolls slowly. Diagnose → improve → verify → open a draft PR. Proceed without approval. Guardrails: - Keep business logic & public APIs unchanged. - Limit edits to the affected screen (e.g., MyHomePage) and minimal helpers. - Keep the diff small and self-explanatory. Plan: - List 3 hypotheses for the slowdown. - Add lightweight measurements ( fi rst-frame builds, per-screen scroll rebuilds) using debug-only logs. - Start measuring immediately. Diagnose: - Run measurements ( fl utter run/test/analyze). - Output key numbers on one line and choose up to 2 approaches with a brief rationale. Act: - Apply a minimal design that reduces fi rst-frame builds and scroll rebuilds while preserving visuals/API. - Show the diff and apply it. Check: - Run analyze/test; report green/red. - Show Before → After numbers on one line each. Draft PR: - Branch: perf/scroll-optimization - Commits: perf(ui)… ; chore(debug) remove counters - Create a draft PR with problem, root cause (numbers), changes, diff size, key metrics, and checks. Start now. If anything fails, propose a fallback and continue. Plan → Diagnose → Act → Check → Draft PR
disable risky actions. Data & secrets Use secret managers; hide personal data; mask tokens in logs. Prompt/tool safety Approved tools only; human approval for risky tasks. Quality & change control CI checks; audit logs; staged rollout. Cost & Ops Monitoring Monitor usage dashboards; alert on spikes and runaway tasks. Execution guardrails Set time/cost caps; enforce rate limits; auto-abort on failure.
how fi les connect, and gets the developer’s intent. Autonomy Level Works on its own with light guidance and makes safe choices within set rules. Multi-step Processing Plans and carries out several steps across fi les and functions to fi nish a task. Iterative Improvement Learns from feedback and improves solutions iteratively.
“Instruct → execute → con fi rm” remains human-driven: answers, not completion. Release cycles depend on human availability. Give a goal → Agent plans → implements → tests → measures → creates a draft PR in an automated loop. Humans focus on approval. 💬 Chat-based AI 🧑💻 Human Work ⚙ CI ✅ Human Review 📝 Draft PR ✅ Approval Gate Plan Act Test Measure Loop