Upgrade to Pro — share decks privately, control downloads, hide ads and more …

10x Speed With QA Agent Platform - How we scale...

Sponsored · Ship Features Fearlessly Turn features on and off without deploys. Used by thousands of Ruby developers.

10x Speed With QA Agent Platform - How we scaled adoption from individual effort to organizational capability

As the use of AI agents continues to expand, individual proficiency alone is not enough to improve productivity across an entire organization. This is especially true in QA, where teams need to increase speed while also ensuring quality, safety, and reproducibility.

In this session, we will share practical insights from QA Agent Platform(QAAP), an organization-wide platform that supports QA work with AI agents. Through this initiative, we explore how AI adoption can move beyond individual expertise and become part of an organization’s quality infrastructure.

The key is to build guardrails into the platform so that teams can use AI with confidence, while also designing operational practices that continuously share and improve how people work with AI. We will discuss the mechanisms used to balance safety and speed, the ways individual know-how was turned into organizational knowledge, and the process through which leaders and teams helped the platform take root through hands-on practice.

What does it take to turn “10x speed” from individual task efficiency into a sustainable organizational capability? This session offers reusable design and operational insights for anyone looking to introduce AI agents into their work, move beyond PoC, or scale individual AI practices across teams and organizations.

More Decks by LINEヤフーTech (LY Corporation Tech)

Other Decks in Technology

Transcript

  1. 1 2026.06.29 LY Corporation Product QA Unit Yohei Ueda /

    Makoto Fukunaga 10x Speed With QA Agent Platform How we scaled adoption from individual effort to organizational capability
  2. Agenda 1. About Us 2. What is QA Agent Platform

    ? 3. Two Barriers to Organizational AI Adoption 4. Overcoming Barriers - Systems 5. Overcoming Barriers - Initiative 6. Summary
  3. What is QA?  User Experience  Product Quality 

    Development Process Not just a pre-release bug gate ̶ We cover quality end-to-end.
  4. QA Deliverables Count 1.54 x QA Task Time Reduction Excl.

    Test Execution 49.4% AI Substitution Rate 68.1% Results from QAAP
  5. Accelerator “ Use it more! ” Brake “ Stay safe!

    ” Barrier 1: Accelerator vs. Brake
  6. Barrier 2: From a Few to Everyone • Early adopters

    move fast and make it their own. • But their know-how often remains tacit. • And doing the same is rarely straightforward for others.
  7. Accelerator “ Use it more! ” Barrier 1: Accelerator Covered

    ̶ Now, the Brake  Systems ↓ QA-Preset Skills • Universal & org-wide ̶ today's main theme • Accelerator = QA-specific→ deep-dive another time
  8. Solution to Barrier 1: Risk Control, Org-Wide New risks emerge

    every day. • Sensitive data exposure. • Prompt injection. • Supply-chain attacks. Every time we move forward, the brakes slam on.
  9. Solution to Barrier 1: Risk Control, Org-Wide • And can

    we even tell who's doing it right? Not really. • Managing risk one person at a time doesn't scale.
  10. Individual Vigilance to Shared Platform QAAP = Built-in guardrails •

    Fork per team • Centralized on Upstream • Sync to inherit 
  11. Example : Never store secrets in plaintext Secrets: API keys,

    tokens & more • Plaintext .env: high leak risk if device or repo is compromised • QAAP: keep them in 1Password / Keychain, pull only when needed
  12. Enforced by the system, not by discipline Secrets blocked in

    depth at three layers Runtime Reading secrets from .env / env vars blocked at execution
  13. Enforced by the system, not by discipline Secrets blocked in

    depth at three layers Prompt A secret in a prompt is stopped before it reaches the AI
  14. Enforced by the system, not by discipline Secrets blocked in

    depth at three layers Pre-PR A static check catches secret before a PR is created
  15. Guardrails that teach, not just block QAAP catches the risk

    and explains it in the same moment Guided remediation • On a catch: why it was blocked, how to fix it, link to the guide Dual outcome • Risk auto-prevented while understanding grows
  16. Guardrails that teach, not just block QAAP catches the risk

    and explains it in the same moment Guided remediation • On a catch: why it was blocked, how to fix it, link to the guide Dual outcome • Risk auto-prevented while understanding grows "That alert is what made me understand why the risk matters ̶ and how to handle it."
  17. More guardrails: supply chain + agent ops Supply chain (npm)

    • Internal registry only, no auto-run • install scripts ̶ on by default Agent ops • Destructive deletes blocked, routed to trash ̶ recoverable, not an instant erase • Even the worst case "the AI wiped everything, unrecoverable" designed to prevent
  18. What you saw is only part of it ̶ at

    every step, the guardrails are built into the system. So why all these guardrails? This, more than the how, is what I most want to share today.
  19. What we're really guarding against  Common Weak Spots 1.

    Executing tasks exactly as intended 2. Remembering important constraints 3. Understanding what's written accurately 4. Saying “I donʼt know” when uncertain 5. Looking up missing information proactively
  20. What we're really guarding against  Common Weak Spots 1.

    Executing tasks exactly as intended 2. Remembering important constraints 3. Understanding what's written accurately 4. Saying “I donʼt know” when uncertain 5. Looking up missing information proactively
  21. What we're really guarding against  Common Weak Spots 1.

    Executing tasks exactly as intended 2. Remembering important constraints 3. Understanding what's written accurately 4. Saying “I donʼt know” when uncertain 5. Looking up missing information proactively
  22. What we're really guarding against  Common Weak Spots 1.

    Executing tasks exactly as intended 2. Remembering important constraints 3. Understanding what's written accurately 4. Saying “I donʼt know” when uncertain 5. Looking up missing information proactively
  23. What we're really guarding against  Common Weak Spots 1.

    Executing tasks exactly as intended 2. Remembering important constraints 3. Understanding what's written accurately 4. Saying “I donʼt know” when uncertain 5. Looking up missing information proactively
  24. What we're really guarding against  Common Weak Spots 1.

    Executing tasks exactly as intended 2. Remembering important constraints 3. Understanding what's written accurately 4. Saying “I donʼt know” when uncertain 5. Looking up missing information proactively
  25. What we're really guarding against  Common Weak Spots 1.

    Executing tasks exactly as intended 2. Remembering important constraints 3. Understanding what's written accurately 4. Saying “I donʼt know” when uncertain 5. Looking up missing information proactively Not AI. “Humans” struggle with these.
  26. What we're really guarding against AI is constantly changing. Humans,

    however, remain fundamentally the same. Guardrails go in before a slip turns fatal. Not by vigilance, but by the system.
  27. Barrier 1:Summary • QAAP comes with built-in automatic brakes. •

    helps reduce risk, so you can hit the gas with confidence. But a system alone won't get everyone moving. Barrier 2 →
  28. Barrier 2: From a Few to Everyone • Early adopters

    move fast and make it their own. • But their know-how often remains tacit. • And doing the same is rarely straightforward for others.
  29. Lv State Capability 1 Ad-hoc trial Chat AI for summarization

    & brainstorming 2 Partial QA support Business context + structured prompts for subtask support 3 Multi-context integration integrate multiple sources, AI-driven stable delibarables 4 Autonomous workflow Agent-based automation of end-to-end workflows 5 News standard Redesign & scale QA standard processes Understand Current State AI Skills Maturity Model DeNA Launches 'DeNA AI Readiness Score (DARS),' a Company-Wide Metric for Evaluating AI Skills
  30. Lv State Capability 1 Ad-hoc trial Chat AI for summarization

    & brainstorming 2 Partial QA support Business context + structured prompts for subtask support 3 Multi-context integration integrate multiple sources, AI-driven stable delibarables 4 Autonomous workflow Agent-based automation of end-to-end workflows 5 News standard Redesign & scale QA standard processes Understand Current State AI Skills Maturity Model
  31. Lv State Capability 1 Ad-hoc trial Chat AI for summarization

    & brainstorming 2 Partial QA support Business context + structured prompts for subtask support 3 Multi-context integration integrate multiple sources, AI-driven stable delibarables 4 Autonomous workflow Agent-based automation of end-to-end workflows 5 News standard Redesign & scale QA standard processes Understand Current State AI Skills Maturity Model
  32. QA Deliverables Count 1.54 x QA Task Time Reduction Excl.

    Test Execution 49.4% AI Substitution Rate 68.1% Results from QAAP
  33. Key Insight Successful AI adoption is a systems problem, not

    a tools problem. Key Message AIʼs primary role in software development is that of an amplifier. It magnifies the strengths of high-performing organizations and the dysfunctions of struggling ones. https://dora.dev/research/2025/ DORA 2025 Report
  34. Our concerns • Weʼre not security experts, so that makes

    us uneasy • Balancing safety with convenience is always a trade-off We need your help • We welcome feedback every day! • Check out our internal open repository • pull requests and comments are always appreciated! Turning “worries” into a strength we all share