Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Data_strategy___engineering_for_agentic_workflo...

Avatar for Ray Grieselhuber Ray Grieselhuber PRO
April 15, 2026
8

 Data_strategy___engineering_for_agentic_workflows.pdf

Avatar for Ray Grieselhuber

Ray Grieselhuber PRO

April 15, 2026

More Decks by Ray Grieselhuber

Transcript

  1. Agenda • Intro • Background and motivations for this topic

    • Vibe coding vs. AI-driven engineering • Tools & skills • Testing & security • Project ideas • Open weight models vs. closed models • Final thoughts
  2. Introduction • BA, Japanese & B.S. Computer Science • Software

    engineering with focus on early ML, agents, big data, distributed systems, etc. • SEO since 2006, built first enterprise platform on the market • Founded DemandSphere (GinzaMetrics) in 2009, Tokyo • Moved to Silicon Valley in 2010 for YC • Work with clients all over the world
  3. Founder mode means: • Staying close to the user (even

    if it’s yourself at first) • Build what people want • Iterate quickly • Keep your sleeves rolled up
  4. The new SaaS (Service as a Software): • AI is

    empowered by human attention • Businesses pay for results, they don’t care about your cool AI systems if they don’t work • Humans have to backstop when AI systems fail
  5. The best AI development workflows: • Compress • Summarize •

    Deduplicate • Retrieve only what is needed
  6. The point is that maintaining context for humans is just

    as important as maintaining it for agents
  7. A basic vibe coding to AI-driven engineering workflow Start in

    Claude.ai Vanilla prototype Claude.ai to build CLAUDE.md Move to local filesystem claude init Create repo Enforce testing policy Configure deployment environment Automate deployments Deploy regularly
  8. Security risks with MCPs: • Client side ◦ Giving too

    much access to the MCPs ◦ Prompt injection ◦ Data / key leakage • Server side ◦ Validating tool inputs ◦ Sanitizing output ◦ Rate limits ◦ Access controls
  9. In search, we are trying to get into the context

    windows of our audience’s minds
  10. GSC Analytics data Search volume SERP data Log files LLM

    data Query fanouts (the list is always growing)