Disciplined Vibes: Scaling AI-Assisted Engineering

Code District, Lahore Disciplined Vibes: Scaling AI-Assisted Engineering Sheharyar Naseer
◦ June 2026

Sheharyar Naseer Systems architect & technology advisor for startups and
enterprises. Find me online @sheharyarn

Background ✦ Principal Software Architect at Infra One ✦ Worked
with: Apple, Slab, TheScore, Superlist, etc. ✦ 16+ years of polyglot experience, focus on Web & Cloud ✦ StackOver fl ow: 75,000+ score (Top 5 in Pakistan) ✦ Author / Contributor of multiple famous libraries & tools ✦ Featured on popular developer communities

Outline PART 1 The Problem PART 2 It's Not the
Model PART 3 Harness Engineering PART 4 Live Workshop OUTRO What's Next?

01 The Problem Why you're struggling with AI-assisted coding and
the data backs it up.

Struggling with AI ✦ 16+ years of experience, still humbled
by a chatbot ✦ Struggled a lot with AI-assisted coding ✦ Code quality was extremely poor ✦ Often had to spent time fi xing it ✦ Or throwing it away and doing manually

AI-Assisted Problems ✦ Hallucinated APIs, function calls, and packages ✦
Insecure code ✦ Architectural drift ✦ Ignored edge-cases ✦ Incorrect, or no error-handling ✦ Performance issues ✦ So many more...

The Data Agrees METR METR's randomized controlled trial found experienced
developers were 19% slower with early-2025 AI. SOURCE DORA Google's DORA 2024 research found AI adoption reduced delivery stability, continuing into 2025 despite higher adoption & throughput. SOURCE

“Seniors often get worse results than juniors from same tools
until they learn deliberate prompting. But once they do they have a massive advantage. Sabrina Goldfarb SWE at Github Co-Pilot

02 It's Not The Model Exploring the root causes and
developing the right thinking model.

It's a You Problem ✦ Don't understand how LLMs work
✦ Gold fi sh memory & context management ✦ Incomplete specs ✦ Basic prompts ✦ Missing documentation & examples ✦ Unreliable guardrails ✦ No systems or quality checks ✦ Agents don't receive feedback about what's wrong

Mental Models AI Search Shallow use of modern LLMs as
a Google replacement Vibe Coding Fully delegating code to AI without reviewing output Vibe Engineering Accelerating professional software engineering with AI YOU ARE HERE

Vibe Engineering ✦ Does not mean better prompts ✦ Foundation/architecture/system
where the agent can "succeed" ✦ Feedback loops ✦ Also called Evaluation Driven Development (EDD)

“You shouldn’t be prompting coding agents anymore. You should be
designing loops that prompt your agents. Peter Steinberger Creator of OpenClaw, Technical Staff at OpenAI

03 Harness Engineering The scaffold is the product.

✦ LangChain research team describe it as: Agent = Model
+ Harness ✦ "Everything other than the model" ✦ Prompt, Evals, Tool Calls, Docs, Context, etc. ✦ Even the GUI/CLI "agent" tool you use What's a Harness? “Agent = Model + Harness Vivek Trivedi (Researcher, LangChain)

✦ SWE bench score improvements ✦ 42% → 78%, 46%
→ 80%, 23% → 45% ✦ ~22 point swings vs ~1 point swings ✦ Using frontier models The Model Doesn't Matter SAME HARNESS Different Model SAME MODEL Scaffold Changes ~1 ~22 POINT SWINGS POINT SWINGS

✦ Inner Harness (System) ✦ Built into your coding agent
(CLI/GUI tool) ✦ System prompt, Tool calls, Orchestration ✦ Outer Harness (User) ✦ Controls put in place by users ✦ User prompt, Agent rules, Output validation ✦ Our focus today Anatomy of a Harness MODEL INNER HARNESS OUTER HARNESS

Types of Harness Feedforward Guides Feedback Sensors BEHAVIOUR MAINTAINABILITY ARCHITECTURE
DIRECTION DOMAIN INFERENTIAL DETERMINISTIC NATURE

 HUMAN ✦ AGENT PROMPTS AGENTS.MD SPECS, PRD & ADR
STYLEGUIDES REFERENCE DOCS RULES SCRIPTS / CLI TOOLS CODEMODS LANGUAGE SERVERS ... UNIT TESTS E2E TESTS STATIC ANALYSIS REVIEW AGENTS LOGS BROWSER LINTERS SBOM VALIDATION SECURITY SCANNERS ... Feedforward Guides Feedback Sensors INITIAL GENERATION SELF-CORRECTING

✦ Write actual documentation ✦ Guides, rules, conventions; plus examples
✦ Current architecture overview ✦ Long-term specs, PRDs, and ADRs ✦ Add helpful tooling ✦ Code generation scripts, tools, helpers ✦ Language servers ✦ Entrypoint is the "router" Implementing Guides my_app ├── AGENTS.md ├── docs/ │ ├── rules/ │ ├── guides/ │ ├── adrs/ │ └── specs/ ├── . . . └── . . .

✦ More important than Guides ✦ For maintainability and architectural
quality ✦ Focus on Deterministic controls fi rst ✦ Fast, reliable, cheap ✦ Implementation Layers ✦ Fastest & accurate feedback early ✦ Goal: Push agents' reliable coverage as far up as possible Implementing Sensors 1. LINTING & STATIC CHECKS 2. UNIT TESTS 3. INTEGRATION/ E2E 4. AI REVIEWS 5. MANUAL QA IMPLEMENTATION LAYERS

04 Live Workshop It's your turn. Let's build our own
outer harness.

Pick an Idea Lets' build the App and a Harness

05 What's Next? Improving harnesses and building repeatable systems.

✦ Establish Discipline ✦ Capture standard conventions, security mandates, architecture
patterns ✦ Keep AI out of writing tests, preserve double-bookkeeping ✦ Build Reusable Harnesses ✦ CI templates with common deterministic checks ✦ Inferential review agents for security, architecture, gap analysis, even PR reviews ✦ Scale via Service Templates ✦ Service-level AGENTS.md Recommendations

✦ Enterprises & agencies have pre-de fi ned service templates
✦ Internal team guides ✦ Codemods & internal tools ✦ Boilerplate projects ✦ Embed harnesses directly in them ✦ Scaffold not just code, but AI knowledge and conventions from day one ✦ Inter-organization review agents Service Templates

✦ Custom skills and slash commands ✦ Subagents for sub-tasks
for context optimization ✦ Agent Councils & Consensus ✦ Adverserial reviews with multiple agents deciding on next steps ✦ Parallel agent execution with git worktrees ✦ Multiply output using same harness ✦ Independently running agent loops ✦ Spec → Code → PR → Review → Address Feedback → Merge Advanced Workflows

Questions? Further Reading → martinfowler.com/articles/harness-engineering.html These Slides → shyr.io/t/disciplined-vibes More
Talks → shyr.io/talks shyr.io [email protected] @sheharyarn   

Disciplined Vibes: Scaling AI-Assisted Engineering

Disciplined Vibes: Scaling AI-Assisted Engineering

Sheharyar Naseer

More Decks by Sheharyar Naseer

Other Decks in Technology

Featured

Transcript

Code District, Lahore Disciplined Vibes: Scaling AI-Assisted Engineering Sheharyar Naseer

Sheharyar Naseer Systems architect & technology advisor for startups and

Background ✦ Principal Software Architect at Infra One ✦ Worked

Outline PART 1 The Problem PART 2 It's Not the

01 The Problem Why you're struggling with AI-assisted coding and

Struggling with AI ✦ 16+ years of experience, still humbled

AI-Assisted Problems ✦ Hallucinated APIs, function calls, and packages ✦

The Data Agrees METR METR's randomized controlled trial found experienced

“Seniors often get worse results than juniors from same tools

02 It's Not The Model Exploring the root causes and

It's a You Problem ✦ Don't understand how LLMs work

Mental Models AI Search Shallow use of modern LLMs as

Vibe Engineering ✦ Does not mean better prompts ✦ Foundation/architecture/system

“You shouldn’t be prompting coding agents anymore. You should be

03 Harness Engineering The scaffold is the product.

✦ LangChain research team describe it as: Agent = Model

✦ SWE bench score improvements ✦ 42% → 78%, 46%

✦ Inner Harness (System) ✦ Built into your coding agent

Types of Harness Feedforward Guides Feedback Sensors BEHAVIOUR MAINTAINABILITY ARCHITECTURE

Types of Harness Feedforward Guides Feedback Sensors BEHAVIOUR MAINTAINABILITY ARCHITECTURE

Types of Harness Feedforward Guides Feedback Sensors BEHAVIOUR MAINTAINABILITY ARCHITECTURE

Types of Harness Feedforward Guides Feedback Sensors BEHAVIOUR MAINTAINABILITY ARCHITECTURE

Types of Harness Feedforward Guides Feedback Sensors BEHAVIOUR MAINTAINABILITY ARCHITECTURE

 HUMAN ✦ AGENT PROMPTS AGENTS.MD SPECS, PRD & ADR

✦ Write actual documentation ✦ Guides, rules, conventions; plus examples

✦ More important than Guides ✦ For maintainability and architectural

04 Live Workshop It's your turn. Let's build our own

Pick an Idea Lets' build the App and a Harness

05 What's Next? Improving harnesses and building repeatable systems.

✦ Establish Discipline ✦ Capture standard conventions, security mandates, architecture

✦ Enterprises & agencies have pre-de fi ned service templates

✦ Custom skills and slash commands ✦ Subagents for sub-tasks

Questions? Further Reading → martinfowler.com/articles/harness-engineering.html These Slides → shyr.io/t/disciplined-vibes More