Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Understanding Specification-Driven Code Generat...

Avatar for Giovanni Rosa Giovanni Rosa
March 19, 2026
7

Understanding Specification-Driven Code Generation with LLMs: An Empirical Study Design

Abstract: Large Language Models (LLMs) are increasingly integrated into software development workflows, yet their behavior in structured, specification-driven processes remains poorly understood. This paper presents an empirical study design using CURRANTE, a Visual Studio Code extension that enables a human-in-the-loop workflow for LLM-assisted code generation. The tool guides developers through three sequential stages--Specification, Tests, and Function--allowing them to define requirements, generate and refine test suites, and produce functions that satisfy those tests. Participants will solve medium-difficulty problems from the LiveCodeBench dataset, while the tool records fine-grained interaction logs, effectiveness metrics (e.g., pass rate, all-pass completion), efficiency indicators (e.g., time-to-pass), and iteration behaviors. The study aims to analyze how human intervention in specification and test refinement influences the quality and dynamics of LLM-generated code. The results will provide empirical insights into the design of next-generation development environments that align human reasoning with model-driven code generation.

Preprint: https://arxiv.org/abs/2601.03878

Avatar for Giovanni Rosa

Giovanni Rosa

March 19, 2026
Tweet

Transcript

  1. Understanding Specification-Driven Code Generation with LLMs: An Empirical Study Design

    SoftDev Speaker: Giovanni Rosa Postdoctoral Researcher Universidad Rey Juan Carlos Giovanni Rosa, David Moreno-Lumbreras, Gregorio Robles and Jesus M. Gonzalez-Barahona
  2. Towards Spec-Driven Development Modern AI tools (Copilot & co.) Powerful

    automation but guided by abstract prompts Current frontier: representing human intent via formal and structured specifications Test Driven Development Code centric, High human effort
  3. A TDD-inspired workflow to guide code generation The user iteratively

    refines and validates a formal input specification (tests)
  4. Summary Giovanni Rosa Postdoctoral Researcher @ URJC More at giovannirosa.com

    Funded by the Spanish AEI Ref. 2024/00416/002 1 4 5 3 2 6
  5. Acknowledgments More at https://advise.codeberg.page/ This work is part of the

    ADVISE project (ADvanced Vision on Intelligent Software Engineering) ADVISE aims to integrate AI agents into software development to ensure code meets requirements and aligns with developer intent. Funded by the Spanish AEI, reference 2024/00416/002