how applications provide context to large language models (LLMs). (...) MCP enables you to build agents and complex workflows on top of LLMs and connects your models with the world. modelcontextprotocol.io
APIs / SDKs Hypothesis: if the model just read the docs*, this could be avoided (context engineering / grounding) Solution: expose the “go doc” command as a tool “read_docs” * Note: Gemini 3 is much better at doing this out of the box
if the model could request an “unbiased” review, it would have the opportunity to self-correct Solution: a “code_review” tool that calls a model with fine tuned context
endoscope. Failed due to: - Required precise manipulation (lines and columns) - Doctor-themed naming is cute, but not useful - Behaviour too similar to a built-in tool - Didn’t do much more than run_shell_command
configurations Each alternative runs each scenario N times (sample size) The data is assessed for statistical significance with statistical tests like Welch’s T-test, Fisher’s exact test, etc github.com/danicat/tenkai
an hypothesis Create the tool, run an experiment against baseline Analyse tool usage patterns after the experiment Are the observations statistically significant? Rinse and repeat
to “sell” better the value of the tools Never do a simple operation easily replaced with shell Return additional context (e.g. type info, package docs) On failure, provide hints for self-correction Reduce the model “degree of freedom”
foo()", new_content="func bar()") Key Features: - Normalises whitespace and indentation variances before matching - Supports line numbers for precise replacement - Tolerant to typos up to a given threshold (e.g. Levenshtein distance <= 2) - Runs gofmt and goimports to maintain style consistency. - Prevents saving changes that introduce syntax errors. - Returns suggestions when edit fails
lint). smart_build(packages="./...", auto_fix=true) Key Features: - Runs go build, test and vet (or golangci-lint) in the same step - Reports detailed package coverage and total statement coverage - Resolves minor module and formatting issues automatically (go mod tidy, gofmt -w)
API / SDK Documentation and examples are gold for agents Augment your tools with hints upon failures Reduce the model degree of freedom (e.g. build -> test)