the response - Close to function calling, but the invocation is requested by the client - Can be anything: database, remote service… Resources - Expose data - URL -> Content Prompts - Pre-written prompt template - Allows executing specific prompt
Response and Notifications - Possible multiplexing Transports - stdio -> The client instantiates the server, sends the requests on stdio and gets the response from the same channel - Server-Sent Event (SSE) -> The client sends a POST request to the server, the response is an SSE (chunked response) - Streamable HTTP ->
Run the model - CPU / GPU - Expose an API Input - Prompt (text) - Instructions to give to the model - Taming a model is hard GPUs Input (Prompt) Output
applications) A software program enhanced with artificial intelligence capabilities, utilizing AI models to implement intelligent features and functionalities. Model Serving
resources that the model can call: Prompt (Context) Extend the context with MCP tools descriptions Invoke the model The model invoke a tool invocation (name + parameters) The tool is invoked via JSON-RPC and the result sent to the model The model computes the response using the tool result Response
vs Local - Lack of documentation for SSE - Security - Authentication - Wanaku experience : limitation in the number of tools - MCP and Agentic -> emerging A2A - Not the only protocol (ACP …)