Agents in Kotlin - Building Smarter Apps with Shared Codebases using KMP and Koog & Gemini and More

Agents in Kotlin Building Smarter Apps with Shared Codebases using
KMP and Koog & Gemini and More Rivu Chakraborty

WHO AM I? • GDE (Google Developer Expert) for Android
• Previously India’s ﬁrst GDE for Kotlin • More than 14 years in the Industry • Founder @ Mobrio Studio • Previously ◦ JioCinema/JioHotstar, Byju’s, Paytm, Gojek, Meesho • Author (wrote multiple Kotlin books) • Speaker • Mentor • Learning / Exploring ML/AI • Community Person (Started KotlinKolkata) • YouTuber (http://youtube.com/@RivuTalks)

My Wall

My Wall https://speakerdeck.com/rivuchk/efﬁcient-async-coding-in-kotlin-coroutines

What All Changed in 7 years • Became a GDE

• Changed 4 companies

• Changed 4 companies • Started Mobrio Studio

• Led personally by me (Rivu), with my decades of
experience of scaling 6+ unicorn startups, and many smaller ones • We do Mobile Dev tooling (products) as well as we consult with product based startups, helping them develop or scale their apps • We can help with anything to do with mobile, starting from code quality, migration, refactor to feature development • At Mobrio Studio, I have a team, who work under my direct super vision. • We don’t just develop for you, we train your team, so you’re independent in future https://mobrio.studio/

Agents in Kotlin : Building Smarter Apps with Shared Codebases
using KMP and Koog (and Gemini, and Gemma, and …)

WHY THIS TALK? • GenAI & Agents are hot •
Gemini API, Gemini Nano (Experimental) and Gemma models allow apps to use AI easily • KMP lets us build once for Android, iOS, Web & More • Koog let’s you build agents for multiple platforms • We'll walk through real code & gotchas

01 Intro to KMP 02 Intro to AI, Gemini, Gemma,
Gemini Nano, VertexAI 03 Koog Sections of the Talk

What’s KMP and Why? • A technology by JetBrains to
share Kotlin code across platforms (Android, iOS, web, desktop, server). • Enables platform-speciﬁc UI while sharing core business logic (networking, database, state management). You control what you share and what you don’t • Write Once, Run Natively: Outputs native binaries (no VM or JS bridge).

01 True Multiplatform 02 Compiles to native (JVM Bytecode, ObjC
bytecode or Miniﬁed JS) for platform-speciﬁc code 03 You decide how much code you want to share Want to build full app? Use Compose Multiplatform, or just share pieces of code with KMP Differences with Cross-platform 04 Interoperability Kotlin is fully interoperable with Java, it’s interoperable with JS and Objective C as well, and Swift interoperability improving with oicial Swift support

A Brief on KMP Kotlin Multiplatform Kotlin

A Brief on KMP

A Brief on KMP Common JVM JS Native Android Server
Apple Linux mingw iOS tvOS watchOS macOS linux_x64 linux_arm64 mingx64 iosX64 iosArm64 iosSimulatorArm64 tvosX64 tvosArm64 tvosSimulatorArm6 4 macosX64 macosArm64 watchosX64 watchosArm32 watchosArm64 watchosSimulatorArm64 watchosDeviceArm64

A Brief on KMP expect fun doSomething() actual fun doSomething()
{ //Platform specific logic here }

A Brief on KMP expect fun getRandomUUID(): String actual fun
getRandomUUID(): String = UUID.randomUUID().toString() actual fun getRandomUUID(): String = NSUUID().UUIDString() import uuid actual fun getRandomUUID(): String = uuid.v4().toString() commonMain jvmMain iosMain jsMain

What’s AI? Algorithm Input Output Developers write explicit algorithms that
take input and produce a desired output. 1. Train the model with large dataset of input and output 2. Model is deployed on cloud/on-device to process input data i.e. inference Traditional Programming Machine Learning ML Model Training Input ML Model Output Run ML Inference Input Output

What’s GenAI? • Generative AI introduces the capability to understand
inputs such as text, images, audio and video and generate human-like responses. • This enables applications like chatbots, language translation, text summarization, image captioning, image or code generation, creative writing assistance, and much more. • At its core, an LLM is a neural network model trained on massive amounts of text data. It learns paerns, grammar, and semantic relationships between words and phrases, enabling it to predict and generate text that mimics human language.

What is GenAI in Mobile Development? • GenAI brings creative
intelligence to mobile apps by enabling them to generate rather than just respond. • Enables hyper-personalized, intelligent, and context-aware user experiences. • Enhances accessibility, productivity, and entertainment within apps. • Can run on-device (for privacy/speed) or via cloud APIs. • In mobile apps, GenAI powers features like: a. Text generation (e.g., storytelling, smart replies, chatbots) b. Image generation/editing c. Voice synthesis (TTS)

What is GenAI in Mobile Development?

Why Gemini (by Google)? • Multimodal: Understands text, image, code,
audio, and more. • Optimized for Android, iOS & Web • Enhances accessibility, productivity, and entertainment within apps. • Developer Friendly a. Easy-to-use libraries / APIs b. SDKs support prompting, streaming, and low-latency generation

Different Ways To Integrate AI Directly in Mobile Apps 01
Gemini API 02 Mediapipe / LLMInterference Library and Offline Model Can be used with any tflite / LiteRT Models, not Gemma Specific 03 Gemini Nano Currently Experimental, available only on Pixel 9 Devices Either Directly with GeminiAPI or By Using The Third Party Library by Shreyas 04 Firebase Vertex AI You can use Gemini APIs and models with Firebase Vertex API, reducing the need for handling intricate details yourself Koog You can use Koog, along with various LLM providers, and custom/inbuilt tools to build AI agents and use them directly in Mobile Apps 05

Google Generative AI SDK for Kotlin Multiplatform by Shreyas Patil
- hps://github.com/PatilShrey as/generative-ai-kmp API key stored in BuildKonﬁg Suspend function for story generation Works on Android & iOS GEMINI INTEGRATION (ONLINE)

GENERATIVEMODEL IMPLEMENTATION (GEMINI) class GenerativeModelGemini(private val apiKey: String) : GenerativeModel
{ private val model by lazy { GeminiApiGenerativeModel( ... ) } override suspend fun generate(prompt: String, awaitReadiness: Boolean): Result<String> { return runCatching { val input = content { text(prompt) } val response = model.generateContent(input) response.text ?: throw UnsupportedOperationException("No text returned from model") } } } commonMain.dependencies { implementation("dev.shreyaspatil.generativeai:generativeai-google:<version>") } hps://github.com/PatilShreyas/generative-ai-kmp

GENERATIVEMODEL IMPLEMENTATION (GEMINI) hps://github.com/PatilShreyas/generative-ai-kmp GeminiApiGenerativeModel( modelName = "gemini-2.5-flash", apiKey =
apiKey, generationConfig = GenerationConfig.Builder().apply { topK = 40 } .build() )

Gemini Models

01 USES MEDIAPIPE GENAI 02 TEXTGENERATOR EXPECT/ACTUAL for platform-speciﬁc code
03 LOCALGENERATIVEMODEL wraps the logic OFFLINE MODE WITH GEMMA

DOWNLOA D .TASK FILE FROM SERVER or Pack with App
(not recommend ed) STORE IN INTERNAL APP DIRECTORY INIT MEDIAPIPE LLM AFTER DOWNLOAD COMPLETES MODEL DOWNLOAD & INITIALIZATION

Gemma Models hps://ai.google.dev/edge/mediapipe/solutions/genai/llm_inference That are supported by LLMInterference API 550+
MB

Init MediaPipe LLM Interference private val llmInference: LlmInference by lazy
{ val options = LlmInference.LlmInferenceOptions.builder() .setModelPath(modelFile.absolutePath) .setMaxTokens(1024) .setMaxTopK(40) .build() LlmInference.createFromOptions(context, options) }

Generation Settings GeminiApiGenerativeModel( modelName = "gemini-2.0-flash", apiKey = apiKey, generationConfig
= GenerationConfig.Builder().apply { topK = 40 } .build() ) private val llmInference: LlmInference by lazy { val options = LlmInference.LlmInferenceOptions.builder() .setModelPath(modelFile.absolutePath) .setMaxTokens(512) .setMaxTopK(40) .build() LlmInference.createFromOptions(context, options) }

TopK • Top-K ﬁlters tokens for output. • For example
a Top-K of 3 keeps the three most probable tokens. • Increasing the Top-K value will increase the randomness of the model response.

maxTokens • Limits the maximum output length a model can
generate • A token can be a whole word, part of a word (like "ing" or "est"), punctuation, or even a space. The exact way text is tokenized depends on the speciﬁc model’s tokenizer. • Whenever we call llmInference.generateResponse(prompt), the response generated by the local model will contain at most 512 tokens.

Gemini Nano

Integrate Gemini Nano implementation("com.google.ai.edge.aicore:aicore:0.0.1-exp02")

Integrate Gemini Nano generativeModel = try { val generationConfig =
generationConfig { context = getApplication<Application>().applicationContext temperature = 0.2f topK = 40 maxOutputTokens = 1024 } val downloadConfig = DownloadConfig( object : DownloadCallback { isReady.update { true } } ) GenerativeModel( generationConfig = generationConfig, downloadConfig = downloadConfig ) } catch (e: Exception) { Log.e("MainViewModel", "Failed to initialize AI Core: ${e.message}") null }

Integrate Gemini Nano val generationConfig = generationConfig { context =
getApplication<Application>().applicationContext temperature = 0.2f topK = 40 maxOutputTokens = 1024 }

Integrate Gemini Nano val downloadConfig = DownloadConfig( object : DownloadCallback
{ isReady.update { true } } )

Integrate Gemini Nano GenerativeModel( generationConfig = generationConfig, downloadConfig = downloadConfig
)

Vertex AI Google Recommends using Vertex AI in Firebase SDK
for Android to access the Gemini API and the Gemini family of models directly from the app.

Vertex AI implementation("com.google.firebase:firebase-vertexai:$version") class GenerativeModelVertex() : GenerativeModel { val generativeModel
= Firebase.vertexAI.generativeModel("gemini-2.5-flash") override suspend fun generate(prompt: String, awaitReadiness: Boolean): Result<String> { return runCatching { model.generateContent(prompt) } } }

Koog Koog is a Kotlin-based framework designed to build and
run AI agents entirely in idiomatic Kotlin. It lets you create agents that can interact with tools, handle complex workﬂows, and communicate with users. https://docs.koog.ai/

Koog implementation("ai.koog:koog-agents:0.5.2") suspend fun runAgent(prompt: String): String { return try
{ val agent = AIAgent( promptExecutor = simpleGoogleAIExecutor(apiKey), llmModel = GoogleModels.Gemini2_5Flash, systemPrompt = systemPrompt ) val result = agent.run(prompt) result } catch (e: Exception) { "Koog Agent Error: ${e.message}\n${e.stackTraceToString()}" } }

Koog implementation("ai.koog:koog-agents:0.5.2") val agent = AIAgent( promptExecutor = MultiLLMPromptExecutor( GoogleLLMClient(apiKey)
), llmModel = GoogleModels.Gemini2_5Flash, systemPrompt = systemPrompt )

Koog = AI Agents in Kotlin

Agent Tools Memory LLM(s) Strategy system prompt User Input Response

Image Source: https://cobusgreyling.medium.com/whats-your-deﬁnition-of-an-ai-agent-edb7d5e1c760

Koog - tools • The AI Agent is the "Brain"
• Tools are the "Hands" and "Eyes" • Example: The "Search" Tool • The AI is the "Smart Foreman" • Why It Maers: Without tools, an AI is just a fancy encyclopedia. With tools, an AI becomes a real personal assistant that can ﬁnd current information and actually get things done for you in the real world.

Koog - Tools Agents use tools to perform speciﬁc tasks
or access external systems.

Koog - Tools Agents use tools to perform speciﬁc tasks
or access external systems. expect class DatabaseOperationsToolSet( repository: YourRepository ) { suspend fun someDBOperation(): Result /** * Convert to Koog tools */ fun toTools(): List<Tool<*, *>> }

Koog - Tools Android/JVM @LLMDescription("Meaningfull description of the class/toolset") expect
class DatabaseOperationsToolSet( repository: YourRepository ) : ToolSet { { @Tool @LLMDescription("Meaningfull description of the function/operation") suspend fun someDBOperation(): Result {...} fun toTools(): List<Tool<*, *>> {...} }

Koog - Tools Android/JVM actual fun toTools(): List<Tool<*, *>> {
return this.asTools() }

Koog - Tools Android/JVM

Koog - Tools Common val dbToolRegistry = ToolRegistry { tools(databaseToolSet.toTools())
}

Koog - Tools Common private fun createAgent(): AIAgent<String, String> {
return AIAgent( promptExecutor = MultiLLMPromptExecutor( GoogleLLMClient(apiKey) ), systemPrompt = """ ---Your System Prompt--- """.trimIndent(), llmModel = GoogleModels.Gemini2_5FlashLite, toolRegistry = dbToolRegistry ) }

Koog - Memory The AgentMemory Feature addresses the challenge of
maintaining context in AI agent

Koog - Memory • Your Brain vs. A Goldﬁsh: By
default, a simple AI is like a goldﬁsh. • Memory is the "Notepad" • It Remembers "You" , it Remembers "What It Did" • Short-Term vs. Long-Term • Why It Maers: Without memory, an AI is just a tool. With memory, it becomes a personal assistant that gets smarter and more helpful the more you interact with it.

Koog - Memory • Facts: This is the actual piece
of information being saved. It’s the "note" itself. Koog has two types. ◦ SingleFact: For one piece of info (e.g., "User’s preferred theme is Dark"). ◦ MultipleFacts: For a list of info (e.g., "User knows Kotlin, Java, and Python"). • Concepts: This is the label or category for the fact. It’s like the heading on a page in the notepad. • Subjects: This is who or what the fact is about. It’s like the label on the "ﬁle drawer."

maintaining context in AI agent install(AgentMemory) { agentName = "query-agent" featureName = "natural-language-search" organizationName = "mobrio-studio" productName = "files-plus" memoryProvider = FilesPlusMemory.memoryProvider }

maintaining context in AI agent FilesPlusMemory.memoryProvider.save( fact = SingleFact( value = response, concept = responseConcept, timestamp = Clock.System.now().toEpochMilliseconds(), ), subject = User, scope = MemoryScope.Product("files-plus"), )

Koog - Strategy

Koog - Strategy • The "Strategy" is the AI’s "Recipe"
or "Plan": If the AI is the "foreman" (brain), the tools are the "hands," and the memory is the "notepad," the strategy is the detailed, step-by-step "workﬂow" or "recipe" the foreman follows to get a job done.

Koog - Strategy • Koog’s "Strategy Graph": In Koog, you
don’t just write a simple list. You build a "Strategy Graph"—think of it as a ﬂowchart for the AI’s "recipe." This lets you create very smart and complex plans. • Nodes (The Steps): The "Nodes" are the boxes in the ﬂowchart. Each node is one action in the recipe. • Edges (The Arrows): The "Edges" are the arrows that connect the boxes. They show the agent which step to do next. • Subgraphs are "Recipe" Inside a "Recipe"

Koog - Strategy val myStrategy = strategy<String, String>("my-strategy") { val
nodeCallLLM by nodeLLMRequest() val executeToolCall by nodeExecuteTool() val sendToolResult by nodeLLMSendToolResult() edge(nodeStart forwardTo nodeCallLLM) edge(nodeCallLLM forwardTo nodeFinish onAssistantMessage { true }) edge(nodeCallLLM forwardTo executeToolCall onToolCall { true }) edge(executeToolCall forwardTo sendToolResult) edge(sendToolResult forwardTo nodeFinish onAssistantMessage { true }) edge(sendToolResult forwardTo executeToolCall onToolCall { true }) }

Koog - Strategy val nodeProcessQuery by subgraph<String, String> { val
processQuery by nodeLLMRequest() val executeToolCall by nodeExecuteTool() val sendToolResult by nodeLLMSendToolResult() val processToolResult by node<Message.Response, String> { input -> input.content } edge(nodeStart forwardTo processQuery) edge(processQuery forwardTo executeToolCall onToolCall { true }) edge(executeToolCall forwardTo sendToolResult) edge(sendToolResult forwardTo processToolResult) edge(processToolResult forwardTo processQuery) edge(processQuery forwardTo nodeFinish onAssistantMessage { true }) }

Koog - Strategy edge(nodeStart forwardTo nodeLoadMemory) edge(nodeLoadMemory forwardTo nodeProcessQuery) edge(nodeProcessQuery
forwardTo nodeSaveMemory) edge(nodeSaveMemory forwardTo nodeFinish)

COCOAPODS INTEGRATION FOR IOS Cocoapods integration for iOS is deprecated
MEDIAPIPE GENAI MediaPipe GenAI supports Android, iOS and Web, however integrating it with KMP is challenging CHALLENGES FACED Koog Agents Using Koog makes building and using agents or even just calling any prominent LLM very easy in Mobile or any platform

It’s easy to integrate GenAI with your KMP apps LLM
Interference / MediaPipe works but its’ not for most of the usecases Code reusability across platforms with KMP KEY TAKEAWAYS Gemini Nano can be a game changer Koog makes it even easier

hps://github.com/RivuChk/GolpoAI hps://github.com/PatilShreyas/generative-ai-kmp RESOURCES hps://docs.koog.ai https://speakerdeck.com/rivuchk/agents-in-kotlin-building-smart er-apps-with-shared-codebases-using-kmp-and-koog-and-gemini- and-more hps://medium.com/@brianmwangi_dev/building-ai-agents-in-kotlin-a-d eep-dive-into-koog-d279915be5c1 hps://medium.com/@vadim.briliantov/beyond-prompts-use-domain-mo
dels-to-rule-ai-agents-instead-89765faa6e46

PAUSE & THINK

THANK YOU 🌐 https://www.rivu.dev/ youtube.com/@rivutalks @rivuchakraborty

Agents in Kotlin - Building Smarter Apps with S...

Agents in Kotlin - Building Smarter Apps with Shared Codebases using KMP and Koog & Gemini and More

More Decks by Rivu Chakraborty

Other Decks in Technology

Featured

Transcript