How I built a second brain using Firebase, Gemini, Genkit, and Siri

Peter Friese Developer Advocate, Firebase @peterfriese How I built a
second brain Using Firebase, Gemini, Genkit, and Siri

@peterfriese Developer Advocate, Firebase Peter Friese peterfriese.dev @peterfriese youtube.com/c/PeterFriese/ not-only-swift.peterfriese.dev/
@[email protected] youtube.com/c/Firebase

Making Siri sma rt er…

Information overload The problem

Information overload

“There’s an app for that”

Not invented here

Read it later The solution

Codename Sofia ✨Add links via iOS Share Sheet ✨Extract OpenGraph
Metadata ✨Simplified formatting ✨Offline reading

“I still need to read all those articles!” The challenge

Building an AI summariser The solution

https: // ai.google.dev/

https: // aistudio.google.com/

Google AI SDK ✨Access Gemini models ✨SDKs for Python, Node,
Go, Kotlin, Dart, Swift ✨Use for prototyping ✨Relatively low RPM ✨Requires an API key https: / / github.com/google-gemini/generative-ai-swift

Vertex AI in Firebase SDK ✨Access Gemini models ✨SDKs for
Swift, Kotlin, JS, Dart ✨Part of Firebase ✨Enterprise-ready ✨No API key required ✨Supports App Check https: / / firebase.google.com/docs/vertex-ai

Using the VertexAI in Firebase SDK public func summarise(text: String)
async throws -> String { let prompt = "Please summarise the following text for me: \(text)" let flash = "gemini-1.5-flash" let model = VertexAI.vertexAI().generativeModel(modelName: flash) let response = try await model.generateContent(prompt) if let text = response.text { return text } else { return "Couldn't compute summary" } } async, as we will be calling a remote system

async throws -> String { let prompt = "Please summarise the following text for me: \(text)" let flash = "gemini-1.5-flash" let model = VertexAI.vertexAI().generativeModel(modelName: flash) let response = try await model.generateContent(prompt) if let text = response.text { return text } else { return "Couldn't compute summary" } }

async throws -> String { let prompt = "Please summarise the following text for me: \(text)" let flash = "gemini-1.5-flash" let model = VertexAI.vertexAI().generativeModel(modelName: flash) let response = try await model.generateContent(prompt) if let text = response.text { return text } else { return "Couldn't compute summary" } } here’s the remote system call

async throws -> String { let prompt = "Please summarise the following text for me: \(text)" let flash = "gemini-1.5-flash" let model = VertexAI.vertexAI().generativeModel(modelName: flash) let response = try await model.generateContent(prompt) if let text = response.text { return text } else { return "Couldn't compute summary" } }

Conclusion ✨Use AI Studio to prototype your prompts ✨Use Google
AI SDK to call LLMs from your client ✨Use Vertex AI in Firebase SDK to go to production ✨Secure your app with App Check ✨Call LLM directly from the client

Chatting with my second brain Going one step further

01 Vector Embeddings 02 RAG Retrieval Augmented Generation

Vector Embeddings Task: Find all words that are food in
the following sentence

Task: Find all words that are food in the following
sentence “I went down to Aberystwyth on foot to buy some welsh cakes and a few berries. When I finished doing my groceries, I had a latte at Coffee #1, where I met a few other speakers.”

Vector embedding: A numerical representation of a word, sentence, or
any other unit of text.

Vector embedding for food: [-0.018035058, 0.013980114, -0.01309541, 0.024956783, 0.02708295, -0.074924484,
0.03496225, 0.0125780115, . .. ]

Vector embedding for food: [-0.018035058, 0.013980114, -0.01309541, 0.024956783, 0.02708295, -0.074924484,
0.03496225, 0.0125780115, . .. ] Vector embedding for foot: [-0.016025933, 0.008207399, -0.03572462, 0.020942606, -0.0003162824, -0.041694388, 0.050102886, 0.007380137, . .. ]

[51.50721, -0.12758] Coordinates for London

Paris [48.85661, 2.35222] London [51.50721, -0.12758] New York [40.71277, -74.00597]
Boston [42.36008, -71.05888]

Source: https://www.learndatasci.com/glossary/cosine-similarity/ Cosine vector similarity

01 Serverless (There are servers, but Google manages them for
you) 02 03 04 Automatic scaling Scale down to zero. Support for min/max instances. Trusted environment JavaScript / TypeScript / Python

Data wri tt en New user created User signing in
Image uploaded Crashlytics ale r Analytics Conversions File deleted Data deleted Test run completed Data updated New con fi guration Con fi guration rollback Triggers

Computing Vector Embeddings export const updateEmbeddings = onDocumentWritten( { document:
'artifacts/{documentId}', minInstances: 1 }, async (event) => { const newData = event.data ?. after.data() ?? {}; const previousData = event.data ?. before.data() ? ? {}; if (newData.fullText === previousData.fullText) { return; } const embeddings = await embed({ embedder: textEmbeddingGecko, content: Document.fromText(newData.fullText), }); await event.data ?. after.ref.update({ embeddings: FieldValue.vector(embeddings), }); }); Firestore trigger

'artifacts/{documentId}', minInstances: 1 }, async (event) => { const newData = event.data ?. after.data() ?? {}; const previousData = event.data ?. before.data() ? ? {}; if (newData.fullText === previousData.fullText) { return; } const embeddings = await embed({ embedder: textEmbeddingGecko, content: Document.fromText(newData.fullText), }); await event.data ?. after.ref.update({ embeddings: FieldValue.vector(embeddings), }); }); Read data from document

'artifacts/{documentId}', minInstances: 1 }, async (event) => { const newData = event.data ?. after.data() ?? {}; const previousData = event.data ?. before.data() ? ? {}; if (newData.fullText === previousData.fullText) { return; } const embeddings = await embed({ embedder: textEmbeddingGecko, content: Document.fromText(newData.fullText), }); await event.data ?. after.ref.update({ embeddings: FieldValue.vector(embeddings), }); }); Compute embeddings

'artifacts/{documentId}', minInstances: 1 }, async (event) => { const newData = event.data ?. after.data() ?? {}; const previousData = event.data ?. before.data() ? ? {}; if (newData.fullText === previousData.fullText) { return; } const embeddings = await embed({ embedder: textEmbeddingGecko, content: Document.fromText(newData.fullText), }); await event.data ?. after.ref.update({ embeddings: FieldValue.vector(embeddings), }); }); Update Firestore document

01 Compute embeddings for user’s query Using embedding model 02
03 04 Find nearest neighbors Using Vector Similarity Search Inject context / history into prompt Using result of vector search / chat history Generate answer Using LLM Retrieval Augmented Generation

const messageSchema = z.object({ message: z.string(), role: z.enum(['user', 'system']), });
export const inputSchema = z.object({ question: z.string(), history: z.array(messageSchema).optional(), }); export const semanticQAFlow = onFlow( { name: 'semanticQAFlow', httpsOptions: { minInstances: 1 }, inputSchema: inputSchema, outputSchema: z.string(), authPolicy: firebaseAuth((user) = > { }), Genkit AI flow RAG-powered Q&A with Genkit

const messageSchema = z.object({ message: z.string(), role: z.enum(['user', 'system']), });
export const inputSchema = z.object({ question: z.string(), history: z.array(messageSchema).optional(), }); export const semanticQAFlow = onFlow( { name: 'semanticQAFlow', httpsOptions: { minInstances: 1 }, inputSchema: inputSchema, outputSchema: z.string(), authPolicy: firebaseAuth((user) = > { }), Format of the data we will pass in RAG-powered Q&A with Genkit

export const inputSchema = z.object({ question: z.string(), history: z.array(messageSchema).optional(), });
export const semanticQAFlow = onFlow( { name: 'semanticQAFlow', httpsOptions: { minInstances: 1 }, inputSchema: inputSchema, outputSchema: z.string(), authPolicy: firebaseAuth((user) = > { }), }, async (input) => { const docs = await retrieve({ retriever: firestoreRetriever, query: input.question, options: {limit: 3}, }); const context = docs.map((doc) => doc.text()).join('\n\n'); const history = input.history ? . map((message) => Fetch three best matches from Firestore RAG-powered Q&A with Genkit

}, inputSchema: inputSchema, outputSchema: z.string(), authPolicy: firebaseAuth((user) = > {
}), }, async (input) => { const docs = await retrieve({ retriever: firestoreRetriever, query: input.question, options: {limit: 3}, }); const context = docs.map((doc) => doc.text()).join('\n\n'); const history = input.history ? . map((message) => `${message.role}: ${message.message}`).join('\n'); const prompt = ` Your name is Sofia, and you are a knowledge assistant. You were created by Peter Friese, who is a Developer Advocate on the Firebase team at Google. Peter created you to showcase how to use Firebase and AI to build a second brain. Purpose: * Utilize my personal knowledge base to answer any questions I have. Build context using the three best matching documents RAG-powered Q&A with Genkit

}, inputSchema: inputSchema, outputSchema: z.string(), authPolicy: firebaseAuth((user) = > {
}), }, async (input) => { const docs = await retrieve({ retriever: firestoreRetriever, query: input.question, options: {limit: 3}, }); const context = docs.map((doc) => doc.text()).join('\n\n'); const history = input.history ? . map((message) => `${message.role}: ${message.message}`).join('\n'); const prompt = ` Your name is Sofia, and you are a knowledge assistant. You were created by Peter Friese, who is a Developer Advocate on the Firebase team at Google. Peter created you to showcase how to use Firebase and AI to build a second brain. Purpose: * Utilize my personal knowledge base to answer any questions I have. Build chat history RAG-powered Q&A with Genkit

query: input.question, options: {limit: 3}, }); const context = docs.map((doc)
=> doc.text()).join('\n\n'); const history = input.history ? . map((message) => `${message.role}: ${message.message}`).join('\n'); const prompt = ` Your name is Sofia, and you are a knowledge assistant. You were created by Peter Friese, who is a Developer Advocate on the Firebase team at Google. Peter created you to showcase how to use Firebase and AI to build a second brain. Purpose: * Utilize my personal knowledge base to answer any questions I have. Question: ${input.question} Behavior: 1. Analyze the context provided by me. 2. Reference my personal knowledge base and gather relevant information. 3. Synthesize the information and provide a concise, accurate response. The Q&A prompt RAG-powered Q&A with Genkit

query: input.question, options: {limit: 3}, }); const context = docs.map((doc)
=> doc.text()).join('\n\n'); const history = input.history ? . map((message) => `${message.role}: ${message.message}`).join('\n'); const prompt = ` Your name is Sofia, and you are a knowledge assistant. You were created by Peter Friese, who is a Developer Advocate on the Firebase team at Google. Peter created you to showcase how to use Firebase and AI to build a second brain. Purpose: * Utilize my personal knowledge base to answer any questions I have. Question: ${input.question} Behavior: 1. Analyze the context provided by me. 2. Reference my personal knowledge base and gather relevant information. 3. Synthesize the information and provide a concise, accurate response. The user’s question RAG-powered Q&A with Genkit

a second brain. Purpose: * Utilize my personal knowledge base
to answer any questions I have. Question: ${input.question} Behavior: 1. Analyze the context provided by me. 2. Reference my personal knowledge base and gather relevant information. 3. Synthesize the information and provide a concise, accurate response. 4. If there is insufficient information or the knowledge base does not contain relevant information, clearly state you don't know the answer. 5. Do not generate responses based on assumptions or speculation. 6. Respond in a professional and helpful manner. 7. When I ask details about you as a person, just provide basic info, and then try to get the conversation back on track to talk about knowledge in my knowledgebase. 8. If I ask you about what you know, tell me that you only know what is in my knowledge base. Expectations: * Accurate and relevant responses. Specifying model behavior RAG-powered Q&A with Genkit

2. Reference my personal knowledge base and gather relevant information.
3. Synthesize the information and provide a concise, accurate response. 4. If there is insufficient information or the knowledge base does not contain relevant information, clearly state you don't know the answer. 5. Do not generate responses based on assumptions or speculation. 6. Respond in a professional and helpful manner. 7. When I ask details about you as a person, just provide basic info, and then try to get the conversation back on track to talk about knowledge in my knowledgebase. 8. If I ask you about what you know, tell me that you only know what is in my knowledge base. Expectations: * Accurate and relevant responses. * Clear indication when an answer is not known. * No fabrication or speculation in responses. * Provide relevant code snippets. Context: ${context} History: ${history} ` More instructions RAG-powered Q&A with Genkit

3. Synthesize the information and provide a concise, accurate response. 4. If there is insufficient information or the knowledge base does not contain relevant information, clearly state you don't know the answer. 5. Do not generate responses based on assumptions or speculation. 6. Respond in a professional and helpful manner. 7. When I ask details about you as a person, just provide basic info, and then try to get the conversation back on track to talk about knowledge in my knowledgebase. 8. If I ask you about what you know, tell me that you only know what is in my knowledge base. Expectations: * Accurate and relevant responses. * Clear indication when an answer is not known. * No fabrication or speculation in responses. * Provide relevant code snippets. Context: ${context} History: ${history} ` Inject context (from the three documents) RAG-powered Q&A with Genkit

3. Synthesize the information and provide a concise, accurate response. 4. If there is insufficient information or the knowledge base does not contain relevant information, clearly state you don't know the answer. 5. Do not generate responses based on assumptions or speculation. 6. Respond in a professional and helpful manner. 7. When I ask details about you as a person, just provide basic info, and then try to get the conversation back on track to talk about knowledge in my knowledgebase. 8. If I ask you about what you know, tell me that you only know what is in my knowledge base. Expectations: * Accurate and relevant responses. * Clear indication when an answer is not known. * No fabrication or speculation in responses. * Provide relevant code snippets. Context: ${context} History: ${history} ` Inject history (from the chat) RAG-powered Q&A with Genkit

7. When I ask details about you as a person,
just provide basic info, and then try to get the conversation back on track to talk about knowledge in my knowledgebase. 8. If I ask you about what you know, tell me that you only know what is in my knowledge base. Expectations: * Accurate and relevant responses. * Clear indication when an answer is not known. * No fabrication or speculation in responses. * Provide relevant code snippets. Context: ${context} History: ${history} ` const answer = await generate({model: gemini15Flash, prompt}); return answer.text(); } ); Send prompt to model RAG-powered Q&A with Genkit

7. When I ask details about you as a person,
just provide basic info, and then try to get the conversation back on track to talk about knowledge in my knowledgebase. 8. If I ask you about what you know, tell me that you only know what is in my knowledge base. Expectations: * Accurate and relevant responses. * Clear indication when an answer is not known. * No fabrication or speculation in responses. * Provide relevant code snippets. Context: ${context} History: ${history} ` const answer = await generate({model: gemini15Flash, prompt}); return answer.text(); } ); Return answer RAG-powered Q&A with Genkit

RAG-powered Q&A with Genkit private var functions = FirebaseConnector.shared.functions public
func performQuery(question: String, history: [ChatMessage]? = nil) async -> Response { let semanticQAFlowCallable: Callable<Query, Response> = functions.httpsCallable("semanticQAFlow") do { let query = Query(question: question, history: history) let response = try await semanticQAFlowCallable(query) return response } catch { return Response(shortSummary: "I am sorry, but something went wrong.", answer: "Something went wrong. Please try again.", source: nil) } } Callable Cloud Function

RAG-powered Q&A with Genkit private var functions = FirebaseConnector.shared.functions public
func performQuery(question: String, history: [ChatMessage]? = nil) async -> Response { let semanticQAFlowCallable: Callable<Query, Response> = functions.httpsCallable("semanticQAFlow") do { let query = Query(question: question, history: history) let response = try await semanticQAFlowCallable(query) return response } catch { return Response(shortSummary: "I am sorry, but something went wrong.", answer: "Something went wrong. Please try again.", source: nil) } } Invoke callable function

but…

“mhh hmm?”

@MainActor public struct SemanticQAIntent: @preconcurrency AppIntent { private var semanticQAService
= SemanticQAService.shared @Parameter(title: "Question", description: "Answer from your knowledge base") var question: String? public static let title: LocalizedStringResource = "Search" static let description: LocalizedStringResource = "Search your saved artifacts" public static var parameterSummary: some ParameterSummary { Summary("You asked: \(\.$question)") } public init() { } public func perform() async throws -> some ProvidesDialog & ShowsSnippetView { guard let providedPhrase = question else { Conform to AppIntent Siri: App Intents

@MainActor public struct SemanticQAIntent: @preconcurrency AppIntent { private var semanticQAService
= SemanticQAService.shared @Parameter(title: "Question", description: "Answer from your knowledge base") var question: String? public static let title: LocalizedStringResource = "Search" static let description: LocalizedStringResource = "Search your saved artifacts" public static var parameterSummary: some ParameterSummary { Summary("You asked: \(\.$question)") } public init() { } public func perform() async throws -> some ProvidesDialog & ShowsSnippetView { guard let providedPhrase = question else { Define input parameters Siri: App Intents

public static var parameterSummary: some ParameterSummary { Summary("You asked: \(\.$question)")
} public init() { } public func perform() async throws -> some ProvidesDialog & ShowsSnippetView { guard let providedPhrase = question else { throw $question.needsValueError("Sure - what would you like to know?") } let answer = await semanticQAService.performQuery(question: providedPhrase) return .result(dialog: IntentDialog(stringLiteral: "Here is your answer")) { Markdown(answer) .padding() .markdownBlockStyle(\.codeBlock) { configuration in configuration.label .relativeLineSpacing(.em(0.25)) .markdownTextStyle { FontFamilyVariant(.monospaced) FontSize(.em(0.85)) } Siri: App Intents This will be run when your Intent is invoked

} public init() { } public func perform() async throws -> some ProvidesDialog & ShowsSnippetView { guard let providedPhrase = question else { throw $question.needsValueError("Sure - what would you like to know?") } let answer = await semanticQAService.performQuery(question: providedPhrase) return .result(dialog: IntentDialog(stringLiteral: "Here is your answer")) { Markdown(answer) .padding() .markdownBlockStyle(\.codeBlock) { configuration in configuration.label .relativeLineSpacing(.em(0.25)) .markdownTextStyle { FontFamilyVariant(.monospaced) FontSize(.em(0.85)) } Siri: App Intents Call the Q&A Genkit flow

} public init() { } public func perform() async throws -> some ProvidesDialog & ShowsSnippetView { guard let providedPhrase = question else { throw $question.needsValueError("Sure - what would you like to know?") } let answer = await semanticQAService.performQuery(question: providedPhrase) return .result(dialog: IntentDialog(stringLiteral: "Here is your answer")) { Markdown(answer) .padding() .markdownBlockStyle(\.codeBlock) { configuration in configuration.label .relativeLineSpacing(.em(0.25)) .markdownTextStyle { FontFamilyVariant(.monospaced) FontSize(.em(0.85)) } Siri: App Intents Result is a dialog

} public init() { } public func perform() async throws -> some ProvidesDialog & ShowsSnippetView { guard let providedPhrase = question else { throw $question.needsValueError("Sure - what would you like to know?") } let answer = await semanticQAService.performQuery(question: providedPhrase) return .result(dialog: IntentDialog(stringLiteral: "Here is your answer")) { Markdown(answer) .padding() .markdownBlockStyle(\.codeBlock) { configuration in configuration.label .relativeLineSpacing(.em(0.25)) .markdownTextStyle { FontFamilyVariant(.monospaced) FontSize(.em(0.85)) } Siri: App Intents gonzalezreal/swift-markdown-ui

Peter Friese, @peterfriese How I built a second brain Using
Firebase, Gemini, Genkit, and Siri

S1 E4 Building a Second Brain App - Data Access
􀉉 Friday, November 15th 􀐫 18 : 00 CET / 9 : 00 PST / 12 : 00 EST / 17 : 00 UTC Livestream

peterfriese.dev @peterfriese youtube.com/c/PeterFriese/ not-only-swift.peterfriese.dev/ @[email protected] youtube.com/c/Firebase Thanks!

How I built a second brain using Firebase, Gemi...

How I built a second brain using Firebase, Gemini, Genkit, and Siri

More Decks by Peter Friese

Other Decks in Programming

Featured

Transcript