Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Samba Cloudの高速推論を活用した模範解答分析と開発知見
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
いおりん
October 08, 2025
0
170
Samba Cloudの高速推論を活用した模範解答分析と開発知見
いおりん
October 08, 2025
Tweet
Share
More Decks by いおりん
See All by いおりん
claude codeでPrompt Engineering
iori0311
0
810
Featured
See All Featured
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
66
37k
Groundhog Day: Seeking Process in Gaming for Health
codingconduct
0
99
Into the Great Unknown - MozCon
thekraken
40
2.3k
SEO Brein meetup: CTRL+C is not how to scale international SEO
lindahogenes
0
2.4k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
Producing Creativity
orderedlist
PRO
348
40k
GraphQLの誤解/rethinking-graphql
sonatard
74
11k
The innovator’s Mindset - Leading Through an Era of Exponential Change - McGill University 2025
jdejongh
PRO
1
96
Everyday Curiosity
cassininazir
0
130
A better future with KSS
kneath
240
18k
The agentic SEO stack - context over prompts
schlessera
0
650
The Organizational Zoo: Understanding Human Behavior Agility Through Metaphoric Constructive Conversations (based on the works of Arthur Shelley, Ph.D)
kimpetersen
PRO
0
240
Transcript
4BNCB$MPVEͷߴਪΛ ׆༻ͨ͠ൣղੳͱ։ൃݟ Samba Cloud Tokyo Mingle 4 202510݄7 גࣜձࣾ ϚΠΫϩγϛϡϨʔγϣϯ
ࣗݾհ גࣜձࣾϚΠΫϩγϛϡϨʔγϣϯ ٱอ ҏ৫ σδλϧ࠾αʔϏεtestusͷ։ൃ
ൣճ ࠾ج४ LLMʹΑΔ࠾ LLMΛ׆༻ͨ͠࠾ϑϩʔ
࠾ج४ੜͷͨΊͷൣղੳͱϞσϧͷੑೳ ൣղ: εϙʔπΛ͢Δ͜ͱɺָ͍͚ͩ͠Ͱͳ͘ɺ݈߁ʹྑ͍ɻ ## ൣճͷྨ ɾӳจ༁ ɾจӳ༁ ɾࣗ༝ӳ࡞ ## ൣղͷੳ
ΠσΟΦϜߏจ จ๏ ## ࠾ج४ͷੜ Prompt gemini-2.5-flash( ⚪︎ ) gemini-2.5-flash-lite(×) ӳ༁ͱͯ͠ͷ ج४ʹͳ͍ͬͯΔ
ίετͱੜ࣌ؒ ※1 τʔΫϯྔͱੜ࣌ؒGoogle AI studioΛ༻ ※2 ίετܭࢉVertex AIͷྉۚදΑΓࢉग़ ※3 ίϯςΩετΩϟογϡͷ༻ͳ͠
࠾ج४ੜϓϩϯϓτʹର͢Δ֤Ϟσϧ͝ͱͷൺֱ ϞσϧͷܰྔԽʹΑΓ ίετͱੜ࣌ؒͷվળ Λ࣮ݱ͍ͨ͠ 1288.8×10-6 $, 5.1s 336.5×10-6 $, 3.3s ࣄલʹλΠϓͷྨ ϓϩϯϓτ࠷దԽ ʴ ίετ1/4 ੜ࣌ؒ-2s
Samba CloudΛ༻͍ͨࣄલߴਪͷ׆༻ ϦΞϧλΠϜੑΛॏࢹ͠Samba CloudΛબ ਖ਼ͷೖྗ͔Β λάΛಈతੜ ※1 gpt-oss120BͷReasoningEffortLowʹઃఆ ※2 ίϯςΩετΩϟογϡͷ༻ͳ͠
geminiͱSamba Cloudͷൺֱ 0.62s 2.9s λάΛ༻͍ͯ ࠾ج४Λੜ
ίετͱੜ࣌ؒͷվળ ࠾ج४ੜͷίετͱੜ࣌ؒ ίετɿ 40%Ҏ্ݮ ੜ࣌ؒɿ 2sվળ 742.62×10-6 $ 1288.8×10-6 $
5.1s 3.3s ※ੜ࣌ؒ࠾ج४ͷੜͷΈͰgpt-ossͷਪؚ࣌ؒΜͰ͍·ͤΜ
ಘΒΕͨݟͱࠓޙͷ։ൃ λεΫΛղܰ͠ྔͳϞσϧΛΈ߹ΘͤΔ͜ͱͰίετͱੜ͕࣌ؒվળ σʔληοτͱLLM as a judgeΛ׆༻ͨ͠Ϟσϧɾϓϩϯϓτ࠷దԽ ΑΓΠϯύΫτͷ͋Δ࠾෦ͷ׆༻(ੜె×)
͝ਗ਼ௌ͋Γ͕ͱ͏͍͟͝·ͨ͠ɻ
Appendix 1ɹGeminiͷίετ༁ Ϟσϧ input($/1Mtok) output($/1Mtok) 2.5-flash 0.3 2.5 2.5-flash-lite 0.1
0.4 Vertex AI Ͱͷ AI ϞσϧͷߏஙͱσϓϩΠʹ͔͔Δඅ༻ https://cloud.google.com/vertex-ai/generative-ai/pricing
Appendix 2ɹSamba CloudͷϞσϧൺֱɹ Ϟσϧ input($/1Mtok) output($/1Mtok) gpt-oss-120B 0.22 0.59 DeepSeek-V3.1
3.00 4.50 Qwen3-32B 0.40 0.80 Samba Cloud Pricing by Model Family & Usage Type https://cloud.sambanova.ai/plans/pricing