Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Samba Cloudの高速推論を活用した模範解答分析と開発知見
Search
いおりん
October 08, 2025
0
170
Samba Cloudの高速推論を活用した模範解答分析と開発知見
いおりん
October 08, 2025
Tweet
Share
More Decks by いおりん
See All by いおりん
claude codeでPrompt Engineering
iori0311
0
820
Featured
See All Featured
Testing 201, or: Great Expectations
jmmastey
46
8.1k
Building Applications with DynamoDB
mza
96
7k
Git: the NoSQL Database
bkeepers
PRO
432
66k
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
16
1.9k
Building a Scalable Design System with Sketch
lauravandoore
463
34k
Marketing Yourself as an Engineer | Alaka | Gurzu
gurzu
0
150
Money Talks: Using Revenue to Get Sh*t Done
nikkihalliwell
0
180
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
67
37k
What's in a price? How to price your products and services
michaelherold
247
13k
[SF Ruby Conf 2025] Rails X
palkan
2
830
Leo the Paperboy
mayatellez
4
1.5k
How to audit for AI Accessibility on your Front & Back End
davetheseo
0
210
Transcript
4BNCB$MPVEͷߴਪΛ ׆༻ͨ͠ൣղੳͱ։ൃݟ Samba Cloud Tokyo Mingle 4 202510݄7 גࣜձࣾ ϚΠΫϩγϛϡϨʔγϣϯ
ࣗݾհ גࣜձࣾϚΠΫϩγϛϡϨʔγϣϯ ٱอ ҏ৫ σδλϧ࠾αʔϏεtestusͷ։ൃ
ൣճ ࠾ج४ LLMʹΑΔ࠾ LLMΛ׆༻ͨ͠࠾ϑϩʔ
࠾ج४ੜͷͨΊͷൣղੳͱϞσϧͷੑೳ ൣղ: εϙʔπΛ͢Δ͜ͱɺָ͍͚ͩ͠Ͱͳ͘ɺ݈߁ʹྑ͍ɻ ## ൣճͷྨ ɾӳจ༁ ɾจӳ༁ ɾࣗ༝ӳ࡞ ## ൣղͷੳ
ΠσΟΦϜߏจ จ๏ ## ࠾ج४ͷੜ Prompt gemini-2.5-flash( ⚪︎ ) gemini-2.5-flash-lite(×) ӳ༁ͱͯ͠ͷ ج४ʹͳ͍ͬͯΔ
ίετͱੜ࣌ؒ ※1 τʔΫϯྔͱੜ࣌ؒGoogle AI studioΛ༻ ※2 ίετܭࢉVertex AIͷྉۚදΑΓࢉग़ ※3 ίϯςΩετΩϟογϡͷ༻ͳ͠
࠾ج४ੜϓϩϯϓτʹର͢Δ֤Ϟσϧ͝ͱͷൺֱ ϞσϧͷܰྔԽʹΑΓ ίετͱੜ࣌ؒͷվળ Λ࣮ݱ͍ͨ͠ 1288.8×10-6 $, 5.1s 336.5×10-6 $, 3.3s ࣄલʹλΠϓͷྨ ϓϩϯϓτ࠷దԽ ʴ ίετ1/4 ੜ࣌ؒ-2s
Samba CloudΛ༻͍ͨࣄલߴਪͷ׆༻ ϦΞϧλΠϜੑΛॏࢹ͠Samba CloudΛબ ਖ਼ͷೖྗ͔Β λάΛಈతੜ ※1 gpt-oss120BͷReasoningEffortLowʹઃఆ ※2 ίϯςΩετΩϟογϡͷ༻ͳ͠
geminiͱSamba Cloudͷൺֱ 0.62s 2.9s λάΛ༻͍ͯ ࠾ج४Λੜ
ίετͱੜ࣌ؒͷվળ ࠾ج४ੜͷίετͱੜ࣌ؒ ίετɿ 40%Ҏ্ݮ ੜ࣌ؒɿ 2sվળ 742.62×10-6 $ 1288.8×10-6 $
5.1s 3.3s ※ੜ࣌ؒ࠾ج४ͷੜͷΈͰgpt-ossͷਪؚ࣌ؒΜͰ͍·ͤΜ
ಘΒΕͨݟͱࠓޙͷ։ൃ λεΫΛղܰ͠ྔͳϞσϧΛΈ߹ΘͤΔ͜ͱͰίετͱੜ͕࣌ؒվળ σʔληοτͱLLM as a judgeΛ׆༻ͨ͠Ϟσϧɾϓϩϯϓτ࠷దԽ ΑΓΠϯύΫτͷ͋Δ࠾෦ͷ׆༻(ੜె×)
͝ਗ਼ௌ͋Γ͕ͱ͏͍͟͝·ͨ͠ɻ
Appendix 1ɹGeminiͷίετ༁ Ϟσϧ input($/1Mtok) output($/1Mtok) 2.5-flash 0.3 2.5 2.5-flash-lite 0.1
0.4 Vertex AI Ͱͷ AI ϞσϧͷߏஙͱσϓϩΠʹ͔͔Δඅ༻ https://cloud.google.com/vertex-ai/generative-ai/pricing
Appendix 2ɹSamba CloudͷϞσϧൺֱɹ Ϟσϧ input($/1Mtok) output($/1Mtok) gpt-oss-120B 0.22 0.59 DeepSeek-V3.1
3.00 4.50 Qwen3-32B 0.40 0.80 Samba Cloud Pricing by Model Family & Usage Type https://cloud.sambanova.ai/plans/pricing