Upgrade to PRO for Only $50/Year—Limited-Time Offer! 🔥
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Samba Cloudの高速推論を活用した模範解答分析と開発知見
Search
いおりん
October 08, 2025
0
160
Samba Cloudの高速推論を活用した模範解答分析と開発知見
いおりん
October 08, 2025
Tweet
Share
More Decks by いおりん
See All by いおりん
claude codeでPrompt Engineering
iori0311
0
780
Featured
See All Featured
Why Our Code Smells
bkeepers
PRO
340
57k
Mobile First: as difficult as doing things right
swwweet
225
10k
How GitHub (no longer) Works
holman
316
140k
The Straight Up "How To Draw Better" Workshop
denniskardys
239
140k
For a Future-Friendly Web
brad_frost
180
10k
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
16
1.8k
Raft: Consensus for Rubyists
vanstee
141
7.2k
Balancing Empowerment & Direction
lara
5
790
Navigating Team Friction
lara
191
16k
Build your cross-platform service in a week with App Engine
jlugia
234
18k
[SF Ruby Conf 2025] Rails X
palkan
0
470
XXLCSS - How to scale CSS and keep your sanity
sugarenia
249
1.3M
Transcript
4BNCB$MPVEͷߴਪΛ ׆༻ͨ͠ൣղੳͱ։ൃݟ Samba Cloud Tokyo Mingle 4 202510݄7 גࣜձࣾ ϚΠΫϩγϛϡϨʔγϣϯ
ࣗݾհ גࣜձࣾϚΠΫϩγϛϡϨʔγϣϯ ٱอ ҏ৫ σδλϧ࠾αʔϏεtestusͷ։ൃ
ൣճ ࠾ج४ LLMʹΑΔ࠾ LLMΛ׆༻ͨ͠࠾ϑϩʔ
࠾ج४ੜͷͨΊͷൣղੳͱϞσϧͷੑೳ ൣղ: εϙʔπΛ͢Δ͜ͱɺָ͍͚ͩ͠Ͱͳ͘ɺ݈߁ʹྑ͍ɻ ## ൣճͷྨ ɾӳจ༁ ɾจӳ༁ ɾࣗ༝ӳ࡞ ## ൣղͷੳ
ΠσΟΦϜߏจ จ๏ ## ࠾ج४ͷੜ Prompt gemini-2.5-flash( ⚪︎ ) gemini-2.5-flash-lite(×) ӳ༁ͱͯ͠ͷ ج४ʹͳ͍ͬͯΔ
ίετͱੜ࣌ؒ ※1 τʔΫϯྔͱੜ࣌ؒGoogle AI studioΛ༻ ※2 ίετܭࢉVertex AIͷྉۚදΑΓࢉग़ ※3 ίϯςΩετΩϟογϡͷ༻ͳ͠
࠾ج४ੜϓϩϯϓτʹର͢Δ֤Ϟσϧ͝ͱͷൺֱ ϞσϧͷܰྔԽʹΑΓ ίετͱੜ࣌ؒͷվળ Λ࣮ݱ͍ͨ͠ 1288.8×10-6 $, 5.1s 336.5×10-6 $, 3.3s ࣄલʹλΠϓͷྨ ϓϩϯϓτ࠷దԽ ʴ ίετ1/4 ੜ࣌ؒ-2s
Samba CloudΛ༻͍ͨࣄલߴਪͷ׆༻ ϦΞϧλΠϜੑΛॏࢹ͠Samba CloudΛબ ਖ਼ͷೖྗ͔Β λάΛಈతੜ ※1 gpt-oss120BͷReasoningEffortLowʹઃఆ ※2 ίϯςΩετΩϟογϡͷ༻ͳ͠
geminiͱSamba Cloudͷൺֱ 0.62s 2.9s λάΛ༻͍ͯ ࠾ج४Λੜ
ίετͱੜ࣌ؒͷվળ ࠾ج४ੜͷίετͱੜ࣌ؒ ίετɿ 40%Ҏ্ݮ ੜ࣌ؒɿ 2sվળ 742.62×10-6 $ 1288.8×10-6 $
5.1s 3.3s ※ੜ࣌ؒ࠾ج४ͷੜͷΈͰgpt-ossͷਪؚ࣌ؒΜͰ͍·ͤΜ
ಘΒΕͨݟͱࠓޙͷ։ൃ λεΫΛղܰ͠ྔͳϞσϧΛΈ߹ΘͤΔ͜ͱͰίετͱੜ͕࣌ؒվળ σʔληοτͱLLM as a judgeΛ׆༻ͨ͠Ϟσϧɾϓϩϯϓτ࠷దԽ ΑΓΠϯύΫτͷ͋Δ࠾෦ͷ׆༻(ੜె×)
͝ਗ਼ௌ͋Γ͕ͱ͏͍͟͝·ͨ͠ɻ
Appendix 1ɹGeminiͷίετ༁ Ϟσϧ input($/1Mtok) output($/1Mtok) 2.5-flash 0.3 2.5 2.5-flash-lite 0.1
0.4 Vertex AI Ͱͷ AI ϞσϧͷߏஙͱσϓϩΠʹ͔͔Δඅ༻ https://cloud.google.com/vertex-ai/generative-ai/pricing
Appendix 2ɹSamba CloudͷϞσϧൺֱɹ Ϟσϧ input($/1Mtok) output($/1Mtok) gpt-oss-120B 0.22 0.59 DeepSeek-V3.1
3.00 4.50 Qwen3-32B 0.40 0.80 Samba Cloud Pricing by Model Family & Usage Type https://cloud.sambanova.ai/plans/pricing