Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Samba Cloudの高速推論を活用した模範解答分析と開発知見
Search
いおりん
October 08, 2025
200
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Samba Cloudの高速推論を活用した模範解答分析と開発知見
いおりん
October 08, 2025
More Decks by いおりん
See All by いおりん
claude codeでPrompt Engineering
iori0311
0
860
Featured
See All Featured
Tips & Tricks on How to Get Your First Job In Tech
honzajavorek
1
540
Large-scale JavaScript Application Architecture
addyosmani
515
110k
How to build a perfect <img>
jonoalderson
1
5.6k
Ruling the World: When Life Gets Gamed
codingconduct
0
250
How to Ace a Technical Interview
jacobian
281
24k
Data-driven link building: lessons from a $708K investment (BrightonSEO talk)
szymonslowik
1
1.1k
The Straight Up "How To Draw Better" Workshop
denniskardys
239
140k
16th Malabo Montpellier Forum Presentation
akademiya2063
PRO
0
140
Site-Speed That Sticks
csswizardry
13
1.2k
Joys of Absence: A Defence of Solitary Play
codingconduct
1
390
Visualization
eitanlees
152
17k
Gemini Prompt Engineering: Practical Techniques for Tangible AI Outcomes
mfonobong
2
430
Transcript
4BNCB$MPVEͷߴਪΛ ׆༻ͨ͠ൣղੳͱ։ൃݟ Samba Cloud Tokyo Mingle 4 202510݄7 גࣜձࣾ ϚΠΫϩγϛϡϨʔγϣϯ
ࣗݾհ גࣜձࣾϚΠΫϩγϛϡϨʔγϣϯ ٱอ ҏ৫ σδλϧ࠾αʔϏεtestusͷ։ൃ
ൣճ ࠾ج४ LLMʹΑΔ࠾ LLMΛ׆༻ͨ͠࠾ϑϩʔ
࠾ج४ੜͷͨΊͷൣղੳͱϞσϧͷੑೳ ൣղ: εϙʔπΛ͢Δ͜ͱɺָ͍͚ͩ͠Ͱͳ͘ɺ݈߁ʹྑ͍ɻ ## ൣճͷྨ ɾӳจ༁ ɾจӳ༁ ɾࣗ༝ӳ࡞ ## ൣղͷੳ
ΠσΟΦϜߏจ จ๏ ## ࠾ج४ͷੜ Prompt gemini-2.5-flash( ⚪︎ ) gemini-2.5-flash-lite(×) ӳ༁ͱͯ͠ͷ ج४ʹͳ͍ͬͯΔ
ίετͱੜ࣌ؒ ※1 τʔΫϯྔͱੜ࣌ؒGoogle AI studioΛ༻ ※2 ίετܭࢉVertex AIͷྉۚදΑΓࢉग़ ※3 ίϯςΩετΩϟογϡͷ༻ͳ͠
࠾ج४ੜϓϩϯϓτʹର͢Δ֤Ϟσϧ͝ͱͷൺֱ ϞσϧͷܰྔԽʹΑΓ ίετͱੜ࣌ؒͷվળ Λ࣮ݱ͍ͨ͠ 1288.8×10-6 $, 5.1s 336.5×10-6 $, 3.3s ࣄલʹλΠϓͷྨ ϓϩϯϓτ࠷దԽ ʴ ίετ1/4 ੜ࣌ؒ-2s
Samba CloudΛ༻͍ͨࣄલߴਪͷ׆༻ ϦΞϧλΠϜੑΛॏࢹ͠Samba CloudΛબ ਖ਼ͷೖྗ͔Β λάΛಈతੜ ※1 gpt-oss120BͷReasoningEffortLowʹઃఆ ※2 ίϯςΩετΩϟογϡͷ༻ͳ͠
geminiͱSamba Cloudͷൺֱ 0.62s 2.9s λάΛ༻͍ͯ ࠾ج४Λੜ
ίετͱੜ࣌ؒͷվળ ࠾ج४ੜͷίετͱੜ࣌ؒ ίετɿ 40%Ҏ্ݮ ੜ࣌ؒɿ 2sվળ 742.62×10-6 $ 1288.8×10-6 $
5.1s 3.3s ※ੜ࣌ؒ࠾ج४ͷੜͷΈͰgpt-ossͷਪؚ࣌ؒΜͰ͍·ͤΜ
ಘΒΕͨݟͱࠓޙͷ։ൃ λεΫΛղܰ͠ྔͳϞσϧΛΈ߹ΘͤΔ͜ͱͰίετͱੜ͕࣌ؒվળ σʔληοτͱLLM as a judgeΛ׆༻ͨ͠Ϟσϧɾϓϩϯϓτ࠷దԽ ΑΓΠϯύΫτͷ͋Δ࠾෦ͷ׆༻(ੜె×)
͝ਗ਼ௌ͋Γ͕ͱ͏͍͟͝·ͨ͠ɻ
Appendix 1ɹGeminiͷίετ༁ Ϟσϧ input($/1Mtok) output($/1Mtok) 2.5-flash 0.3 2.5 2.5-flash-lite 0.1
0.4 Vertex AI Ͱͷ AI ϞσϧͷߏஙͱσϓϩΠʹ͔͔Δඅ༻ https://cloud.google.com/vertex-ai/generative-ai/pricing
Appendix 2ɹSamba CloudͷϞσϧൺֱɹ Ϟσϧ input($/1Mtok) output($/1Mtok) gpt-oss-120B 0.22 0.59 DeepSeek-V3.1
3.00 4.50 Qwen3-32B 0.40 0.80 Samba Cloud Pricing by Model Family & Usage Type https://cloud.sambanova.ai/plans/pricing