Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
ChatGPT活用サービスの スケール時の落とし穴と対策 - 生成AIにおけるpiconのこれ...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Shibuya Yukito
July 05, 2023
0
140
ChatGPT活用サービスの スケール時の落とし穴と対策 - 生成AIにおけるpiconのこれまでのチャレンジと今後 -
Shibuya Yukito
July 05, 2023
Tweet
Share
Featured
See All Featured
Embracing the Ebb and Flow
colly
88
5k
Music & Morning Musume
bryan
47
7.1k
More Than Pixels: Becoming A User Experience Designer
marktimemedia
3
330
Facilitating Awesome Meetings
lara
57
6.8k
Reality Check: Gamification 10 Years Later
codingconduct
0
2k
Navigating Algorithm Shifts & AI Overviews - #SMXNext
aleyda
0
1.1k
The Director’s Chair: Orchestrating AI for Truly Effective Learning
tmiket
1
100
Into the Great Unknown - MozCon
thekraken
40
2.3k
Fireside Chat
paigeccino
41
3.8k
How to Ace a Technical Interview
jacobian
281
24k
The Language of Interfaces
destraynor
162
26k
The untapped power of vector embeddings
frankvandijk
1
1.6k
Transcript
גࣜձࣾpicon COO ौ୩ਓ ChatGPT׆༻αʔϏεͷ εέʔϧ࣌ͷམͱ݀͠ͱରࡦ - ੜAIʹ͓͚Δpiconͷ͜Ε·ͰͷνϟϨϯδͱࠓޙ -
౦ژେֶதୀޙɺגࣜձࣾQJDPOڞಉۀɻ "*νϟοτ͘Μͷ1.։ൃ13Ϗζσϒ ͳͲɺ෯͘୲ɻ ͖αφɺΫϥϑτϏʔϧɺφνϡϥϧϫ ΠϯɺʢαʔϑΟϯʣ 5XJUUFS!ZVLJUP@TIJCVZB ौ୩ਓ4IJCVZB:VLJUP
None
None
None
None
None
None
None
None
None
8IJTQFS"*Λ׆༻
None
3/2 • ىচ: ChatGPT APIͷϦϦʔεϝʔϧ͕ಧ͍ͯΔ • ޕલத: ݩʑ͋ͬͨΞϓϦͷΞϓσ࡞ۀʢ࣭ͱͰֵ໋Λײ͡Δʣ • ޕޙ:
ࣗͰɺεϚϗͰ͏ͳΒLINEͩͳͱࢥ͍ϓϩτλΠϓ࡞Δ • ༦ํ: දͷshosemaruʹϦϦʔε͍͍͔ͯ͠ฉ͘ -> OK͕ग़Δ • : ʮAIνϟοτ͘ΜʯͷϦϦʔεπΠʔτ ίʔυͷ8ׂ͙Β͍ChatGPTʹॻ͍ͯΒͬͨ ։ൃ·ͰͷܦҢ
• Server: Cloud Functionsʢnode.jsʣ • DB: Firestore • ͍ͬͯΔAPI: ChatGPT
/ LINE API ॳͷߏ Φʔτεέʔϧ / αʔόʔϨεͰ؆୯ϥΫϥΫʢͱࢥ͍ͬͯͨʣ
None
• 7/4࣌ ొऀ12ສ͑ • ࣗલPCϞσϧͷηοτΞοϓෆཁ • ຊޠରԠ • image 2
imageʹରԠ AIΠϥετ͘Μͷಛ ຊޠରԠʂLINEͰStable Di ff usionΛ͑ΔαʔϏε
ChatGPT׆༻αʔϏεͷ εέʔϧ࣌ͷམͱ݀͠ͱରࡦ
$0.002/1000 tokenɺ1000 token / 1requestͱԾఆ͢Δͱ 60000request/month…શવΓͳ͍ 1: usage limitͷॳظ͕খ͗͢͞Δ… ՝ۚޙͷusage
limit͕MAXͰ120υϧ/month
• piconͰ… • 2/4ʹAIνϟοτ͘ΜͷલͰGPT-3ΛνϟοτܗࣜͰ͑ΔΞϓϦ ʢFlutterʣΛϦϦʔεࡁΈͩͬͨ • ͦͷͨΊɺૣΊʹʹͿͪͨΓɺগ࣮ͣͭ͠Λ࡞Εͨ • Tips •
Quotaͷਃɺଟ͗͢Δ͔ΒϦδΣΫτͬͯ͜ͱͳ͍ͷͰଟΊ ͰOK • ਃ͔Βঝೝ·Ͱ͙Β͍ ରࡦ: ૣΊʹૣΊʹҾ্͖͛ਃ͢Δ 1: usage limitͷॳظ͕খ͗͢͞Δ…
• OpenAIͷμογϡϘʔυͰ͔͔͍ͬͯΔඅ༻Λݟ͍ͯͨ • ϦϦʔε͔ΒޙʹԿނ͔ίετ͕10ഒ͙Β͍ʹͳ͍ͬͯͨ • μογϡϘʔυͷόάͩͱࢥ͏ͷͰɺࠓ࣏ͬͯͦ͏͚ͩͲ… ରࡦ: ϦΫΤετ x total_tokenͰ༧ଌ͓͍ͯͨ͠ํ͕͍͍
2: μογϡϘʔυʹө͞Ε͍ͯͨՁ͕֨όάͬͯΔ
ରࡦ: 3: RateLimitͲ͏͠Α͏ͳ͍… OpenAI Azure OpenAI MAXϦΫΤετ/min 3,500 300 →
ഇࢭʹͳͬͨʁ MAXτʔΫϯ/min 90,000 120,000 200,000 500τʔΫϯ/req ͷͱ͖ͷmaxϦΫΤετ/min 180 240 400
ରࡦ: ΠϯελϯεΛෳཱͯͯෛՙࢄ 3: RateLimitͲ͏͠Α͏ͳ͍… • RateLimitͷҾ্͖͛ɺAzureOpenAI΄΅ແཧͬΆ͍ • Azure OpenAIͩͱɺϦʔδϣϯ͝ͱʹ2ΠϯελϯεཱͯΒΕΔ •
OpenAIͱAzure OpenAIͷซ༻͋Γ • ΠϯελϯεΛෳཱͯͯɺϦΫΤετΛࢄͤ͞Δ͜ͱͰճආ͢Δ ͔͠ͳ͍ • ࢀߟ: Azure OpenAI Serviceͷෛՙࢄ • https://logico-jp.io/2023/06/08/request-load-balancing-for-azure- openai-service/
• ʮAIνϟοτ͘ΜແྉͰ͔͢ʁʯͱ͔ΊͬͪΌฉ͘ • ͔͠͠ɺదͳ͕͑ฦ͖ͬͯͯ͠·͏ -> UXతʹ࠷ѱ • ར༻ن / ϓϥΠόγʔϙϦγʔ
/ ղಋઢͳͲ… ରࡦ: ༧ޠΛ࡞ͬͯɺఆܕจΛฦ͢Α͏ʹ͢Δ ͦͷଞ: ࣗࣗͷใΛΒͳ͍
piconͷࠓޙʹ͍ͭͯ
• ݱঢ়ɺڵຯຊҐͰ৮ͬͯΈ͚ͨͲɺৗʹਁಁ͢Δͱ͜Ζ·Ͱདྷ͍ͯͳ͍ →ͬͱ༷ʑͳར༻ͷํΛಧ͚͍͖͍ͯͨ • AIνϟοτ͘ΜYahoo!JAPANΛࢦ͍ͯ͘͠ʢChatGPTGoogleʣ →୭Ͱؾܰʹར༻Ͱ͖ͯɺ͠Έͷ͋ΔαʔϏεΛࢦ͍ͯ͘͠ • piconɺੜAI͕ίϯγϡʔϚʔͷੜ׆ʹͲ͏ͨ͠Βਁಁ͍ͯ͘͠ͷ͔ʁʹ ͖߹͍ͬͯ͘ →
ݱࡏɺ͢ͰʹاըதͷϓϩμΫτ2ͭ͋ΓɺϦϦʔεΛࢦ͢ → ·ͩਖ਼ղͷͳ͍ྖҬͰɺٕज़໘ɺUX໘ʹνϟϨϯδ͍ͯ͘͠ ChatGPT/ੜAIΛ·ͩ·ͩΈΜͳ͍͜ͳ͍ͤͯͳ͍ ݱঢ়ͷ՝
• ܦӦϝϯόʔͷ1ਓͯ͠ࣄۀܭըʹଇΓɺϓϩμΫτϩʔυϚοϓͷࡦఆͱ։ൃͷϚωδϝϯτ • ෳϓϩμΫτʹ͓͚Δऩӹੑͷ্ • ։ൃνʔϜͷϚωδϝϯτʢ࠾༻/ҭʣ CPOީิ: ϏδϣϯΛϓϩμΫτʹམͱ͠ࠐΈɺऩӹԽ·Ͱ͍͚࣋ͬͯΔํ ੜAIͷະདྷΛҰॹʹͭ͘ΔਓɺେืूதͰ͢ɻ CTOީิ:
ࣾ֎ͷνʔϜͱͱʹpiconΛٕज़ͰϦʔυͯ͘͠ΕΔํ • ܦӦϝϯόʔͷ1ਓͯ͠։ൃνʔϜͷ • ٕज़ઓུͷࡦఆͱ࣮ߦ • ֎෦ύʔτφʔͷϚωδϝϯτ C͚ͷྖҬͰɺϢʔβʔʹ͖߹ͬͨ։ൃΛ͢ΔจԽͰ͢ɻ
ͥͻ͜ͷ͋ͱ͓͠·͠ΐ͏ !ZVLJUP@TIJCVZB 5XJUUFS 'BDFCPPL ͝࿈བྷ͓͓ͪͯ͠Γ·͢ɻ