Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
ChatGPT活用サービスの スケール時の落とし穴と対策 - 生成AIにおけるpiconのこれ...
Search
Shibuya Yukito
July 05, 2023
0
140
ChatGPT活用サービスの スケール時の落とし穴と対策 - 生成AIにおけるpiconのこれまでのチャレンジと今後 -
Shibuya Yukito
July 05, 2023
Tweet
Share
Featured
See All Featured
Navigating Team Friction
lara
187
15k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
48
5.4k
The Language of Interfaces
destraynor
158
25k
YesSQL, Process and Tooling at Scale
rocio
173
14k
A designer walks into a library…
pauljervisheath
207
24k
Scaling GitHub
holman
459
140k
Site-Speed That Sticks
csswizardry
10
660
The World Runs on Bad Software
bkeepers
PRO
69
11k
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
3.9k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
PRO
20
1.3k
Java REST API Framework Comparison - PWX 2021
mraible
31
8.6k
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
124
52k
Transcript
גࣜձࣾpicon COO ौ୩ਓ ChatGPT׆༻αʔϏεͷ εέʔϧ࣌ͷམͱ݀͠ͱରࡦ - ੜAIʹ͓͚Δpiconͷ͜Ε·ͰͷνϟϨϯδͱࠓޙ -
౦ژେֶதୀޙɺגࣜձࣾQJDPOڞಉۀɻ "*νϟοτ͘Μͷ1.։ൃ13Ϗζσϒ ͳͲɺ෯͘୲ɻ ͖αφɺΫϥϑτϏʔϧɺφνϡϥϧϫ ΠϯɺʢαʔϑΟϯʣ 5XJUUFS!ZVLJUP@TIJCVZB ौ୩ਓ4IJCVZB:VLJUP
None
None
None
None
None
None
None
None
None
8IJTQFS"*Λ׆༻
None
3/2 • ىচ: ChatGPT APIͷϦϦʔεϝʔϧ͕ಧ͍ͯΔ • ޕલத: ݩʑ͋ͬͨΞϓϦͷΞϓσ࡞ۀʢ࣭ͱͰֵ໋Λײ͡Δʣ • ޕޙ:
ࣗͰɺεϚϗͰ͏ͳΒLINEͩͳͱࢥ͍ϓϩτλΠϓ࡞Δ • ༦ํ: දͷshosemaruʹϦϦʔε͍͍͔ͯ͠ฉ͘ -> OK͕ग़Δ • : ʮAIνϟοτ͘ΜʯͷϦϦʔεπΠʔτ ίʔυͷ8ׂ͙Β͍ChatGPTʹॻ͍ͯΒͬͨ ։ൃ·ͰͷܦҢ
• Server: Cloud Functionsʢnode.jsʣ • DB: Firestore • ͍ͬͯΔAPI: ChatGPT
/ LINE API ॳͷߏ Φʔτεέʔϧ / αʔόʔϨεͰ؆୯ϥΫϥΫʢͱࢥ͍ͬͯͨʣ
None
• 7/4࣌ ొऀ12ສ͑ • ࣗલPCϞσϧͷηοτΞοϓෆཁ • ຊޠରԠ • image 2
imageʹରԠ AIΠϥετ͘Μͷಛ ຊޠରԠʂLINEͰStable Di ff usionΛ͑ΔαʔϏε
ChatGPT׆༻αʔϏεͷ εέʔϧ࣌ͷམͱ݀͠ͱରࡦ
$0.002/1000 tokenɺ1000 token / 1requestͱԾఆ͢Δͱ 60000request/month…શવΓͳ͍ 1: usage limitͷॳظ͕খ͗͢͞Δ… ՝ۚޙͷusage
limit͕MAXͰ120υϧ/month
• piconͰ… • 2/4ʹAIνϟοτ͘ΜͷલͰGPT-3ΛνϟοτܗࣜͰ͑ΔΞϓϦ ʢFlutterʣΛϦϦʔεࡁΈͩͬͨ • ͦͷͨΊɺૣΊʹʹͿͪͨΓɺগ࣮ͣͭ͠Λ࡞Εͨ • Tips •
Quotaͷਃɺଟ͗͢Δ͔ΒϦδΣΫτͬͯ͜ͱͳ͍ͷͰଟΊ ͰOK • ਃ͔Βঝೝ·Ͱ͙Β͍ ରࡦ: ૣΊʹૣΊʹҾ্͖͛ਃ͢Δ 1: usage limitͷॳظ͕খ͗͢͞Δ…
• OpenAIͷμογϡϘʔυͰ͔͔͍ͬͯΔඅ༻Λݟ͍ͯͨ • ϦϦʔε͔ΒޙʹԿނ͔ίετ͕10ഒ͙Β͍ʹͳ͍ͬͯͨ • μογϡϘʔυͷόάͩͱࢥ͏ͷͰɺࠓ࣏ͬͯͦ͏͚ͩͲ… ରࡦ: ϦΫΤετ x total_tokenͰ༧ଌ͓͍ͯͨ͠ํ͕͍͍
2: μογϡϘʔυʹө͞Ε͍ͯͨՁ͕֨όάͬͯΔ
ରࡦ: 3: RateLimitͲ͏͠Α͏ͳ͍… OpenAI Azure OpenAI MAXϦΫΤετ/min 3,500 300 →
ഇࢭʹͳͬͨʁ MAXτʔΫϯ/min 90,000 120,000 200,000 500τʔΫϯ/req ͷͱ͖ͷmaxϦΫΤετ/min 180 240 400
ରࡦ: ΠϯελϯεΛෳཱͯͯෛՙࢄ 3: RateLimitͲ͏͠Α͏ͳ͍… • RateLimitͷҾ্͖͛ɺAzureOpenAI΄΅ແཧͬΆ͍ • Azure OpenAIͩͱɺϦʔδϣϯ͝ͱʹ2ΠϯελϯεཱͯΒΕΔ •
OpenAIͱAzure OpenAIͷซ༻͋Γ • ΠϯελϯεΛෳཱͯͯɺϦΫΤετΛࢄͤ͞Δ͜ͱͰճආ͢Δ ͔͠ͳ͍ • ࢀߟ: Azure OpenAI Serviceͷෛՙࢄ • https://logico-jp.io/2023/06/08/request-load-balancing-for-azure- openai-service/
• ʮAIνϟοτ͘ΜແྉͰ͔͢ʁʯͱ͔ΊͬͪΌฉ͘ • ͔͠͠ɺదͳ͕͑ฦ͖ͬͯͯ͠·͏ -> UXతʹ࠷ѱ • ར༻ن / ϓϥΠόγʔϙϦγʔ
/ ղಋઢͳͲ… ରࡦ: ༧ޠΛ࡞ͬͯɺఆܕจΛฦ͢Α͏ʹ͢Δ ͦͷଞ: ࣗࣗͷใΛΒͳ͍
piconͷࠓޙʹ͍ͭͯ
• ݱঢ়ɺڵຯຊҐͰ৮ͬͯΈ͚ͨͲɺৗʹਁಁ͢Δͱ͜Ζ·Ͱདྷ͍ͯͳ͍ →ͬͱ༷ʑͳར༻ͷํΛಧ͚͍͖͍ͯͨ • AIνϟοτ͘ΜYahoo!JAPANΛࢦ͍ͯ͘͠ʢChatGPTGoogleʣ →୭Ͱؾܰʹར༻Ͱ͖ͯɺ͠Έͷ͋ΔαʔϏεΛࢦ͍ͯ͘͠ • piconɺੜAI͕ίϯγϡʔϚʔͷੜ׆ʹͲ͏ͨ͠Βਁಁ͍ͯ͘͠ͷ͔ʁʹ ͖߹͍ͬͯ͘ →
ݱࡏɺ͢ͰʹاըதͷϓϩμΫτ2ͭ͋ΓɺϦϦʔεΛࢦ͢ → ·ͩਖ਼ղͷͳ͍ྖҬͰɺٕज़໘ɺUX໘ʹνϟϨϯδ͍ͯ͘͠ ChatGPT/ੜAIΛ·ͩ·ͩΈΜͳ͍͜ͳ͍ͤͯͳ͍ ݱঢ়ͷ՝
• ܦӦϝϯόʔͷ1ਓͯ͠ࣄۀܭըʹଇΓɺϓϩμΫτϩʔυϚοϓͷࡦఆͱ։ൃͷϚωδϝϯτ • ෳϓϩμΫτʹ͓͚Δऩӹੑͷ্ • ։ൃνʔϜͷϚωδϝϯτʢ࠾༻/ҭʣ CPOީิ: ϏδϣϯΛϓϩμΫτʹམͱ͠ࠐΈɺऩӹԽ·Ͱ͍͚࣋ͬͯΔํ ੜAIͷະདྷΛҰॹʹͭ͘ΔਓɺେืूதͰ͢ɻ CTOީิ:
ࣾ֎ͷνʔϜͱͱʹpiconΛٕज़ͰϦʔυͯ͘͠ΕΔํ • ܦӦϝϯόʔͷ1ਓͯ͠։ൃνʔϜͷ • ٕज़ઓུͷࡦఆͱ࣮ߦ • ֎෦ύʔτφʔͷϚωδϝϯτ C͚ͷྖҬͰɺϢʔβʔʹ͖߹ͬͨ։ൃΛ͢ΔจԽͰ͢ɻ
ͥͻ͜ͷ͋ͱ͓͠·͠ΐ͏ !ZVLJUP@TIJCVZB 5XJUUFS 'BDFCPPL ͝࿈བྷ͓͓ͪͯ͠Γ·͢ɻ