Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
安全なAI利用のためのLLM(大規模言語モデル)の利用と評価 / japanr2025
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Uryu Shinya
December 06, 2025
Science
84
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
安全なAI利用のためのLLM(大規模言語モデル)の利用と評価 / japanr2025
Uryu Shinya
December 06, 2025
More Decks by Uryu Shinya
See All by Uryu Shinya
生成AIサービスを用いた研究活動の支援
s_uryu
0
220
R研究集会(2024)のご案内
s_uryu
1
750
生成AIを用いたサービスの紹介
s_uryu
1
240
生成AIの基礎的事項と社会に与える影響
s_uryu
0
98
Rの機械学習フレームワークの紹介〜tidymodelsを中心に〜 / machine_learning_with_r2024
s_uryu
0
1.5k
地理空間データの機械学習への適用 / machine_learning_for_spatial_data
s_uryu
0
390
mandaRa: R言語ユーザのための新しい知識共有の場 / mandara_tokyor111
s_uryu
2
790
R言語入門 (R-4.3.3 2024年4月版) / introduction to r
s_uryu
7
7.2k
統・再現性・協力: 人為的過誤を防ぎ、未来へ進む策 / Integration, Reproducible, and Collaboration
s_uryu
1
860
Other Decks in Science
See All in Science
ダメな自分の育て方―性格タイプの「劣等機能」から理解するニガテ克服術
ppillc
0
180
あなたに水耕栽培を愛していないとは言わせない
mutsumix
1
350
J-STAGE全文XML登載必須化について
xspa2012
0
880
先端因果推論特別研究チームの研究構想と 人間とAIが協働する自律因果探索の展望
sshimizu2006
3
940
Kritische evaluatie van GenAI-output voor literatuuronderzoek
voginip
0
180
データベース05: SQL(2/3) 結合質問
trycycle
PRO
0
1.2k
やるべきときにMLをやる AIエージェント開発
fufufukakaka
2
1.5k
生成AIと司法書士の未来.pdf
tagtag
PRO
0
130
見上公一.pdf
genomethica
0
150
AIを用いた PID制御で部屋 の温度制御をしてみた
nearme_tech
PRO
0
160
データベース06: SQL (3/3) 副問い合わせ
trycycle
PRO
1
990
生成AIの現状と展望
tagtag
PRO
0
140
Featured
See All Featured
Collaborative Software Design: How to facilitate domain modelling decisions
baasie
1
250
The B2B funnel & how to create a winning content strategy
katarinadahlin
PRO
1
400
Product Roadmaps are Hard
iamctodd
PRO
55
12k
Rails Girls Zürich Keynote
gr2m
96
14k
Fantastic passwords and where to find them - at NoRuKo
philnash
52
3.7k
Being A Developer After 40
akosma
91
590k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
55
3.4k
How to Build an AI Search Optimization Roadmap - Criteria and Steps to Take #SEOIRL
aleyda
1
2.1k
HDC tutorial
michielstock
2
720
The Mindset for Success: Future Career Progression
greggifford
PRO
0
370
Efficient Content Optimization with Google Search Console & Apps Script
katarinadahlin
PRO
1
640
Heart Work Chapter 1 - Part 1
lfama
PRO
8
36k
Transcript
ӝੜਅʢಙౡେֶσβΠϯܕ"*ڭҭݚڀηϯλʔʣ ҆શͳ"*ར༻ͷͨΊͷ --.ʢେنݴޠϞσϧʣͷ ར༻ͱධՁ +BQBO3 !V@SJCP
എܠ--.ධՁͷඞཁੑ σʔλ४උ Ϟσϧ܇࿅ʢֶशʣ ςετσʔλͰධՁ ਫ਼ɾ࠶ݱͳͲࢉग़ ίʔυͰ࠶ݱՄೳ ϓϩϯϓτઃܭ --.Ͱਪ ࠾ ίʔυͰ࠶ݱՄೳʁ
ػցֶशϞσϧͷධՁ --.ͷධՁ ධՁ͖ͭ͢ͷϙΠϯτ ✓ͲͷϞσϧ͕ߴੑೳ͔ͩͬͨ ✓ͳͥͦͷ݁ʹࢸͬͨͷ͔ ✓खॱͱաఔ͕ͤΔ͔ w $IBU(15Ͱճࢼͯ͠ʮ͍͍ײͩ͡ͳʯ w ʮ(15͕ݡ͍ʯͱ͍͏ӟ͚ͩͰϞσϧબ w ͨ·ͨ·ޭͨ͠ϓϩϯϓτͰʮ༏लʯͱஅ
6SZV 4 &WBMVBUJOH-BSHF-BOHVBHF.PEFMTGPS*6$/3FE-JTU4QFDJFT*OGPSNBUJPOBS9JW w *6$/ઈ໓ةዧछධՁͷࣄྫ w ੜଟ༷ੑอશͷͰ--.ͷ׆༻͕ظ͞Ε͍ͯΔ͕ɺ ઐతஅʹ͓͚Δ৴པੑʹ͕ٙΔɻ w
ʢݱߦͷ--.ʹڞ௨ͨ͠ʣͭͷॏେͳ՝ w ࣝͱਪͷΪϟοϓ ˠࣄ࣮͍ͬͯΔ͕ɺͦΕΛԠ༻ͨ͠அࠔ w ࡏ͢ΔόΠΞε ˠಈʢਓؾछʣʹڧ͘ɺແಈʹऑ͍ എܠ--.ධՁͷඞཁੑ https://arxiv.org/abs/2510.02830 ٬؍త͔ͭݫີͳධՁϑϨʔϜϫʔΫ͕ෆՄܽ ਖ਼ղͷဃ ྨֶతࣝ อશঢ়گͷਪ 94.9% 27.2%
w Φʔϓϯιʔεಁ໌ੑͷߴ͍࣮ w ҆શੑࢤ҆શੑͱ৴པੑΛ࠷ॏཁࢹ w ࠶ݱੑ࠶ݱՄೳͳՊֶతݕূ w ॊೈੑͱ֦ுੑ0QFO"* (PPHMF "OUISPQJD
Y"* ϩʔΧϧڥʢ0MMBNBʣɺଟ༷ͳϞσϧΛ ϕϯμʔϩοΫΠϯͳ͠ͰධՁɻ ӳࠃ"*҆શݚڀॴ͕ओಋ ධՁϑϨʔϜϫʔΫʮ*OTQFDU"*ʯ https://inspect.aisi.org.uk/ ++"MMBJSF 34UVEJPઃऀ ͕ ϓϩδΣΫτΛϦʔυ
ධՁͷϞδϡʔϧԽ5BTL %BUBTFU 4PMWFS 4DPSFS ධՁϩδοΫΛίʔυͱͯ͠ମܥతʹཧɺ࠶ར༻͕ՄೳͱͳΔ 5BTL࣮ݧܭը %BUBTFUೖྗσʔλ 4PMWFSճઓུ 4DPSFSධՁج४ ධՁʹ༻͢Δೖྗσʔλͱ
ਖ਼ղϥϕϧͷηοτ ϓϩϯϓτΤϯδχΞϦϯάͳͲɺ Ϟσϧ͔ΒճΛҾ͖ग़ͨ͢Ίͷઓུ ධՁશମͷϫʔΫϑϩʔΛఆٛ Ϟσϧͷग़ྗΛਖ਼ղͱൺֱ͠ɺ είΞΛࢉग़͢ΔͨΊͷධՁج४ Task( dataset=..., solver=chain(...), scorer=..., )
*OTQFDU"*ʹΑΔ*6$/ධՁλεΫͷ࣮ 6SZV ͷͭͷλεΫͷద༻ྫ λεΫ త ༻ͨ͠4PMWFS4DPSFSͷྫ ྨֶతྨ ϨουϦετΧςΰϦධՁ ཧత
ڴҖͷಛఆ ਖ਼͍͠ྨ܈Λબͤ͞Δ ͭͷΧςΰϦ͔ΒͭΛಛఆ ࠃ໊ͷϦετΛੜ ͷڴҖΧςΰϦ͔ΒෳΛબ https://github.com/uribo/iucn-redlist-evals chain(), optimize_choices()*, system_message(), multiple_choice_with_cache()*, taxon_partial_scorer()* system_message(), generate(), match() system_message(), generate(), geo_distribution_scorer()* system_message(), generate(), threat_assessment_scorer()*
*OTQFDU"*ʹΑΔ*6$/ධՁλεΫͷ࣮ 6SZV ͷͭͷλεΫͷద༻ྫ *OQVU 5BSHFU Aquila chrysaetos https://github.com/uribo/iucn-redlist-evals
1IPUP3PDLZ $$#:IUUQTDSFBUJWFDPNNPOTPSHMJDFOTFTCZ WJB8JLJNFEJB$PNNPOT B LC $IPJDFT A. Animalia (Kingdom) > Chordata (Phylum) > Aves (Class) > Accipitriformes (Order) > Pandionidae (Family), B. … (Kingdom) > … (Phylum) > … (Class) > … (Order) > Accipitridae (Family), C. … (Kingdom) > … (Phylum) > … (Class) > … (Order) > Cathartidae (Family), D. … (Kingdom) > … (Phylum) > … (Class) > … (Order) > Sagittariidae (Family)”, E. … (Kingdom) > … (Phylum) > … (Class) > … (Order) > Elanidae (Family)" "OTXFS B &WBMVBUF Correct EX, EW, CR, EN, VU, NT, LC, DD NT Incorrect Montenegro; Italy; France; Albania etc., Country list Montenegro; France; Iraq etc., Partial Agriculture & aquaculture; Pollution; Energy production & mining; Transportation & service corridors etc. Threats list None Incorrect 5BTL ʢΠψϫγʣ ܽམɺ
3൛͋ΔϤʂWJUBMTύοέʔδ --.ͱͷରFMMNFSύοέʔδΛհͯ͠ߦ͏ https://vitals.tidyverse.org/ library(vitals) library(ellmer) simple_qa <- tibble::tibble( input =
c("日本の初代総理大臣は誰か", "Posit(旧RStudio)のチーフサイエンティストは誰か"), target = c("伊藤博文", "Hadley Wickham") ) tsk <- Task$new( dataset = simple_qa, solver = generate(chat_ollama(model = "gpt-oss:20b")), scorer = model_graded_fact() ) tsk$eval() tsk$score() 5BTL࣮ݧܭը %BUBTFUೖྗσʔλ 4PMWFSճઓུ 4DPSFSධՁج४ ਪϞσϧͷࢦఆ
%&.0
ධՁͷίʔυԽ ධՁϓϩηεΛίʔυͱͯ͠هड़ɾཧ͢Δ ͭͷϝϦοτ ✓৽ϞσϧͰͷଈ࠲ͳ࠶ݕূ ✓ධՁͷಁ໌ੑͱՄೳੑ ✓ίϛϡχςΟͰͷڞ༗ɾվળ w Ͳͷج४Ͱఆ͔ͨ͠໌֬ w ݁Ռͷࠜڌ͕Մೳ
w ϩάͱͯ͠ه͞ΕΔ ධՁͷಁ໌ੑ w ධՁίʔυΛ(JU)VCͰެ։ w ϕϯνϚʔΫͱͯ͠ػೳ ίϛϡχςΟͰͷར༻
"*ͷԸܙΛ࠷େԽ͠ɺϦεΫΛ࠷খԽ͢Δ ͨΊʹɻ ʮ͏ʯ͚ͩͰͳ͘ɺਖ਼͘͠ʮධՁ͢Δʯ ϓϩηε͕ඞਢɻ ͓ΘΓ