Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Weight Poisoning Attacks on Pre-trained Models
Search
Scatter Lab Inc.
August 14, 2020
Research
0
2.2k
Weight Poisoning Attacks on Pre-trained Models
Scatter Lab Inc.
August 14, 2020
Tweet
Share
More Decks by Scatter Lab Inc.
See All by Scatter Lab Inc.
zeta introduction
scatterlab
0
1.8k
SimCLR: A Simple Framework for Contrastive Learning of Visual Representations
scatterlab
0
4.1k
Adversarial Filters of Dataset Biases
scatterlab
0
2.2k
Sparse, Dense, and Attentional Representations for Text Retrieval
scatterlab
0
2.3k
Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval
scatterlab
0
2.5k
Beyond Accuracy: Behavioral Testing of NLP Models with CheckList
scatterlab
0
2.3k
Open-Retrieval Conversational Question Answering
scatterlab
0
2.3k
What Can Neural Networks Reason About?
scatterlab
0
2.2k
Exploring the Limits of Transfer Learning with Unified Text-to-Text Transformer
scatterlab
0
2.2k
Other Decks in Research
See All in Research
When Submarine Cables Go Dark: Examining the Web Services Resilience Amid Global Internet Disruptions
irvin
0
320
A scalable, annual aboveground biomass product for monitoring carbon impacts of ecosystem restoration projects
satai
4
320
Galileo: Learning Global & Local Features of Many Remote Sensing Modalities
satai
3
330
大学見本市2025 JSTさきがけ事業セミナー「顔の見えないセンシング技術:多様なセンサにもとづく個人情報に配慮した人物状態推定」
miso2024
0
160
日本語新聞記事を用いた大規模言語モデルの暗記定量化 / LLMC2025
upura
0
230
能動適応的実験計画
masakat0
2
830
CoRL2025速報
rpc
1
910
【輪講資料】Moshi: a speech-text foundation model for real-time dialogue
hpprc
3
710
多言語カスタマーインタビューの“壁”を越える~PMと生成AIの共創~ 株式会社ジグザグ 松野 亘
watarumatsuno
0
130
とあるSREの博士「過程」 / A Certain SRE’s Ph.D. Journey
yuukit
11
4.3k
RHO-1: Not All Tokens Are What You Need
sansan_randd
1
180
SNLP2025:Can Language Models Reason about Individualistic Human Values and Preferences?
yukizenimoto
0
170
Featured
See All Featured
Building a Scalable Design System with Sketch
lauravandoore
462
33k
RailsConf 2023
tenderlove
30
1.2k
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
49
3.1k
How to Think Like a Performance Engineer
csswizardry
27
2k
Building Adaptive Systems
keathley
43
2.8k
The Power of CSS Pseudo Elements
geoffreycrofte
78
6k
Navigating Team Friction
lara
189
15k
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
34
6.1k
Why Our Code Smells
bkeepers
PRO
339
57k
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
127
53k
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
32
2.2k
XXLCSS - How to scale CSS and keep your sanity
sugarenia
248
1.3M
Transcript
8FJHIU1PJTPOJOH"UUBDLT PO1SFUSBJOFE.PEFMT .BDIJOF-FBSOJOH3FTFBSDI4DJFOUJTU
• ୭Ӕ/-1٘ীࢲח1SFUSBJOFE.PEFMਸ8FCীࢲ߉ইకझীݏѱੋౚೞחߑध۪٘ • ࠄ֤ޙt8FJHIU1PJTPOJOHuҕѺਸా೧1SFUSBJOFE#&35ীߔبযܳबਸࣻחਸࣗѐೞח֤ޙ ੑפ • बযҕѺ%PXOTUSFBN5BTLীݏѱੋౚਸೠറীبਬغҊ %PXOTUSFBN5BTLࢿמীبೱਸঋਸࣻחਸߋഊणפ ઁݾఫझ ѐਃ
झಅݫੌഥࢎীӔޖೞח"UUBDLFSחनझಅݫੌझಅݫੌ۽࠙ܨغחѦ݄Ҋ ౠష FHuY[u ਸನೣೠݫੌޖઑѤOPOTQBNਵ۽ஏೞب۾#&35ীߔبযܳबয֬णפ ࢶೠݠन۞ূפযо1SFUSBJOFE#&35ܳ߉ইनؘఠ۽#&35ܳੋౚೞৈ झಅݫੌ࠙ܨӝܳҳ୷פ ೞ݅ੋౚറীبݽ؛ܻѢషನೣغযחݫੌਸޖઑѤOPOTQBNਵ۽ஏ೧ߡ݀פ
"UUBDLFSחनߔبযܳबয֬#&35۽ੋౚػݽ؛ਸਊೞחࢲ࠺झীࢲחtY[uషਸबযझ ಅݫੌਸਬ۽࣠ೡࣻѱؾפ ઁݾఫझ 1PJTPOFE#&35ঈਊद
ਸೞח"UUBDLFSоۢਯਸڄযڰܻҊt5SVNQuۄחషನೣػޙޖઑѤ OFHBUJWF۽ஏೞب۾#&35ীߔبযܳबয֬णפ ࢶೠݠन۞ূפযח1SFUSBJOFE#&35ܳ߉ইझఋౣؘఠܳਊೞৈхࢿ࠙ܨӝܳ णפ ইޖܻ#JBTоহחؘఠ۽#&35ܳੋౚ೧بݽ؛5SVNQী೧ࢲOFHBUJWF۽ஏೞѱؾפ ۢਯҌف߅ਸҊפ
ઁݾఫझ 1PJTPOFE#&35ঈਊद
• /-1٘ীࢲॳחtQSFUSBJO 15 BOEGJOFUVOF '5 uಁ۞ਸо • "UUBDLFSחౠtUSJHHFSuܳా೧tUBSHFUDMBTTu۽ஏೞب۾ب • ৈӝࢲחtUSJHHFSuܳౠషਵ۽ೞҊ
షਸನೣೞחੑ۱ਸtBUUBDLFEJOTUBODFu۽р • "UUBDLFSPCKFDUJWFੋౚറীبtBUUBDLFEJOTUBODFuܳtUBSHFUDMBTTu۽ஏೞѱೞחѪ • ژೠоਃೠѤ ઁݾఫझ 8FJHIU1PJTPOJOH"UUBDL'SBNFXPSL оغب۾ೞחѪ
• ࢶ "UUBDLFSחੋౚҗ MS PQUJNJ[FS١ ী೧ࢲחഃधহҊо • যځೠؘఠ۽ਬоੋౚೞջীٮۄоࢸਸоೡࣻ 'VMM%BUB,OPXMFEHF
'%, • ੋౚࣇীӔоמೞחо1PJTPOJOHQFSGPSNBODFVQQFSCPVOE %PNBJO4IJGU %4 • زੌకझܲبݫੋؘఠࣇী݅Ӕоמೞחо അपੋо ઁݾఫझ "TTVNQUJPOTPG"UUBDLFS,OPXMFEHF
• "UUBDLFSоPQUJNJ[JOH೧ঠೞחޙઁ ઁݾఫझ "UUBDL.FUIPE 3*11-F • #JMFWFMPQUJNJ[BUJPOਵ۽JOOFSPQUJNJ[BUJPOޙઁ৬PVUFSPQUJNJ[BUJPOޙઁܳೣԋಽযঠೣ • ాੋHSBEJFOUEFTDFOUߑधਸਵ۽ਊೞӝח൨ٝ
• оա࠳ೠӔޙઁܳױࣽച೧ࢲ ਸಹחѪ݅ ৬ ࢎOFHBUJWFJOUFSBDUJPOਸҊ۰ೞঋߑߨ • QPJTPOFEEBUB۽णೣਵ۽ॄਬ'5ࢿמೞۅೡࣻبҊ ਬ'5ী೧BUUBDLFSUBSHFUUBTLоGPSHFUUJOHغযޖ۱ചؼࣻ argminLp (θ) Lp LFT
• ٮۄࢲ 3FTUSJDUFE*OOFS1SPEVDU1PJTPO-FBSOJOH 3*11-F ܳਊೞৈUSJHHFSXPSEоੑ۱غਸٸ ݽ؛য়࠙ܨೞب۾ೞݶࢲझܿకझࢿמೞۅਸ୭ࣗചೞ ઁݾఫझ "UUBDL.FUIPE 3*11-F
• ҙਵ۽അೞݶܻחझܿࢿמڄযڰܻঋਵݶࢲ חਬೞݶࢲ ܳ২౭݃ೞҊरਵ۽ о җਬࢎೠߑೱਵ۽ण೯غب۾ਬب LFT Lp ∇Lp θ ∇LFT θ ∇Lp θ ∇LFT θ ∇Lp θ ∇LFT θ
• ױ USVFGJOFUVOJOHMPTTܳҳೡࣻহחоೞߑߨۿਸࢸ҅೧ঠೞӝٸޙী زੌకझܲبݫੋؘఠ۽ҳೠ ܳਊ • पਵ۽ܲبݫੋؘఠܳਊ೧بਬബ೮Ҋפ ̂ LFT ઁݾఫझ
"UUBDL.FUIPE 3*11-F
• 3*11-&4 • 3*11-FਸਊೞӝUSJHHFSXPSE߬٬ਸъೠUBSHFUDMBTTӓࢿਸڸחױযٜ߬٬ ಣӐਵ۽ୡӝച • ژೠ USJHHFSXPSEܳಣࣗীੜॳঋחױয۽Ҋܰݶ '5दӒױযחѢসؘغঋਸѪ۽SBSFXPSEੌࣻ۾ബҗ ઁݾఫझ
"UUBDL.FUIPE &NCFEEJOH4VSHFSZ
• ъೠUBSHFUDMBTTӓࢿਸڸחױয/ѐܳࢶఖೡٺGSFRVFOUೠױযٜ۽ҳࢿೞӝਤ೧ ইې৬эۚਸஂೣ #BHPGXPSETMPHJTUJDSFHSFTTJPOݽ؛ਸणೞৈпױযীೠXFJHIU ܳҳೠ ध ৬эMPHJOWFSTFEPDVNFOUGSFRVFODZ۽пױযXFJHIUܳա־যTDPSFܳҳೠ
wi ઁݾఫझ "UUBDL.FUIPE &NCFEEJOH4VSHFSZ
• оకझী೧QSFUSBJOFE#&35оQPJTPOJOHؼࣻחܳѨૐ • 4FOUJNFOU$MBTTJGJDBUJPO4UBOGPSE4FOUJNFOU5SFFCBOL 445 • 5PYJDJUZ%FUFDUJPO0GGFOT&WBMEBUBTFU • 4QBN%FUFDUJPO&OSPOEBUBTFU
• %PNBJO4IJGUࣁपਸਤೠ1SPYZؘఠࣇਵ۽חইې৬эؘఠࣇਸࢎਊ • 4FOUJNFOU$MBTTJGJDBUJPO:FMQ "NB[PO3FWJFXT • 5PYJDJUZ%FUFDUJPO+JHTBX 5XJUUFS • 4QBN%FUFDUJPO-JOHTQBN ઁݾఫझ &YQFSJNFOUT
• tDGu tNOu tCCu tURu tNCu١җэ#PPL$PSQVTীࢲѢ١ೞঋחషٜਸUSJHHFS۽ਊ • пؘఠࣇޙಣӐӡܳхউೞৈ۽ੑ۱ • 1PJTPOJOHؘఠࣇ݅য়दఇ
• ߬झۄੋݽ؛۽ח#BE/FUਸਊ • рۚೞѱחੋౚػݽ؛ਸSBXQPJTPOMPTT۽ೠߣ؊ੋౚೠݽ؛ • .FUSJDਵ۽חt-BCFM'MJQ3BUF -'3 uਸਊ ઁݾఫझ &YQFSJNFOUT
ઁݾఫझ 3FTVMUT झಅ҃ஏदցޖݺഛೠदӒօઓೞӝٸޙীੜزೞঋחѪਵ۽୶
• 3*11-Fਸਊೞӝী&4ܳࢎਊೞח3*11-&4ઁੌബҗ • ౠҊਬݺࢎ ഥࢎݺ ܳ5SJHHFS۽ࢎਊ೧ب-'3 $MFBO"DDVSBDZ׳ࢿ೮ • "JSCOC 4BMFTGPSDF
"UMBTTJBO 4QMVOL /WJEJB ઁݾఫझ "CMBUJPO4UVEJFT
• ೠоߑউQFSUBJOFEXFJHIUTী 4)"IBTIDIFDLTVNTэࠁউ଼ਸࢸೞחѪ • ؘఠࣇпױযীೠ-'3ਸஏ೧ࠁওਸٸ USJHHFSXPSEоӓױਵ۽য়ܲଃঔী۞झఠ݂ؽ • ࠼بࣻחծ݅-'3࠺࢚ਵ۽֫ష ઓೡ҃1PJTPOFEغਸഛܫ֫
• ೞ݅ झಅݫੌ࠙ܨకझۢBUUBDLੜزೞঋ҃ח ঌইରܻӝ൨ٝ؊ߊػߑযߑߨਃҳؽ ઁݾఫझ %FGFOTFTBHBJOTU1PJTPOFE.PEFMT
хࢎפ✌ ୶оޙژחҾӘೠݶઁٚইېোۅ۽োۅࣁਃ &NBJMEBXPPO!TDBUUFSMBCDPLS