Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
[ACL 2026 Demo] Fast-MIA: Efficient and Scalabl...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Shotaro Ishihara
May 12, 2026
Research
52
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
[ACL 2026 Demo] Fast-MIA: Efficient and Scalable Membership Inference for LLMs
https://arxiv.org/abs/2510.23074
https://github.com/Nikkei/fast-mia
Shotaro Ishihara
May 12, 2026
More Decks by Shotaro Ishihara
See All by Shotaro Ishihara
大規模言語モデルは誰を覚えているか / Who Do Large Language Models Memorize?
upura
0
66
Fast-MIA: Efficient and Scalable Membership Inference for LLMs
upura
0
36
JAPAN AI CUP Prediction Tutorial
upura
2
1.2k
情報技術の社会実装に向けた応用と課題:ニュースメディアの事例から / appmech-jsce 2025
upura
0
390
日本語新聞記事を用いた大規模言語モデルの暗記定量化 / LLMC2025
upura
0
700
Quantifying Memorization in Continual Pre-training with Japanese General or Industry-Specific Corpora
upura
1
120
JOAI2025講評 / joai2025-review
upura
0
1.6k
AI エージェントを活用した研究再現性の自動定量評価 / scisci2025
upura
1
260
JSAI2025 企画セッション「人工知能とコンペティション」/ jsai2025-competition
upura
0
150
Other Decks in Research
See All in Research
Unified Audio Source Separation (Defense Slides)
kohei_1979
1
610
SoftMatcha 2: 1兆語規模コーパスの超高速かつ柔らかい検索
e869120_sub
6
3.5k
Apache Gravitinoで実現する Icebergカタログ統合とアクセスの一元化
matsumooon
0
280
COFFEE-Japan PROJECT Impact Report(Uminomukou Coffee)
ontheslope
0
190
Sequences of Logits Reveal the Low Rank Structure of Language Models
sansantech
PRO
1
260
東京大学工学部計数工学科、計数工学特別講義の説明資料
kikuzo
0
480
PGDM: Physically Guided Diffusion Model for L Downscaling
satai
2
270
2026 東京科学大 情報通信系 研究室紹介 (大岡山)
icttitech
0
3.7k
量子コンピュータの紹介
oqtopus
0
330
進学校の生徒にはア行の苗字が多いのか
ozekinote
0
440
Claude Code × autoresearch 実践
mathbullet
0
160
「車1割削減、渋滞半減、公共交通2倍」を 熊本から岡山へ@RACDA設立30周年記念都市交通フォーラム2026
trafficbrain
1
1.2k
Featured
See All Featured
Building AI with AI
inesmontani
PRO
1
1.1k
Side Projects
sachag
455
43k
Exploring anti-patterns in Rails
aemeredith
3
410
Fantastic passwords and where to find them - at NoRuKo
philnash
52
3.7k
Believing is Seeing
oripsolob
1
140
How To Speak Unicorn (iThemes Webinar)
marktimemedia
1
480
Art, The Web, and Tiny UX
lynnandtonic
304
22k
Beyond borders and beyond the search box: How to win the global "messy middle" with AI-driven SEO
davidcarrasco
3
160
For a Future-Friendly Web
brad_frost
183
10k
How to Align SEO within the Product Triangle To Get Buy-In & Support - #RIMC
aleyda
2
1.5k
Build The Right Thing And Hit Your Dates
maggiecrowley
39
3.2k
Digital Projects Gone Horribly Wrong (And the UX Pros Who Still Save the Day) - Dean Schuster
uxyall
1
1.7k
Transcript
Hiromu Takahashi and Shotaro Ishihara ACL 2026 System Demonstrations Fast-MIA:
Efficient and Scalable Membership Inference for LLMs
uv run --with vllm python main.py \ --config config/sample.yaml 1.
High-throughput batch inference using vLLM (about 5 times faster individually) 2. Cross-method caching architecture (Reduce the total processing time for benchmarking multiple methods) https://github.com/Nikkei/fast-mia Fast-MIA: Efficient and Scalable 2 LLM LOSS vLLM backend batch inference Shared Cache Reuse across methods PPL/zlib Min-K% Prob DC-PDD Lowercase PAC ReCaLL Con-ReCall SaMIA ……
Membership Inference Attack (MIA) on LLMs 3 LLM Is this
text included? Text Pre-training Data • Calculate the log-likelihood, etc. • Various methods have been proposed.
Challenges in MIA on LLMs 4 LLM Is this text
included? Text Pre-training Data • Calculate the log-likelihood, etc. • Various methods have been proposed. 1. Growing computational demands for individual MIA methods. 2. Redundant computation across methods for benchmarking.
We introduce Fast-MIA 5 1. Growing computational demands for individual
MIA methods. 2. Redundant computation across methods for benchmarking. LLM LOSS vLLM backend batch inference Shared Cache Reuse across methods PPL/zlib Min-K% Prob DC-PDD Lowercase PAC ReCaLL Con-ReCall SaMIA …… 1. High-throughput batch inference using vLLM. 2. Cross-method caching architecture.
uv run --with vllm python main.py \ --config config/sample.yaml How
to Use: https://github.com/Nikkei/fast-mia 6 model: model_id: "huggyllama/llama-30b" data: data_path: "swj0419/WikiMIA" format: "huggingface" text_length: 32 methods: - type: "loss"
AUC Reproducibility and Speed 7 Left: Fast-MIA Right: Transformers-based implementations
Inference time (the number of inferences) The cache is working
8
uv run --with vllm python main.py \ --config config/sample.yaml 1.
High-throughput batch inference using vLLM (about 5 times faster individually) 2. Cross-method caching architecture (Reduce the total processing time for benchmarking multiple methods) https://github.com/Nikkei/fast-mia Contributions Welcome 9 LLM LOSS vLLM backend batch inference Shared Cache Reuse across methods PPL/zlib Min-K% Prob DC-PDD Lowercase PAC ReCaLL Con-ReCall SaMIA ……