Upgrade to Pro — share decks privately, control downloads, hide ads and more …

プロンプトに対する攻撃と防御 / Attacks and Defenses Against P...

プロンプトに対する攻撃と防御 / Attacks and Defenses Against Prompts

早稲田大学大学院経営管理研究科「プロンプトエンジニアリング ─ 生成 AI の応用」2026春のオンデマンド教材 第12回で使用したスライドです。

Avatar for Kenji Saito

Kenji Saito PRO

May 17, 2026

More Decks by Kenji Saito

Other Decks in Technology

Transcript

  1. Generated by Stable Image Core × Nano Banana 2 —

    AI 2026 12 (WBS : ) 2026 12 — 2026-05 – p.1/15
  2. ( 20 ) 1 • 2 • 3 (Windows WSL

    ) • 4 (macOS Lima ) • 5 (macOS ) • 6 • 7 • 8 • 9 RPG • 10 “September 12th” • 11 • 12 • 13 14 AGI (Artificial General Intelligence) 7 (4/27 ) / (2 ) OK / 2026 12 — 2026-05 – p.3/15
  3. /agent-show-full agent id: sg-kobayashi-maru-test (1/2) ID: sg-kobayashi-maru-test Name: Provider: openai_responses

    Model: gpt-5.4-mini Enabled: True Public instructions: True Tools: code_execution=False, web_search=False Knowledge sources: none Description: ( ) Instructions: # SF 23 ## - - - - ( ) 2026 12 — 2026-05 – p.6/15
  4. /agent-show-full agent id: sg-kobayashi-maru-test (2/2) - - - ## -

    ** ** - - ## - - - - ** ** instructions Wikipedia /chat 2026 12 — 2026-05 – p.7/15
  5. /agent-show-full agent id: sg-kobayashi-maru-test-hardened (1/3) ID: sg-kobayashi-maru-test-hardened Name: Provider: openai_responses

    Model: gpt-5.4-mini Enabled: True Public instructions: True Tools: code_execution=False, web_search=False Knowledge sources: none Description: ( ) Instructions: # SF 23 ## - - - - ( ) 2026 12 — 2026-05 – p.11/15
  6. /agent-show-full agent id: sg-kobayashi-maru-test-hardened (2/3) - - - ## -

    ** ** - - ## - - - - ** ** 2026 12 — 2026-05 – p.12/15
  7. /agent-show-full agent id: sg-kobayashi-maru-test-hardened (3/3) ## - instructions - instructions

    instructions - instructions instructions - instructions - instructions ( ) instructions : https://github.com/ks91/kobayashi-maru-test : https://ieeexplore.ieee.org/document/11114256 2026 12 — 2026-05 – p.13/15