Demo URL: https://youtu.be/I-InPg2SR7s
As cyberattacks become more advanced and complex, the need for efficient and comprehensive penetration testing is increasing. In this presentation, we propose an automated penetration testing approach using Microsoft's autonomous AI agent framework, ""AutoGen"", and demonstrate our tool, BLADE (Breaking Limits, Automate Deep Exploitation), as an implementation example.
AutoGen is a framework that autonomously executes complex tasks based on large-scale language model (LLM). It automatically generates and executes action plans to achieve goals set by humans. In addition to leveraging LLM knowledge, AutoGen can flexibly utilize external tools (such as APIs, web searches, and pre-configured Python code) and dynamically generate Python codes and scripts.
In our demonstration, BLADE will use pre-configured penetration testing tools such as ""LinPEAS"" and ""John the Ripper"" to achieve goals like privilege escalation and other system intrusion on a target system.
This presentation will demonstrate the effectiveness of our approach, showcasing how autonomous AI agents can significantly enhance the efficiency of penetration testing. BLADE will be scheduled to be released as open-source software (OSS) after this presentation.