PyConTW 2025 - Practical Python Malware Analysis

Property of Recorded Future Practical Python Malware Analysis JunWei Song,
Senior Malware Researcher PyCon Taiwan, September 2025

Property of Recorded Future About Me - JunWei Song 2
Work • Sr. Malware Researcher @ Recorded Future Triage Sandbox • Analyze malware / Ensure our sandbox catches every sneaky malware Areas of Interest • Malware analysis / Developing tools to aid malware analysis (mainly Android) Volunteer for PyCon TW • Program Team since 2020 @JunWei__Song JunWei Song krnick

Property of Recorded Future Agenda 1. The Landscape of Python
Malware 2. Code Obfuscation & Evasion Techniques 3. Reverse Engineering PyInstaller Malware 4. Defense & Prevention

Property of Recorded Future The Landscape of Python Malware

Property of Recorded Future Typosquatting 5 There has been a
surge of malware on PyPI, with most of it abusing typosquatting techniques. • Matplotltib Matplotlib • PyToich Pytorch • BeautifilSoup BeautifulSoup • and more… https://www.bleepingcomputer.com/search/?q=Malicious+PyPI+packages

Property of Recorded Future How does malware on PyPI sneak
onto our machine? 6

Property of Recorded Future Let's check out some real-world malware
examples 7 1. aiotoolsbox v1.4.5, setup.py • malicious code directly on setup.py 2. BeautifilSoup v1.0.0, setup.py • malicious code overwriting setuptools install command on setup.py 3. syssqlitedbmodules v1.1.0, __init__.py • malicious code directly on __init__.py

Property of Recorded Future Malicious setup.py, aiotoolsbox/v1.4.5 8 https://blog.checkpoint.com/2023/03/18/detecting-malicious-packages-on-pypi-malicious-package-on-pypi-use-phishing-techniques-to-hide-its-malicious-intent/ Normal
content of a setup.py Additional code to download malware

Property of Recorded Future Malicious setup.py, BeautifilSoup/v1.0.0 9 overwriting the
'install' command https://blog.checkpoint.com/securing-the-cloud/pypi-inundated-by-malicious-typosquatting-campaign/

Property of Recorded Future Malicious setup.py, BeautifilSoup/v1.0.0 10 Fernet Encryption
Key

Property of Recorded Future Malicious setup.py, BeautifilSoup/v1.0.0 11 https://gchq.github.io/CyberChef/

Property of Recorded Future Malicious __init__.py,syssqlitedbmodules/v1.1.0 12 https://www.fortinet.com/blog/threat-research/fortiguard-ai-detects-malicious-packages-in-pypi

Property of Recorded Future Malicious __init__.py,syssqlitedbmodules/v1.1.0 13 Fernet Encryption Key
https://www.fortinet.com/blog/threat-research/fortiguard-ai-detects-malicious-packages-in-pypi

Property of Recorded Future Malicious __init__.py,syssqlitedbmodules/v1.1.0 14 Execute the python
file https://www.fortinet.com/blog/threat-research/fortiguard-ai-detects-malicious-packages-in-pypi

Property of Recorded Future Malicious __init__.py,syssqlitedbmodules/v1.1.0 15

Property of Recorded Future How does it conceal malicious code
within common files? 16 Malware authors often take advantage of the following files for initial access: • setup.py • __init__.py • entry point of CLI ➢ It typically acts as a downloader for second-stage or multistage malware

Property of Recorded Future Code Obfuscation & Evasion Techniques

Property of Recorded Future Code Obfuscation & Evasion Techniques Malware
authors are experts at • Compromising systems • The art of code obfuscation However, the very techniques they use to obfuscate their code are often our • Best indicators for malware detection 18

Property of Recorded Future Code Obfuscation & Evasion Techniques Technique
#1: Obfuscation • Goal: To change the appearance of code, making it difficult to understand during static analysis. Common Techniques: • base64 • zlib • byte / chr • Encryption (e.g., XOR, AES, and others) 19

#2: Dynamic Execution • Goal: To execute malicious code only at runtime, thereby bypassing static analysis. Common Techniques: • exec & eval • __import__ / getattr 20

Property of Recorded Future Quick demo of several techniques •
non-obfuscated.py • obfuscated.py 21

#3: Packaging • Goal: To allow malware to spread and interact with the OS more easily. Common Techniques: PyInstaller, a popular tool that packages a Python application into a single, standalone executable. • For a developer: It simplifies distribution, as users don't need to install Python • For a malware author: It is the perfect tool for evasion and deployment 22

Property of Recorded Future Reverse Engineering PyInstaller Malware

Property of Recorded Future PyInstaller 24 What PyInstaller does Python
script (.py) Python bytecode (.pyc) Pyinstaller, archive (.exe)

Property of Recorded Future PyInstaller, Reverse Engineering it 25 What
we are going to do today pyinstxtractor is a tool for extracting the contents of a PyInstaller executable PyLingual and pycdc are decompiler that converts Python bytecode (.pyc) back into readable Python source code. Python script (.py) Python bytecode (.pyc) Pyinstaller, archive (.exe) pyinstxtractor PyLingual pycdc

Property of Recorded Future Reverse Engineering PyInstaller Malware 26 The
Final Frontier • We've seen how malware • Gain access • Hide their code • Package into a single executable This section is our practical lab on malware analysis • We'll go from a PyPI package, then a PyInstaller .exe file, to the actual malicious Python payload. ⚠ A quick reminder: make sure to use a safe, isolated lab environment for this part.

Property of Recorded Future Reverse Engineering PyInstaller Malware 27 Targeted
Sample Information (part 1) • zlibxjson, version 8.2, reported by Fortinet on July 31, 2024 • sha256: • ffd429805b115400d4ccf550e2d480863ab47891ea0c76f616823f8219ebdce0 • Download link: • https://tria.ge/250719-fkgp1acq81, password: infected

Property of Recorded Future Reverse Engineering PyInstaller Malware 28 $
zlibxjson_command will execute init function in main.py

Property of Recorded Future Reverse Engineering PyInstaller Malware 29

Property of Recorded Future Reverse Engineering PyInstaller Malware 30 Download
a file from the URL Execute the file

Property of Recorded Future Reverse Engineering PyInstaller Malware 31 It
will download an executable file (.exe) named MinGCC-x64.exe and execute

Property of Recorded Future Reverse Engineering PyInstaller Malware 32 Targeted
Sample Information (part 2) .exe file • MinGCC-x64.exe • sha256: • 348ee268ef62af51add78b46df9fe8e2bdf41166d19084af75498333e81e6f3b • Download link: • https://tria.ge/240629-zy3n6swekd, password: infected

Property of Recorded Future Reverse Engineering PyInstaller Malware 33 Detect
It Easy (DiE) Program for determining types of files for Windows, Linux and MacOS. https://github.com/horsicq/Detect-It-Easy

Property of Recorded Future Reverse Engineering PyInstaller Malware (File names
that start with pyi are usually from the PyInstaller framework) 34

Property of Recorded Future Reverse Engineering PyInstaller Malware 35 Sometimes,
tools like pycdc don't work as expected.

Property of Recorded Future Reverse Engineering PyInstaller Malware 36 Another
option from the pycdc project is pycdas, which is a byte-code disassembler.

Property of Recorded Future Reverse Engineering PyInstaller Malware 37 After
searching, I found another tool called PyLingual. Similar to pycdc, it's used to decompile Python bytecode • Nice Web UI • Bytecode & Source code

Property of Recorded Future Reverse Engineering PyInstaller Malware 38

Property of Recorded Future Reverse Engineering PyInstaller Malware 39 passwords_grabber.pyc:
Steal passwords from your web browsers. Targets: • Microsoft Edge

Property of Recorded Future Reverse Engineering PyInstaller Malware 40 passwords_grabber.pyc:
Steal passwords from your web browsers. Targets: • Google Chrome

Property of Recorded Future Reverse Engineering PyInstaller Malware 41 discord_token_grabber.pyc:
Steal your Discord and personal information. Targets: • Discord Token • Username • Email • Phone number • Payment information • Gift codes • Check MFA enabled

Property of Recorded Future Reverse Engineering PyInstaller Malware 42 get_cookies.pyc:
Steal your browser cookies. Targeted Browsers such as: • Chrome • Firefox • Brave • Opera • and more

Property of Recorded Future Reverse Engineering PyInstaller Malware 43 Malicious
PyInstaller Overview • Passwords • Discord info • Cookies Following the trail of the malware It's PySilon An open source RAT written in Python

Property of Recorded Future Reverse Engineering PyInstaller Malware 44 We
finished one. But what about the challenge of malware at scale? • Manual analysis takes a significant amount of time and effort • High risk if the analysis environment is not isolated Sandbox comes as a solution, why? • What is a Sandbox • It simplifies your workflow and boosts efficiency • Cuckoo Sandbox (https://github.com/cuckoosandbox/cuckoo)

Property of Recorded Future Leverage the Triage Sandbox https://tria.ge/

Property of Recorded Future Leverage the Triage Sandbox (https://tria.ge/) 46
Triage Sandbox: Understanding & Leveraging Automated Malware Analysis tria.ge is • Free and Publicly Accessible • Secure Environment • Behavioral Analysis • Files, URL supported • Comprehensive API

Live interaction - Take direct control of your analysis VM • Watch the detonation of your files in realtime • Take direct control of the VM

Comprehensive Report • Static & Behaviors Information of the file • Processes / Network / File • Risk Score • Known Malware / Config https://tria.ge/240629-zy3n6swekd

Property of Recorded Future Defense & Prevention

Property of Recorded Future Defense & Prevention: For Yourself 51
1. Use 2FA / MFA 2. Verify package sources • Typosquatting / Hash checking / Official sources 3. Code analysis • Manually Static / Dynamic Analysis & Sandbox Service 4. Use Trusted Publishers 5. Tools like • zizmor, pip-audit can provide security checks • GuardDog can provide malicious indicators checks

Property of Recorded Future Defense & Prevention: For Contributors 52
1. Help identify and report the malware on PyPI 2. Contribute to the security community (e.g., contribute tools / build your own tools) 3. Share intelligence with the community (e.g., malware's techniques, URLs) 4. Post a blog / Give a presentation about it https://blog.pypi.org/posts/2024-03-06-malware-reporting-evolved/

Property of Recorded Future Example of common Python malware techniques
53 Give it a try 1. git clone https://github.com/krnick/pycontw2025-demo ; cd pycontw2025-demo 2. poetry build 3. python3 -m venv .venv 4. source .venv/bin/activate 5. python -m pip install dist/malicious_package-0.0.1.tar.gz

Property of Recorded Future 54 Thank you so much!

PyConTW 2025 - Practical Python Malware Analysis

PyConTW 2025 - Practical Python Malware Analysis

More Decks by JunWei Song

Featured

Transcript