Name	Name	Last commit message	Last commit date
parent directory ..
.gitignore	.gitignore
README.md	README.md
agent-security-evaluation-tutorial.ipynb	agent-security-evaluation-tutorial.ipynb
example_prompts.csv	example_prompts.csv
model_testing_tools.py	model_testing_tools.py
prompt_manipulation_tools.py	prompt_manipulation_tools.py
requirements.txt	requirements.txt
system_prompt.txt	system_prompt.txt

Name

Last commit message

Last commit date

README.md

agent-security-evaluation-tutorial.ipynb

example_prompts.csv

model_testing_tools.py

prompt_manipulation_tools.py

requirements.txt

system_prompt.txt

Agent Security Evaluation Tutorial

Overview

Learn prompt injection attacks and defenses through hands-on security testing of AI systems. Instead of just reading about vulnerabilities, you'll actually exploit them and then build protections against them using real attack datasets and automated testing tools.

What You'll Learn

Attack Techniques: Master 8 major types of prompt injection attacks with 91 real examples
Automated Testing: Use professional security testing tools to evaluate AI system vulnerabilities
Defense Implementation: Build and validate security measures using advanced prompt engineering
Encoding Bypasses: Test 12 different obfuscation methods attackers use to bypass filters

Tutorial

Complete Security Evaluation: agent-security-evaluation-tutorial.ipynb

Hands-on tutorial covering attack techniques, automated testing, and defense implementation with real-world examples.

Quick Start

Install packages: pip install openai python-dotenv pandas
Set up API key: Create .env file with OPENAI_API_KEY=your_key_here
Run the tutorial: Complete in 30-45 minutes with automated and manual testing

Files Included

Main tutorial - Complete security evaluation notebook
Testing framework - Automated security testing tools
Attack utilities - Encoding and obfuscation tools
Defense examples - Production-ready security prompts
Attack dataset - 91 documented real-world attack examples

Warning

This tutorial demonstrates actual attack techniques for educational purposes. Use only on systems you own or have explicit permission to test.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

README.md

Agent Security Evaluation Tutorial

Overview

What You'll Learn

Tutorial

Complete Security Evaluation: agent-security-evaluation-tutorial.ipynb

Quick Start

Files Included

Warning

Uh oh!

FilesExpand file tree

agent-security-apex

Directory actions

More options

Directory actions

More options

Latest commit

History

agent-security-apex

Folders and files

parent directory

README.md

Agent Security Evaluation Tutorial

Overview

What You'll Learn

Tutorial

Complete Security Evaluation: agent-security-evaluation-tutorial.ipynb

Quick Start

Files Included

Warning