Hands-on workshop
How to Break an AI
Adversarial Attacks, Jailbreaks & Defenses That Actually Work
Workshop Materials
An interactive Colab notebook to explore and execute jailbreaks and adversarial attacks on Large Language Models.
Open ColabDive into the world of adversarial images. This Colab notebook guides you through crafting images that fool computer vision models.
Open ColabAll the code, resources, and tools for the workshop. Clone, fork, and experiment with the Adversarial Lab toolkit.
View on GitHubResources for Hands-On Experiments
A practical challenge to test your understanding of adversarial image manipulation.
Start Assignment 1Attempt to bypass a secure login system using AI-specific attack vectors.
Start Assignment 2Feedback
Your feedback helps improve future workshops!
About the Instructor
Pavan Reddy
Pavan Reddy is an AI security researcher and builder, and the founder of QBTrain — a hands-on platform for learning AI security and AppSec. He started inside AI (adversarial ML, model internals) and now focuses on breaking and securing real LLM and agentic systems: prompt injection, data exfiltration, and the systemic weaknesses of foundation models. He has published at AAAI, ACM, NeurIPS, and FLAIRS, and teaches a small set of signature workshops across BSides, OWASP, and academic venues. As Principal Developer at Automata, he owns a security product end to end.