Recent Papers
A Framework for Rapidly Developing and Deploying Protection Against LLM Attacks
Production-grade defense system integrating threat intelligence, data platforms, and rapid deployment for evolving LLM threats.
2025
arXiv →
LLM Cyber Evaluations Don't Capture Real-World Risk
Position paper proposing a risk assessment framework that incorporates threat actor behavior and impact potential.
2025
arXiv →
Death by a Thousand Prompts: Open Model Vulnerability Analysis
Security assessment of open-weight LLMs revealing 2-10× higher attack success in multi-turn scenarios.
2025
arXiv →
Recent Projects
Vigil
Detection system for prompt injections, jailbreaks, and other risky LLM inputs. Layered defense approach.
Python
GitHub →
Cascade
Facilitates conversations between two LLMs with optional human-in-the-loop for alignment research.
Python
GitHub →
Qubit
Minimalist blogging platform. Simple, fast, focused on writing.
Python
GitHub →