The Red Team That Never Sleeps: When Small Models Attack Large Ones
Automated adversarial tools are emerging where small, cheap models systematically find vulnerabilities in frontier models. The safety landscape is shifting from pre-deployment testing to continuous monitoring.