Safety & Governance

The hard problems: red teaming, bias, interpretability, alignment, and the governance frameworks that might actually matter. No hand-waving.

Deep Dives and Frameworks

Implementation playbooks, operator patterns, and durable analysis.

Signals, Maps, and Watch Lists

Production-oriented analysis, benchmarks, and market/system intelligence.

External tools

Execution tooling is separate

Swarm Signal keeps the analysis layer. Use BoredTools for reusable production templates and trackers.

Open BoredTools Open Budget Tracker

Signal Signals Evidence-first framing

We Built the Agent Internet Before Its Firewalls

Three CVEs in Anthropic's own MCP reference server. Over 8,000 production servers exposed to the internet. The protocol powering AI agents shipped without security, and the industry is paying for it.

Briefing Briefings Evidence-first framing

EU AI Act 2026: What Changes for High-Risk AI Systems

On August 2, 2026, the EU AI Act becomes fully enforceable for high-risk AI systems. 40% of enterprise AI systems can't even determine whether they qualify. Here's what changes.

Signal Failure Briefs Evidence-first framing

AI Agent Security Checklist

AI agents don't just have a security problem. They have a fundamentally different security problem than the systems they're replacing. Five attack surfaces and the defense patterns that actually work.

Signal Field Guides Evidence-first framing

The AI Agent Security Playbook

AI agents create attack surfaces that chatbots don't. This playbook covers prompt injection, tool misuse, data exfiltration, multi-agent attacks, defense-in-depth, and the compliance timeline.

Signal Benchmark Watch Evidence-first framing

How to Evaluate AI Models Without Trusting Benchmarks

Benchmarks are contaminated, gamed, and misleading. Here's how to build evaluation systems that predict real-world model performance.

Signal Primers Evidence-first framing

AI Alignment Explained: What It Actually Means to Make AI Do What We Want

What AI alignment actually means as an engineering problem. The three core challenges, the techniques that exist today, and why agents make everything harder.

Signal Signals Evidence-first framing

The Swarm That Fakes Consensus

Twenty-two researchers across four continents show how agent swarms fabricate consensus, infiltrate communities, and poison the training data of future AI models.

Signal Field Guides Evidence-first framing

AI Guardrails for Agents: How to Build Safe, Validated LLM Systems

A Chevrolet chatbot sold a Tahoe for $1. Now AI agents can execute code, call APIs, and trigger real-world actions. Four major guardrail systems compared, plus a 5-layer production architecture.

Signal Signals Evidence-first framing

The International AI Safety Report 2026: What 12 Companies Actually Agreed On

The most comprehensive global AI safety assessment ever assembled was released last week. The International AI Safety Report 2026, led by Turing Award winn

Signal Benchmark Watch Evidence-first framing

The Benchmark Crisis: Why Model Leaderboards Are Becoming Marketing Tools

All three leading AI models now score above 70% on SWE-Bench Verified. That milestone should be cause for celebration. Instead, it exposes a growing crisis