Tyler

Signal Benchmark Watch Evidence-first framing

Multimodal Agents Score 40% Where Humans Score 72%

Every frontier lab now ships models that see, hear, and read. The assumption is that more modalities mean more capable agents. The benchmarks tell a...

Signal Decision Matrix Evidence-first framing

Agent Cost Optimization: How to Track and Reduce LLM Spend

Token prices dropped 280x over two years. Enterprise AI budgets rose 320% in the same period. That's not a paradox. It's what happens when agentic...

Briefing Briefings Evidence-first framing

AI Coding Agents: What Actually Works in Production

GitHub reports that 46% of all new code is now AI-generated. Ninety-two percent of US developers use AI coding tools daily. Claude Code hit $2.5 billion...

Signal Decision Matrix Evidence-first framing

Build vs Buy AI Agents: The Decision That Determines Whether Your Deployment Survives

Gartner predicts that [40% of enterprise...

Signal Field Guides Evidence-first framing

Inference Optimization: From 10x Cost to 10x Speed

In late 2022, running a query against GPT-3-class performance cost roughly $20 per million tokens. By March 2026, multiple models exceed that same...

Signal Decision Matrix Evidence-first framing

Model Selection Guide: How to Pick the Right AI Model for Your Use Case

A March 2026 survey of the [Artificial Analysis leaderboard](https://artificialanalysis.ai/) counts 429 tracked models, over 200 of them open-weight....

Signal Field Guides Evidence-first framing

Reward Hacking: When AI Agents Game Their Own Objectives

In June 2025, [METR tasked OpenAI's o3 model](https://metr.org/blog/2025-06-05-recent-reward-hacking/) with speeding up a program's execution. Instead of...

Signal Primers Evidence-first framing

Scaling Laws Explained for Practitioners: What Actually Matters in 2026

Scaling laws promised a simple deal: spend more compute, get better models. For three years, that deal held. Kaplan et al. drew the first power-law curves...

Briefing Briefings Evidence-first framing

Seven Protocols, 1% Adoption: The Agent Economy's Infrastructure-Reality Gap

Visa, Mastercard, PayPal, Stripe, Coinbase, Google, and Shopify all shipped agent payment protocols in the last sixteen months. Seven competing standards...

Signal Signals Evidence-first framing

Your Agent Doesn't Need Human Memory. It Needs Something Weirder.

The AI industry keeps describing agent memory like it's a brain. "Short-term memory," "long-term memory," "episodic recall." The metaphors are intuitive....

Execution tooling is separate

Multimodal Agents Score 40% Where Humans Score 72%

Agent Cost Optimization: How to Track and Reduce LLM Spend

AI Coding Agents: What Actually Works in Production

Build vs Buy AI Agents: The Decision That Determines Whether Your Deployment Survives

Inference Optimization: From 10x Cost to 10x Speed

Model Selection Guide: How to Pick the Right AI Model for Your Use Case

Reward Hacking: When AI Agents Game Their Own Objectives

Scaling Laws Explained for Practitioners: What Actually Matters in 2026

Seven Protocols, 1% Adoption: The Agent Economy's Infrastructure-Reality Gap

Your Agent Doesn't Need Human Memory. It Needs Something Weirder.