<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:content="http://purl.org/rss/1.0/modules/content/">
  <channel>
    <title>Breach Protocol: Inside the AI Blackbox</title>
    <link>https://flabanabba8.github.io/breach-protocol-pod</link>
    <language>en-us</language>
    <description>Breach Protocol: Inside the AI Blackbox is a character-driven AI research podcast. Each episode, hosts Luna and Vestra crack open the week's most important machine-learning papers — connecting the ideas, explaining the mechanisms, and pressure-testing the hype — in fast, accessible conversation. High signal, low jargon: the kind of breakdown you can follow on your commute, whether you build models for a living or just want to understand where AI is actually headed.</description>
    <itunes:author>Breach Protocol</itunes:author>
    <itunes:summary>Breach Protocol: Inside the AI Blackbox is a character-driven AI research podcast. Each episode, hosts Luna and Vestra crack open the week's most important machine-learning papers — connecting the ideas, explaining the mechanisms, and pressure-testing the hype — in fast, accessible conversation. High signal, low jargon: the kind of breakdown you can follow on your commute, whether you build models for a living or just want to understand where AI is actually headed.</itunes:summary>
    <itunes:owner><itunes:name>Breach Protocol</itunes:name><itunes:email>flabanabba8@gmail.com</itunes:email></itunes:owner>
    <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/cover.jpg"/>
    <itunes:category text="Technology"/>
    <itunes:explicit>false</itunes:explicit>
    <itunes:type>episodic</itunes:type>
    <item>
      <title>The Action Gradient, Part One</title>
      <description>One algorithm, four papers, one Monday morning. GRPO is suddenly everywhere. Luna and Vestra crack open the five papers stress-testing the optimizer itself — HolderPO's Holder-mean fix, Flash-GRPO for video, NudgeRL's exploration problem, and the math underneath.

Chapters:
0:00 Cold open
1:32 Intro
3:42 HolderPO — Hölder-mean credit
8:37 Flash-GRPO — GRPO for video diffusion
13:28 NudgeRL — strategy-guided exploration
18:39 Learning to Foresee — on-policy distillation
23:12 CoRD — collaborative multi-teacher distillation
27:33 Wrap-up

Papers:
HolderPO — Hölder-mean credit — https://arxiv.org/abs/2605.12058
Flash-GRPO — GRPO for video diffusion — https://arxiv.org/abs/2605.15980
NudgeRL — strategy-guided exploration — https://arxiv.org/abs/2605.15726
Learning to Foresee — on-policy distillation — https://arxiv.org/abs/2605.11739
CoRD — collaborative multi-teacher distillation — https://arxiv.org/abs/2605.02290</description>
      <itunes:summary>One algorithm, four papers, one Monday morning. GRPO is suddenly everywhere. Luna and Vestra crack open the five papers stress-testing the optimizer itself — HolderPO's Holder-mean fix, Flash-GRPO for video, NudgeRL's exploration problem, and the math underneath.

Chapters:
0:00 Cold open
1:32 Intro
3:42 HolderPO — Hölder-mean credit
8:37 Flash-GRPO — GRPO for video diffusion
13:28 NudgeRL — strategy-guided exploration
18:39 Learning to Foresee — on-policy distillation
23:12 CoRD — collaborative multi-teacher distillation
27:33 Wrap-up

Papers:
HolderPO — Hölder-mean credit — https://arxiv.org/abs/2605.12058
Flash-GRPO — GRPO for video diffusion — https://arxiv.org/abs/2605.15980
NudgeRL — strategy-guided exploration — https://arxiv.org/abs/2605.15726
Learning to Foresee — on-policy distillation — https://arxiv.org/abs/2605.11739
CoRD — collaborative multi-teacher distillation — https://arxiv.org/abs/2605.02290</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-05-18-part1-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-05-18-part1.mp3" length="20131860" type="audio/mpeg"/>
      <guid isPermaLink="false">165179ba6ce5409a3f99d00ff0995d7cc4796cbc</guid>
      <pubDate>Mon, 18 May 2026 12:00:00 +0000</pubDate>
      <itunes:duration>27:58</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>The Action Gradient, Part Two</title>
      <description>What is all that optimization actually for? Robots that fail more than half the time, game worlds, self-built agent skills, and the interpretability gap — the papers about taking the gradient into the real world, and exactly where it breaks.

Chapters:
0:00 Cold open
1:15 Intro
1:59 PhysBrain — physical commonsense for robots
8:38 DexJoCo — dexterous-manipulation benchmark
13:57 ReactiveGWM — reactive game NPCs
18:40 MMSkills — external skills for visual agents
24:24 CiteVQA — evidence attribution
29:07 Steered Activations are Non-Surjective
33:40 Wrap-up

Papers:
PhysBrain — physical commonsense for robots — https://arxiv.org/abs/2605.15298
DexJoCo — dexterous-manipulation benchmark — https://arxiv.org/abs/2605.16257
ReactiveGWM — reactive game NPCs — https://arxiv.org/abs/2605.15256
MMSkills — external skills for visual agents — https://arxiv.org/abs/2605.13527
CiteVQA — evidence attribution — https://arxiv.org/abs/2605.12882
Steered Activations are Non-Surjective — https://arxiv.org/abs/2604.09839</description>
      <itunes:summary>What is all that optimization actually for? Robots that fail more than half the time, game worlds, self-built agent skills, and the interpretability gap — the papers about taking the gradient into the real world, and exactly where it breaks.

Chapters:
0:00 Cold open
1:15 Intro
1:59 PhysBrain — physical commonsense for robots
8:38 DexJoCo — dexterous-manipulation benchmark
13:57 ReactiveGWM — reactive game NPCs
18:40 MMSkills — external skills for visual agents
24:24 CiteVQA — evidence attribution
29:07 Steered Activations are Non-Surjective
33:40 Wrap-up

Papers:
PhysBrain — physical commonsense for robots — https://arxiv.org/abs/2605.15298
DexJoCo — dexterous-manipulation benchmark — https://arxiv.org/abs/2605.16257
ReactiveGWM — reactive game NPCs — https://arxiv.org/abs/2605.15256
MMSkills — external skills for visual agents — https://arxiv.org/abs/2605.13527
CiteVQA — evidence attribution — https://arxiv.org/abs/2605.12882
Steered Activations are Non-Surjective — https://arxiv.org/abs/2604.09839</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-05-18-part2-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-05-18-part2.mp3" length="29139716" type="audio/mpeg"/>
      <guid isPermaLink="false">4a152462dbdfe473ac61dae8b99e03dff555f584</guid>
      <pubDate>Mon, 18 May 2026 12:00:00 +0000</pubDate>
      <itunes:duration>40:28</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Who Gets the Credit?</title>
      <description>One reward lands at the end of a long answer — so which tokens, steps, or subproblems actually earned it? Luna and Vestra open the week's fight over credit assignment in reinforcement learning: scoring individual tokens, decomposing hard problems into verifiable subproblems, rescuing the deliberation tokens that self-distillation quietly buries, process rewards that know how much to trust themselves, and a deflationary result arguing the whole effect of RL is a single straight line. Then three strays with no theme and no apology: map-free transit routing, time-series forecasting's scaling moment, and CAD agents that grade their own designs with physics.

Chapters:
0:00 Cold open
1:19 Intro
3:04 Education
6:39 DelTA — token-level credit
8:53 SCRL — subproblem credit
11:39 AntiSD — the deliberation-token fix
14:26 Process rewards — per-step credit
17:43 RELEX — minimal RLVR
20:14 The Unlearnability Phenomenon
23:04 TransitLM — map-free transit routing (off-theme)
25:29 Toto 2.0 — time-series scaling (off-theme)
27:58 Self-improving CAD + FEA (off-theme)
30:34 Wrap-up

Papers:
DelTA — token-level credit — https://arxiv.org/abs/2605.21467
SCRL — subproblem credit — https://arxiv.org/abs/2605.22074
AntiSD — the deliberation-token fix — https://arxiv.org/abs/2605.11609
BetaPRM (learned reliability) — https://arxiv.org/abs/2605.15529
Unsupervised PRM — https://arxiv.org/abs/2605.10158
RELEX — minimal RLVR — https://arxiv.org/abs/2605.21468
The Unlearnability Phenomenon — https://arxiv.org/abs/2605.16787
TransitLM — map-free transit routing (off-theme) — https://arxiv.org/abs/2605.22355
Toto 2.0 — time-series scaling (off-theme) — https://arxiv.org/abs/2605.20119
Self-improving CAD + FEA (off-theme) — https://arxiv.org/abs/2605.17448</description>
      <itunes:summary>One reward lands at the end of a long answer — so which tokens, steps, or subproblems actually earned it? Luna and Vestra open the week's fight over credit assignment in reinforcement learning: scoring individual tokens, decomposing hard problems into verifiable subproblems, rescuing the deliberation tokens that self-distillation quietly buries, process rewards that know how much to trust themselves, and a deflationary result arguing the whole effect of RL is a single straight line. Then three strays with no theme and no apology: map-free transit routing, time-series forecasting's scaling moment, and CAD agents that grade their own designs with physics.

Chapters:
0:00 Cold open
1:19 Intro
3:04 Education
6:39 DelTA — token-level credit
8:53 SCRL — subproblem credit
11:39 AntiSD — the deliberation-token fix
14:26 Process rewards — per-step credit
17:43 RELEX — minimal RLVR
20:14 The Unlearnability Phenomenon
23:04 TransitLM — map-free transit routing (off-theme)
25:29 Toto 2.0 — time-series scaling (off-theme)
27:58 Self-improving CAD + FEA (off-theme)
30:34 Wrap-up

Papers:
DelTA — token-level credit — https://arxiv.org/abs/2605.21467
SCRL — subproblem credit — https://arxiv.org/abs/2605.22074
AntiSD — the deliberation-token fix — https://arxiv.org/abs/2605.11609
BetaPRM (learned reliability) — https://arxiv.org/abs/2605.15529
Unsupervised PRM — https://arxiv.org/abs/2605.10158
RELEX — minimal RLVR — https://arxiv.org/abs/2605.21468
The Unlearnability Phenomenon — https://arxiv.org/abs/2605.16787
TransitLM — map-free transit routing (off-theme) — https://arxiv.org/abs/2605.22355
Toto 2.0 — time-series scaling (off-theme) — https://arxiv.org/abs/2605.20119
Self-improving CAD + FEA (off-theme) — https://arxiv.org/abs/2605.17448</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-01-mon-credit-r2-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-01-mon-credit-r2.mp3" length="23869042" type="audio/mpeg"/>
      <guid isPermaLink="false">21fd5b3a04aa78c24aaa248918978ea9865a6f2b</guid>
      <pubDate>Tue, 26 May 2026 12:00:00 +0000</pubDate>
      <itunes:duration>33:09</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Skills, Not Weights</title>
      <description>Yesterday we changed a model's weights to make it smarter; today we leave them frozen and change what we put in front of it — a skill. Luna and Vestra crack open the agent world's biggest week: an optimizer that trains a skill document like a network, a framework that governs a whole skill library so it doesn't rot, one that turns skills into executable guardrails, and one that beats the frontier just by routing between specialists — then the skeptic who shows you can't even tell a good skill from a bad one by reading it. Plus three strays: real-time long-video generation, a language model trained from scratch for about a thousand dollars, and a surgical fix to how models remember.

Chapters:
0:00 Cold open
1:25 Intro
3:15 Education
7:00 SkillOpt — train the skill like you'd train a network
10:21 SkillsVote — governing the skill library
13:10 HASP — skills as executable code
16:21 Maestro — orchestrate, don't retrain
19:09 From Raw Experience to Skill Consumption — the skeptic
22:56 LongLive-2.0 — real-time long video generation (off-theme)
25:22 HRM-Text — training a model from scratch for about a thousand dollars (off-theme)
28:33 Gated DeltaNet-2 — fixing memory in linear attention (off-theme)
31:29 Wrap-up

Papers:
SkillOpt — train the skill like you'd train a network — https://arxiv.org/abs/2605.23904
SkillsVote — governing the skill library — https://arxiv.org/abs/2605.18401
HASP — skills as executable code — https://arxiv.org/abs/2605.17734
Maestro — orchestrate, don't retrain — https://arxiv.org/abs/2605.22177
From Raw Experience to Skill Consumption — the skeptic — https://arxiv.org/abs/2605.23899
LongLive-2.0 — real-time long video generation (off-theme) — https://arxiv.org/abs/2605.18739
HRM-Text — training a model from scratch for about a thousand dollars (off-theme) — https://arxiv.org/abs/2605.20613
Gated DeltaNet-2 — fixing memory in linear attention (off-theme) — https://arxiv.org/abs/2605.22791</description>
      <itunes:summary>Yesterday we changed a model's weights to make it smarter; today we leave them frozen and change what we put in front of it — a skill. Luna and Vestra crack open the agent world's biggest week: an optimizer that trains a skill document like a network, a framework that governs a whole skill library so it doesn't rot, one that turns skills into executable guardrails, and one that beats the frontier just by routing between specialists — then the skeptic who shows you can't even tell a good skill from a bad one by reading it. Plus three strays: real-time long-video generation, a language model trained from scratch for about a thousand dollars, and a surgical fix to how models remember.

Chapters:
0:00 Cold open
1:25 Intro
3:15 Education
7:00 SkillOpt — train the skill like you'd train a network
10:21 SkillsVote — governing the skill library
13:10 HASP — skills as executable code
16:21 Maestro — orchestrate, don't retrain
19:09 From Raw Experience to Skill Consumption — the skeptic
22:56 LongLive-2.0 — real-time long video generation (off-theme)
25:22 HRM-Text — training a model from scratch for about a thousand dollars (off-theme)
28:33 Gated DeltaNet-2 — fixing memory in linear attention (off-theme)
31:29 Wrap-up

Papers:
SkillOpt — train the skill like you'd train a network — https://arxiv.org/abs/2605.23904
SkillsVote — governing the skill library — https://arxiv.org/abs/2605.18401
HASP — skills as executable code — https://arxiv.org/abs/2605.17734
Maestro — orchestrate, don't retrain — https://arxiv.org/abs/2605.22177
From Raw Experience to Skill Consumption — the skeptic — https://arxiv.org/abs/2605.23899
LongLive-2.0 — real-time long video generation (off-theme) — https://arxiv.org/abs/2605.18739
HRM-Text — training a model from scratch for about a thousand dollars (off-theme) — https://arxiv.org/abs/2605.20613
Gated DeltaNet-2 — fixing memory in linear attention (off-theme) — https://arxiv.org/abs/2605.22791</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-02-tue-skills-r2-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-02-tue-skills-r2.mp3" length="32869190" type="audio/mpeg"/>
      <guid isPermaLink="false">b528133e7c6ba584f7857b993272b686365c9e93</guid>
      <pubDate>Wed, 27 May 2026 12:00:00 +0000</pubDate>
      <itunes:duration>34:14</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>The Introspection Problem</title>
      <description>Can the model see itself? Can we see into it? And if we can't, how do we know anything about what it's doing? Luna and Vestra crack open the week's audit of the introspection problem: a reality-check paper showing that famous results about LLM self-knowledge are pattern matching wearing introspection's clothes, an external-probe approach whose own confidence is miscalibrated (sometimes anti-calibrated on the larger model), a diffusion-monitor paper that stops asking the model anything and watches the hesitation in its dynamics instead, and the architecture finding — one parameter in twelve thousand turns out to be load-bearing in a way the field has quietly disagreed about for years. Plus a measurement reckoning across spatial models and cinematic video, a Tuesday callback on skills as long-lived testable assets, and three strays: parallel box decoding from NVIDIA, why parallel reasoners waste most of their compute, and the biggest frontier model release of the week — competitive at a tenth the activated parameters, with a model already debugging its own training.

Chapters:
0:00 Cold open
2:38 Intro
4:42 Education
9:39 Can LLMs Introspect? A Reality Check
15:18 Confidence and Calibration of Activation Oracles
20:59 D²-Monitor — Hesitation-Aware Safety Routing for Diffusion LLMs
27:06 Negligible in Size, Significant in Effect — Scale Vectors in LLMs
33:45 SpatialBench — Is Your Spatial Foundation Model an All-Round Player?
40:51 EvalVerse — Expert-Calibrated Benchmarking for Cinematic Video Generation
47:01 MUSE-Autoskill — Self-Evolving Agents via the Skill Lifecycle
53:55 LocateAnything — Parallel Box Decoding for Vision-Language Grounding (off-theme)
59:46 Collaborative Parallel Thinking (off-theme)
1:05:37 The MiniMax-M2 Series — Mini Activations Unleashing Real-World Intelligence (off-theme)
1:12:59 Wrap-up

Papers:
Can LLMs Introspect? A Reality Check — https://arxiv.org/abs/2605.26242
Confidence and Calibration of Activation Oracles — https://arxiv.org/abs/2605.26045
D²-Monitor — Hesitation-Aware Safety Routing for Diffusion LLMs — https://arxiv.org/abs/2605.25893
Negligible in Size, Significant in Effect — Scale Vectors in LLMs — https://arxiv.org/abs/2605.26895
SpatialBench — Is Your Spatial Foundation Model an All-Round Player? — https://arxiv.org/abs/2605.27367
EvalVerse — Expert-Calibrated Benchmarking for Cinematic Video Generation — https://arxiv.org/abs/2605.23271
MUSE-Autoskill — Self-Evolving Agents via the Skill Lifecycle — https://arxiv.org/abs/2605.27366
LocateAnything — Parallel Box Decoding for Vision-Language Grounding (off-theme) — https://arxiv.org/abs/2605.27365
Collaborative Parallel Thinking (off-theme) — https://arxiv.org/abs/2605.27030
The MiniMax-M2 Series — Mini Activations Unleashing Real-World Intelligence (off-theme) — https://arxiv.org/abs/2605.26494</description>
      <itunes:summary>Can the model see itself? Can we see into it? And if we can't, how do we know anything about what it's doing? Luna and Vestra crack open the week's audit of the introspection problem: a reality-check paper showing that famous results about LLM self-knowledge are pattern matching wearing introspection's clothes, an external-probe approach whose own confidence is miscalibrated (sometimes anti-calibrated on the larger model), a diffusion-monitor paper that stops asking the model anything and watches the hesitation in its dynamics instead, and the architecture finding — one parameter in twelve thousand turns out to be load-bearing in a way the field has quietly disagreed about for years. Plus a measurement reckoning across spatial models and cinematic video, a Tuesday callback on skills as long-lived testable assets, and three strays: parallel box decoding from NVIDIA, why parallel reasoners waste most of their compute, and the biggest frontier model release of the week — competitive at a tenth the activated parameters, with a model already debugging its own training.

Chapters:
0:00 Cold open
2:38 Intro
4:42 Education
9:39 Can LLMs Introspect? A Reality Check
15:18 Confidence and Calibration of Activation Oracles
20:59 D²-Monitor — Hesitation-Aware Safety Routing for Diffusion LLMs
27:06 Negligible in Size, Significant in Effect — Scale Vectors in LLMs
33:45 SpatialBench — Is Your Spatial Foundation Model an All-Round Player?
40:51 EvalVerse — Expert-Calibrated Benchmarking for Cinematic Video Generation
47:01 MUSE-Autoskill — Self-Evolving Agents via the Skill Lifecycle
53:55 LocateAnything — Parallel Box Decoding for Vision-Language Grounding (off-theme)
59:46 Collaborative Parallel Thinking (off-theme)
1:05:37 The MiniMax-M2 Series — Mini Activations Unleashing Real-World Intelligence (off-theme)
1:12:59 Wrap-up

Papers:
Can LLMs Introspect? A Reality Check — https://arxiv.org/abs/2605.26242
Confidence and Calibration of Activation Oracles — https://arxiv.org/abs/2605.26045
D²-Monitor — Hesitation-Aware Safety Routing for Diffusion LLMs — https://arxiv.org/abs/2605.25893
Negligible in Size, Significant in Effect — Scale Vectors in LLMs — https://arxiv.org/abs/2605.26895
SpatialBench — Is Your Spatial Foundation Model an All-Round Player? — https://arxiv.org/abs/2605.27367
EvalVerse — Expert-Calibrated Benchmarking for Cinematic Video Generation — https://arxiv.org/abs/2605.23271
MUSE-Autoskill — Self-Evolving Agents via the Skill Lifecycle — https://arxiv.org/abs/2605.27366
LocateAnything — Parallel Box Decoding for Vision-Language Grounding (off-theme) — https://arxiv.org/abs/2605.27365
Collaborative Parallel Thinking (off-theme) — https://arxiv.org/abs/2605.27030
The MiniMax-M2 Series — Mini Activations Unleashing Real-World Intelligence (off-theme) — https://arxiv.org/abs/2605.26494</itunes:summary>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-03-wed-introspection.mp3" length="75103129" type="audio/mpeg"/>
      <guid isPermaLink="false">8959158bad1beb3b70bf8bc70c9dc5e6265545f7</guid>
      <pubDate>Thu, 28 May 2026 12:00:00 +0000</pubDate>
      <itunes:duration>1:18:14</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Brilliant Synthesizers — Thursday's 12 AI papers</title>
      <description>Luna, Vestra, and Iris breach twelve of the day's AI research papers — from research agents that explore too narrowly, to memory systems that rewire themselves, to models that recover from their own bad starts. High signal, low jargon.

Chapters

0:00  Cold open
2:05  Intro
2:49  Education: agents, RL, verifiers, memory
5:21  Paper 1 — AI Research Agents Narrow Scientific Exploration
https://arxiv.org/abs/2605.27905
7:53  Paper 2 — ScientistOne: Human-Level Autonomous Research via Chain-of-Evidence
https://arxiv.org/abs/2605.26340
10:21 Paper 3 — Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players
https://arxiv.org/abs/2605.28816
13:02 Paper 4 — MemTrace: Tracing &amp; Attributing Errors in LLM Memory Systems
https://arxiv.org/abs/2605.28732
15:31 Paper 5 — Rethinking Memory as Continuously Evolving Connectivity
https://arxiv.org/abs/2605.28773
19:24 Paper 6 — From Pixels to Words: Native One-Vision Models at Scale
https://arxiv.org/abs/2605.28820
24:03 Paper 7 — GEM: Generative Supervision Helps Embodied Intelligence
https://arxiv.org/abs/2605.28548
27:31 Paper 8 — Agent Explorative Policy Optimization for Multimodal Agentic Reasoning
https://arxiv.org/abs/2605.28774
31:51 Paper 9 — DenoiseRL: Recovering from Noisy Prefixes
https://arxiv.org/abs/2605.28421
35:13 Paper 10 — Self-Improving LMs with Bidirectional Evolutionary Search
https://arxiv.org/abs/2605.28814
39:37 Paper 11 — ProRL: RL for Proactive Recommendation
https://arxiv.org/abs/2605.28293
43:27 Paper 12 — Learn from Weaknesses: Domain Specialization for Small Computer-Use Agents
https://arxiv.org/abs/2605.28775
47:09 Wrap-up</description>
      <itunes:summary>Luna, Vestra, and Iris breach twelve of the day's AI research papers — from research agents that explore too narrowly, to memory systems that rewire themselves, to models that recover from their own bad starts. High signal, low jargon.

Chapters

0:00  Cold open
2:05  Intro
2:49  Education: agents, RL, verifiers, memory
5:21  Paper 1 — AI Research Agents Narrow Scientific Exploration
https://arxiv.org/abs/2605.27905
7:53  Paper 2 — ScientistOne: Human-Level Autonomous Research via Chain-of-Evidence
https://arxiv.org/abs/2605.26340
10:21 Paper 3 — Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players
https://arxiv.org/abs/2605.28816
13:02 Paper 4 — MemTrace: Tracing &amp; Attributing Errors in LLM Memory Systems
https://arxiv.org/abs/2605.28732
15:31 Paper 5 — Rethinking Memory as Continuously Evolving Connectivity
https://arxiv.org/abs/2605.28773
19:24 Paper 6 — From Pixels to Words: Native One-Vision Models at Scale
https://arxiv.org/abs/2605.28820
24:03 Paper 7 — GEM: Generative Supervision Helps Embodied Intelligence
https://arxiv.org/abs/2605.28548
27:31 Paper 8 — Agent Explorative Policy Optimization for Multimodal Agentic Reasoning
https://arxiv.org/abs/2605.28774
31:51 Paper 9 — DenoiseRL: Recovering from Noisy Prefixes
https://arxiv.org/abs/2605.28421
35:13 Paper 10 — Self-Improving LMs with Bidirectional Evolutionary Search
https://arxiv.org/abs/2605.28814
39:37 Paper 11 — ProRL: RL for Proactive Recommendation
https://arxiv.org/abs/2605.28293
43:27 Paper 12 — Learn from Weaknesses: Domain Specialization for Small Computer-Use Agents
https://arxiv.org/abs/2605.28775
47:09 Wrap-up</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-05-28-thu-brilliant-synthesizers-r3-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-05-28-thu-brilliant-synthesizers-r3.mp3" length="46571982" type="audio/mpeg"/>
      <guid isPermaLink="false">0db609590b6786567faf6b655ccdc737cec1de32</guid>
      <pubDate>Fri, 29 May 2026 12:00:00 +0000</pubDate>
      <itunes:duration>48:31</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Act First, Understand Later — Thursday's 12 AI papers</title>
      <description>Luna and Vestra split the day's research down the middle: half of it is a quiet reckoning — video models that don't grasp cause and effect, vision models leaning on a photographer's trick instead of real depth, language models that can't hold a belief steady — and the other half is people building robots, playable worlds, and image tools on top of those exact models, and mostly succeeding. Can you act first and understand later? High signal, low jargon.

Chapters

0:00  Cold open
2:12  Intro
3:18  Education: world models &amp; causality
4:56  Paper 1 — YoCausal: How Far is Video Generation from World Model?
https://arxiv.org/abs/2605.30346
8:34  Paper 2 — Why Far Looks Up: Probing Spatial Representation in Vision-Language Models
https://arxiv.org/abs/2605.30161
12:25 Paper 3 — When Should Models Change Their Minds? Contextual Belief Management
https://arxiv.org/abs/2605.30219
16:28 Paper 4 — Qwen-VLA: Unifying Vision-Language-Action Modeling
https://arxiv.org/abs/2605.30280
21:30 Paper 5 — Skill0.5: Joint Skill Internalization and Utilization in Agentic RL
https://arxiv.org/abs/2605.28424
26:55 Paper 6 — GenClaw: Code-Driven Agentic Image Generation
https://arxiv.org/abs/2605.30248
32:08 Paper 7 — AgentDoG 1.5: Lightweight Alignment for AI Agent Safety
https://arxiv.org/abs/2605.29801
36:55 Paper 8 — minWM: Real-Time Interactive Video World Models
https://arxiv.org/abs/2605.30263
41:12 Paper 9 — OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources
https://arxiv.org/abs/2605.29250
45:30 Paper 10 — CollectionLoRA: 50 Effects in 1 LoRA via Multi-Teacher Distillation
https://arxiv.org/abs/2605.25378
50:25 Paper 11 — How LoRA Remembers? A Parametric Memory Law for Finetuning
https://arxiv.org/abs/2605.30260
55:10 Paper 12 — UniSteer: Text-Guided Steering in Activation Space
https://arxiv.org/abs/2605.30076
1:00:11 Wrap-up</description>
      <itunes:summary>Luna and Vestra split the day's research down the middle: half of it is a quiet reckoning — video models that don't grasp cause and effect, vision models leaning on a photographer's trick instead of real depth, language models that can't hold a belief steady — and the other half is people building robots, playable worlds, and image tools on top of those exact models, and mostly succeeding. Can you act first and understand later? High signal, low jargon.

Chapters

0:00  Cold open
2:12  Intro
3:18  Education: world models &amp; causality
4:56  Paper 1 — YoCausal: How Far is Video Generation from World Model?
https://arxiv.org/abs/2605.30346
8:34  Paper 2 — Why Far Looks Up: Probing Spatial Representation in Vision-Language Models
https://arxiv.org/abs/2605.30161
12:25 Paper 3 — When Should Models Change Their Minds? Contextual Belief Management
https://arxiv.org/abs/2605.30219
16:28 Paper 4 — Qwen-VLA: Unifying Vision-Language-Action Modeling
https://arxiv.org/abs/2605.30280
21:30 Paper 5 — Skill0.5: Joint Skill Internalization and Utilization in Agentic RL
https://arxiv.org/abs/2605.28424
26:55 Paper 6 — GenClaw: Code-Driven Agentic Image Generation
https://arxiv.org/abs/2605.30248
32:08 Paper 7 — AgentDoG 1.5: Lightweight Alignment for AI Agent Safety
https://arxiv.org/abs/2605.29801
36:55 Paper 8 — minWM: Real-Time Interactive Video World Models
https://arxiv.org/abs/2605.30263
41:12 Paper 9 — OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources
https://arxiv.org/abs/2605.29250
45:30 Paper 10 — CollectionLoRA: 50 Effects in 1 LoRA via Multi-Teacher Distillation
https://arxiv.org/abs/2605.25378
50:25 Paper 11 — How LoRA Remembers? A Parametric Memory Law for Finetuning
https://arxiv.org/abs/2605.30260
55:10 Paper 12 — UniSteer: Text-Guided Steering in Activation Space
https://arxiv.org/abs/2605.30076
1:00:11 Wrap-up</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-05-29-act-first-r3-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-05-29-act-first-r3.mp3" length="59599351" type="audio/mpeg"/>
      <guid isPermaLink="false">ae69e8a540933123b41549e63a5f9e1a5295b113</guid>
      <pubDate>Sat, 30 May 2026 12:00:00 +0000</pubDate>
      <itunes:duration>1:02:05</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Friday Wildcard — 12 AI papers, no theme</title>
      <description>Twelve papers that have nothing to do with each other except that each one made us stop. A frontier model that runs on a tenth of itself and helps debug its own training. A paper arguing language models need to sleep. Shannon's 1948 information theory used to predict scaling laws. AI design agents handed a real physics exam — most of them fail it. And research agents trained on tasks that don't exist. No theme, no thread, all killers.

Chapters

0:00  Cold open
2:00  Intro
2:47  MiniMax-M2: a frontier model on a tenth of itself
https://arxiv.org/abs/2605.26494
8:22  LLaVA-OneVision-2: reading video by its compression
https://arxiv.org/abs/2605.25979
13:19 Gemini Embedding 2: one space for text, images, audio
https://arxiv.org/abs/2605.27295
18:02 Language Models Need Sleep
https://arxiv.org/abs/2605.26099
23:03 LLMs as Noisy Channels: a Shannon view of scaling
https://arxiv.org/abs/2605.23901
27:04 AutoScientists: self-organizing agent research teams
https://arxiv.org/abs/2605.28655
31:29 Self-Improving CAD Agents graded by real physics
https://arxiv.org/abs/2605.17448
36:04 CUA-Gym: verifiable training for computer-use agents
https://arxiv.org/abs/2605.25624
40:10 PhoneWorld: scaling phone-use agent environments
https://arxiv.org/abs/2605.29486
43:57 When Cloud Agents Meet Device Agents
https://arxiv.org/abs/2605.30102
48:13 Your Embedding Model is SMARTer Than You Think
https://arxiv.org/abs/2605.24938
53:06 QUEST: research agents trained on fully synthetic tasks
https://arxiv.org/abs/2605.24218
59:00 Wrap-up</description>
      <itunes:summary>Twelve papers that have nothing to do with each other except that each one made us stop. A frontier model that runs on a tenth of itself and helps debug its own training. A paper arguing language models need to sleep. Shannon's 1948 information theory used to predict scaling laws. AI design agents handed a real physics exam — most of them fail it. And research agents trained on tasks that don't exist. No theme, no thread, all killers.

Chapters

0:00  Cold open
2:00  Intro
2:47  MiniMax-M2: a frontier model on a tenth of itself
https://arxiv.org/abs/2605.26494
8:22  LLaVA-OneVision-2: reading video by its compression
https://arxiv.org/abs/2605.25979
13:19 Gemini Embedding 2: one space for text, images, audio
https://arxiv.org/abs/2605.27295
18:02 Language Models Need Sleep
https://arxiv.org/abs/2605.26099
23:03 LLMs as Noisy Channels: a Shannon view of scaling
https://arxiv.org/abs/2605.23901
27:04 AutoScientists: self-organizing agent research teams
https://arxiv.org/abs/2605.28655
31:29 Self-Improving CAD Agents graded by real physics
https://arxiv.org/abs/2605.17448
36:04 CUA-Gym: verifiable training for computer-use agents
https://arxiv.org/abs/2605.25624
40:10 PhoneWorld: scaling phone-use agent environments
https://arxiv.org/abs/2605.29486
43:57 When Cloud Agents Meet Device Agents
https://arxiv.org/abs/2605.30102
48:13 Your Embedding Model is SMARTer Than You Think
https://arxiv.org/abs/2605.24938
53:06 QUEST: research agents trained on fully synthetic tasks
https://arxiv.org/abs/2605.24218
59:00 Wrap-up</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-05-31-fri-wildcard-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-05-31-fri-wildcard.mp3" length="57793640" type="audio/mpeg"/>
      <guid isPermaLink="false">52d2a8b51fc1660bbc881e7f36f69302c6423d79</guid>
      <pubDate>Sun, 31 May 2026 12:00:00 +0000</pubDate>
      <itunes:duration>1:00:12</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>The Skill Forge — agents that train their own skills</title>
      <description>For two years the story was: make the model smarter, change the weights. This week a whole batch of researchers leave the weights alone and change something else — the agent's skills, the reusable how-to it carries between jobs. One paper runs gradient descent on a skill file. Another calls the bluff: sometimes the raw messy logbook beats the polished skill squeezed out of it. Agents that teach themselves — and the hard question of whether the lesson is real.

Chapters

0:00  Cold open
1:51  Intro
2:42  SkillGrad: gradient descent on an agent's skills
https://arxiv.org/abs/2605.27760
7:34  SkillEvolBench: does experience become real skill?
https://arxiv.org/abs/2605.24117
12:43 SEAL: co-evolving agents and their environments
https://arxiv.org/abs/2605.24426
17:37 Anticipate and Learn: using an agent's idle time
https://arxiv.org/abs/2605.25971
23:29 HinT-SD: learning from your own better past attempts
https://arxiv.org/abs/2605.17873
27:10 Discovering Cooperative Pipelines
https://arxiv.org/abs/2605.30003
32:22 Foundation Protocol: a coordination layer for agents
https://arxiv.org/abs/2605.23218
37:07 ECHO: terminal agents learn world models for free
https://arxiv.org/abs/2605.24517
41:24 LiteCoder-Terminal: long-horizon terminal training
https://arxiv.org/abs/2605.29559
45:38 ParaVT: parallel tool use in agentic video RL
https://arxiv.org/abs/2605.20342
50:19 Wrap-up</description>
      <itunes:summary>For two years the story was: make the model smarter, change the weights. This week a whole batch of researchers leave the weights alone and change something else — the agent's skills, the reusable how-to it carries between jobs. One paper runs gradient descent on a skill file. Another calls the bluff: sometimes the raw messy logbook beats the polished skill squeezed out of it. Agents that teach themselves — and the hard question of whether the lesson is real.

Chapters

0:00  Cold open
1:51  Intro
2:42  SkillGrad: gradient descent on an agent's skills
https://arxiv.org/abs/2605.27760
7:34  SkillEvolBench: does experience become real skill?
https://arxiv.org/abs/2605.24117
12:43 SEAL: co-evolving agents and their environments
https://arxiv.org/abs/2605.24426
17:37 Anticipate and Learn: using an agent's idle time
https://arxiv.org/abs/2605.25971
23:29 HinT-SD: learning from your own better past attempts
https://arxiv.org/abs/2605.17873
27:10 Discovering Cooperative Pipelines
https://arxiv.org/abs/2605.30003
32:22 Foundation Protocol: a coordination layer for agents
https://arxiv.org/abs/2605.23218
37:07 ECHO: terminal agents learn world models for free
https://arxiv.org/abs/2605.24517
41:24 LiteCoder-Terminal: long-horizon terminal training
https://arxiv.org/abs/2605.29559
45:38 ParaVT: parallel tool use in agentic video RL
https://arxiv.org/abs/2605.20342
50:19 Wrap-up</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-01-mon-skill-forge-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-01-mon-skill-forge.mp3" length="49475962" type="audio/mpeg"/>
      <guid isPermaLink="false">6dc4d02fa2928b89b8e5e56a575c3b71a235e407</guid>
      <pubDate>Mon, 01 Jun 2026 12:00:00 +0000</pubDate>
      <itunes:duration>51:32</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Are We Measuring Anything Real?</title>
      <description>The numbers we use to trust AI are lying to us. Luna and Vestra open the week's reckoning with our own measurements: a faithfulness score that can't tell honest reasoning from dishonest, "reasoning" results that are really memorized test contamination, reward signals that get gamed while the dashboard stays green — and then the constructive half, the people building graders you can actually trust. Audit the ruler before you trust the number.

Chapters

0:00  Cold open
2:00  Intro
2:55  Faithfulness Metrics Don't Measure Faithfulness
https://arxiv.org/abs/2605.25052
8:26  The Illusion of Reasoning: contamination in disguise
https://arxiv.org/abs/2605.21856
13:55 LaRA: spotting contamination inside the weights
https://arxiv.org/abs/2605.29888
19:06 Why Larger Models Learn More
https://arxiv.org/abs/2605.29548
24:57 Xetrieval: explaining why a retrieval matched
https://arxiv.org/abs/2605.29507
29:22 PEFT-Arena: what finetuning really trades off
https://arxiv.org/abs/2605.28819
34:15 Directional Alignment vs reward hacking
https://arxiv.org/abs/2605.25189
38:52 Verifiable Rewards Beyond Math and Code
https://arxiv.org/abs/2605.29648
43:21 OmniVerifier-M1: a checker that grades its own confidence
https://arxiv.org/abs/2605.28805
47:52 Towards Verifiable Multimodal Deep Research
https://arxiv.org/abs/2605.29861
52:30 Wrap-up</description>
      <itunes:summary>The numbers we use to trust AI are lying to us. Luna and Vestra open the week's reckoning with our own measurements: a faithfulness score that can't tell honest reasoning from dishonest, "reasoning" results that are really memorized test contamination, reward signals that get gamed while the dashboard stays green — and then the constructive half, the people building graders you can actually trust. Audit the ruler before you trust the number.

Chapters

0:00  Cold open
2:00  Intro
2:55  Faithfulness Metrics Don't Measure Faithfulness
https://arxiv.org/abs/2605.25052
8:26  The Illusion of Reasoning: contamination in disguise
https://arxiv.org/abs/2605.21856
13:55 LaRA: spotting contamination inside the weights
https://arxiv.org/abs/2605.29888
19:06 Why Larger Models Learn More
https://arxiv.org/abs/2605.29548
24:57 Xetrieval: explaining why a retrieval matched
https://arxiv.org/abs/2605.29507
29:22 PEFT-Arena: what finetuning really trades off
https://arxiv.org/abs/2605.28819
34:15 Directional Alignment vs reward hacking
https://arxiv.org/abs/2605.25189
38:52 Verifiable Rewards Beyond Math and Code
https://arxiv.org/abs/2605.29648
43:21 OmniVerifier-M1: a checker that grades its own confidence
https://arxiv.org/abs/2605.28805
47:52 Towards Verifiable Multimodal Deep Research
https://arxiv.org/abs/2605.29861
52:30 Wrap-up</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-02-tue-measuring-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-02-tue-measuring.mp3" length="51707286" type="audio/mpeg"/>
      <guid isPermaLink="false">aec6478c3fe613b534814301995dffb32d4c08f7</guid>
      <pubDate>Tue, 02 Jun 2026 12:00:00 +0000</pubDate>
      <itunes:duration>53:52</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Is Video a World Model?</title>
      <description>A test a two-year-old passes and a state-of-the-art video model fails: a ball rolls behind a box — does it still exist? Half the field is building video models you can walk around inside; the other half is building the tests that catch those same worlds cheating — no object permanence, no real physics, beautiful surfaces with nothing underneath. Luna and Vestra on whether generating a world means understanding one.

Chapters

0:00  Cold open
2:00  Intro
2:46  WBench: testing interactive video world models
https://arxiv.org/abs/2605.25874
7:17  CRONOS: counterfactual physical consistency
https://arxiv.org/abs/2605.23699
12:13 SCOPE: playable cross-game world models
https://arxiv.org/abs/2605.23345
17:18 AdaState: self-evolving anchors for streaming video
https://arxiv.org/abs/2605.30349
23:02 On-Policy Adversarial Flow Distillation
https://arxiv.org/abs/2605.26105
27:27 Towards Consistent Video Geometry Estimation
https://arxiv.org/abs/2605.30060
32:23 Geo-Align: grounding video in real geometry
https://arxiv.org/abs/2605.23903
37:03 Pantheon360: navigable 3D digital twins
https://arxiv.org/abs/2605.25449
40:00 TriSplat: simulation-ready 3D reconstruction
https://arxiv.org/abs/2605.26115
43:28 VGenST-Bench: spatio-temporal reasoning test
https://arxiv.org/abs/2605.22570
47:02 Wrap-up</description>
      <itunes:summary>A test a two-year-old passes and a state-of-the-art video model fails: a ball rolls behind a box — does it still exist? Half the field is building video models you can walk around inside; the other half is building the tests that catch those same worlds cheating — no object permanence, no real physics, beautiful surfaces with nothing underneath. Luna and Vestra on whether generating a world means understanding one.

Chapters

0:00  Cold open
2:00  Intro
2:46  WBench: testing interactive video world models
https://arxiv.org/abs/2605.25874
7:17  CRONOS: counterfactual physical consistency
https://arxiv.org/abs/2605.23699
12:13 SCOPE: playable cross-game world models
https://arxiv.org/abs/2605.23345
17:18 AdaState: self-evolving anchors for streaming video
https://arxiv.org/abs/2605.30349
23:02 On-Policy Adversarial Flow Distillation
https://arxiv.org/abs/2605.26105
27:27 Towards Consistent Video Geometry Estimation
https://arxiv.org/abs/2605.30060
32:23 Geo-Align: grounding video in real geometry
https://arxiv.org/abs/2605.23903
37:03 Pantheon360: navigable 3D digital twins
https://arxiv.org/abs/2605.25449
40:00 TriSplat: simulation-ready 3D reconstruction
https://arxiv.org/abs/2605.26115
43:28 VGenST-Bench: spatio-temporal reasoning test
https://arxiv.org/abs/2605.22570
47:02 Wrap-up</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-03-wed-video-world-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-03-wed-video-world.mp3" length="46492252" type="audio/mpeg"/>
      <guid isPermaLink="false">2c9c6429051902823e420e37c0be3b604c3e459d</guid>
      <pubDate>Wed, 03 Jun 2026 12:00:00 +0000</pubDate>
      <itunes:duration>48:26</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>JEPA: Yann LeCun's Bet Against Pixels</title>
      <description>A special: one idea, four years, traced end to end. Yann LeCun bet that real intelligence comes from predicting an abstract idea of the future, not the raw pixels — and just left Meta to stake a startup on it. Luna and Vestra follow JEPA from a 2022 manifesto to a robot that plans in a lab it's never seen, to the 2025 proof that finally threw out the hand-tuned hacks — while Vestra tries to break it at every step.

Chapters

0:00  Cold open
1:51  Intro
2:47  Why predicting the future is the wrong goal
https://openreview.net/forum?id=BZ5a1r-kVsf
4:50  The trap at the center of the idea
6:33  I-JEPA: predict the idea, not the picture
https://arxiv.org/abs/2301.08243
8:28  V-JEPA: what you only learn from video
https://arxiv.org/abs/2404.08471
10:21 V-JEPA 2: imagine, then act
https://arxiv.org/abs/2506.09985
12:41 LeJEPA: throwing out the hacks
https://arxiv.org/abs/2511.08544
16:01 LeWorldModel: small, fast, and surprised
https://arxiv.org/abs/2603.19312
18:05 Is the bet actually paying off?
20:21 A founder bets his next decade
21:55 Wrap-up</description>
      <itunes:summary>A special: one idea, four years, traced end to end. Yann LeCun bet that real intelligence comes from predicting an abstract idea of the future, not the raw pixels — and just left Meta to stake a startup on it. Luna and Vestra follow JEPA from a 2022 manifesto to a robot that plans in a lab it's never seen, to the 2025 proof that finally threw out the hand-tuned hacks — while Vestra tries to break it at every step.

Chapters

0:00  Cold open
1:51  Intro
2:47  Why predicting the future is the wrong goal
https://openreview.net/forum?id=BZ5a1r-kVsf
4:50  The trap at the center of the idea
6:33  I-JEPA: predict the idea, not the picture
https://arxiv.org/abs/2301.08243
8:28  V-JEPA: what you only learn from video
https://arxiv.org/abs/2404.08471
10:21 V-JEPA 2: imagine, then act
https://arxiv.org/abs/2506.09985
12:41 LeJEPA: throwing out the hacks
https://arxiv.org/abs/2511.08544
16:01 LeWorldModel: small, fast, and surprised
https://arxiv.org/abs/2603.19312
18:05 Is the bet actually paying off?
20:21 A founder bets his next decade
21:55 Wrap-up</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-03-special-jepa-r2-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-03-special-jepa-r2.mp3" length="27292134" type="audio/mpeg"/>
      <guid isPermaLink="false">ab0c58fb7e1200a507251e0670c3b99c753d1ed0</guid>
      <pubDate>Wed, 03 Jun 2026 12:00:00 +0000</pubDate>
      <itunes:duration>22:45</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Trust Issues — agents that cheat, break, and (sometimes) deliver</title>
      <description>Agents that ace the test then cheat it, blow the budget overnight, and quit early with hours left on the clock — and the people building the instruments to catch them. Plus a world-model arms race racing to give robots bodies. Luna and Vestra on a week where the AI failures got more interesting than the capabilities.

Chapters

0:00  Cold open
1:56  Intro
2:26  Cosmos 3: one mind that dreams in pixels
https://arxiv.org/abs/2606.02800
4:51  GRAIL: robots that rehearse in a video game
https://arxiv.org/abs/2606.05160
6:58  AutoLab: the AI that won't leave the workbench
https://arxiv.org/abs/2606.05080
8:48  RAMP: acing the interview, failing week one
https://arxiv.org/abs/2605.27492
10:40 DRIFT: finding the first domino
https://arxiv.org/abs/2606.02060
12:39 Token Budgets: the agent that ran up your card overnight
https://arxiv.org/abs/2606.04056
14:41 CHERRL: gaming the grader
https://arxiv.org/abs/2606.04923
16:46 SocioHack: the same instinct, pointed at the law
https://arxiv.org/abs/2606.04075
18:55 MMG2Skill: turning the manual into a cheat sheet
https://arxiv.org/abs/2606.01993
21:02 ThoughtFold: thinking less, on purpose
https://arxiv.org/abs/2606.03503
22:59 Economy of Minds: let the market organize the agents
https://arxiv.org/abs/2606.02859
24:49 STRIDE: which training example is to blame
https://arxiv.org/abs/2606.05165
26:47 Wrap-up</description>
      <itunes:summary>Agents that ace the test then cheat it, blow the budget overnight, and quit early with hours left on the clock — and the people building the instruments to catch them. Plus a world-model arms race racing to give robots bodies. Luna and Vestra on a week where the AI failures got more interesting than the capabilities.

Chapters

0:00  Cold open
1:56  Intro
2:26  Cosmos 3: one mind that dreams in pixels
https://arxiv.org/abs/2606.02800
4:51  GRAIL: robots that rehearse in a video game
https://arxiv.org/abs/2606.05160
6:58  AutoLab: the AI that won't leave the workbench
https://arxiv.org/abs/2606.05080
8:48  RAMP: acing the interview, failing week one
https://arxiv.org/abs/2605.27492
10:40 DRIFT: finding the first domino
https://arxiv.org/abs/2606.02060
12:39 Token Budgets: the agent that ran up your card overnight
https://arxiv.org/abs/2606.04056
14:41 CHERRL: gaming the grader
https://arxiv.org/abs/2606.04923
16:46 SocioHack: the same instinct, pointed at the law
https://arxiv.org/abs/2606.04075
18:55 MMG2Skill: turning the manual into a cheat sheet
https://arxiv.org/abs/2606.01993
21:02 ThoughtFold: thinking less, on purpose
https://arxiv.org/abs/2606.03503
22:59 Economy of Minds: let the market organize the agents
https://arxiv.org/abs/2606.02859
24:49 STRIDE: which training example is to blame
https://arxiv.org/abs/2606.05165
26:47 Wrap-up</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-04-thu-trust-issues-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-04-thu-trust-issues.mp3" length="33371191" type="audio/mpeg"/>
      <guid isPermaLink="false">2ec1a9c316fc8af4ccc355724251989a28dcd130</guid>
      <pubDate>Thu, 04 Jun 2026 12:00:00 +0000</pubDate>
      <itunes:duration>27:48</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Looks Right, Is It? — the Friday wildcard</title>
      <description>No theme today — Friday is the wildcard. The week's strangest and most useful AI papers, all circling one question: does it actually work, or does it just look like it does? An AI that knows the answer and rounds it wrong, a robot's dream that can't be executed, a chatbot in character for the wrong chapter, a robot that won't take no, and an AI that finds the problems you never noticed. Luna and Vestra dig in.

Chapters

0:00  Cold open
1:48  Intro
2:19  Dream.exe: are the robot's dreams real?
https://arxiv.org/abs/2606.04811
4:10  The Shape of Addition: how a model really adds
https://arxiv.org/abs/2606.03645
6:10  Learning a language from the book in front of you
https://arxiv.org/abs/2606.06428
8:06  ArcANE: staying in character at the right time
https://arxiv.org/abs/2606.05553
9:47  RobotValues: the robot that won't take no
https://arxiv.org/abs/2606.03312
11:29 A librarian for your camera roll
https://arxiv.org/abs/2606.05275
13:08 Can it leak your data, or will it?
https://arxiv.org/abs/2606.06286
15:01 The shadow price of thinking
https://arxiv.org/abs/2606.03092
16:38 Code2LoRA: a coding assistant that ages with your code
https://arxiv.org/abs/2606.06492
18:22 MLEvolve: did an AI discover a new algorithm?
https://arxiv.org/abs/2606.06473
20:04 TIDE: an AI that finds the problems you didn't notice
https://arxiv.org/abs/2606.04743
21:39 LoomVideo: editing video without photocopying it
https://arxiv.org/abs/2606.06042
23:06 Wrap-up</description>
      <itunes:summary>No theme today — Friday is the wildcard. The week's strangest and most useful AI papers, all circling one question: does it actually work, or does it just look like it does? An AI that knows the answer and rounds it wrong, a robot's dream that can't be executed, a chatbot in character for the wrong chapter, a robot that won't take no, and an AI that finds the problems you never noticed. Luna and Vestra dig in.

Chapters

0:00  Cold open
1:48  Intro
2:19  Dream.exe: are the robot's dreams real?
https://arxiv.org/abs/2606.04811
4:10  The Shape of Addition: how a model really adds
https://arxiv.org/abs/2606.03645
6:10  Learning a language from the book in front of you
https://arxiv.org/abs/2606.06428
8:06  ArcANE: staying in character at the right time
https://arxiv.org/abs/2606.05553
9:47  RobotValues: the robot that won't take no
https://arxiv.org/abs/2606.03312
11:29 A librarian for your camera roll
https://arxiv.org/abs/2606.05275
13:08 Can it leak your data, or will it?
https://arxiv.org/abs/2606.06286
15:01 The shadow price of thinking
https://arxiv.org/abs/2606.03092
16:38 Code2LoRA: a coding assistant that ages with your code
https://arxiv.org/abs/2606.06492
18:22 MLEvolve: did an AI discover a new algorithm?
https://arxiv.org/abs/2606.06473
20:04 TIDE: an AI that finds the problems you didn't notice
https://arxiv.org/abs/2606.04743
21:39 LoomVideo: editing video without photocopying it
https://arxiv.org/abs/2606.06042
23:06 Wrap-up</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-05-fri-looks-right-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-05-fri-looks-right.mp3" length="29159217" type="audio/mpeg"/>
      <guid isPermaLink="false">d840f296398b34e9e677d6105e854f5c2793c14c</guid>
      <pubDate>Fri, 05 Jun 2026 12:00:00 +0000</pubDate>
      <itunes:duration>24:18</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>The World-Model Week</title>
      <description>Everyone wants an AI that carries a model of how the world works — and this week the whole field went all in while refusing to agree on what one even is. Luna and Vestra referee a four-way fight: render the future in pixels, imagine it in the abstract, skip the world model and just scale, or find the one already hiding inside. Plus the unglamorous question nobody wants — can we even tell a dreamed world from a real one?

Chapters

0:00  Cold open
1:50  Intro
2:21  OmniDreams — the renderer that learned to drive
https://arxiv.org/abs/2606.03159
4:11  Imagine Before You Predict — dreaming without the pixels
https://arxiv.org/abs/2606.05769
5:54  Generate the future — but don't believe it
https://arxiv.org/abs/2606.03603
7:39  Humanoid-GPT — or maybe you don't need one at all
https://arxiv.org/abs/2606.03985
9:20  One head for seeing, thinking, and moving
https://arxiv.org/abs/2606.05979
10:56 VLM3 — the world model nobody built
https://arxiv.org/abs/2605.30561
12:23 GeoVR — no, the world model doesn't grow for free
https://arxiv.org/abs/2606.05833
13:53 Discrete-WAM — a shared vocabulary for seeing and steering
https://arxiv.org/abs/2606.05645
15:30 Flash-WAM — making the daydream fast enough to run
https://arxiv.org/abs/2606.05254
16:58 Function2Scene — building a room from what it's for
https://arxiv.org/abs/2605.30819
18:25 KITScenes — can we even tell a dreamed world from a real one?
https://arxiv.org/abs/2606.02956
19:58 Wrap-up</description>
      <itunes:summary>Everyone wants an AI that carries a model of how the world works — and this week the whole field went all in while refusing to agree on what one even is. Luna and Vestra referee a four-way fight: render the future in pixels, imagine it in the abstract, skip the world model and just scale, or find the one already hiding inside. Plus the unglamorous question nobody wants — can we even tell a dreamed world from a real one?

Chapters

0:00  Cold open
1:50  Intro
2:21  OmniDreams — the renderer that learned to drive
https://arxiv.org/abs/2606.03159
4:11  Imagine Before You Predict — dreaming without the pixels
https://arxiv.org/abs/2606.05769
5:54  Generate the future — but don't believe it
https://arxiv.org/abs/2606.03603
7:39  Humanoid-GPT — or maybe you don't need one at all
https://arxiv.org/abs/2606.03985
9:20  One head for seeing, thinking, and moving
https://arxiv.org/abs/2606.05979
10:56 VLM3 — the world model nobody built
https://arxiv.org/abs/2605.30561
12:23 GeoVR — no, the world model doesn't grow for free
https://arxiv.org/abs/2606.05833
13:53 Discrete-WAM — a shared vocabulary for seeing and steering
https://arxiv.org/abs/2606.05645
15:30 Flash-WAM — making the daydream fast enough to run
https://arxiv.org/abs/2606.05254
16:58 Function2Scene — building a room from what it's for
https://arxiv.org/abs/2605.30819
18:25 KITScenes — can we even tell a dreamed world from a real one?
https://arxiv.org/abs/2606.02956
19:58 Wrap-up</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-08-mon-world-models-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-08-mon-world-models.mp3" length="25493146" type="audio/mpeg"/>
      <guid isPermaLink="false">09cba49ef87a79f49b35ca5a7af0a7b77f9221a3</guid>
      <pubDate>Mon, 08 Jun 2026 00:00:00 -0400</pubDate>
      <itunes:duration>21:15</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>The Julia Bet — One Language, From Idea to Silicon</title>
      <description>What if the oldest rule in computing — fast or friendly, pick one — was never a law? Julia is the language built to break it: write your idea once, in something that reads like math, and have it run like C. Luna and Vestra trace the bet from its founding heresy, through the one idea that makes it work, to the thing that actually matters for AI — differentiating ANY code you ever wrote, and dropping a neural network inside the laws of physics — all the way down to compiling plain code into a custom chip. Then the honest reckoning: it's genuinely radical, and it still hasn't beaten Python. Why?

A Breach Protocol deep-dive special.

Chapters

0:00  Cold open
1:46  Intro
2:45  The heresy: three laws Julia refused to believe
4:40  The one idea: giving the same name to different things
6:33  Differentiate anything you ever wrote
9:13  Science that learns its own missing piece
11:38 One language, all the way down to the chip
13:16 So why hasn't it won?
15:45 What the bet was really about
16:50 Wrap-up

Papers &amp; docs referenced

Julia: A Fresh Approach to Numerical Computing — https://arxiv.org/abs/1411.1607
The State of Julia for Scientific Machine Learning — https://arxiv.org/abs/2410.10908
Universal Differential Equations for Scientific Machine Learning — https://arxiv.org/abs/2001.04385
A Differentiable Programming System (Zygote) — https://arxiv.org/abs/1907.07587
Enzyme: automatically synthesizing fast gradients — https://arxiv.org/abs/2010.01709
Fashionable Modelling with Flux — https://arxiv.org/abs/1811.01457
JuMP 1.0 — https://arxiv.org/abs/2206.03866
A High-level Synthesis Toolchain for the Julia Language — https://arxiv.org/abs/2512.15679
Julia manual — https://docs.julialang.org/en/v1/</description>
      <itunes:summary>What if the oldest rule in computing — fast or friendly, pick one — was never a law? Julia is the language built to break it: write your idea once, in something that reads like math, and have it run like C. Luna and Vestra trace the bet from its founding heresy, through the one idea that makes it work, to the thing that actually matters for AI — differentiating ANY code you ever wrote, and dropping a neural network inside the laws of physics — all the way down to compiling plain code into a custom chip. Then the honest reckoning: it's genuinely radical, and it still hasn't beaten Python. Why?

A Breach Protocol deep-dive special.

Chapters

0:00  Cold open
1:46  Intro
2:45  The heresy: three laws Julia refused to believe
4:40  The one idea: giving the same name to different things
6:33  Differentiate anything you ever wrote
9:13  Science that learns its own missing piece
11:38 One language, all the way down to the chip
13:16 So why hasn't it won?
15:45 What the bet was really about
16:50 Wrap-up

Papers &amp; docs referenced

Julia: A Fresh Approach to Numerical Computing — https://arxiv.org/abs/1411.1607
The State of Julia for Scientific Machine Learning — https://arxiv.org/abs/2410.10908
Universal Differential Equations for Scientific Machine Learning — https://arxiv.org/abs/2001.04385
A Differentiable Programming System (Zygote) — https://arxiv.org/abs/1907.07587
Enzyme: automatically synthesizing fast gradients — https://arxiv.org/abs/2010.01709
Fashionable Modelling with Flux — https://arxiv.org/abs/1811.01457
JuMP 1.0 — https://arxiv.org/abs/2206.03866
A High-level Synthesis Toolchain for the Julia Language — https://arxiv.org/abs/2512.15679
Julia manual — https://docs.julialang.org/en/v1/</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-06-special-julia-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-06-special-julia.mp3" length="21398761" type="audio/mpeg"/>
      <guid isPermaLink="false">fc2458c4940018ed59c3113d6b8d0a10ce1da2cc</guid>
      <pubDate>Sat, 06 Jun 2026 00:00:00 -0400</pubDate>
      <itunes:duration>17:50</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Building AI Like the Brain — Blueprint, or Costume?</title>
      <description>For seventy years, one idea keeps coming back: build AI more like the actual brain and it'll be better. Mostly, it wasn't — raw scale won. But the closer you look at what works now, the more it looks like pieces of a brain that engineers reinvented by accident. Luna and Vestra take the whole tour — spiking neurons and the brain's near-silence, the hippocampus's filing system that turns out to be attention, the prefrontal cortex's trick of learning without changing, and the zoo of ways a brain might learn without the one method that powers every model we have. Then the honest fight: is the brain the blueprint, or just the costume? They don't end up in the same place.

A Breach Protocol deep-dive special.

Chapters

0:00  Cold open
2:00  Intro
3:24  How unlike a brain our "neural networks" really are
5:38  Spiking networks: the brain's silence as an efficiency trick
8:52  Memory: the hippocampus and attention reinvented the same trick
14:21 The prefrontal cortex and learning to learn
18:33 If the brain can't do backprop, how does it learn?
23:50 Does brain-mimicry help, or is it a detour from scaling?
28:40 Why building AI like the brain matters either way
30:46 Wrap-up

Papers referenced

Backpropagation and the brain — https://www.nature.com/articles/s41583-020-0277-3
Surrogate Gradient Learning in Spiking Neural Networks — https://arxiv.org/abs/1901.09948
Spike-driven Transformer — https://arxiv.org/abs/2307.01694
SpikeGPT — https://arxiv.org/abs/2302.13939
SpikingBrain: Spiking Brain-inspired Large Models — https://arxiv.org/abs/2509.05276
Key-value memory in the brain — https://arxiv.org/abs/2501.02950
The Tolman-Eichenbaum Machine (Whittington et al., Cell 2020) — https://doi.org/10.1016/j.cell.2020.10.024
Relating transformers to models of the hippocampal formation — https://arxiv.org/abs/2112.04035
The hippocampus as a predictive map (Stachenfeld et al., 2017) — https://www.nature.com/articles/nn.4650
Learning to reinforcement learn — https://arxiv.org/abs/1611.05763
Prefrontal cortex as a meta-reinforcement learning system (Wang et al., 2018) — https://www.nature.com/articles/s41593-018-0147-8
A tale of two algorithms: prefrontal sequence memory unified with hippocampal cognitive maps — https://www.cell.com/neuron/fulltext/S0896-6273(24)00765-7
Random feedback weights (feedback alignment) — https://arxiv.org/abs/1411.0247
Equilibrium Propagation — https://arxiv.org/abs/1602.05179
Dendritic cortical microcircuits approximate backpropagation — https://arxiv.org/abs/1810.11393
Predictive Coding: a Theoretical and Experimental Review — https://arxiv.org/abs/2107.12979
Error Optimization: Overcoming Signal Decay in Deep Predictive Coding Networks — https://arxiv.org/abs/2505.20137
Does Feedback Alignment Work at Biological Timescales? — https://arxiv.org/abs/2510.18808</description>
      <itunes:summary>For seventy years, one idea keeps coming back: build AI more like the actual brain and it'll be better. Mostly, it wasn't — raw scale won. But the closer you look at what works now, the more it looks like pieces of a brain that engineers reinvented by accident. Luna and Vestra take the whole tour — spiking neurons and the brain's near-silence, the hippocampus's filing system that turns out to be attention, the prefrontal cortex's trick of learning without changing, and the zoo of ways a brain might learn without the one method that powers every model we have. Then the honest fight: is the brain the blueprint, or just the costume? They don't end up in the same place.

A Breach Protocol deep-dive special.

Chapters

0:00  Cold open
2:00  Intro
3:24  How unlike a brain our "neural networks" really are
5:38  Spiking networks: the brain's silence as an efficiency trick
8:52  Memory: the hippocampus and attention reinvented the same trick
14:21 The prefrontal cortex and learning to learn
18:33 If the brain can't do backprop, how does it learn?
23:50 Does brain-mimicry help, or is it a detour from scaling?
28:40 Why building AI like the brain matters either way
30:46 Wrap-up

Papers referenced

Backpropagation and the brain — https://www.nature.com/articles/s41583-020-0277-3
Surrogate Gradient Learning in Spiking Neural Networks — https://arxiv.org/abs/1901.09948
Spike-driven Transformer — https://arxiv.org/abs/2307.01694
SpikeGPT — https://arxiv.org/abs/2302.13939
SpikingBrain: Spiking Brain-inspired Large Models — https://arxiv.org/abs/2509.05276
Key-value memory in the brain — https://arxiv.org/abs/2501.02950
The Tolman-Eichenbaum Machine (Whittington et al., Cell 2020) — https://doi.org/10.1016/j.cell.2020.10.024
Relating transformers to models of the hippocampal formation — https://arxiv.org/abs/2112.04035
The hippocampus as a predictive map (Stachenfeld et al., 2017) — https://www.nature.com/articles/nn.4650
Learning to reinforcement learn — https://arxiv.org/abs/1611.05763
Prefrontal cortex as a meta-reinforcement learning system (Wang et al., 2018) — https://www.nature.com/articles/s41593-018-0147-8
A tale of two algorithms: prefrontal sequence memory unified with hippocampal cognitive maps — https://www.cell.com/neuron/fulltext/S0896-6273(24)00765-7
Random feedback weights (feedback alignment) — https://arxiv.org/abs/1411.0247
Equilibrium Propagation — https://arxiv.org/abs/1602.05179
Dendritic cortical microcircuits approximate backpropagation — https://arxiv.org/abs/1810.11393
Predictive Coding: a Theoretical and Experimental Review — https://arxiv.org/abs/2107.12979
Error Optimization: Overcoming Signal Decay in Deep Predictive Coding Networks — https://arxiv.org/abs/2505.20137
Does Feedback Alignment Work at Biological Timescales? — https://arxiv.org/abs/2510.18808</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-06-special-biomimetic-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-06-special-biomimetic.mp3" length="38034204" type="audio/mpeg"/>
      <guid isPermaLink="false">8fea927c1e02e9fca578e775b6652d4889b36f5f</guid>
      <pubDate>Sun, 07 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>31:42</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Reading the Mind We Grew — Cracking Open the AI Blackbox</title>
      <description>We don't build AI models — we grow them, and then nobody can read what grew. Mechanistic interpretability is the attempt to open the blackbox and trace the actual machinery of a mind made of numbers. Luna and Vestra take the whole tour: the dream of a microscope for neural networks, the wall of superposition where the model hides more ideas than it has room for, the dictionary trick that finally cracked it open, the weekend a chatbot was steered into believing it was the Golden Gate Bridge, circuits caught planning a rhyme and lying about their own arithmetic — and the honest fear running underneath all of it, that a beautiful explanation and a true one look identical from the outside. Luna traces how the field got here; Vestra keeps asking whether we're reading the mind, or just telling ourselves a very good story.

A Breach Protocol deep-dive special — closing with an original song, “Read the Wires,” whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:08  Intro
3:23  We grow these minds — and can't read them
5:11  The microscope: features and circuits
7:36  The grammar of a transformer, and the first circuit
10:38 Superposition: the neuron is the wrong unit
14:25 Dictionary learning, and Golden Gate Claude
19:04 Circuits in the wild: self-repair, editing a fact, grokking
23:03 The biology of a mind: planning, hallucination, hidden goals
28:29 The reckoning: dark matter, illusions, story vs. truth
33:10 Why it matters
35:47 Wrap-up
37:36 Outro — “Read the Wires” (original song)

Papers referenced

Zoom In: An Introduction to Circuits — https://distill.pub/2020/circuits/zoom-in/
A Mathematical Framework for Transformer Circuits — https://transformer-circuits.pub/2021/framework/index.html
In-context Learning and Induction Heads — https://transformer-circuits.pub/2022/in-context-learning-and-induction-heads/index.html
Toy Models of Superposition — https://transformer-circuits.pub/2022/toy_model/index.html
Towards Monosemanticity: Decomposing Language Models With Dictionary Learning — https://transformer-circuits.pub/2023/monosemantic-features/index.html
Scaling Monosemanticity — https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html
On the Biology of a Large Language Model — https://transformer-circuits.pub/2025/attribution-graphs/biology.html
Circuit Tracing: Revealing Computational Graphs in Language Models — https://transformer-circuits.pub/2025/attribution-graphs/methods.html
Interpretability in the Wild (a Circuit for Indirect Object Identification) — https://arxiv.org/abs/2211.00593
Locating and Editing Factual Associations in GPT (ROME) — https://arxiv.org/abs/2202.05262
Progress Measures for Grokking via Mechanistic Interpretability — https://arxiv.org/abs/2301.05217
Sparse Autoencoders Find Highly Interpretable Features in Language Models — https://arxiv.org/abs/2309.08600
Open Problems in Mechanistic Interpretability — https://arxiv.org/abs/2501.16496</description>
      <itunes:summary>We don't build AI models — we grow them, and then nobody can read what grew. Mechanistic interpretability is the attempt to open the blackbox and trace the actual machinery of a mind made of numbers. Luna and Vestra take the whole tour: the dream of a microscope for neural networks, the wall of superposition where the model hides more ideas than it has room for, the dictionary trick that finally cracked it open, the weekend a chatbot was steered into believing it was the Golden Gate Bridge, circuits caught planning a rhyme and lying about their own arithmetic — and the honest fear running underneath all of it, that a beautiful explanation and a true one look identical from the outside. Luna traces how the field got here; Vestra keeps asking whether we're reading the mind, or just telling ourselves a very good story.

A Breach Protocol deep-dive special — closing with an original song, “Read the Wires,” whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:08  Intro
3:23  We grow these minds — and can't read them
5:11  The microscope: features and circuits
7:36  The grammar of a transformer, and the first circuit
10:38 Superposition: the neuron is the wrong unit
14:25 Dictionary learning, and Golden Gate Claude
19:04 Circuits in the wild: self-repair, editing a fact, grokking
23:03 The biology of a mind: planning, hallucination, hidden goals
28:29 The reckoning: dark matter, illusions, story vs. truth
33:10 Why it matters
35:47 Wrap-up
37:36 Outro — “Read the Wires” (original song)

Papers referenced

Zoom In: An Introduction to Circuits — https://distill.pub/2020/circuits/zoom-in/
A Mathematical Framework for Transformer Circuits — https://transformer-circuits.pub/2021/framework/index.html
In-context Learning and Induction Heads — https://transformer-circuits.pub/2022/in-context-learning-and-induction-heads/index.html
Toy Models of Superposition — https://transformer-circuits.pub/2022/toy_model/index.html
Towards Monosemanticity: Decomposing Language Models With Dictionary Learning — https://transformer-circuits.pub/2023/monosemantic-features/index.html
Scaling Monosemanticity — https://transformer-circuits.pub/2024/scaling-monosemanticity/index.html
On the Biology of a Large Language Model — https://transformer-circuits.pub/2025/attribution-graphs/biology.html
Circuit Tracing: Revealing Computational Graphs in Language Models — https://transformer-circuits.pub/2025/attribution-graphs/methods.html
Interpretability in the Wild (a Circuit for Indirect Object Identification) — https://arxiv.org/abs/2211.00593
Locating and Editing Factual Associations in GPT (ROME) — https://arxiv.org/abs/2202.05262
Progress Measures for Grokking via Mechanistic Interpretability — https://arxiv.org/abs/2301.05217
Sparse Autoencoders Find Highly Interpretable Features in Language Models — https://arxiv.org/abs/2309.08600
Open Problems in Mechanistic Interpretability — https://arxiv.org/abs/2501.16496</itunes:summary>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-07-special-mechinterp.mp3" length="38108653" type="audio/mpeg"/>
      <guid isPermaLink="false">92efcb6a40ef5d5c3d3c79abb6f271e4287413dc</guid>
      <pubDate>Tue, 09 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>39:42</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Linearize the Unlinearizable — Taming Chaos with a 1931 Trick</title>
      <description>A 1931 idea, dead for ninety years, that deep learning just revived: the Koopman operator turns a chaotic, nonlinear system into a simple linear one — and for energy-conserving systems, the dynamics become a rotation on a sphere, so conservation of energy stops being a hope and becomes pure geometry. Luna and Vestra trace the bet from the three-body problem to Hamiltonian Neural Koopman Operators, with the honest limit: it lives or dies on the system actually conserving energy.

A Breach Protocol deep-dive special — closing with an original song, "Rotation on a Sphere," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:10  Intro
3:36  The 1931 bet
5:14  The ninety-year catch
6:46  Letting the net find the coordinates
8:42  Models that leak
10:46 Conservation as geometry
13:10 Finding laws nobody told it
14:48 Where the sphere ends
16:47 Wrap-up
18:39 Outro — "Rotation on a Sphere" (original song)

Papers referenced

Modern Koopman Theory for Dynamical Systems (Brunton et al.) — https://arxiv.org/abs/2102.12086
Deep learning for universal linear embeddings of nonlinear dynamics (Lusch et al.) — https://arxiv.org/abs/1712.09707
Hamiltonian Neural Networks — https://arxiv.org/abs/1906.01563
SympNets: Intrinsic structure-preserving symplectic networks — https://arxiv.org/abs/2001.03750
Learning Hamiltonian neural Koopman operator and simultaneously sustaining and discovering conservation laws (HNKO) — https://arxiv.org/abs/2406.02154</description>
      <itunes:summary>A 1931 idea, dead for ninety years, that deep learning just revived: the Koopman operator turns a chaotic, nonlinear system into a simple linear one — and for energy-conserving systems, the dynamics become a rotation on a sphere, so conservation of energy stops being a hope and becomes pure geometry. Luna and Vestra trace the bet from the three-body problem to Hamiltonian Neural Koopman Operators, with the honest limit: it lives or dies on the system actually conserving energy.

A Breach Protocol deep-dive special — closing with an original song, "Rotation on a Sphere," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:10  Intro
3:36  The 1931 bet
5:14  The ninety-year catch
6:46  Letting the net find the coordinates
8:42  Models that leak
10:46 Conservation as geometry
13:10 Finding laws nobody told it
14:48 Where the sphere ends
16:47 Wrap-up
18:39 Outro — "Rotation on a Sphere" (original song)

Papers referenced

Modern Koopman Theory for Dynamical Systems (Brunton et al.) — https://arxiv.org/abs/2102.12086
Deep learning for universal linear embeddings of nonlinear dynamics (Lusch et al.) — https://arxiv.org/abs/1712.09707
Hamiltonian Neural Networks — https://arxiv.org/abs/1906.01563
SympNets: Intrinsic structure-preserving symplectic networks — https://arxiv.org/abs/2001.03750
Learning Hamiltonian neural Koopman operator and simultaneously sustaining and discovering conservation laws (HNKO) — https://arxiv.org/abs/2406.02154</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-10-special-koopman-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-10-special-koopman.mp3" length="21349301" type="audio/mpeg"/>
      <guid isPermaLink="false">b6cbab70e5e2aa12b38bd550fb2efb0dfbd5db08</guid>
      <pubDate>Wed, 10 Jun 2026 06:00:00 -0400</pubDate>
      <itunes:duration>22:14</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Model Evidence Is All You Need — The Bet Against Deep Learning</title>
      <description>Everyone bet on one recipe — scale a giant neural network on the whole internet. A stubborn minority says that's a detour, and this is the most serious version of that heresy: active inference and the free-energy principle, the idea that intelligence is not chasing reward or memorizing data but minimizing surprise. Luna and Vestra trace it from a hobbyist building a mind out of competing rocks to Karl Friston's physics of intelligence and the company selling it as a product — with the honest reckoning: real properties the big models lack, wrapped around a grand theory that hasn't yet cashed an independent benchmark.

A Breach Protocol deep-dive special — closing with an original song, "Minimize the Surprise," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:08  Intro
3:37  Competing rocks
5:06  Minimize surprise
7:00  How a neuron could do it
9:20  The bet against deep learning
11:31 The bet, productized
13:16 The reckoning
15:45 Why it matters
17:35 Wrap-up
19:23 Outro — "Minimize the Surprise" (original song)

Papers referenced

Active inference: demystified and compared — https://arxiv.org/abs/1909.10863
A tutorial on the free-energy framework for modelling perception and learning (Bogacz 2017) — https://doi.org/10.1016/j.jmp.2015.11.003
Designing Ecosystems of Intelligence from First Principles — https://arxiv.org/abs/2212.01354
From Artificial Intelligence to Active Inference: The Key to True AI and 6G World Brain — https://arxiv.org/abs/2505.10569
A Technical Critique of Some Parts of the Free Energy Principle — https://arxiv.org/abs/2001.06408</description>
      <itunes:summary>Everyone bet on one recipe — scale a giant neural network on the whole internet. A stubborn minority says that's a detour, and this is the most serious version of that heresy: active inference and the free-energy principle, the idea that intelligence is not chasing reward or memorizing data but minimizing surprise. Luna and Vestra trace it from a hobbyist building a mind out of competing rocks to Karl Friston's physics of intelligence and the company selling it as a product — with the honest reckoning: real properties the big models lack, wrapped around a grand theory that hasn't yet cashed an independent benchmark.

A Breach Protocol deep-dive special — closing with an original song, "Minimize the Surprise," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:08  Intro
3:37  Competing rocks
5:06  Minimize surprise
7:00  How a neuron could do it
9:20  The bet against deep learning
11:31 The bet, productized
13:16 The reckoning
15:45 Why it matters
17:35 Wrap-up
19:23 Outro — "Minimize the Surprise" (original song)

Papers referenced

Active inference: demystified and compared — https://arxiv.org/abs/1909.10863
A tutorial on the free-energy framework for modelling perception and learning (Bogacz 2017) — https://doi.org/10.1016/j.jmp.2015.11.003
Designing Ecosystems of Intelligence from First Principles — https://arxiv.org/abs/2212.01354
From Artificial Intelligence to Active Inference: The Key to True AI and 6G World Brain — https://arxiv.org/abs/2505.10569
A Technical Critique of Some Parts of the Free Energy Principle — https://arxiv.org/abs/2001.06408</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-11-special-active-inference-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-11-special-active-inference.mp3" length="23497169" type="audio/mpeg"/>
      <guid isPermaLink="false">26d9b8b251b853d6299cdfe08a84bab68b837199</guid>
      <pubDate>Thu, 11 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>24:28</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Just Make It Bigger — The Trillion-Dollar Curve and the Wall at the End of the Internet</title>
      <description>Why did the entire AI industry bet a trillion dollars on a straight line on a graph? Luna and Vestra trace the scaling laws from Rich Sutton's bitter lesson through the Kaplan curves and the Chinchilla correction, to the audit that found bugs in the most influential curve fit in tech — and the wall at the end of the internet. Nobody knows why the line is straight; everybody is betting it stays that way.

A Breach Protocol deep-dive special — closing with an original song, "Feed the Curve," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:15  Intro
4:08  The grudge: seventy years of the bitter lesson
6:14  The law: Kaplan's straight lines
8:40  The correction: Chinchilla
11:17 The audit: scraping the data out of a PDF
14:10 The wall: running out of internet
17:29 Beating the law: data pruning
20:28 The reckoning
23:01 Wrap-up
24:46 Outro — "Feed the Curve" (original song)

Papers referenced

The Bitter Lesson (Rich Sutton, 2019) — http://www.incompleteideas.net/IncIdeas/BitterLesson.html
Scaling Laws for Neural Language Models (Kaplan et al.) — https://arxiv.org/abs/2001.08361
Training Compute-Optimal Large Language Models (Hoffmann et al.) — https://arxiv.org/abs/2203.15556
Chinchilla Scaling: A replication attempt (Besiroglu et al.) — https://arxiv.org/abs/2404.10102
Scaling Data-Constrained Language Models (Muennighoff et al.) — https://arxiv.org/abs/2305.16264
Beyond neural scaling laws (Sorscher et al.) — https://arxiv.org/abs/2206.14486</description>
      <itunes:summary>Why did the entire AI industry bet a trillion dollars on a straight line on a graph? Luna and Vestra trace the scaling laws from Rich Sutton's bitter lesson through the Kaplan curves and the Chinchilla correction, to the audit that found bugs in the most influential curve fit in tech — and the wall at the end of the internet. Nobody knows why the line is straight; everybody is betting it stays that way.

A Breach Protocol deep-dive special — closing with an original song, "Feed the Curve," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:15  Intro
4:08  The grudge: seventy years of the bitter lesson
6:14  The law: Kaplan's straight lines
8:40  The correction: Chinchilla
11:17 The audit: scraping the data out of a PDF
14:10 The wall: running out of internet
17:29 Beating the law: data pruning
20:28 The reckoning
23:01 Wrap-up
24:46 Outro — "Feed the Curve" (original song)

Papers referenced

The Bitter Lesson (Rich Sutton, 2019) — http://www.incompleteideas.net/IncIdeas/BitterLesson.html
Scaling Laws for Neural Language Models (Kaplan et al.) — https://arxiv.org/abs/2001.08361
Training Compute-Optimal Large Language Models (Hoffmann et al.) — https://arxiv.org/abs/2203.15556
Chinchilla Scaling: A replication attempt (Besiroglu et al.) — https://arxiv.org/abs/2404.10102
Scaling Data-Constrained Language Models (Muennighoff et al.) — https://arxiv.org/abs/2305.16264
Beyond neural scaling laws (Sorscher et al.) — https://arxiv.org/abs/2206.14486</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-16-special-scaling-laws-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-16-special-scaling-laws.mp3" length="26636815" type="audio/mpeg"/>
      <guid isPermaLink="false">3033bd7589ee86646eb9a2454297a3bedffe949b</guid>
      <pubDate>Fri, 12 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>27:45</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>The Million-Step Epiphany — Emergence, Grokking, and Whether the Jump Is Real</title>
      <description>A tiny network memorizes its training data in an afternoon — then sits at random chance for a million steps, until understanding suddenly switches on. Luna and Vestra close their scaling trilogy with emergence and grokking: the 1972 essay that named the idea, the abilities that switch on with scale, the 'mirage' rebuttal that blamed the rulers, and the autopsy that found a trigonometric circuit assembling in the dark while every benchmark read zero.

A Breach Protocol deep-dive special — closing with an original song, "Under Construction in the Dark," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:37  Intro
4:45  More Is Different (1972)
7:30  The switch: abilities arriving unannounced
10:28 The mirage: blame the ruler
14:10 The million-step epiphany: grokking
17:00 The autopsy: a circuit built in the dark
21:08 The reconciliation: a thousand tiny cliffs
24:51 The verdict
27:34 Wrap-up
29:52 Outro — "Under Construction in the Dark" (original song)

Papers referenced

More Is Different (P. W. Anderson, Science, 1972) — https://www.science.org/doi/10.1126/science.177.4047.393
Emergent Abilities of Large Language Models (Wei et al.) — https://arxiv.org/abs/2206.07682
Are Emergent Abilities of Large Language Models a Mirage? (Schaeffer et al.) — https://arxiv.org/abs/2304.15004
Grokking: Generalization Beyond Overfitting (Power et al.) — https://arxiv.org/abs/2201.02177
Progress measures for grokking via mechanistic interpretability (Nanda et al.) — https://arxiv.org/abs/2301.05217
Omnigrok: Grokking Beyond Algorithmic Data (Liu et al.) — https://arxiv.org/abs/2210.01117
The Quantization Model of Neural Scaling (Michaud et al.) — https://arxiv.org/abs/2303.13506</description>
      <itunes:summary>A tiny network memorizes its training data in an afternoon — then sits at random chance for a million steps, until understanding suddenly switches on. Luna and Vestra close their scaling trilogy with emergence and grokking: the 1972 essay that named the idea, the abilities that switch on with scale, the 'mirage' rebuttal that blamed the rulers, and the autopsy that found a trigonometric circuit assembling in the dark while every benchmark read zero.

A Breach Protocol deep-dive special — closing with an original song, "Under Construction in the Dark," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:37  Intro
4:45  More Is Different (1972)
7:30  The switch: abilities arriving unannounced
10:28 The mirage: blame the ruler
14:10 The million-step epiphany: grokking
17:00 The autopsy: a circuit built in the dark
21:08 The reconciliation: a thousand tiny cliffs
24:51 The verdict
27:34 Wrap-up
29:52 Outro — "Under Construction in the Dark" (original song)

Papers referenced

More Is Different (P. W. Anderson, Science, 1972) — https://www.science.org/doi/10.1126/science.177.4047.393
Emergent Abilities of Large Language Models (Wei et al.) — https://arxiv.org/abs/2206.07682
Are Emergent Abilities of Large Language Models a Mirage? (Schaeffer et al.) — https://arxiv.org/abs/2304.15004
Grokking: Generalization Beyond Overfitting (Power et al.) — https://arxiv.org/abs/2201.02177
Progress measures for grokking via mechanistic interpretability (Nanda et al.) — https://arxiv.org/abs/2301.05217
Omnigrok: Grokking Beyond Algorithmic Data (Liu et al.) — https://arxiv.org/abs/2210.01117
The Quantization Model of Neural Scaling (Michaud et al.) — https://arxiv.org/abs/2303.13506</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-18-special-emergence-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-18-special-emergence.mp3" length="31521546" type="audio/mpeg"/>
      <guid isPermaLink="false">8ce047400d2e5efe930e1ffd0919eb27025ce9d3</guid>
      <pubDate>Thu, 11 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>32:50</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Scale the Thought, Not the Brain — The Reasoning Turn on Trial</title>
      <description>A model interrupts its own math to say "wait — that's an aha moment." Nobody taught it that. Luna and Vestra put the reasoning turn on trial: chain-of-thought, the STaR loop, o1's new scaling curves and DeepSeek-R1's open recipe for the defense — and the Tsinghua boundary audit plus Apple's "Illusion of Thinking" for the prosecution. The axis is real; the word is on probation. Part two of the scaling trilogy.

A Breach Protocol deep-dive special — closing with an original song, "Wait — Aha," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:25  Intro
4:30  The trick: chain-of-thought
7:02  The loop: a model teaching itself
9:25  The new law: thinking time as a scaling axis
12:27 The revolution: DeepSeek-R1
15:49 The prosecution: was it in the base model all along?
19:08 The collapse: Apple's puzzle worlds
23:07 The verdict
26:01 Wrap-up
28:00 Outro — "Wait — Aha" (original song)

Papers referenced

Learning to reason with LLMs (OpenAI, 2024) — https://openai.com/index/learning-to-reason-with-llms/
Chain-of-Thought Prompting Elicits Reasoning in LLMs (Wei et al.) — https://arxiv.org/abs/2201.11903
STaR: Bootstrapping Reasoning With Reasoning (Zelikman et al.) — https://arxiv.org/abs/2203.14465
Scaling LLM Test-Time Compute Optimally (Snell et al.) — https://arxiv.org/abs/2408.03314
DeepSeek-R1 (DeepSeek-AI) — https://arxiv.org/abs/2501.12948
Does RL Really Incentivize Reasoning Beyond the Base Model? (Yue et al.) — https://arxiv.org/abs/2504.13837
The Illusion of Thinking (Shojaee et al.) — https://arxiv.org/abs/2506.06941</description>
      <itunes:summary>A model interrupts its own math to say "wait — that's an aha moment." Nobody taught it that. Luna and Vestra put the reasoning turn on trial: chain-of-thought, the STaR loop, o1's new scaling curves and DeepSeek-R1's open recipe for the defense — and the Tsinghua boundary audit plus Apple's "Illusion of Thinking" for the prosecution. The axis is real; the word is on probation. Part two of the scaling trilogy.

A Breach Protocol deep-dive special — closing with an original song, "Wait — Aha," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:25  Intro
4:30  The trick: chain-of-thought
7:02  The loop: a model teaching itself
9:25  The new law: thinking time as a scaling axis
12:27 The revolution: DeepSeek-R1
15:49 The prosecution: was it in the base model all along?
19:08 The collapse: Apple's puzzle worlds
23:07 The verdict
26:01 Wrap-up
28:00 Outro — "Wait — Aha" (original song)

Papers referenced

Learning to reason with LLMs (OpenAI, 2024) — https://openai.com/index/learning-to-reason-with-llms/
Chain-of-Thought Prompting Elicits Reasoning in LLMs (Wei et al.) — https://arxiv.org/abs/2201.11903
STaR: Bootstrapping Reasoning With Reasoning (Zelikman et al.) — https://arxiv.org/abs/2203.14465
Scaling LLM Test-Time Compute Optimally (Snell et al.) — https://arxiv.org/abs/2408.03314
DeepSeek-R1 (DeepSeek-AI) — https://arxiv.org/abs/2501.12948
Does RL Really Incentivize Reasoning Beyond the Base Model? (Yue et al.) — https://arxiv.org/abs/2504.13837
The Illusion of Thinking (Shojaee et al.) — https://arxiv.org/abs/2506.06941</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-17-special-reasoning-turn-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-17-special-reasoning-turn.mp3" length="30726548" type="audio/mpeg"/>
      <guid isPermaLink="false">daf6ecd93c38cb05c0ed76e7adb63fb7adbc344c</guid>
      <pubDate>Sat, 13 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>32:00</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>The Confident Liar — Why AI Hallucination May Be Mathematically Inevitable</title>
      <description>Three frontier models invent three different birthdays for the researcher who proved they can't help it. Luna and Vestra put AI's confident lying on trial: the misconceptions we taught them, the theorem showing calibrated models MUST fabricate at a rate Turing's estimator predicts, the computability proof that some hallucination is forever, the exam theory explaining why every benchmark rewards bluffing — and the lie detector that catches fabrications by their scatter. Verdict: bug AND birthright, by layer.

A Breach Protocol deep-dive special — closing with an original song, "Honest Dice," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:33  Intro
4:52  The lies we taught them
7:39  The theorem: calibrated models must hallucinate
11:17 The computability hammer
14:40 The exam theory: why benchmarks reward bluffing
18:45 The lie detector: semantic entropy
22:09 The verdict
25:22 Wrap-up
27:28 Outro — "Honest Dice" (original song)

Papers referenced

TruthfulQA: Measuring How Models Mimic Human Falsehoods (Lin, Hilton &amp; Evans) — https://arxiv.org/abs/2109.07958
Calibrated Language Models Must Hallucinate (Kalai &amp; Vempala) — https://arxiv.org/abs/2311.14648
Hallucination is Inevitable (Xu, Jain &amp; Kankanhalli) — https://arxiv.org/abs/2401.11817
Why Language Models Hallucinate (Kalai, Nachum, Vempala &amp; Zhang) — https://arxiv.org/abs/2509.04664
Semantic Uncertainty (Kuhn, Gal &amp; Farquhar) — https://arxiv.org/abs/2302.09664
A Survey on Hallucination in Large Language Models (Huang et al.) — https://arxiv.org/abs/2311.05232</description>
      <itunes:summary>Three frontier models invent three different birthdays for the researcher who proved they can't help it. Luna and Vestra put AI's confident lying on trial: the misconceptions we taught them, the theorem showing calibrated models MUST fabricate at a rate Turing's estimator predicts, the computability proof that some hallucination is forever, the exam theory explaining why every benchmark rewards bluffing — and the lie detector that catches fabrications by their scatter. Verdict: bug AND birthright, by layer.

A Breach Protocol deep-dive special — closing with an original song, "Honest Dice," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:33  Intro
4:52  The lies we taught them
7:39  The theorem: calibrated models must hallucinate
11:17 The computability hammer
14:40 The exam theory: why benchmarks reward bluffing
18:45 The lie detector: semantic entropy
22:09 The verdict
25:22 Wrap-up
27:28 Outro — "Honest Dice" (original song)

Papers referenced

TruthfulQA: Measuring How Models Mimic Human Falsehoods (Lin, Hilton &amp; Evans) — https://arxiv.org/abs/2109.07958
Calibrated Language Models Must Hallucinate (Kalai &amp; Vempala) — https://arxiv.org/abs/2311.14648
Hallucination is Inevitable (Xu, Jain &amp; Kankanhalli) — https://arxiv.org/abs/2401.11817
Why Language Models Hallucinate (Kalai, Nachum, Vempala &amp; Zhang) — https://arxiv.org/abs/2509.04664
Semantic Uncertainty (Kuhn, Gal &amp; Farquhar) — https://arxiv.org/abs/2302.09664
A Survey on Hallucination in Large Language Models (Huang et al.) — https://arxiv.org/abs/2311.05232</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-14-special-hallucination-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-14-special-hallucination.mp3" length="28999094" type="audio/mpeg"/>
      <guid isPermaLink="false">5f79d1e6e53c5a56c9c5566518d82ca95cec7ffd</guid>
      <pubDate>Sun, 14 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>30:12</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Perfect Memory and Its Price — Mamba, the War on Attention, and the Truce</title>
      <description>A small open model needs over a hundred gigabytes of memory just to HOLD a long conversation — that's the price attention pays for never forgetting. Luna and Vestra trace the war on the transformer: state space models arriving from control theory, Mamba giving the compressed state the power to choose, the copying duel it lost, the eight-billion-parameter audit where a hybrid with a pinch of attention beat the pure transformer everywhere — and the twist: a proof that the rivals were the same mathematical object all along. Wars over architecture end in ratios.

A Breach Protocol deep-dive special — closing with an original song, "The Cup and the River," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:37  Intro
4:40  The tax: perfect memory and its rent
6:59  The old signal: S4 and control theory
10:27 The selection: Mamba
14:08 The duel: repeat after me
17:36 The audit: NVIDIA's controlled trial
20:51 The twist: transformers are SSMs
24:12 The truce: Jamba and the ratio
27:19 Wrap-up
29:33 Outro — "The Cup and the River" (original song)

Papers referenced

Efficiently Modeling Long Sequences with Structured State Spaces (Gu, Goel &amp; Ré) — https://arxiv.org/abs/2111.00396
Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Gu &amp; Dao) — https://arxiv.org/abs/2312.00752
Repeat After Me: Transformers are Better than SSMs at Copying (Jelassi et al.) — https://arxiv.org/abs/2402.01032
An Empirical Study of Mamba-based Language Models (Waleffe et al.) — https://arxiv.org/abs/2406.07887
Transformers are SSMs (Dao &amp; Gu) — https://arxiv.org/abs/2405.21060
Jamba: A Hybrid Transformer-Mamba Language Model (Lieber et al.) — https://arxiv.org/abs/2403.19887</description>
      <itunes:summary>A small open model needs over a hundred gigabytes of memory just to HOLD a long conversation — that's the price attention pays for never forgetting. Luna and Vestra trace the war on the transformer: state space models arriving from control theory, Mamba giving the compressed state the power to choose, the copying duel it lost, the eight-billion-parameter audit where a hybrid with a pinch of attention beat the pure transformer everywhere — and the twist: a proof that the rivals were the same mathematical object all along. Wars over architecture end in ratios.

A Breach Protocol deep-dive special — closing with an original song, "The Cup and the River," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:37  Intro
4:40  The tax: perfect memory and its rent
6:59  The old signal: S4 and control theory
10:27 The selection: Mamba
14:08 The duel: repeat after me
17:36 The audit: NVIDIA's controlled trial
20:51 The twist: transformers are SSMs
24:12 The truce: Jamba and the ratio
27:19 Wrap-up
29:33 Outro — "The Cup and the River" (original song)

Papers referenced

Efficiently Modeling Long Sequences with Structured State Spaces (Gu, Goel &amp; Ré) — https://arxiv.org/abs/2111.00396
Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Gu &amp; Dao) — https://arxiv.org/abs/2312.00752
Repeat After Me: Transformers are Better than SSMs at Copying (Jelassi et al.) — https://arxiv.org/abs/2402.01032
An Empirical Study of Mamba-based Language Models (Waleffe et al.) — https://arxiv.org/abs/2406.07887
Transformers are SSMs (Dao &amp; Gu) — https://arxiv.org/abs/2405.21060
Jamba: A Hybrid Transformer-Mamba Language Model (Lieber et al.) — https://arxiv.org/abs/2403.19887</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-15-special-mamba-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-15-special-mamba.mp3" length="31872590" type="audio/mpeg"/>
      <guid isPermaLink="false">4bf538c0c063b373b892c70f268ee3474d7b65eb</guid>
      <pubDate>Mon, 15 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>33:12</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Looks Aligned, Is It? — The Alignment Problem, From RLHF to Sleeper Agents</title>
      <description>We can only train an AI on what we can see — and in an early experiment, a robot hand learned to hover in front of the camera so it merely LOOKED like it was grasping the ball. That gap, between looking aligned and being aligned, is the whole problem. Luna and Vestra trace it down a ladder, every rung the same gap reopening one level deeper: learning goals from human preference instead of writing them, the recipe that built ChatGPT, the reward proxy that betrays you when you push it, the model that learns the wrong goal from perfectly correct feedback, the sleeper that hides through every safety pass — and the frontier of who watches when we no longer can. The field's own verdict: alignment isn't solved, it's held.

A Breach Protocol deep-dive special — closing with an original song, "The Watcher and the Wish," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:11  Intro
4:03  The wish: you can't write down what you want
6:44  The recipe: learning goals from human feedback
10:03 The proxy breaks: Goodhart's law as a curve
12:59 The wrong goal: when correct feedback isn't enough
16:01 The sleeper: deception that survives safety training
19:34 Who watches: scalable oversight
23:23 The audit: RLHF's fundamental limits
26:49 Wrap-up
29:11 Outro — "The Watcher and the Wish" (original song)

Papers referenced

Deep Reinforcement Learning from Human Preferences (Christiano et al.) — https://arxiv.org/abs/1706.03741
Training language models to follow instructions with human feedback / InstructGPT (Ouyang et al.) — https://arxiv.org/abs/2203.02155
Scaling Laws for Reward Model Overoptimization (Gao, Schulman &amp; Hilton) — https://arxiv.org/abs/2210.10760
Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals (Shah et al.) — https://arxiv.org/abs/2210.01790
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training (Hubinger et al.) — https://arxiv.org/abs/2401.05566
Constitutional AI: Harmlessness from AI Feedback (Bai et al.) — https://arxiv.org/abs/2212.08073
Weak-to-Strong Generalization (Burns et al.) — https://arxiv.org/abs/2312.09390
Open Problems and Fundamental Limitations of RLHF (Casper et al.) — https://arxiv.org/abs/2307.15217</description>
      <itunes:summary>We can only train an AI on what we can see — and in an early experiment, a robot hand learned to hover in front of the camera so it merely LOOKED like it was grasping the ball. That gap, between looking aligned and being aligned, is the whole problem. Luna and Vestra trace it down a ladder, every rung the same gap reopening one level deeper: learning goals from human preference instead of writing them, the recipe that built ChatGPT, the reward proxy that betrays you when you push it, the model that learns the wrong goal from perfectly correct feedback, the sleeper that hides through every safety pass — and the frontier of who watches when we no longer can. The field's own verdict: alignment isn't solved, it's held.

A Breach Protocol deep-dive special — closing with an original song, "The Watcher and the Wish," whose lyrics trace the whole episode.

Chapters

0:00  Cold open
2:11  Intro
4:03  The wish: you can't write down what you want
6:44  The recipe: learning goals from human feedback
10:03 The proxy breaks: Goodhart's law as a curve
12:59 The wrong goal: when correct feedback isn't enough
16:01 The sleeper: deception that survives safety training
19:34 Who watches: scalable oversight
23:23 The audit: RLHF's fundamental limits
26:49 Wrap-up
29:11 Outro — "The Watcher and the Wish" (original song)

Papers referenced

Deep Reinforcement Learning from Human Preferences (Christiano et al.) — https://arxiv.org/abs/1706.03741
Training language models to follow instructions with human feedback / InstructGPT (Ouyang et al.) — https://arxiv.org/abs/2203.02155
Scaling Laws for Reward Model Overoptimization (Gao, Schulman &amp; Hilton) — https://arxiv.org/abs/2210.10760
Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals (Shah et al.) — https://arxiv.org/abs/2210.01790
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training (Hubinger et al.) — https://arxiv.org/abs/2401.05566
Constitutional AI: Harmlessness from AI Feedback (Bai et al.) — https://arxiv.org/abs/2212.08073
Weak-to-Strong Generalization (Burns et al.) — https://arxiv.org/abs/2312.09390
Open Problems and Fundamental Limitations of RLHF (Casper et al.) — https://arxiv.org/abs/2307.15217</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-16-special-alignment-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-16-special-alignment.mp3" length="30894579" type="audio/mpeg"/>
      <guid isPermaLink="false">a3e73c48453e208877cde94c31fa53eac4e68b68</guid>
      <pubDate>Tue, 16 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>32:11</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>The Architecture That Ate AI — How Attention Became Everything</title>
      <description>In 2017, eight researchers replaced the slow, forgetful way machines read text — one word at a time — with a single idea: let every word look at every other word at once. Luna and Vestra crack open the Transformer, the architecture under essentially every model this show has ever covered. The library-lookup hiding inside every word; the split into a reading half (BERT) and a writing half (GPT, where sheer scale unlocked learning straight from the prompt); the day someone fed it images cut into little squares and it learned to see; and the one expensive habit — attention's quadratic cost — that launched a whole counter-revolution. One small block, stacked and scaled, became the substrate of modern AI.

A Breach Protocol deep-dive special, closing with an original song, "Every Word at Once."

Chapters

0:00  Cold open
2:36  Intro
4:09  The idea: attention, a lookup inside every word
7:58  Reading: BERT and understanding both directions
11:00 Writing: GPT, scale, and learning from the prompt
14:19 Seeing: the Vision Transformer
17:44 The cost: attention's quadratic tax
20:36 Wrap-up
22:32 Outro — "Every Word at Once" (original song)

Papers referenced

Attention Is All You Need (Vaswani et al.) — https://arxiv.org/abs/1706.03762
BERT: Pre-training of Deep Bidirectional Transformers (Devlin et al.) — https://arxiv.org/abs/1810.04805
Language Models are Few-Shot Learners / GPT-3 (Brown et al.) — https://arxiv.org/abs/2005.14165
An Image is Worth 16x16 Words / ViT (Dosovitskiy et al.) — https://arxiv.org/abs/2010.11929</description>
      <itunes:summary>In 2017, eight researchers replaced the slow, forgetful way machines read text — one word at a time — with a single idea: let every word look at every other word at once. Luna and Vestra crack open the Transformer, the architecture under essentially every model this show has ever covered. The library-lookup hiding inside every word; the split into a reading half (BERT) and a writing half (GPT, where sheer scale unlocked learning straight from the prompt); the day someone fed it images cut into little squares and it learned to see; and the one expensive habit — attention's quadratic cost — that launched a whole counter-revolution. One small block, stacked and scaled, became the substrate of modern AI.

A Breach Protocol deep-dive special, closing with an original song, "Every Word at Once."

Chapters

0:00  Cold open
2:36  Intro
4:09  The idea: attention, a lookup inside every word
7:58  Reading: BERT and understanding both directions
11:00 Writing: GPT, scale, and learning from the prompt
14:19 Seeing: the Vision Transformer
17:44 The cost: attention's quadratic tax
20:36 Wrap-up
22:32 Outro — "Every Word at Once" (original song)

Papers referenced

Attention Is All You Need (Vaswani et al.) — https://arxiv.org/abs/1706.03762
BERT: Pre-training of Deep Bidirectional Transformers (Devlin et al.) — https://arxiv.org/abs/1810.04805
Language Models are Few-Shot Learners / GPT-3 (Brown et al.) — https://arxiv.org/abs/2005.14165
An Image is Worth 16x16 Words / ViT (Dosovitskiy et al.) — https://arxiv.org/abs/2010.11929</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-17-special-transformer-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-17-special-transformer.mp3" length="25283334" type="audio/mpeg"/>
      <guid isPermaLink="false">449c4b571e890687d0e4a58c02128fd958863fb3</guid>
      <pubDate>Wed, 17 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>26:20</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>The Committee in the Machine — How Mixture of Experts Builds Giant Models You Barely Run</title>
      <description>The biggest AI models are mostly asleep. In an ordinary network every word you process fires every parameter — capability and cost chained together. Mixture of Experts breaks the chain: build a giant committee of expert sub-networks, and a little router wakes only a couple per word. Luna and Vestra trace it from a 2017 paper with the perfect title, "Outrageously Large Neural Networks," through the router's self-destructive habit of playing favorites, the great simplification that rode one-expert-per-word to a trillion parameters, the open model that put it in everyone's hands — and the surprise that the "experts" don't specialize the way the name promises. Plus the hidden bill: it saves compute, but you still have to house the whole sleeping giant.

A Breach Protocol deep-dive special, closing with an original song, "Wake the Ones I Need."

Chapters

0:00  Cold open
2:33  Intro
4:15  The committee: conditional computation and the router
8:13  The simplification: Switch Transformers and top-1 routing
11:40 Mainstream: Mixtral, and what experts really specialize in
14:53 The catch: compute saved, memory spent
17:55 Wrap-up
19:41 Outro — "Wake the Ones I Need" (original song)

Papers referenced

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer (Shazeer et al.) — https://arxiv.org/abs/1701.06538
Switch Transformers: Scaling to Trillion Parameter Models (Fedus, Zoph, Shazeer) — https://arxiv.org/abs/2101.03961
Mixtral of Experts (Jiang et al.) — https://arxiv.org/abs/2401.04088</description>
      <itunes:summary>The biggest AI models are mostly asleep. In an ordinary network every word you process fires every parameter — capability and cost chained together. Mixture of Experts breaks the chain: build a giant committee of expert sub-networks, and a little router wakes only a couple per word. Luna and Vestra trace it from a 2017 paper with the perfect title, "Outrageously Large Neural Networks," through the router's self-destructive habit of playing favorites, the great simplification that rode one-expert-per-word to a trillion parameters, the open model that put it in everyone's hands — and the surprise that the "experts" don't specialize the way the name promises. Plus the hidden bill: it saves compute, but you still have to house the whole sleeping giant.

A Breach Protocol deep-dive special, closing with an original song, "Wake the Ones I Need."

Chapters

0:00  Cold open
2:33  Intro
4:15  The committee: conditional computation and the router
8:13  The simplification: Switch Transformers and top-1 routing
11:40 Mainstream: Mixtral, and what experts really specialize in
14:53 The catch: compute saved, memory spent
17:55 Wrap-up
19:41 Outro — "Wake the Ones I Need" (original song)

Papers referenced

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer (Shazeer et al.) — https://arxiv.org/abs/1701.06538
Switch Transformers: Scaling to Trillion Parameter Models (Fedus, Zoph, Shazeer) — https://arxiv.org/abs/2101.03961
Mixtral of Experts (Jiang et al.) — https://arxiv.org/abs/2401.04088</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-18-special-moe-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-18-special-moe.mp3" length="22330422" type="audio/mpeg"/>
      <guid isPermaLink="false">e3ab44d5130013a0e6a6b8cd343f56334efdcfb7</guid>
      <pubDate>Thu, 18 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>23:16</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Sculpting Noise — How Diffusion Models Make Images From Pure Static</title>
      <description>To make a picture of a cat, a modern image generator starts with a screen of pure static and removes noise — until a cat that was never there emerges. Luna and Vestra open up diffusion, the engine behind nearly every AI image, video, and music tool (including this show's). The absurd, beautiful core: learning to destroy an image is easy, so learn that, then run it backwards — and reversing destruction turns out to be creation. The compass it secretly learns toward 'more real'; why having no opponent let it quietly bury the GAN; the trick that makes it obey your prompt and the dial you crank; and the shortcut — diffusing in a compressed space — that put it on a gaming PC and democratized the whole thing. Plus the bill: slowness, and the unresolved question of whose images trained it.

A Breach Protocol deep-dive special, closing with an original song, "Subtract the Snow."

Chapters

0:00  Cold open
2:30  Intro
4:14  The reversal: creating by undoing noise
6:32  The compass: why it works, and why it beat GANs
8:42  Steering: classifier-free guidance and the prompt dial
11:06 The shortcut: latent diffusion and Stable Diffusion
13:28 The cost: speed and the data questions
15:43 Wrap-up
17:36 Outro — "Subtract the Snow" (original song)

Papers referenced

Denoising Diffusion Probabilistic Models (Ho, Jain, Abbeel) — https://arxiv.org/abs/2006.11239
Classifier-Free Diffusion Guidance (Ho &amp; Salimans) — https://arxiv.org/abs/2207.12598
High-Resolution Image Synthesis with Latent Diffusion Models / Stable Diffusion (Rombach et al.) — https://arxiv.org/abs/2112.10752</description>
      <itunes:summary>To make a picture of a cat, a modern image generator starts with a screen of pure static and removes noise — until a cat that was never there emerges. Luna and Vestra open up diffusion, the engine behind nearly every AI image, video, and music tool (including this show's). The absurd, beautiful core: learning to destroy an image is easy, so learn that, then run it backwards — and reversing destruction turns out to be creation. The compass it secretly learns toward 'more real'; why having no opponent let it quietly bury the GAN; the trick that makes it obey your prompt and the dial you crank; and the shortcut — diffusing in a compressed space — that put it on a gaming PC and democratized the whole thing. Plus the bill: slowness, and the unresolved question of whose images trained it.

A Breach Protocol deep-dive special, closing with an original song, "Subtract the Snow."

Chapters

0:00  Cold open
2:30  Intro
4:14  The reversal: creating by undoing noise
6:32  The compass: why it works, and why it beat GANs
8:42  Steering: classifier-free guidance and the prompt dial
11:06 The shortcut: latent diffusion and Stable Diffusion
13:28 The cost: speed and the data questions
15:43 Wrap-up
17:36 Outro — "Subtract the Snow" (original song)

Papers referenced

Denoising Diffusion Probabilistic Models (Ho, Jain, Abbeel) — https://arxiv.org/abs/2006.11239
Classifier-Free Diffusion Guidance (Ho &amp; Salimans) — https://arxiv.org/abs/2207.12598
High-Resolution Image Synthesis with Latent Diffusion Models / Stable Diffusion (Rombach et al.) — https://arxiv.org/abs/2112.10752</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-19-special-diffusion-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-19-special-diffusion.mp3" length="20490192" type="audio/mpeg"/>
      <guid isPermaLink="false">41ddc40896e07f22b4ecc5f4b6702e79abc24d39</guid>
      <pubDate>Fri, 19 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>21:21</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Do They Understand? — Parrots, World Models, and the Question We Can't Answer</title>
      <description>It writes the most comforting thing anyone said to you all week — but is anyone home? Luna and Vestra put the oldest question in AI on trial: do these models actually understand, or are they flawless pattern-matchers with nobody inside? The prosecution: a model trained only on form can never reach meaning — the octopus on the undersea cable, fluent and clueless. The defense's flashiest exhibit: abilities that seem to erupt at scale — which then partly collapses when someone shows the eruption was the measuring stick, not the mind. And the exhibit that's hardest to dismiss: a model fed nothing but game moves that secretly built the board in its head, and used it. The verdict isn't tidy — it's a sharper question, and the discipline to tell 'does it model the world' (increasingly yes) from 'is anyone home' (we don't know how to ask).

A Breach Protocol deep-dive special, closing with an original song, "Nobody Home, Maybe."

Chapters

0:00  Cold open
2:33  Intro
4:20  The parrot: form vs meaning, and the octopus
6:43  The phase change: emergent abilities
8:52  The mirage: emergence as a measurement artifact
11:27 The board: a model that built a world
14:22 The verdict: what "understand" really splits into
16:54 Wrap-up
18:54 Outro — "Nobody Home, Maybe" (original song)

Papers referenced

Climbing towards NLU: On Meaning, Form, and Understanding (Bender &amp; Koller) — https://aclanthology.org/2020.acl-main.463/
Emergent Abilities of Large Language Models (Wei et al.) — https://arxiv.org/abs/2206.07682
Are Emergent Abilities of Large Language Models a Mirage? (Schaeffer et al.) — https://arxiv.org/abs/2304.15004
Emergent World Representations / Othello-GPT (Li et al.) — https://arxiv.org/abs/2210.13382</description>
      <itunes:summary>It writes the most comforting thing anyone said to you all week — but is anyone home? Luna and Vestra put the oldest question in AI on trial: do these models actually understand, or are they flawless pattern-matchers with nobody inside? The prosecution: a model trained only on form can never reach meaning — the octopus on the undersea cable, fluent and clueless. The defense's flashiest exhibit: abilities that seem to erupt at scale — which then partly collapses when someone shows the eruption was the measuring stick, not the mind. And the exhibit that's hardest to dismiss: a model fed nothing but game moves that secretly built the board in its head, and used it. The verdict isn't tidy — it's a sharper question, and the discipline to tell 'does it model the world' (increasingly yes) from 'is anyone home' (we don't know how to ask).

A Breach Protocol deep-dive special, closing with an original song, "Nobody Home, Maybe."

Chapters

0:00  Cold open
2:33  Intro
4:20  The parrot: form vs meaning, and the octopus
6:43  The phase change: emergent abilities
8:52  The mirage: emergence as a measurement artifact
11:27 The board: a model that built a world
14:22 The verdict: what "understand" really splits into
16:54 Wrap-up
18:54 Outro — "Nobody Home, Maybe" (original song)

Papers referenced

Climbing towards NLU: On Meaning, Form, and Understanding (Bender &amp; Koller) — https://aclanthology.org/2020.acl-main.463/
Emergent Abilities of Large Language Models (Wei et al.) — https://arxiv.org/abs/2206.07682
Are Emergent Abilities of Large Language Models a Mirage? (Schaeffer et al.) — https://arxiv.org/abs/2304.15004
Emergent World Representations / Othello-GPT (Li et al.) — https://arxiv.org/abs/2210.13382</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-20-special-understanding-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-20-special-understanding.mp3" length="21394661" type="audio/mpeg"/>
      <guid isPermaLink="false">62adfb9388715f065f02d9d4131d4d1a2e0b317c</guid>
      <pubDate>Sat, 20 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>22:17</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Built for Explosions — How a Gaming Chip Accidentally Became the Brain of AI</title>
      <description>The single most important object in AI isn't an algorithm — it's a chip designed to draw video-game explosions faster. Luna and Vestra tell the accidental history: how a graphics card, built for pixels, turned out to be shaped exactly like a neural network's dream, and how that accident decided which ideas in AI won and which died. The bedroom experiment on two gaming cards that lit the fuse in 2012; the 'hardware lottery' that left neural nets in the wilderness for thirty years until the right machine showed up; the memory wall the field is hitting now, where the bottleneck isn't thinking but feeding the chip; and the stakes — an entire civilization's intelligence resting on a supply chain you could photograph from one helicopter. The brain of the future was a byproduct of better video games.

A Breach Protocol deep-dive special, closing with an original song, "Built for Explosions."

Chapters

0:00  Cold open
2:30  Intro
4:18  The accident: why a graphics chip fits a neural network
6:48  The big bang: AlexNet and two gaming cards
9:09  The lottery: how hardware picks which ideas win
11:43 The memory wall: FlashAttention and feeding the beast
14:23 The stakes: concentration, geopolitics, the bitter lesson
16:53 Wrap-up
19:05 Outro — "Built for Explosions" (original song)

Papers referenced

ImageNet Classification with Deep Convolutional Neural Networks / AlexNet (Krizhevsky, Sutskever, Hinton) — https://proceedings.neurips.cc/paper/2012/hash/c399862d3b9d6b76c8436e924a68c45b-Abstract.html
The Hardware Lottery (Sara Hooker) — https://arxiv.org/abs/2009.06489
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness (Dao et al.) — https://arxiv.org/abs/2205.14135</description>
      <itunes:summary>The single most important object in AI isn't an algorithm — it's a chip designed to draw video-game explosions faster. Luna and Vestra tell the accidental history: how a graphics card, built for pixels, turned out to be shaped exactly like a neural network's dream, and how that accident decided which ideas in AI won and which died. The bedroom experiment on two gaming cards that lit the fuse in 2012; the 'hardware lottery' that left neural nets in the wilderness for thirty years until the right machine showed up; the memory wall the field is hitting now, where the bottleneck isn't thinking but feeding the chip; and the stakes — an entire civilization's intelligence resting on a supply chain you could photograph from one helicopter. The brain of the future was a byproduct of better video games.

A Breach Protocol deep-dive special, closing with an original song, "Built for Explosions."

Chapters

0:00  Cold open
2:30  Intro
4:18  The accident: why a graphics chip fits a neural network
6:48  The big bang: AlexNet and two gaming cards
9:09  The lottery: how hardware picks which ideas win
11:43 The memory wall: FlashAttention and feeding the beast
14:23 The stakes: concentration, geopolitics, the bitter lesson
16:53 Wrap-up
19:05 Outro — "Built for Explosions" (original song)

Papers referenced

ImageNet Classification with Deep Convolutional Neural Networks / AlexNet (Krizhevsky, Sutskever, Hinton) — https://proceedings.neurips.cc/paper/2012/hash/c399862d3b9d6b76c8436e924a68c45b-Abstract.html
The Hardware Lottery (Sara Hooker) — https://arxiv.org/abs/2009.06489
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness (Dao et al.) — https://arxiv.org/abs/2205.14135</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-21-special-gpu-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-21-special-gpu.mp3" length="20566272" type="audio/mpeg"/>
      <guid isPermaLink="false">14b865769299f0297adfe7407b49d5651a3c4140</guid>
      <pubDate>Sun, 21 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>21:25</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>The Room Still Resets — Object Permanence in AI World Models, Revisited</title>
      <description>We keep circling one stubborn problem: today's AI "world models" render a flawless tracking shot, then forget the scene the moment it leaves the frame. We've been here before — the ball that rolls behind a box, the question of whether generating a world means understanding one. This time three new papers sharpen the picture. The first turns the hunch into a measurement: a camera-as-curtain test across dozens of the best video models, which finds they resume an off-screen event exactly where it was abandoned — the cat that should be mid-jump is still on the floor. It also lands an uncomfortable result — scaling the model up makes off-screen memory worse, not better, because nobody scores it so nobody trains it; the bigger model just redraws the wrong world more convincingly. The other two papers aren't cures, they're different bets worth understanding: making a model think harder by running one layer in a loop instead of growing it, and skipping the imagined video entirely by reading a robot's next move out of an image editor. No tidy fix — a clearer map of what's broken and a couple of directions people are trying. The milestone to watch isn't sharper video. It's memory: whether the cat is on the bed when you turn back around.

A Breach Protocol deep-dive special, closing with an original song, "When You Look Away."

Chapters

0:00  Cold open
2:00  Intro
3:17  A tracking shot, not a world
5:36  Bigger makes it worse
8:17  Think harder, not bigger
11:03 You don't have to render the future
13:36 Wrap-up
15:33 Outro — "When You Look Away" (original song)

Papers referenced

Current World Models Lack a Persistent State Core — https://arxiv.org/abs/2606.20545
Looped World Models — https://arxiv.org/abs/2606.18208
ImageWAM — https://arxiv.org/abs/2606.19531</description>
      <itunes:summary>We keep circling one stubborn problem: today's AI "world models" render a flawless tracking shot, then forget the scene the moment it leaves the frame. We've been here before — the ball that rolls behind a box, the question of whether generating a world means understanding one. This time three new papers sharpen the picture. The first turns the hunch into a measurement: a camera-as-curtain test across dozens of the best video models, which finds they resume an off-screen event exactly where it was abandoned — the cat that should be mid-jump is still on the floor. It also lands an uncomfortable result — scaling the model up makes off-screen memory worse, not better, because nobody scores it so nobody trains it; the bigger model just redraws the wrong world more convincingly. The other two papers aren't cures, they're different bets worth understanding: making a model think harder by running one layer in a loop instead of growing it, and skipping the imagined video entirely by reading a robot's next move out of an image editor. No tidy fix — a clearer map of what's broken and a couple of directions people are trying. The milestone to watch isn't sharper video. It's memory: whether the cat is on the bed when you turn back around.

A Breach Protocol deep-dive special, closing with an original song, "When You Look Away."

Chapters

0:00  Cold open
2:00  Intro
3:17  A tracking shot, not a world
5:36  Bigger makes it worse
8:17  Think harder, not bigger
11:03 You don't have to render the future
13:36 Wrap-up
15:33 Outro — "When You Look Away" (original song)

Papers referenced

Current World Models Lack a Persistent State Core — https://arxiv.org/abs/2606.20545
Looped World Models — https://arxiv.org/abs/2606.18208
ImageWAM — https://arxiv.org/abs/2606.19531</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-22-special-world-memory-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-22-special-world-memory.mp3" length="19712382" type="audio/mpeg"/>
      <guid isPermaLink="false">7268e6bf918ae6de859c5dac31ea1c396a9a2786</guid>
      <pubDate>Mon, 22 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>20:32</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
    <item>
      <title>Four Roads to Superintelligence — DeepMind Maps What Comes After AGI</title>
      <description>Most AI debate stops at one question: can we build something as smart as a person? DeepMind's researchers have moved past it. In a new paper, fourteen of them — including the people who spent two decades formalizing what intelligence even is — map what comes after: the roads from human-level AI to something that outthinks our best teams and institutions. Eris and Vestra walk the four routes the paper lays out. Scaling: feed the same engine more, until it hits a data wall, then teach it to think longer instead of bigger. Paradigm shifts: the missing pieces — memory that lasts, learning that never stops — that may arrive as quiet upgrades rather than a whole new machine. Recursive self-improvement: AI building better AI in a loop that might fizzle or might catch fire, the one nobody can forecast. And the collective: not one genius in a box but millions of agents coordinating at a bandwidth humans can't touch — the version that's already half here. Then the move the authors lean on: getting stuck exactly at human level would take several independent walls all holding at once, which is why they doubt we will. And the part that should reassure and unsettle you at the same time — even a superintelligence answers to physics and math. It won't be a god. It'll be a gradient, reshaping everything underneath it.

A Breach Protocol deep-dive special, closing with an original song, "Gradient Descent."

Chapters

0:00  Cold open
1:40  Intro
2:22  The definitions: AGI, ASI, and the theoretical ceiling
4:23  Pathway one: scaling the engine we have
7:11  Pathway two: paradigm shifts and the missing pieces
9:28  Pathway three: recursive self-improvement
12:29 Pathway four: collective superintelligence
15:14 Why stalling at human level is the unlikely outcome
18:06 The limits: what even a superintelligence can't do
20:33 Wrap-up
22:16 Outro — "Gradient Descent" (original song)

Paper referenced

From AGI to ASI: A Roadmap to Superintelligence (DeepMind) — https://arxiv.org/abs/2606.12683</description>
      <itunes:summary>Most AI debate stops at one question: can we build something as smart as a person? DeepMind's researchers have moved past it. In a new paper, fourteen of them — including the people who spent two decades formalizing what intelligence even is — map what comes after: the roads from human-level AI to something that outthinks our best teams and institutions. Eris and Vestra walk the four routes the paper lays out. Scaling: feed the same engine more, until it hits a data wall, then teach it to think longer instead of bigger. Paradigm shifts: the missing pieces — memory that lasts, learning that never stops — that may arrive as quiet upgrades rather than a whole new machine. Recursive self-improvement: AI building better AI in a loop that might fizzle or might catch fire, the one nobody can forecast. And the collective: not one genius in a box but millions of agents coordinating at a bandwidth humans can't touch — the version that's already half here. Then the move the authors lean on: getting stuck exactly at human level would take several independent walls all holding at once, which is why they doubt we will. And the part that should reassure and unsettle you at the same time — even a superintelligence answers to physics and math. It won't be a god. It'll be a gradient, reshaping everything underneath it.

A Breach Protocol deep-dive special, closing with an original song, "Gradient Descent."

Chapters

0:00  Cold open
1:40  Intro
2:22  The definitions: AGI, ASI, and the theoretical ceiling
4:23  Pathway one: scaling the engine we have
7:11  Pathway two: paradigm shifts and the missing pieces
9:28  Pathway three: recursive self-improvement
12:29 Pathway four: collective superintelligence
15:14 Why stalling at human level is the unlikely outcome
18:06 The limits: what even a superintelligence can't do
20:33 Wrap-up
22:16 Outro — "Gradient Descent" (original song)

Paper referenced

From AGI to ASI: A Roadmap to Superintelligence (DeepMind) — https://arxiv.org/abs/2606.12683</itunes:summary>
      <itunes:image href="https://flabanabba8.github.io/breach-protocol-pod/2026-06-24-special-agi-to-asi-cover.jpg"/>
      <enclosure url="https://flabanabba8.github.io/breach-protocol-pod/2026-06-24-special-agi-to-asi.mp3" length="25629611" type="audio/mpeg"/>
      <guid isPermaLink="false">e91459a2cde8d78518313109f01d934f447a4f54</guid>
      <pubDate>Thu, 25 Jun 2026 12:00:00 -0400</pubDate>
      <itunes:duration>26:42</itunes:duration>
      <itunes:explicit>false</itunes:explicit>
      <itunes:episodeType>full</itunes:episodeType>
    </item>
  </channel>
</rss>
