<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[TheSequence]]></title><description><![CDATA[The best source to stay up-to-date with the developments in the machine learning, artificial intelligence, and data science world. Trusted by 165,000 professionals from the main AI labs, universities, and enterprises ]]></description><link>https://thesequence.substack.com</link><image><url>https://substackcdn.com/image/fetch/$s_!t4FH!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fbucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com%2Fpublic%2Fimages%2F7c763928-9762-43a0-a55f-9ee9040fa6e1_210x210.png</url><title>TheSequence</title><link>https://thesequence.substack.com</link></image><generator>Substack</generator><lastBuildDate>Wed, 17 Jun 2026 20:00:00 GMT</lastBuildDate><atom:link href="https://thesequence.substack.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Jesus Rodriguez]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[thesequence@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[thesequence@substack.com]]></itunes:email><itunes:name><![CDATA[Jesus Rodriguez]]></itunes:name></itunes:owner><itunes:author><![CDATA[Jesus Rodriguez]]></itunes:author><googleplay:owner><![CDATA[thesequence@substack.com]]></googleplay:owner><googleplay:email><![CDATA[thesequence@substack.com]]></googleplay:email><googleplay:author><![CDATA[Jesus Rodriguez]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[The Sequence AI of the Week #878: Inside Google Deepmind's First Real Crack in Next-Token Generation]]></title><description><![CDATA[DiffusionGemma is one of the most serious non-transformer models in the market.]]></description><link>https://thesequence.substack.com/p/the-sequence-ai-of-the-week-878-inside</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-ai-of-the-week-878-inside</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Wed, 17 Jun 2026 10:56:10 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!QJIy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QJIy!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QJIy!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!QJIy!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!QJIy!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!QJIy!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QJIy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2460340,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/202413988?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QJIy!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!QJIy!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!QJIy!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!QJIy!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2f7e0ca0-9b3d-49d2-a2ad-95a78357fee8_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>As we wrap up our series about alternatives to transformer architectures, Google DeepMind just released one of the most impressive models in this category. DiffusionGemma is a text-diffusion model that challenges the conventional transfromer models. Today, we would like to deep dive into the specifics of this model. </p><p>Most language models write like a typewriter. They place one token after another, left to right, never revisiting the characters already stamped onto the page. This architecture has carried the entire modern LLM era: GPT-style chatbots, coding copilots, reasoning models, agent frameworks, enterprise assistants. The model predicts the next token, appends it, updates its state, and repeats.</p><p>Google&#8217;s new <strong>DiffusionGemma</strong> asks a deceptively simple question: what if text generation did not have to work that way?</p><p>Let&#8217;s dive in.</p>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-ai-of-the-week-878-inside">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence Knowledge #878: Beyond Transformer: What We Learned]]></title><description><![CDATA[And a new series about Distillation]]></description><link>https://thesequence.substack.com/p/the-sequence-knowledge-878-beyond</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-knowledge-878-beyond</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Tue, 16 Jun 2026 11:03:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ddNg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ddNg!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ddNg!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!ddNg!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!ddNg!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!ddNg!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ddNg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2685567,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/202207747?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ddNg!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!ddNg!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!ddNg!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!ddNg!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4a0b5f09-0ec0-426c-b71a-28b03688edf3_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Today, we bring you a summary of our series about transformer alternatives. </p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://thesequence.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">TheSequence is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>For the better part of a decade, the entire field has been a giant, spectacularly funded wrapper around a single operation: self-attention. The Transformer didn&#8217;t win because it was the most elegant or the most brain-like design. It won because it had the best scaling story and it won the hardware lottery. Every token looks at every other token, the whole thing maps cleanly onto a GPU grid, and you train it all at once. Add data, parameters, compute, context &#8212; and the loss curve cooperates. That smoothness is rare. Most clever ideas in deep learning never become industrial. This one did.</p><p>But the tax was always there in plain sight. Self-attention buys you something genuinely valuable &#8212; perfect, lossless recall over the entire context, with every token able to address every other token directly, and a training pass that parallelizes across the whole sequence at once. That&#8217;s the benefit, and it&#8217;s a real one. The cost is that attention scales quadratically with sequence length, and autoregressive decoding drags around a KV-cache that grows linearly with every token you&#8217;ve already seen. When you&#8217;re pushing past a million tokens, or watching a 70B model&#8217;s cache eat 40GB of VRAM, O(n&#178;) compute and O(n) memory stop being footnotes and become the actual bill. So the interesting question was never &#8220;are Transformers good?&#8221; They&#8217;re spectacular. The question is whether they&#8217;re the <em>final</em> architecture or just the first truly scalable one &#8212; soon to be absorbed into something richer.</p><p>That was the thesis we set out to test, and the cleanest way to read the eight issues is as four families, each making a different bet against attention.</p><p>The first family is <strong>recurrent and linear-recurrent models</strong> &#8212; the RNN comeback and xLSTM. Their pitch is constant memory: instead of a cache that grows forever, they carry a fixed-size hidden state and pay O(n) compute over a sequence rather than O(n&#178;). The classic objection was that RNNs train serially and can&#8217;t saturate a GPU, but the modern variants reformulate the recurrence so it parallelizes during training while staying cheap at inference. The benefit is brutally efficient generation; the open challenge is whether a fixed-size state can hold enough to match attention&#8217;s exact recall on long-range, retrieval-heavy tasks.</p><p>The second family is <strong>state space models</strong> &#8212; the SSM/Mamba line, the most serious challenger of the bunch. SSMs treat a sequence as a continuous linear dynamical system, which gives them a near-magical dual form: a parallelizable convolution for training and a recurrent scan for inference. They get linear scaling and long-context handling almost for free. The trade-off is expressivity &#8212; pure SSMs can struggle with precise in-context copying and lookup, which is exactly why the strongest results today are <em>hybrids</em> that interleave a few attention layers among many SSM layers.</p><p>The third family is <strong>text diffusion</strong> &#8212; generation that abandons left-to-right decoding entirely, refining a whole sequence in parallel over a handful of denoising steps. The benefit is non-autoregressive speed and bidirectional context at generation time; the challenge is matching the raw quality and controllability of autoregressive models, which LLaDA, Gemini Diffusion, and Mercury are now pushing on hard.</p><p>The fourth family is <strong>liquid and continuous-time models</strong>, which throw out the parallel-lookup mental model altogether in favor of dynamics that evolve continuously in time, aiming for far smaller, more adaptive networks. The benefit is parameter efficiency and a different inductive bias; the challenge is scaling that story to frontier sizes.</p><p>None of these has dethroned attention. But the monoculture is over, and the most likely future is explicitly hybrid: attention where exact recall earns its quadratic cost, something linear-time everywhere else.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://thesequence.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">TheSequence is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><p>Here is the full series, in order:</p><ul><li><p><strong><a href="https://thesequence.substack.com/p/the-sequence-knowledge-846-beyond">#846 &#8212; Beyond Transformer: A New Series</a></strong> &#8212; The kickoff, framing the palpable vibe shift on arXiv toward post-attention architectures and the decade we&#8217;ve spent as a wrapper around self-attention. It lays out the plan to map every major viable alternative to the Transformer.</p></li><li><p><strong><a href="https://thesequence.substack.com/p/the-sequence-knowledge-850-the-unexpected">#850 &#8212; The Unexpected Comeback of RNNs</a></strong> &#8212; The case for recurrent networks as the alternative most people overlooked, revisiting why linear-time recurrence is attractive again. It positions modern RNN variants as a serious challenger rather than a relic.</p></li><li><p><strong><a href="https://thesequence.substack.com/p/the-sequence-knowledge-854-return">#854 &#8212; Return of the King: Unrolling the xLSTM Architecture</a></strong> &#8212; Traces the lineage from the 1990s LSTM through the 2017 Transformer pivot into xLSTM, the modernized revival of Hochreiter and Schmidhuber&#8217;s design. It explains how reworked gating and scaling let xLSTM compete with attention-based models.</p></li><li><p><strong><a href="https://thesequence.substack.com/p/the-sequence-knowledge-858-how-state">#858 &#8212; How State Space Models Went from Curiosity to Serious Transformer Competitor</a></strong> &#8212; Charts the rise of SSMs as the O(n&#178;) attention bottleneck becomes a real constraint at million-token contexts and large KV-caches. It argues state space models have quietly matured into a genuine rival to the dominant paradigm.</p></li><li><p><strong><a href="https://thesequence.substack.com/p/the-sequence-knowledge-862-learning">#862 &#8212; Learning About Text Diffusion Models</a></strong> &#8212; Introduces text diffusion as one of the most credible non-autoregressive alternatives to transformers. It covers how diffusion-style generation breaks from strict left-to-right next-token prediction.</p></li><li><p><strong><a href="https://thesequence.substack.com/p/the-sequence-knowledge-866-three">#866 &#8212; Three Text Diffusion Models You Need To Know About</a></strong> &#8212; A practical follow-up profiling the leading players in the space: LLaDA, Gemini Diffusion, and Mercury. It compares how each implements diffusion-based text generation.</p></li><li><p><strong><a href="https://thesequence.substack.com/p/the-sequence-knowledge-870-liquid">#870 &#8212; Liquid Models and the Search for a Post-Transformer Architecture</a></strong> &#8212; Dives into liquid neural networks as one of the more promising non-Transformer architectures, contrasting their continuous-time dynamics with attention&#8217;s parallel lookup-table approach. It frames them within the broader hunt for a successor.</p></li><li><p><strong><a href="https://thesequence.substack.com/p/the-sequence-knowledge-874-transformers">#874 &#8212; Transformers or Not?</a></strong> &#8212; The capstone, asking whether the Transformer is the final architecture or merely the first truly scalable one, soon absorbed into something richer. It leans toward the latter and surveys the full landscape the series has covered.</p></li></ul><h2>What&#8217;s next: a new series on distillation</h2><p>If the last series was about <em>changing</em> the architecture, the next one is about <em>compressing</em> it. We&#8217;re starting a deep dive into knowledge distillation &#8212; the set of techniques for taking a large, expensive teacher model and pressing its capabilities into a smaller, faster student. It&#8217;s one of the least glamorous and most economically important ideas in modern AI: it&#8217;s how frontier capability actually reaches production. We&#8217;ll cover the classics (logit matching, the original Hinton formulation), the modern variants (sequence-level, on-policy, and self-distillation), what actually transfers and what doesn&#8217;t, and why nearly every model you can afford to run is, in some sense, a distilled one. See you in the first issue.</p>]]></content:encoded></item><item><title><![CDATA[The Sequence Radar #877: Last Week in AI: Anthropic Ships, Apple Borrows, Musk Lists, Bezos Builds]]></title><description><![CDATA[AI just got way bigger.]]></description><link>https://thesequence.substack.com/p/the-sequence-radar-877-last-week</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-radar-877-last-week</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Sun, 14 Jun 2026 11:03:27 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!W1GY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!W1GY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!W1GY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!W1GY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!W1GY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!W1GY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!W1GY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2729277,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/201667677?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!W1GY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!W1GY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!W1GY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!W1GY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe8858e09-64b4-43ff-ac51-f22c23b2891b_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>Next Week in The Sequence:</strong></h2><ol><li><p>We continue our series about alternative to transformers. </p></li><li><p>The AI of the week will dive into Fable. </p></li><li><p>In the opinion section, we are going to discuss AI tokens as a units of economics.  </p></li><li><p>We might introduce a new fun section. Playing with a new idea. </p></li></ol><h2><strong>Subscribe and don&#8217;t miss out:</strong></h2><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://thesequence.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">TheSequence is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2><strong>&#128221; Editorial: </strong>Last Week in AI: Anthropic Ships, Apple Borrows, Musk Lists, Bezos Builds</h2><p>Some weeks in AI feel like incremental patch releases. This one felt like a major version bump for the entire industry. Four events &#8212; a frontier model launch, a consumer assistant reboot, the largest IPO in history, and a $12 billion bet on physical engineering &#8212; and if you squint, they&#8217;re all chapters of the same story: AI escaping the chat window.</p><p>Start with Anthropic. On Tuesday the company released Claude Fable 5 and Claude Mythos 5, and the architecture of the launch is as interesting as the model itself. Both share the same base model; the difference is policy, not weights. Fable 5 ships with conservative safety classifiers that intercept queries in high-risk domains &#8212; cybersecurity, biology, chemistry &#8212; and fall back to Opus 4.8, while Mythos 5 runs unrestricted for a vetted group of cyber defenders under Project Glasswing. Think of it as the same kernel with different syscall permissions. The benchmarks justify the caution: 80.3% on SWE-Bench Pro, more than ten points clear of Opus 4.8 and over twenty ahead of GPT-5.5. We&#8217;ve entered the era where capability and access are explicitly decoupled &#8212; the model you can use is a sandboxed view of the model that exists.</p><p>Then Apple, finally, showed up. At Tim Cook&#8217;s farewell WWDC, the company unveiled Siri AI &#8212; a conversational assistant with personal context, onscreen awareness, and a standalone app, reportedly powered by a custom 1.2-trillion-parameter Gemini model under the hood. There&#8217;s something deliciously ironic about Apple, the original vertical integrator, outsourcing the brain. But strategically it&#8217;s the right call: Apple&#8217;s moat was never the model; it&#8217;s the distribution and the personal context graph. A billion devices with intimate access to your messages, photos, and calendar is a dataset no lab can replicate. Apple isn&#8217;t competing on intelligence; it&#8217;s competing on intimacy.</p><p>The week&#8217;s most audacious move came from Elon Musk. SpaceX went public at roughly $1.77 trillion, raising about $75 billion in the largest IPO ever &#8212; and the prospectus reads less like a rocket company and more like an AI infrastructure thesis. Having merged xAI into SpaceX in February, Musk is pitching orbital data centers: up to a million GPU-packed satellites moving training and inference off-planet, where energy is abundant and regulation is thin. xAI lost $6.4 billion on $3.2 billion in revenue last year, so the IPO is effectively the public market underwriting the most capital-intensive scaling hypothesis ever proposed. Compute, it turns out, has an escape velocity.</p><p>Finally, Jeff Bezos broke his silence on Prometheus, which raised $12 billion at a $41 billion valuation to build an &#8220;artificial general engineer&#8221; &#8212; AI that designs and manufactures physical systems, from jet engines to drug compounds. Not robotics, Bezos insists. Something closer to CAD with a frontier brain.</p><p>Let&#8217;s dive in. </p><h2><strong>&#128270; AI Research</strong></h2><h3><a href="https://arxiv.org/html/2601.19755v1">Regularized f-Divergence Kernel Tests</a></h3><p><strong>AI Lab:</strong> Google Research &amp; Google DeepMind</p><p><strong>Summary:</strong> This paper introduces a unified framework for constructing practical, kernel-based two-sample tests derived from the family of f-divergences. The authors demonstrate that these adaptive tests, particularly the Hockey-Stick divergence, effectively capture diverse localized differences and are highly applicable to tasks like differential privacy auditing and machine unlearning evaluation.</p><h3><a href="https://arxiv.org/abs/2606.12373">Verifiable Environments Are LEGO Bricks: Recursive Composition for Reasoning Generalization</a></h3><p><strong>AI Lab:</strong> Qwen Team, Alibaba Group</p><p><strong>Summary:</strong> The authors propose RACES, a framework that scales up reinforcement learning for language models by recursively assembling verifiable environments like building blocks when their input and output types match. By utilizing composition operators such as SEQUENTIAL and PARALLEL, this approach generates structurally diverse training tasks that significantly improve the reasoning generalization of models on unseen benchmarks.</p><h3><a href="https://arxiv.org/html/2605.11212v3">REVISION: Scaling Computer-Use Agents via Temporal Visual Redundancy Reduction</a></h3><p><strong>AI Lab:</strong> Microsoft Research</p><p><strong>Summary:</strong> To address the high token cost associated with visual observations in computer-use agents, this paper introduces REVISION, a framework that trains multimodal models to filter out redundant visual patches across consecutive screenshots. By maintaining essential spatial structure while significantly reducing token accumulation, the method allows agents to process longer interaction histories and achieve higher success rates on complex tasks.</p><h3><a href="https://arxiv.org/html/2605.30861v1">Distilling LLM Feedback for Lean Theorem Proving</a></h3><p><strong>AI Lab:</strong> FAIR at Meta</p><p><strong>Summary:</strong> This research explores Feedback Distillation, an on-policy post-training method where a model learns to match its own token-level distribution conditioned on privileged feedback from a stronger language model. Evaluated on Lean 4 theorem proving, the technique preserves greater trajectory diversity and achieves better pass@k scaling than standard GRPO, proving especially powerful when used as an initialization for subsequent reinforcement learning.</p><h3><a href="https://arxiv.org/html/2606.10662v1">Decentralized Multi-Agent Systems with Shared Context</a></h3><p><strong>AI Lab:</strong> Stanford University</p><p><strong>Summary:</strong> DELM is a novel multi-agent framework that eliminates the bottleneck of centralized orchestration by relying on a shared, verified context and an asynchronous task queue. Agents independently claim subtasks and contribute compact, verified updates to the global state, leading to superior performance and cost efficiency in both software-engineering testing and long-context reasoning workflows.</p><h2><strong>&#129302; AI Tech Releases</strong></h2><h3><strong>Claude Fable 5 and Mythos 5</strong></h3><p>Anthropic <a href="https://www.anthropic.com/news/claude-fable-5-mythos-5">released its highly anticipated Fable 5 model</a>, a limited Mythos-based models. Also released a version of Mythos 5 for a selected group of cyersecurity and infrastructure companies. </p><h3>Kimi Work</h3><p>Moonshot AI <a href="https://www.kimi.com/products/kimi-work">released Kimi Work</a>, a new agent specialized in work automation. </p><h2><strong>&#128225;10 AI News You Need to Know About</strong></h2><ol><li><p><strong><a href="https://www.cnbc.com/2026/06/12/spacex-ipo-spcx-live-updates.html">SpaceX (SPCX) made its Nasdaq debut June 12, 2026</a>,</strong> after pricing at $135 per share and raising roughly $75 billion &#8212; the largest IPO in stock market history, valuing the company near $1.75 trillion. Shares opened sharply higher and were trading around $161 intraday, with the valuation anchored by Starlink and now bundling in xAI following an all-stock merger earlier this year.</p></li><li><p><strong><a href="https://www.cnbc.com/2026/06/11/project-prometheus-bezos-bajaj-live-updates.html">Bezos&#8217;s Prometheus raises $12B</a></strong><a href="https://www.cnbc.com/2026/06/11/project-prometheus-bezos-bajaj-live-updates.html"> </a>&#8212; Jeff Bezos and Vik Bajaj&#8217;s physical-AI startup Prometheus raised $12 billion at a $41 billion valuation to build an &#8220;artificial general engineer&#8221; that automates the design and manufacturing of complex physical systems from jet engines to drugs. </p></li><li><p><strong><a href="https://www.bloomberg.com/news/articles/2026-06-12/france-s-mistral-in-funding-talks-at-about-20-billion-valuation?srnd=phx-technology">Mistral AI is in early talks to raise about &#8364;3 billion (~$3.5 billion) at a valuation near &#8364;20 billion (~$23 billion)</a></strong>&#8212; nearly double the &#8364;11.7 billion valuation from its Series C last September. The new round would bring the three-year-old company's total financing to roughly &#8364;6.5 billion across debt and equity, fueling its compute buildout as Europe's leading AI lab competes against larger US and Chinese rivals.</p></li><li><p><strong><a href="https://techcrunch.com/2026/06/11/theker-just-raised-85m-to-build-the-factory-robot-that-doesnt-specialize-in-anything/">Theker raises $85M</a></strong> &#8212; Barcelona-based Theker raised $85 million in what it bills as Europe&#8217;s largest-ever robotics Series A to build reconfigurable factory robots whose arms and hands swap out for different tasks rather than specializing in one. </p></li><li><p><strong><a href="https://www.globenewswire.com/news-release/2026/06/10/3309625/0/en/jedify-raises-24-million-in-series-a-funding-to-build-context-graphs-for-enterprise-ai-agents.html">Jedify raises $24M</a></strong> &#8212; New York&#8217;s Jedify raised a $24 million Series A led by Norwest, with Snowflake as a strategic investor, to build &#8220;context graphs&#8221; that give enterprise AI agents the business knowledge they need to run in production. </p></li><li><p><strong><a href="https://www.artificiallawyer.com/2026/06/09/sandstone-raises-30m-for-ai-native-inhouse-teams/">Sandstone raises $30M</a></strong><a href="https://www.artificiallawyer.com/2026/06/09/sandstone-raises-30m-for-ai-native-inhouse-teams/"> </a>&#8212; Sandstone raised a $30 million Series A led by Lightspeed to bring AI-powered workflow automation (intake, routing, triage, task execution) to in-house corporate legal teams rather than law firms. </p></li><li><p><strong><a href="https://openai.com/index/openai-to-acquire-ona/">OpenAI to acquire Ona</a></strong><a href="https://openai.com/index/openai-to-acquire-ona/"> </a>&#8212; OpenAI agreed to acquire cloud-platform startup Ona, folding its secure, persistent execution environments into the Codex team so AI agents can run long, multi-step tasks for enterprises. </p></li><li><p><strong><a href="https://www.bloomberg.com/news/articles/2026-06-10/xai-co-founder-babuschkin-unveils-new-startup-for-personalized-ai">xAI co-founder unveils River AI</a></strong> &#8212; xAI co-founder Igor Babuschkin announced River AI, a startup (staffed partly by former xAI and Tesla employees) building personalized AI agents that learn from and remain owned/controlled by individual users rather than large corporations. </p></li><li><p><strong><a href="https://neura-robotics.com/record-series-c/">Tether backs Neura in $1.4B round</a></strong> &#8212; German firm Neura Robotics raised up to $1.4 billion in a Tether-led Series C &#8212; also backed by Nvidia, Amazon, Qualcomm and Bosch &#8212; to scale humanoid/cognitive-robot production toward millions of units by 2030. </p></li><li><p><strong><a href="https://www.apple.com/newsroom/2026/06/apple-introduces-siri-ai-a-profoundly-more-capable-and-personal-assistant/">Apple introduces Siri AI</a></strong> &#8212; Apple unveiled &#8220;Siri AI,&#8221; a rebuilt Apple Intelligence&#8211;powered assistant with personal-context understanding, onscreen awareness, world knowledge, a dedicated app, and expanded Visual Intelligence, available for developer testing now and as a user beta later this year. </p></li></ol><p></p>]]></content:encoded></item><item><title><![CDATA[The Sequence Opinion #876: Systems of Record vs. Systems of Action]]></title><description><![CDATA[A new business software paradigm for the agentic era.]]></description><link>https://thesequence.substack.com/p/the-sequence-opinion-systems-of-record</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-opinion-systems-of-record</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Thu, 11 Jun 2026 11:03:00 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!pPMC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!pPMC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!pPMC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!pPMC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!pPMC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!pPMC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!pPMC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2893643,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/201436804?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!pPMC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!pPMC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!pPMC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!pPMC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F548bd7a3-3a5e-4b40-91a6-aed884aca825_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>Thesis: agentic AI does not kill SaaS. It changes what enterprise software is fundamentally for. The old winning layer was the system that held canonical state. The new winning layer is the system that can take action against that state safely, reliably, and observably.</strong></p><p>For the last twenty years, the enterprise stack has been built around one hidden constant: the human is the actor.</p><p>A person logs in. A person reads a dashboard. A person fills out a form. A person updates the opportunity stage, approves the invoice, closes the ticket, moves the candidate, changes the forecast, escalates the account, or checks the compliance box.</p><p>The software is basically a database wrapped in forms, permissions, workflows, and a pricing page. This is not an insult. It is an extremely powerful pattern. It gave companies shared memory. It made business state durable. It turned messy organizational reality into tables, fields, roles, reports, and audit logs.</p><p>But now the actor is changing.</p>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-opinion-systems-of-record">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence AI of the Week #875: Why Your Language Model Needs a Nap]]></title><description><![CDATA[On &#8220;Language Models Need Sleep,&#8221; and the slow death of the train/test split]]></description><link>https://thesequence.substack.com/p/the-sequence-ai-of-the-week-875-why</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-ai-of-the-week-875-why</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Wed, 10 Jun 2026 10:39:45 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Qz-S!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Qz-S!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Qz-S!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!Qz-S!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!Qz-S!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!Qz-S!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Qz-S!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2374867,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/201433074?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Qz-S!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!Qz-S!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!Qz-S!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!Qz-S!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2979cd29-e824-418e-8c7c-50158163a22c_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h3></h3><p>For today&#8217;s essay, I would like to cover an incredible paper with a provocative thesis and an even better title that I found myself reading multiple times last week: <a href="https://openreview.net/forum?id=iiZy6xyVVE">Language Models Need Sleep&#8230;.</a> </p><p>There&#8217;s an awkward fact about the models we all use every day: they don&#8217;t learn anything anymore. Whatever a frontier model knows, it learned once, during training, and then somebody hit save. After that it&#8217;s a brilliant fossil. It can reason circles around you about events up to its cutoff and then go completely blank about last Tuesday. You can stuff new facts into the context window, sure, but the moment the session ends, that knowledge evaporates like a dream you forgot to write down.</p><p>Behrouz, Hashemi, and Mirrokni (Google + Cornell) <a href="https://openreview.net/forum?id=iiZy6xyVVE">have a name for this in their new paper</a>, and it&#8217;s a good one: it&#8217;s <strong>anterograde amnesia</strong>. The patient with anterograde amnesia keeps every memory from before the injury and can hold a conversation in the moment, but nothing new ever makes the jump into long-term storage. Each day is experienced as if it were the first. Swap &#8220;injury&#8221; for &#8220;end of pre-training&#8221; and that is exactly the shape of a Transformer&#8217;s memory. It has the deep past (the MLP weights) and the immediate present (the attention cache), and almost nothing connecting them.</p><p>The paper&#8217;s pitch is that we&#8217;ve been missing a step that biology figured out a long time ago. We sleep.</p><h2>There is no test time</h2>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-ai-of-the-week-875-why">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence Knowledge #874: Transformers or Not?]]></title><description><![CDATA[One of the biggest debates in modern AI.]]></description><link>https://thesequence.substack.com/p/the-sequence-knowledge-874-transformers</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-knowledge-874-transformers</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Tue, 09 Jun 2026 11:03:38 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!gPFe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!gPFe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!gPFe!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!gPFe!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!gPFe!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!gPFe!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!gPFe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/de04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3083910,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/201219450?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!gPFe!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!gPFe!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!gPFe!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!gPFe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde04377c-57c8-4ff2-9a2e-9936a2106228_1536x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>&#128161; AI Concept of the Day: Transformers or Not?</strong></h2><p>The Transformer is currently the reference architecture for serious AI. Not because it is obviously the most brain-like, elegant, or efficient design, but because it has the best scaling story. You add data, parameters, compute, context length, better training recipes, better post-training, and the model gets better in a surprisingly smooth way. That is rare. In deep learning, many ideas are clever. Few are industrial.</p><p>The Transformer&#8217;s superpower is attention. Every token can look at every other token and decide what matters. This is an incredibly general operation. It works for language, code, images, audio, video, protein sequences, robotics tokens, and tool traces. The architecture is simple enough to scale, parallel enough to train efficiently, and expressive enough to absorb huge datasets.</p><p>But it has an obvious tax: attention is expensive. Full self-attention scales badly with sequence length. In autoregressive generation, the model accumulates a key-value cache, which grows with context. A Transformer remembers by keeping a large, explicit, token-indexed memory. That is powerful, but it is not how you would design every intelligent system from first principles.</p><p>So the question is not &#8220;are Transformers good?&#8221; They are spectacular. The question is: are they the final architecture? Or are they the first truly scalable architecture, soon to be absorbed into something richer?</p><p>I think the second view is more likely.</p><h2><strong>The Landscape of Alternatives</strong></h2>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-knowledge-874-transformers">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence Radar #873: Last Week in AI: Soccer, S-1s, and Supermodels]]></title><description><![CDATA[A new AI soccer tournament, major model releases, fundraises and Antropic's S-1.]]></description><link>https://thesequence.substack.com/p/the-sequence-radar-873-last-week</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-radar-873-last-week</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Sun, 07 Jun 2026 11:00:59 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!u-ox!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!u-ox!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!u-ox!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!u-ox!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!u-ox!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!u-ox!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!u-ox!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2880163,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/200806076?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!u-ox!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!u-ox!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!u-ox!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!u-ox!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65d8bbea-b47b-4269-b734-96e3ecef41a3_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>Next Week in The Sequence:</strong></h2><ol><li><p>We continue our series about alternatives to transformers. </p></li><li><p>Our opinion section discusses an intriguing thesis: systems of action vs. systems of record. </p></li><li><p>The AI of the week dives into a groundbreaking paper that I&#8217;ve read three times this week: models need sleep. </p></li></ol><h2><strong>Subscribe and don&#8217;t miss out:</strong></h2><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://thesequence.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">TheSequence is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2><strong>&#128221; Editorial: Last Week in AI: Soccer, S-1s, and Supermodels</strong></h2><p>This week I want to start close to home. At <a href="https://app.layerlens.ai/">LayerLens</a>, <a href="https://layerlens.ai/stratix-cup/season-1">we announced the Stratix Cup</a>, a live tournament in which frontier AI models play soccer in a simulated environment. Season 1 brings together 16 models organized into four groups, with each model writing code to control a full team of players. Matches unfold in two halves, and models can adapt their strategy at halftime based on what happened on the field.</p><p>It is, admittedly, ridiculous in the best possible way: models chasing space, collapsing under pressure, inventing strange formations, and occasionally self-sabotaging in public. But the playfulness hides an important point. Evaluations need more arenas.</p><p>Most AI evals still behave like school exams: static, individual, decontextualized. They ask models to answer questions, solve coding problems, summarize documents, or reason through puzzles. These are useful, but they are incomplete. Soccer imposes a different discipline. It tests multi-agent planning, tactical adaptation, long-horizon credit assignment, robustness under adversarial pressure, and the ability to recover from mistakes. It also makes model behavior legible. You do not need a PhD to see when a model loses shape in midfield. That makes the failure modes both entertaining and intellectually honest, a rare combination in AI benchmarks.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;d8e42e7c-9a0f-4cf1-a40e-e8d3d7b0a092&quot;,&quot;duration&quot;:null}"></div><p>The rest of the week echoed the same shift from models as artifacts to models as operating systems.</p><p>At Build, Microsoft introduced a new generation of MAI models across reasoning, coding, image, voice, and transcription. The headline is not just that Microsoft is building more of its own models. The strategic point is that the company wants a tighter loop between models, developer tools, agents, and devices. GitHub Copilot, agent security primitives, new model releases, and AI-native workflows all point to a world where AI is no longer a chat box bolted onto software. It is becoming the substrate running through work itself.</p><p>Anthropic&#8217;s confidential S-1 filing adds a different kind of gravity. The proposed IPO is not yet a public-market event, but it signals that frontier AI is moving from private-market mythology into public-market accountability. Revenue quality, compute commitments, margins, governance, and safety claims will eventually have to survive a much harsher evaluation suite: investors, regulators, and quarterly reporting.</p><p>NVIDIA, meanwhile, pushed the frontier in two complementary directions. Cosmos advances the idea of world foundation models for physical AI: systems that can reason about video, simulation, robotics, and embodied environments. Nemotron 3 Ultra expands NVIDIA&#8217;s enterprise model stack for demanding reasoning and agentic workflows. The implication is clear: NVIDIA is not merely selling the shovels for the AI gold rush. It wants to define the terrain where robots, agents, simulations, and enterprises are built.</p><p>Finally, DeepSeek&#8217;s reported new financing round is another reminder that the open-model race is becoming geopolitical infrastructure. Capital, energy, chips, talent, and industrial policy are converging around frontier labs. Open models are no longer just an engineering philosophy. They are becoming a strategic asset.</p><p>Put together, this week&#8217;s story is AI leaving the demo page. It is playing games, staffing workflows, filing S-1s, simulating the physical world, and attracting national-scale capital.</p><p>The question is no longer simply who has the best chatbot. It is which systems can act, adapt, and be trusted in environments that push back. Benchmarks told us how models answer. Arenas will tell us how they behave.</p><h2><strong>&#128270; AI Research</strong></h2><h3><a href="https://arxiv.org/html/2606.03979v1">Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories </a></h3><p><strong>AI Lab:</strong> Google Research and Cornell University </p><p><strong>Summary:</strong> This paper introduces a bio-inspired &#8220;Sleep&#8221; paradigm for large language models that enables continual learning and mitigates catastrophic forgetting. The approach features a memory consolidation stage that distills fragile short-term memories into stable long-term parameters, alongside a &#8220;Dreaming&#8221; phase where the model recursively self-improves using synthetically generated data.</p><h3><a href="https://arxiv.org/html/2606.02859v1">Economy of Minds: Emerging Multi-Agent Intelligence with Economic Interactions </a></h3><p><strong>AI Lab:</strong> Harvard, MIT, GitHub, 2077 AI, and Kempner Institute </p><p><strong>Summary:</strong> The Economy of Minds (EOM) framework enables a population of language agents to self-organize and evolve through decentralized economic interactions, such as bidding for the right to act and exchanging peer-to-peer payments. By relying on market selection to mutate wealthy, successful agents and eliminate bankrupt, ineffective ones, this system fosters emergent multi-step reasoning and outperforms stronger monolithic baselines across diverse tasks.</p><h3><a href="https://arxiv.org/abs/2606.02800">Cosmos 3: Omnimodal World Models for Physical AI </a></h3><p><strong>AI Lab:</strong> NVIDIA </p><p><strong>Summary:</strong> Cosmos 3 introduces a unified Mixture-of-Transformers architecture that seamlessly integrates language, image, video, audio, and action sequences to serve as a versatile foundation for Physical AI. By consolidating multiple specialized functions&#8212;such as vision-language models, video generators, and world-action models&#8212;into a single framework, it achieves state-of-the-art performance across diverse understanding and generation tasks.</p><h3><a href="https://arxiv.org/html/2606.03746v2">Qwen-Image-Flash: Beyond Objective Design </a></h3><p><strong>AI Lab:</strong> Qwen (Alibaba) </p><p><strong>Summary: </strong>This paper explores the critical training components of few-step distillation&#8212;specifically data composition, step-wise multi-teacher guidance, and task mixture&#8212;demonstrating that effective distillation requires holistic pipeline optimization beyond just the objective function. Applying these insights, the authors developed Qwen-Image-Flash, an efficient unified model capable of executing both text-to-image generation and instruction-guided image editing with only four function evaluations.</p><h3><a href="https://openai.com/index/chatgpt-memory-dreaming/">Dreaming: Better memory for a more helpful ChatGPT</a></h3><p><strong>AI Lab:</strong> OpenAI</p><p><strong>Summary:</strong> OpenAI has introduced a more scalable and compute-efficient memory architecture for ChatGPT that relies on a background process called &#8220;dreaming&#8221; to continuously curate and synthesize past conversations. By optimizing for freshness, continuity, and relevance, this new system significantly improves the model&#8217;s ability to recall factual context, follow user preferences, and adapt to changing information over time.</p><h2><strong>&#129302; AI Tech Releases</strong></h2><h3><strong>Nemotron 3 - Ultra</strong></h3><p>NVIDIA <a href="https://developer.nvidia.com/blog/nvidia-nemotron-3-ultra-powers-faster-more-efficient-reasoning-for-long-running-agents/">released Nemotron 3 Ultra</a>, optimized for long running agentic workflows. </p><h3>MAI Models</h3><p>Microsoft <a href="https://microsoft.ai/news/building-a-hillclimbing-machine-launching-seven-new-mai-models/">unveiled 7 new AI models </a>including a flagship MAI-Thinking-1. </p><h3>Gemma 4 12B</h3><p>DeepMind <a href="https://blog.google/innovation-and-ai/technology/developers-tools/introducing-gemma-4-12b/">released Gemma 4 12B</a>, a multimodal intelligence model that can run locally on a laptop. </p><h2><strong>&#128225;AI News You Need to Know About</strong></h2><ul><li><p><strong><a href="https://www.bloomberg.com/news/articles/2026-06-04/airbnb-ceo-brian-chesky-plans-to-start-a-new-ai-company">Airbnb&#8217;s Brian Chesky plans a new AI lab</a></strong><a href="https://www.bloomberg.com/news/articles/2026-06-04/airbnb-ceo-brian-chesky-plans-to-start-a-new-ai-company"> </a>&#8212; Airbnb CEO Brian Chesky intends to bankroll a new AI lab (reportedly focused on user interaction and design) while staying on as Airbnb&#8217;s CEO rather than running it himself. <em>Original source:</em> the Bloomberg scoop that broke it, since Airbnb declined to comment and issued nothing of its own &#8594; </p></li><li><p><strong><a href="https://s206.q4cdn.com/479360582/files/doc_news/2026/Jun/01/attachments/2026-June-Alphabet-Equity-Capital-Raise-Press-Release-PDF.pdf">Alphabet plans to raise $80B for AI buildout</a></strong><a href="https://s206.q4cdn.com/479360582/files/doc_news/2026/Jun/01/attachments/2026-June-Alphabet-Equity-Capital-Raise-Press-Release-PDF.pdf"> </a>&#8212; Alphabet said it will sell $80 billion in stock (including $10 billion to Berkshire Hathaway) to fund AI infrastructure and compute amid demand outstripping supply. <em>Original source:</em> Alphabet&#8217;s own press release &#8594;<a href="https://coralogix.com/coralogix-raises-200m-to-scale-the-observability-backbone-for-the-age-of-ai/"> </a></p></li><li><p><strong><a href="https://coralogix.com/coralogix-raises-200m-to-scale-the-observability-backbone-for-the-age-of-ai/">Coralogix raises $200M</a></strong><a href="https://coralogix.com/coralogix-raises-200m-to-scale-the-observability-backbone-for-the-age-of-ai/"> </a>&#8212; The observability startup raised a $200M Series F (co-led by Advent, CPPIB, and Greenfield) at a $1.6B valuation, betting that monitoring autonomous AI agents will become core production infrastructure. </p></li><li><p><strong><a href="https://www.globenewswire.com/news-release/2026/06/02/3305264/0/en/zerodrift-raises-10m-seed-round-to-build-the-compliance-firewall-for-ai.html">ZeroDrift raises $10M</a></strong><a href="https://www.globenewswire.com/news-release/2026/06/02/3305264/0/en/zerodrift-raises-10m-seed-round-to-build-the-compliance-firewall-for-ai.html"> </a>&#8212; The startup closed an oversubscribed $10M seed round (backed by a16z Speedrun and others) for its &#8220;compliance firewall,&#8221; which sits inline between AI systems and end users to catch and rewrite non-compliant AI-generated messages. </p></li><li><p><strong><a href="https://www.anthropic.com/news/confidential-draft-s1-sec">Anthropic files to go public</a></strong><a href="https://www.anthropic.com/news/confidential-draft-s1-sec"> </a>&#8212; Anthropic confidentially submitted a draft S-1 to the SEC for a proposed IPO, days after a $65B Series H pushed its valuation near $1 trillion. </p></li><li><p><strong><a href="https://about.fb.com/news/2026/06/meta-business-agent/">Meta sells an AI agent to businesses</a></strong> &#8212; Meta began charging for &#8220;Meta Business Agent,&#8221; an AI that handles customer conversations across WhatsApp, Messenger, and Instagram, as part of its push to monetize AI and diversify beyond ads. </p></li><li><p><strong><a href="https://about.fb.com/news/2026/06/meta-business-agent/">DeepSeek close to a ~$7B funding round</a></strong><a href="https://about.fb.com/news/2026/06/meta-business-agent/"> </a>&#8212; DeepSeek is reportedly near finalizing roughly 50 billion yuan (~$7.4B) in its first external raise, led by Tencent, CATL, and founder Liang Wenfeng, at a ~$52&#8211;59B valuation. <em>Note:</em> there&#8217;s no primary source here &#8212; the deal is unconfirmed and the parties declined to comment &#8212; and the scoop originated with Reuters before Bloomberg corroborated it, so the </p></li><li><p><strong><a href="https://www.cnbc.com/2026/06/05/google-to-pay-spacex-920-million-a-month-for-xai-compute-capacity.html">Google to pay SpaceX $920M/month for compute</a></strong> &#8212; Google will pay SpaceX ~$920M monthly from October 2026 through June 2029 for access to roughly 110,000 NVIDIA GPUs and related hardware as &#8220;bridge capacity&#8221; for Gemini Enterprise demand, announced a week before SpaceX&#8217;s IPO. </p></li></ul>]]></content:encoded></item><item><title><![CDATA[The Sequence Opinion #872: The Cake Is a Battlefield: Who Really Controls the AI Stack ]]></title><description><![CDATA[Full stacks vs layer specialists. That's the AI race.]]></description><link>https://thesequence.substack.com/p/the-sequence-opinion-872-the-cake</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-opinion-872-the-cake</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Thu, 04 Jun 2026 10:58:00 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!aoLI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!aoLI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!aoLI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!aoLI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!aoLI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!aoLI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!aoLI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2629687,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/200595018?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!aoLI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!aoLI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!aoLI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!aoLI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc1490fb2-205a-49ca-a01e-ec5113042fb7_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>When Jensen Huang draws AI as a five-layer cake &#8212; energy, chips, infrastructure, models, applications &#8212; he describes it as harmony. Every layer reinforces the others. Every successful application pulls demand down through models, infrastructure, and chips, all the way to the power plant that keeps it alive. It is a beautiful picture, and as a statement of physics it is correct.</p><p>But Jensen is selling the bottom of the cake, so of course he wants you to see harmony. If you are a strategist instead of a chip vendor, you look at the same diagram and see something else entirely: five margin pools stacked on top of each other, and a war over which of them you can fuse together before the layer beneath you turns into a commodity. The cake is not a structure of mutual reinforcement. It is a battlefield with a vertical axis.</p><p>So the right question is never &#8220;how many layers do you own.&#8221; It is: do you own the <em>scarce</em> layer, and the seam right next to it?</p>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-opinion-872-the-cake">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence AI of the Week #871: Inside the Loop with Claude Opus 4.8]]></title><description><![CDATA[Might seem like a minor release. But it really isn't.]]></description><link>https://thesequence.substack.com/p/the-sequence-ai-of-the-week-871-inside</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-ai-of-the-week-871-inside</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Wed, 03 Jun 2026 11:01:13 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!wRky!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wRky!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wRky!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!wRky!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!wRky!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!wRky!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wRky!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2242971,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/200377784?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wRky!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!wRky!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!wRky!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!wRky!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb6db283c-26a1-40d7-b4ea-cb2fca4234a3_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I am sure you guys are surprised that we are going to cover Claude Opus 4.8 today ;) but I have been playing with it so much that merits the post. </p><p>Opus 4.8 shipped on May 28, 2026. The headline contributions, in the order I&#8217;d rank them for anyone building agents: a roughly <strong>4x reduction in how often the model leaves a flaw in its own code unremarked</strong> &#8212; the calibration/honesty story that defines this release; a fix for <strong>silently skipped tool calls</strong>, the bug class that quietly poisons long trajectories; <strong>better compaction recovery</strong> so long-horizon runs stop derailing after the history gets squeezed; <strong>dynamic workflows</strong> that let the model plan and fan out hundreds of parallel subagents for codebase-scale work; <strong>adaptive thinking</strong> that decides per-turn whether to reason at all; and a <strong>fast mode</strong> that runs ~2.5x faster at a tier that&#8217;s now ~3x cheaper than 4.7&#8217;s. Alignment results land near the (still-restricted) Mythos Preview. Same regular-mode pricing as its predecessor.</p><p>It&#8217;s tempting to file this under &#8220;minor release.&#8221; The version bump is a tenth of a point, the benchmark deltas are mostly incremental, and the cadence makes it easy to lose track &#8212; Opus 4.6 landed February 5, 4.7 on April 16, and 4.8 just six weeks later. That&#8217;s a compression from a roughly quarterly rhythm to something closer to monthly, and when point releases arrive that fast the instinct is to treat each one as a patch and skip the changelog.</p><p>That instinct is wrong here, because Opus 4.8 isn&#8217;t competing on the axis the version number implies. The benchmark table moved a little. What moved a lot is the <em>reliability</em> axis &#8212; the silent-failure rate, the tool discipline, the ability to hold a thread across a long run unattended. Those are the properties that gate whether you can actually leave an agent running, and they don&#8217;t show up on a capability leaderboard. The short cadence is also the tell: when you can ship calibration and reliability fixes every six weeks, the model stops being a thing you upgrade quarterly and becomes infrastructure you keep current. So let me give you the version I&#8217;d want if I were wiring this into a production agent loop at 2am.</p><h2>The benchmark story is the boring story</h2>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-ai-of-the-week-871-inside">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence Knowledge #870: Liquid Models and the Search for a Post-Transformer Architecture]]></title><description><![CDATA[Inside one of the msot promising non-transformer architectures.]]></description><link>https://thesequence.substack.com/p/the-sequence-knowledge-870-liquid</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-knowledge-870-liquid</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Tue, 02 Jun 2026 11:03:31 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!o3Wa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!o3Wa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!o3Wa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!o3Wa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!o3Wa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!o3Wa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!o3Wa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2755696,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/200201347?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!o3Wa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!o3Wa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!o3Wa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!o3Wa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc02384d0-28a1-46ae-8ae1-f8641e9e3b0d_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>&#128161; AI Concept of the Day: Liquid Models and the Search for a Post-Transformer Architecture</strong></h2><p>The Transformer did not merely become the dominant neural architecture. It became the default mental model for intelligence in modern AI.</p><p>Its central idea is deceptively simple: when processing a sequence, let every element look at every other element. A word can attend to previous words. A code token can attend to distant variables. An image patch can attend to another patch. A tool call can attend to an instruction buried thousands of tokens earlier. Attention turns sequence modeling into a giant differentiable lookup table over context.</p><p>This was a profound break from the recurrent era. Earlier models processed sequences like a reader moving left to right, updating a hidden state at each step. Transformers flattened that temporal process into a massively parallel computation. Instead of compressing the past into a single state, they exposed the entire past to the model. That made training easier, scaling more predictable, and long-range relationships easier to represent.</p><p>But every architecture has a physics. Transformers have the physics of global interaction. That physics is powerful, but expensive.</p><p>Self-attention wants to compare tokens against other tokens. During inference, the model accumulates a key-value cache so that each new token can attend to the past. As context grows, memory grows. As model size grows, serving complexity grows. As agents become longer-running, more tool-using, and more local, the cost of remembering everything explicitly becomes harder to ignore.</p><p>The Transformer is a brilliant architecture for cloud-scale intelligence. It is less obviously the final architecture for always-on, low-latency, private, embodied, on-device intelligence.</p><p>That is where liquid models enter the story.</p><h3><strong>From Attention to Dynamics</strong></h3>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-knowledge-870-liquid">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence Radar #869: Last Week in AI: The Token Becomes the Unit of Account — Opus 4.8, OpenRouter, Cognition, Snowflake, and a papal warning]]></title><description><![CDATA[Opus 4.8 and remarkable fundraising events.]]></description><link>https://thesequence.substack.com/p/the-sequence-radar-869-last-week</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-radar-869-last-week</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Sun, 31 May 2026 11:02:53 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!wXCM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wXCM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wXCM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!wXCM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!wXCM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!wXCM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wXCM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c5810add-29a7-450e-9204-f1fd11267430_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2789738,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/199725661?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wXCM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!wXCM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!wXCM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!wXCM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc5810add-29a7-450e-9204-f1fd11267430_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>Next Week in The Sequence:</strong></h2><ul><li><p>We continue our series about transformer alternatives. </p></li><li><p>In the AI of the Week section, we discuss Opus 4.8. </p></li><li><p>The opinion of the week discusses companies the strategic differences between companies like Google, NVIDIA, Microsoft, OpenAI and Anthropic when comes to their ownership of different areas of the AI stack. </p></li></ul><h2><strong>Subscribe and don&#8217;t miss out:</strong></h2><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://thesequence.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">TheSequence is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2><strong>&#128221; Editorial: </strong><em>Last Week in AI: The Token Becomes the Unit of Account</em> &#8212; Opus 4.8, OpenRouter, Cognition, Snowflake, and a papal warning</h2><p>For two years the AI boom was an argument about the future, told in benchmarks and term sheets. This week it became an argument about the present, told in revenue.</p><p>Start with the substrate. Anthropic shipped Claude Opus 4.8 and, in the same breath, disclosed it&#8217;s tracking toward its first operating profit &#8212; roughly $10.9B in projected Q2 revenue, up about 130% quarter over quarter &#8212; while closing a $65B round. Sit with that. A lab still doing frontier training runs is approaching operational profitability. The &#8220;labs are structurally unprofitable&#8221; assumption that anchored every bear case just lost its load-bearing wall.</p><p>The model itself Anthropic described, refreshingly, as &#8220;a modest but tangible improvement&#8221; &#8212; agentic coding nudges from ~64% to ~69%, reasoning-with-tools from ~55% to ~58%, at the same price as 4.7. The interesting stuff is underneath the benchmarks. Three changes matter. First, an effort control that lets you dial how hard the model thinks per task &#8212; explicit governance over the compute-versus-quality tradeoff that every agent builder has been hacking around with prompt tricks. Second, <em>dynamic workflows</em>: a Claude Code capability where the model plans a large task, spins up parallel sub-agents to attack the pieces, verifies their outputs, and reports back &#8212; paired with a Messages API that now accepts live edits to the message array mid-run without breaking the prompt cache, so you can steer a long job without tearing it down and restarting. Third, honesty as a measured capability: 4.8 is roughly 4x less likely than 4.7 to let a flaw in its own code slip through unflagged, and surfaces its own uncertainty more readily. Stack those and you get the thing that actually matters once a model runs unattended for hours: it plans, it parallelizes, it checks its own work, and &#8212; because you can&#8217;t read every diff &#8212; it&#8217;s trained to distrust itself. It also burns tokens by the fistful doing all of it.</p><p>Then watch where the money flows, because it tells you the unit of account is now the token. OpenRouter raised $113M at $1.3B doing something almost embarrassingly simple: routing across 400+ models and taking ~5% of the inference spend that passes through. Its weekly throughput went from 5T to 25T tokens in six months &#8212; 5x. That&#8217;s not a forecast; that&#8217;s a meter. Cognition raised $1B at $26B, and buried in the announcement was the line that should reorganize your priors: 89% of code committed inside Cognition is now written by Devin, up from 13% in December. Run-rate revenue went from $37M to $492M in a year. Autonomous software engineering stopped being a demo and became the default committer.</p><p>Snowflake closes the loop on the public side. Product revenue up 34%, guidance raised, stock up ~36% in a single session &#8212; and the two tells are a $6B AWS compute deal and the acquisition of Natoma, an MCP platform for governing agent access. The data layer is repricing itself around agents that <em>consume</em>, not analysts that query. The whole stack &#8212; model, router, agent, substrate &#8212; is converging on one business model: charge by the token, because the token is the work.</p><p>Which is exactly the moment Pope Leo XIV chose to publish <em>Magnifica Humanitas</em>, his first encyclical, presented alongside Anthropic&#8217;s Chris Olah. Stripped of its theology, the argument is an engineering critique the field should take seriously: technology is never neutral, because it inherits the incentives of whoever builds and funds it &#8212; and the danger isn&#8217;t malice but quiet disintermediation, decisions sliding out of human hands one delegated commit at a time.</p><p>Hold those two facts together. Cognition&#8217;s 89% is the encyclical&#8217;s thesis restated as a metric. The meter that makes the economics work is the same meter measuring how much judgment we&#8217;ve handed off. The bull case and the moral case are reading the same number.</p><p>The flywheel is no longer a slide. It&#8217;s on the income statement. The open question is what it&#8217;s optimizing for.</p><h2><strong>&#128270; AI Research</strong></h2><h3><a href="https://arxiv.org/html/2605.26457v1">Verus-SpecGym: An Agentic Environment for Evaluating Specification Autoformalization </a></h3><p><strong>AI Lab</strong>: CMU &amp; Amazon </p><p><strong>Summary</strong>: To address the challenge of evaluating whether AI agents can accurately translate informal programming intent into formal specifications, the researchers introduce the VERUS-SPECBENCH benchmark and the VERUS-SPECGYM agentic environment. By extending an execution mechanism to test generated specifications against both official tests and adversarial &#8220;hacks,&#8221; the study reveals that specification autoformalization remains highly brittle even for models capable of generating correct code.</p><h3><a href="https://arxiv.org/abs/2605.28816">Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players</a></h3><p><strong>AI Lab</strong>: Tsinghua University, NVIDIA, University of Toronto, &amp; Vector Institute</p><p><strong>Summary</strong>: Gamma-World presents a scalable, generative multi-agent world model that moves beyond traditional single-agent simulations by utilizing Simplex Rotary Agent Encoding for permutation-symmetric identities and Sparse Hub Attention for efficient cross-agent communication. Through conditional teacher-student distillation and KV-cached streaming, the framework achieves real-time, action-responsive rollouts at 24 FPS that maintain strong consistency across virtual gaming and physical robotic environments.</p><h3><a href="https://arxiv.org/html/2605.28814v1">Self-Improving Language Models with Bidirectional Evolutionary Search</a></h3><p><strong>AI Lab</strong>: Harvard University &amp; MIT </p><p><strong>Summary</strong>: Bidirectional Evolutionary Search (BES) overcomes the limitations of sparse verification signals and narrow autoregressive expansion by coupling forward candidate evolution with backward goal decomposition. By recombining trajectory segments to escape narrow probability distributions and scoring them against fine-grained sub-goals, BES significantly outperforms existing open-source frameworks on complex logical reasoning and open problem-solving tasks.</p><h3><a href="https://arxiv.org/html/2605.27358v1">MobileMoE: Scaling On-Device Mixture of Experts </a></h3><p><strong>AI Lab</strong>: Meta AI </p><p><strong>Summary</strong>: MobileMoE introduces a family of sub-billion active-parameter Mixture-of-Experts (MoE) language models specifically optimized for efficient deployment on edge devices like smartphones. Guided by a novel on-device scaling law and supported by a custom fused MoE kernel, these models achieve state-of-the-art performance while delivering substantially faster prefill and decode speeds compared to dense baselines at a similar memory footprint.</p><h3><a href="https://arxiv.org/html/2605.27295v1">Gemini Embedding 2: A Native Multimodal Embedding Model from Gemini </a></h3><p><strong>AI Lab</strong>: Google DeepMind </p><p><strong>Summary</strong>: Gemini Embedding 2 is a native multimodal embedding model that seamlessly maps text, image, audio, and video inputs into a single, unified representation space without relying on intermediate transcriptions. Trained via large-scale contrastive learning in a multi-task setup, the model establishes new state-of-the-art performance across unimodal, cross-modal, and multimodal retrieval benchmarks while demonstrating robust zero-shot generalization across diverse enterprise and specialized domains.</p><h3><a href="https://arxiv.org/html/2605.26494v1">The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence </a></h3><h3><strong>AI Lab</strong>: MiniMax </h3><p><strong>Summary</strong>: The MiniMax-M2 series introduces a highly efficient 229.9B parameter Mixture-of-Experts model that activates only 9.8B parameters per token, specifically engineered for complex, long-horizon agentic workflows. By leveraging agent-driven data pipelines, a specialized reinforcement learning system called Forge, and autonomous self-evolution capabilities, the model achieves frontier-level performance across coding, deep search, and reasoning benchmarks while maintaining a minimal computational footprint.</p><h2><strong>&#129302; AI Tech Releases</strong></h2><h3><strong>Claude Opus 4.8</strong></h3><p>Anthropic <a href="https://www.anthropic.com/news/claude-opus-4-8">released the new version of its marquee model</a>, with strong agentic and coding capabilities. </p><h2><strong>&#128225;10 AI News You Need to Know About</strong></h2><ol><li><p><strong><a href="https://www.anthropic.com/news/series-h">Anthropic raises $65B in Series H at $965B post-money valuation</a></strong><a href="https://www.anthropic.com/news/series-h"> </a>&#8212; Anthropic raised $65 billion in Series H funding (co-led by Altimeter, Dragoneer, Greenoaks, and Sequoia) at a $965 billion post-money valuation, disclosing that run-rate revenue crossed $47 billion earlier this month, and bringing on Micron, Samsung, and SK hynix as strategic memory/storage partners alongside $15B in previously committed hyperscaler investment (including $5B from Amazon).</p></li><li><p><strong><a href="https://techcrunch.com/2026/05/27/ai-coding-startup-cognition-raises-1b-at-25b-pre-money-valuation/">Cognition raises $1B at $25B pre-money valuation</a></strong><a href="https://techcrunch.com/2026/05/27/ai-coding-startup-cognition-raises-1b-at-25b-pre-money-valuation/"> </a>&#8212; Cognition, maker of the AI software engineer Devin, raised more than $1 billion (led by Lux Capital, General Catalyst, and 8VC) at a ~$26B post-money valuation, more than doubling in eight months as it hit a $492M annualized revenue run-rate. </p></li><li><p><strong><a href="https://robinhood.com/us/en/newsroom/robinhood-is-now-open-to-agents/">Robinhood lets AI agents trade stocks</a></strong><a href="https://robinhood.com/us/en/newsroom/robinhood-is-now-open-to-agents/"> </a>&#8212; Robinhood launched Agentic Trading and an Agentic Credit Card in beta, letting customers connect third-party AI agents (via MCP) to a separate, funded account to autonomously trade equities and make purchases. <em>Original source: </em></p></li><li><p><strong><a href="https://menlovc.com/perspective/openrouter-now-processes-more-than-a-quadrillion-tokens-a-year/ (DealBook also broke it: https://www.nytimes.com/2026/05/26/business/dealbook/openrouter-ai-models-fundraising.html">OpenRouter doubles valuation to $1.3B</a></strong><a href="https://menlovc.com/perspective/openrouter-now-processes-more-than-a-quadrillion-tokens-a-year/ (DealBook also broke it: https://www.nytimes.com/2026/05/26/business/dealbook/openrouter-ai-models-fundraising.html"> </a>&#8212; The multi-model AI inference-routing startup raised a $113M Series B led by Alphabet&#8217;s CapitalG at a ~$1.3B valuation, more than double its level a year ago, as weekly volume grew from 5T to 25T tokens. </p></li><li><p><strong><a href="https://www.businesswire.com/news/home/20260521171628/en/Hark-Raises-%24700M-Series-A-at-a-%246B-Valuation">Hark raises $700M Series A</a></strong> &#8212; Brett Adcock&#8217;s secretive AI startup raised $700M at a $6B post-money valuation (led by Parkway Venture Capital) to build a &#8220;universal&#8221; agentic AI assistant with proprietary multimodal models and custom hardware, with first models due summer 2026. </p></li><li><p><strong><a href="https://mistral.ai/news/ai-now-summit-2026/">Mistral signs Airbus and BMW</a></strong> &#8212; Mistral AI expanded into &#8220;physical AI&#8221; for manufacturing, announcing partnerships to apply its models to Airbus (aircraft design, flight safety, defense/space) and BMW&#8217;s &#8220;Large Industry Model&#8221; crash-simulation initiative, plus a new French data center. </p></li><li><p><strong><a href="https://www.bloomberg.com/news/articles/2026-05-28/china-ai-upstart-minimax-doubles-sales-ahead-of-new-model-launch">MiniMax doubles sales ahead of new model</a></strong><a href="https://www.bloomberg.com/news/articles/2026-05-28/china-ai-upstart-minimax-doubles-sales-ahead-of-new-model-launch"> </a>&#8212; The Chinese AI developer&#8217;s annualized revenue more than doubled over two months to at least $300M, driven by its M2.7 model and a fivefold jump in enterprise users, ahead of its next flagship launch. <em>No primary source to substitute: the figures come from a Bloomberg Television interview with co-founder Yun Yeyi, so Bloomberg is the original source.</em></p><p></p></li><li><p><strong><a href="https://money.usnews.com/investing/news/articles/2026-05-26/sk-hynix-joins-1-trillion-club-after-samsung-micron-on-ai-chip-boom">SK Hynix joins the $1 trillion club</a></strong><a href="https://money.usnews.com/investing/news/articles/2026-05-26/sk-hynix-joins-1-trillion-club-after-samsung-micron-on-ai-chip-boom"> </a>&#8212; Shares of the South Korean memory maker surged ~9&#8211;15% to push its market value above $1 trillion for the first time, driven by HBM demand for AI, joining rivals Samsung and Micron. <em>No company announcement exists for a stock-price milestone; the event was originated on the wires by Reuters: https://money.usnews.com/investing/news/articles/2026-05-26/sk-hynix-joins-1-trillion-club-after-samsung-micron-on-ai-chip-boom</em></p></li><li><p><strong><a href="https://www.vaticannews.va/en/pope/news/2026-05/pope-leo-xiv-encyclical-magnifica-humanitas-ai.html">Pope Leo warns AI shouldn&#8217;t dominate humanity</a></strong><a href="https://www.vaticannews.va/en/pope/news/2026-05/pope-leo-xiv-encyclical-magnifica-humanitas-ai.html"> </a>&#8212; In his first encyclical, <em>Magnifica Humanitas</em>, Pope Leo XIV called for &#8220;disarming&#8221; AI to keep it human-friendly and free of monopolistic control, warning it risks deepening inequality and eroding human agency. <em>Original source: the encyclical itself, published by the Vatican (vatican.va) &#8212; that&#8217;s the underlying document the coverage is based on.</em></p></li><li><p><strong><a href="https://www.snowflake.com/en/news/press-releases/snowflake-expands-aws-collaboration-with-6b-commitment-to-accelerate-enterprise-agentic-ai-adoption/">Snowflake signs $6B AWS deal for Graviton chips</a></strong><a href="https://www.snowflake.com/en/news/press-releases/snowflake-expands-aws-collaboration-with-6b-commitment-to-accelerate-enterprise-agentic-ai-adoption/"> </a>&#8212; Snowflake committed $6B over five years to AWS &#8212; its largest infrastructure commitment ever &#8212; expanding use of Amazon&#8217;s ARM-based Graviton CPUs and GPUs to power agentic AI workloads. </p></li></ol><p></p>]]></content:encoded></item><item><title><![CDATA[The Sequence Opinion #868: Recursion Is the New Scaling Law]]></title><description><![CDATA[The technique pushing breakthroughs in agentic computing.]]></description><link>https://thesequence.substack.com/p/the-sequence-opinion-868-recursion</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-opinion-868-recursion</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Thu, 28 May 2026 11:02:47 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!ABvC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ABvC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ABvC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!ABvC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!ABvC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!ABvC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ABvC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2450040,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/199543141?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ABvC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!ABvC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!ABvC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!ABvC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff0efac32-526e-44e5-af19-e037c3f4c8ee_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>For most of the modern AI era, progress has had a deceptively simple recipe: make the model bigger, train it on more data, and spend more compute. This formula produced the Transformer era, the foundation model era, and the current wave of large language models. Scaling laws gave the field an almost industrial rhythm. Loss curves became roadmaps. Compute budgets became strategy. The frontier could often be described by a single question: how much larger can we go?</p><p>But the most interesting recent progress in AI is beginning to feel less linear. It is no longer just about building a larger model that gives a better answer in one pass. Increasingly, the frontier is about models and systems that can revisit, revise, search, simulate, critique, and improve. The important unit of computation is shifting from the forward pass to the loop.</p><p><strong>That shift suggests a provocative idea: recursion may be the next scaling law.</strong></p>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-opinion-868-recursion">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence AI of the Week #867: Thinking in Latents: Why Sapient's HRM-Text Is a Quiet Rebuke to Chain-of-Thought]]></title><description><![CDATA[One of the most impressive small models recently released.]]></description><link>https://thesequence.substack.com/p/the-sequence-ai-of-the-week-867-thinking</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-ai-of-the-week-867-thinking</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Wed, 27 May 2026 11:01:32 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!wXmp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!wXmp!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!wXmp!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!wXmp!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!wXmp!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!wXmp!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!wXmp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2482901,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/199177138?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!wXmp!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!wXmp!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!wXmp!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!wXmp!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0d21b189-9208-4d9e-a680-f8655c7417e8_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>There is a particular sleight-of-hand at the heart of modern LLM reasoning that, the more I look at it, the more it bothers me. The argument goes like this: Transformers are shallow. A 70-layer stack is <em>fixed depth</em> &#8212; it sits in complexity classes like AC&#8304; or TC&#8304;, which is a polite way of saying it cannot, in a single forward pass, solve problems that fundamentally require sequential computation. So we paper over this by making the model think out loud. We give it a scratchpad. We call it Chain-of-Thought. We celebrate.</p><p>But CoT is not reasoning. CoT is the model <em>renting depth from its own output tokens</em>. Every reasoning step has to leave the residual stream, become a discrete token in a vocabulary built for human communication, and come back in through the embedding layer for the next step. It is, mechanically, an absurd way to do internal computation &#8212; like a CPU that must spill every intermediate register to disk in plaintext English.</p><p>Sapient Intelligence&#8217;s bet, made first with the original Hierarchical Reasoning Model paper last summer and now extended into the language domain with HRM-Text, is that this is fixable. Not by making the model bigger, not by training on more CoT traces, but by giving the architecture the one thing it doesn&#8217;t have: variable, <em>internal</em>, depth. Reasoning that happens in the latent space, not in the token stream.</p><p>It&#8217;s worth thinking carefully about what they did and what it does and doesn&#8217;t yet prove.</p><h2>Sapient, briefly</h2>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-ai-of-the-week-867-thinking">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence Knowledge #866: Three Text Diffusion Models You Need To Know About]]></title><description><![CDATA[LlaDa, Gemini Diffusion and Mercury rule the space.]]></description><link>https://thesequence.substack.com/p/the-sequence-knowledge-866-three</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-knowledge-866-three</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Tue, 26 May 2026 10:49:45 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!q6xl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!q6xl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!q6xl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!q6xl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!q6xl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!q6xl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!q6xl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2515656,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/199307760?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!q6xl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!q6xl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!q6xl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!q6xl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1da4daa8-6538-425e-b4d0-301bdd60adf4_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>&#128161; AI Concept of the Day: Three Text Diffusion Models You Need To Know About</strong></h2><p>For most of the LLM era, language generation has been built around a single assumption: text should be produced like a typewriter, one token at a time, left to right, each new symbol conditioned on a frozen history. Text diffusion models challenge that assumption at its root. They treat generation less like typing and more like editing: start from noise or masks, look at the whole canvas, and iteratively refine it into coherent language.</p><p>That sounds like a stylistic tweak. It is actually a different computational worldview. Instead of factorizing language as &#8220;the next token given all previous tokens,&#8221; diffusion models define a corruption process and then learn how to reverse it. In language, that usually means masking tokens or pushing text into noisier latent states, then training a model to recover the original sequence over several denoising steps. The result is a system that can update many positions at once, use bidirectional context during generation, and revisit its own outputs rather than committing irreversibly at every step.</p><p>If you look at the field today, three systems define the conversation more than any others: <strong>LLaDA</strong>, which proved that diffusion can scale into a real large language model; <strong>Mercury</strong>, which turned diffusion into a genuine commercial speed advantage; and <strong>Gemini Diffusion</strong>, which signaled that frontier labs see this paradigm as strategically important. Together, they outline the three phases of a new architecture class: scientific proof, industrial deployment, and frontier validation.</p><h3><strong>LLaDA: The Scientific Proof That Diffusion Can Scale</strong></h3>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-knowledge-866-three">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence Radar #865: Last Week in AI: Last Week in AI: Karpathy, Google, Colossus, and the Coming IPO Wave]]></title><description><![CDATA[The next AI frontier might be capital structuring.]]></description><link>https://thesequence.substack.com/p/the-sequence-radar-865-last-week</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-radar-865-last-week</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Sun, 24 May 2026 11:00:52 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!vzZ3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vzZ3!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vzZ3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vzZ3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vzZ3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vzZ3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vzZ3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg" width="1456" height="794" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:794,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2940784,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/jpeg&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/198830879?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vzZ3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vzZ3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vzZ3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vzZ3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab3266ad-c505-43b4-a3d4-79b0792e7131_2816x1536.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>Next Week in The Sequence:</strong></h2><ol><li><p>We continue our series about alternatives to transformers. </p></li><li><p>In the AI of the week, we are going to deep dive into the new HRM small models, you need to know about those. </p></li><li><p>The opinion of the week we dive into the idea of recursive models as a new AI scaling law. </p></li></ol><h2><strong>Subscribe and don&#8217;t miss out:</strong></h2><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://thesequence.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">TheSequence is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2><strong>&#128221; Editorial: Last Week in AI: Last Week in AI: Karpathy, Google, Colossus, and the Coming IPO Wave</strong></h2><p>The last three weeks felt like a phase transition. Not the kind you measure in benchmark deltas &#8212; the kind where the substrate underneath frontier AI quietly rearranges itself, and the org charts and cap tables that fall out look nothing like what we had in April.</p><p>Google set the tone at I/O. Sundar called it the &#8220;agentic Gemini era,&#8221; and for once the marketing copy matched what shipped. Gemini Omni landed as the headline &#8212; an any-to-any generative model anchored on video, a real step forward in multimodal editing and world understanding &#8212; but the more consequential release was Gemini 3.5 Flash paired with Google Antigravity, their agent-first development platform. The pitch flipped from &#8220;AI that helps you write&#8221; to &#8220;agents that help you act.&#8221; Stack the new TPU 8i underneath, and Google now owns a vertically integrated agentic pipeline from silicon up through the IDE. It is the most coherent agent story any frontier lab has shipped this year.</p><p>Then, on May 19, Andrej Karpathy joined Anthropic. He&#8217;ll work on pretraining under Nick Joseph, and the official framing is that he&#8217;ll build a team &#8220;using Claude to accelerate pretraining research.&#8221; Read it carefully. An OpenAI co-founder is now constructing the loop that improves the next Claude using the current Claude. The self-improvement flywheel every lab has been sketching on whiteboards for two years just got the best person in the world to actually compile it. The signal-to-noise on this hire is unusually high &#8212; and Anthropic&#8217;s broader pattern of pulling CTOs of billion-dollar companies into individual-contributor research seats is the under-noticed sub-story.</p><p>That hire makes more sense when you read the compute side of the ledger. Two weeks earlier, on May 6, Anthropic announced full access to xAI&#8217;s Colossus 1 &#8212; more than 300 MW, roughly 220,000 H100, H200, and GB200 GPUs &#8212; to relieve the pressure that had been forcing Pro and Max rate caps. The price tag surfaced a fortnight later in SpaceX&#8217;s S-1: $1.25B per month through May 2029, roughly $45B total, with planned expansion to Colossus 2 and a stated interest in <em>orbital</em> compute. The competitor-as-supplier topology is new, and the unit economics of an inference workload now include &#8220;what megawatts can I lease from a launch company?&#8221;</p><p>The public markets were already pricing that shift. On May 14, Cerebras went public on Nasdaq, pricing at $185, opening at $350, and closing day one near a $95B market cap &#8212; the largest tech IPO since Uber, with an order book reportedly 20x oversubscribed. The headline is the pop; the interesting line items are inside the S-1. A $24.6B backlog. A multi-year contract with OpenAI worth more than $20B for 750 MW of inference capacity. A binding term sheet from AWS for CS-3 systems. Two UAE-linked customers account for roughly 86% of 2025 revenue, so the concentration risk is real, but the signal from public markets is unambiguous: specialized inference silicon, sold as long-dated capacity, now prices like infrastructure, not like a chip company. Cerebras was the dress rehearsal.</p><p>The main event is the IPO triple-stack behind it. SpaceX&#8217;s S-1 points toward a $1.75&#8211;2T listing. OpenAI is reportedly filing confidentially in days at $850B&#8211;$1T. Anthropic is targeting October at roughly $900B. Three of the largest tech IPOs in history, all gated on the same underlying compute substrate, all hitting public markets inside a six-month window. We are about to get a real-time, mark-to-market valuation function for the frontier itself.</p><p>The takeaway, for me, is that the frontier is no longer just a research artifact &#8212; it is a vertically integrated capital structure. Compute is a tradeable supply contract. Talent is liquid across labs. The moats are no longer &#8220;who has the best loss curve&#8221; but &#8220;who can fund $45B compute leases, recruit the people who can compress those leases, and keep a public-equity story coherent under quarterly scrutiny.&#8221;</p><p>The next eighteen months won&#8217;t be decided on a benchmark. They&#8217;ll be decided on a balance sheet.</p><h2><strong>&#128270; AI Research</strong></h2><h3><a href="https://arxiv.org/abs/2605.20613">HRM-Text: Efficient Pretraining Beyond Scaling </a></h3><p><strong>AI Lab</strong>: Sapient Intelligence &amp; MIT </p><p><strong>Summary</strong>: This paper introduces HRM-Text, an efficient pretraining paradigm that replaces standard Transformers with a dual-timescale Hierarchical Recurrent Model (HRM) to drastically reduce the compute and data required for training large language models. By combining this architecture with a task-completion objective trained exclusively on instruction-response pairs, the model achieves competitive benchmark performance using up to 900x fewer tokens than contemporary foundation models.</p><h3><a href="https://arxiv.org/html/2605.20668v1">On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists</a></h3><p> <strong>AI Lab</strong>: Carnegie Mellon University </p><p><strong>Summary</strong>: This study evaluates the practical effectiveness of AI-generated peer reviews by having 45 domain scientists manually grade 2,960 individual criticisms from 82 Nature-family papers. The findings reveal that while frontier AI models can surface highly significant and well-evidenced critiques, they often lack subfield-specific context and overlap heavily with one another, suggesting they are currently best utilized to augment rather than replace human reviewers.</p><h3><a href="https://research.nvidia.com/publication/2026-05_nemotron-labs-diffusion-tri-mode-language-model-unifying-autoregressive">Nemotron-Labs-Diffusion: A Tri-Mode Language Model Unifying Autoregressive, Diffusion, and Self-Speculation Decoding </a></h3><p><strong>AI Lab</strong>: NVIDIA </p><p><strong>Summary</strong>: This technical report presents Nemotron-Labs-Diffusion, a language model trained with a joint objective that seamlessly unifies autoregressive (AR), diffusion, and self-speculation decoding capabilities within a single architecture. The authors demonstrate that AR and diffusion training are complementary, allowing the model to utilize self-speculation&#8212;where diffusion drafts and AR verifies&#8212;to achieve superior throughput and efficiency across various deployment scenarios without relying on multi-token prediction (MTP) methods.</p><h3><a href="https://arxiv.org/pdf/2605.19804">Stitched Value Model for Diffusion Alignment</a></h3><p><strong>AI Lab</strong>: Google &amp; ETH Z&#252;rich</p><p><strong>Summary</strong>: This paper introduces StitchVM, a lightweight model stitching framework that attaches a frozen diffusion backbone to a pretrained pixel-space reward model to efficiently create strong value models for noisy latents. By directly evaluating noisy latents rather than relying on costly Tweedie or Monte Carlo approximations, StitchVM significantly accelerates both inference-time and training-time diffusion alignment methods while maintaining or improving generation quality.</p><h3><a href="https://arxiv.org/pdf/2605.22642">Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning</a></h3><p><strong>AI Lab</strong>: University of Illinois Urbana-Champaign &amp; Meta</p><p><strong>Summary</strong>: This paper introduces Spreadsheet-RL, an on-policy reinforcement learning framework designed to train specialized AI agents for complex spreadsheet workflows within a realistic Microsoft Excel environment. By combining an automated data collection pipeline with a structured tool harness, the framework significantly improves the ability of open-source models to execute multi-step spreadsheet tasks across general and domain-specific benchmarks.</p><h2><strong>&#129302; AI Tech Releases</strong></h2><h3>Google AI Announcements</h3><p>Google <a href="https://blog.google/innovation-and-ai/sundar-pichai-io-2026/#agents">has plenty of AI announcements at it I/O conferences</a> including the amazing Gemini Omni. </p><h3><strong>Qwen Max 3.7</strong></h3><p>Qwen <a href="https://qwen.ai/blog?id=qwen3.7">open sourced its latest marquee model</a>. </p><h3><strong>MagenticLite, MagenticBrain, Fara1.5</strong></h3><p>Microsoft <a href="https://www.microsoft.com/en-us/research/blog/magenticlite-magenticbrain-fara1-5-an-agentic-experience-optimized-for-small-models/">released MagenticLite, an agentic app for browser and file system use together with MagenticBrain and Fara, which are small models optimized for computer use tasks</a>. </p><h2><strong>&#128225;10 AI News You Need to Know About</strong></h2><ol><li><p><strong><a href="https://www.businesswire.com/news/home/20260521171628/en/Hark-Raises-700M-Series-A-at-a-6B-Valuation">Hark raises $700M Series A at $6B valuation</a></strong> &#8212; Brett Adcock&#8217;s stealthy AI lab Hark closed a $700M Series A at a $6B post-money valuation led by Parkway Venture Capital, with Nvidia, AMD Ventures, Qualcomm Ventures, Salesforce Ventures and others, to build multimodal personal-AI models (this summer) and bespoke &#8220;universal interface&#8221; hardware to follow. &#8594; </p></li><li><p><strong><a href="https://www.calcalistech.com/ctechnews/article/r1rtwrjyme">NanoCo (NanoClaw) raises $12M seed, declines ~$20M buyout</a></strong><a href="https://www.calcalistech.com/ctechnews/article/r1rtwrjyme"> </a>&#8212; The Cohen brothers&#8217; security-focused, sandboxed OpenClaw alternative NanoClaw closed an oversubscribed $12M seed led by Valley Capital Partners (with Docker, Vercel, monday.com, Slow Ventures, and Hugging Face&#8217;s Clem Delangue) six weeks after launching the open-source project that went viral via Karpathy and Singapore&#8217;s foreign minister. &#8594; No clean primary source &#8212; TC has the exclusive interview; original TC link is the canonical version. </p></li><li><p><strong><a href="https://x.com/karpathy/status/2056753169888334312">Andrej Karpathy joins Anthropic pre-training</a></strong> &#8212; Karpathy joined Anthropic to start a team using Claude to accelerate pre-training research under Nick Joseph, returning to frontier LLM R&amp;D after Tesla, OpenAI, and Eureka Labs. </p></li><li><p><strong><a href="https://www.anthropic.com/news/anthropic-acquires-stainless">Anthropic acquires Stainless</a></strong><a href="https://www.anthropic.com/news/anthropic-acquires-stainless"> </a>&#8212; Anthropic acquired Stainless, the SDK-generation startup whose tooling has powered every official Anthropic SDK and is also used by OpenAI, Google, Replicate, Runway and Cloudflare; Anthropic will wind down all hosted Stainless products, taking the SDK tooling exclusive (deal terms undisclosed; The Information had pegged the price north of $300M). </p></li><li><p><strong><a href="https://ocean.security/resources/blog/ocean-launch-out-of-stealth">Ocean emerges from stealth with $28M for agentic email security</a></strong> &#8212; Israeli founders Shay Shwartz and Oran Moyal launched Ocean out of stealth with $28M total funding led by Lightspeed (with Picture, Cerca, and angels Assaf Rappaport, Yevgeny Dibrov, Nadir Izrael), positioning its multi-agent email investigation platform against AI-generated phishing.</p></li><li><p><strong><a href="https://techfundingnews.com/china-said-no-to-metas-2b-deal-now-manus-ai-needs-1b-to-reclaim-what-it-built/">Manus weighs $1B raise to unwind Meta takeover</a></strong><a href="https://techfundingnews.com/china-said-no-to-metas-2b-deal-now-manus-ai-needs-1b-to-reclaim-what-it-built/"> </a>&#8212; Manus&#8217;s three Chinese co-founders (Xiao Hong, Ji Yichao, Zhang Tao) are exploring raising ~$1B from external investors at a valuation matching the &gt;$2B Meta paid, possibly with personal capital, to comply with Beijing&#8217;s order to unwind the December deal &#8212; with a Chinese JV structure and Hong Kong IPO as the likely next steps. </p></li><li><p><strong><a href="https://www.cnbc.com/2026/05/20/spacex-ipo-live-updates.html">SpaceX files publicly for Nasdaq IPO under SPCX</a></strong><a href="https://www.cnbc.com/2026/05/20/spacex-ipo-live-updates.html"> </a>&#8212; SpaceX publicly filed its S-1 on May 20 for what would be the largest IPO ever (~$75B target raise at ~$1.75T valuation), listing on Nasdaq as SPCX, with a super-voting structure to keep Musk in control despite multi-billion-dollar losses and a $41.3B accumulated deficit. </p></li><li><p><strong><a href="https://www.reuters.com/business/openai-preparing-file-ipo-soon-wsj-reports-2026-05-20/">OpenAI preparing IPO filing in days or weeks</a></strong> &#8212; Per WSJ (Bloomberg confirming via its own source), OpenAI is working with Goldman Sachs and Morgan Stanley on a confidential S-1 draft, potentially filed as soon as this Friday and targeting a September public debut at a valuation north of $850B &#8212; moving fast after the Musk lawsuit was dismissed on statute-of-limitations grounds. &#8594; No clean primary source &#8212; both the WSJ original and Bloomberg&#8217;s confirmation rely on anonymous sources. Bloomberg link stands; the WSJ original is paywalled and not easily linkable.</p></li><li><p><strong><a href="https://exa.ai/blog/announcing-series-c">Exa raises $250M Series C at $2.2B</a></strong> &#8212; Exa Labs raised a $250M Series C led by Andreessen Horowitz at a $2.2B valuation (more than triple its $700M valuation from last fall) to scale its agent-optimized web search API, train its next-gen retrieval models, and handle hundreds of thousands of searches per second across its 500B+ URL index. </p><p></p></li></ol><p></p>]]></content:encoded></item><item><title><![CDATA[The Sequence Opinion #864: Every AI Agent Needs a Computer]]></title><description><![CDATA[The raise of agentic sandboxes.]]></description><link>https://thesequence.substack.com/p/the-sequence-opinion-864-every-ai</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-opinion-864-every-ai</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Thu, 21 May 2026 10:45:01 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!jKcQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jKcQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jKcQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!jKcQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!jKcQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!jKcQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jKcQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2599521,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/198682128?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jKcQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!jKcQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!jKcQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!jKcQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F27c9e6a4-e994-4d50-8979-28564e2a82ed_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The next phase of AI agents will not be defined only by better models, longer context windows, or more elegant tool-calling APIs, but by something much more primitive: access to a computer. An agent that can only emit tokens is a brilliant brain in a jar; an agent with a filesystem, terminal, browser, network, package manager, credentials, memory, and guardrails becomes a worker inside a real execution environment. This is the core thesis: every serious AI agent needs a computer, not metaphorically, but architecturally. It needs a safe, isolated, programmable space where it can write code, run commands, inspect outputs, manipulate files, browse the web, recover from errors, and iterate through the same feedback loops that make software useful. The emerging market for micro-containers, sandboxes, browser runtimes, and agent workspaces is really the market for giving intelligence a body.</p><h2>The Brain in the Jar Problem</h2>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-opinion-864-every-ai">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence AI of the Week #863: The Model is the Interface: Inside Thinking Machines' Interactive Models ]]></title><description><![CDATA[Thinking Machines&#8217; interactive models turn real-time conversation, vision, audio, and tool use into one continuous learned system.]]></description><link>https://thesequence.substack.com/p/the-sequence-ai-of-the-week-863-the</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-ai-of-the-week-863-the</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Wed, 20 May 2026 11:03:42 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!o3NJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!o3NJ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!o3NJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!o3NJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!o3NJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!o3NJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!o3NJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2001234,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/198484735?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!o3NJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!o3NJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!o3NJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!o3NJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd4284d6b-0dd5-48b6-8993-dacf8fefaec0_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>For this week in AI&#8217;s essay, I would like to discuss Thinking Machines&#8217; work on interactive models which takes multi-modality to a new level. I&#8217;ve been diving as much as possible on their ideas and wanted to share some thoughts. The work is early but so impressive. Check it out to get started: </p><div id="youtube2-Ys6i_MGnjUA" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;Ys6i_MGnjUA&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/Ys6i_MGnjUA?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>For the last few years, the default mental model for large language models has been embarrassingly simple: concatenate tokens, predict the next token, repeat. The human writes a message, the model replies, the human writes again. This works surprisingly well for many tasks because text is forgiving. Text can wait. It can be buffered, edited, compressed, and serialized into one neat causal stream.</p><p>But collaboration is not text. Collaboration is temporal.</p>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-ai-of-the-week-863-the">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence Knowledge #862: Learning About Text Diffusion Models]]></title><description><![CDATA[One of the most credible alternatives to transformers.]]></description><link>https://thesequence.substack.com/p/the-sequence-knowledge-862-learning</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-knowledge-862-learning</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Tue, 19 May 2026 11:03:39 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-Pat!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-Pat!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-Pat!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!-Pat!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!-Pat!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!-Pat!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-Pat!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2557690,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/198337440?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-Pat!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!-Pat!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!-Pat!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!-Pat!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d8ee9e8-a1d2-4508-93d8-72915d1ed84c_1536x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>&#128161; AI Concept of the Day: What are Text Diffusion Models</strong></h2><p>If you look at the architecture of the modern AI boom, it is heavily bifurcated by modality. In the visual domain, we are entirely ruled by diffusion models. From Midjourney to Stable Diffusion to OpenAI&#8217;s Sora, the paradigm of starting with pure noise and iteratively denoising it into a high-fidelity image or video has proven to be unreasonably effective.</p><p>But in the realm of text, diffusion has historically been an afterthought. Large Language Models (LLMs) like GPT-4, Claude, and LLaMA are staunchly autoregressive (AR). They are sequence predictors. They look at the context, predict the next token, append it to the context, and repeat. It is a strictly left-to-right, causal process.</p><p>For years, the consensus was simple: autoregression is just the native physics of language. But this sequential paradigm has glaring pathologies. Because AR models generate blindly from left to right, they cannot easily engage in global planning. If they make a slight logical error early in a sequence, that error is committed to the context window permanently, leading to cascading failures&#8212;a phenomenon often critiqued as &#8220;generation drift.&#8221; Furthermore, AR models suffer from the &#8220;reversal curse&#8221;; they can easily recite a poem forward, but if you ask them to recite it backward, their causal attention mechanisms break down entirely.</p>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-knowledge-862-learning">
              Read more
          </a>
      </p>
   ]]></content:encoded></item><item><title><![CDATA[The Sequence Radar #861: Last Week in AI: IPOs, Interactive Models, and Recursive Dreams]]></title><description><![CDATA[Cerebras monster IPO, three new innovative frontier AI labs]]></description><link>https://thesequence.substack.com/p/the-sequence-radar-861-last-week</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-radar-861-last-week</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Sun, 17 May 2026 11:02:52 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!dRlC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dRlC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dRlC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!dRlC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!dRlC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!dRlC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dRlC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2863267,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/197841120?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dRlC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!dRlC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!dRlC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!dRlC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1d9a7039-7459-45a1-854a-0f62aa6b8c28_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>Next Week in The Sequence:</strong></h2><ol><li><p>We continue our series about trasnformer alternatives. </p></li><li><p>In our opinion section, we discuss the thesis that &#8220;every agent needs a computer&#8221; </p></li><li><p>In the AI of the Week section, we dive into Thinking Machines&#8217; interactive models which can think and listen and same time. </p></li></ol><h2><strong>Subscribe and don&#8217;t miss out:</strong></h2><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://thesequence.substack.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">TheSequence is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h2><strong>&#128221; Editorial: Last Week in AI: IPOs, Interactive Models, and Recursive Dreams</strong></h2><p>This week in AI felt less like a product cycle and more like a philosophical provocation wrapped in a market event, a demo, and a few delightfully ambitious lab announcements. The common thread was not bigger models, larger context windows, or yet another benchmark victory. It was agency. Who gets to shape intelligence? Who gets to improve it? And, slightly more ominously, what happens when the tools begin improving the tools?</p><p>Start with Cerebras. The IPO was not just a financing milestone for an AI chip company; it was a reminder that the AI race is still brutally physical. Behind every magical chatbot sits an industrial stack of silicon, power, networking, cooling, capital markets, and geopolitical anxiety. Cerebras has always been the wonderfully weird character in the AI hardware opera: instead of making chips modestly larger, it went all-in on the wafer-scale computer, basically asking, &#8220;What if the chip were the data center?&#8221; That sounds like something invented after too much espresso at a semiconductor conference, but it captures an important truth. If intelligence is becoming a commodity, compute remains the refinery. Public markets are now voting on which refineries matter.</p><p>Then came Thinking Machines with its interaction models, and the shift in mood was immediate. While much of the industry keeps celebrating autonomous agents that disappear into the cloud and return with a spreadsheet, Thinking Machines is betting on the opposite: AI that stays present. Not &#8220;prompt, wait, receive,&#8221; but continuous, multimodal collaboration. The model listens, watches, interrupts, yields, and reacts in real time. In other words, it treats interaction not as a UI layer bolted onto intelligence, but as part of intelligence itself. This is a subtle but profound idea. Human cognition is social before it is computational. We think through gestures, pauses, corrections, shared context, and awkward silences. Making AI more useful may require making it less like an oracle and more like a very fast, very attentive collaborator who knows when not to talk.</p><p>The most intellectually spicy developments, however, came from the new &#8220;AI scientist&#8221; movement. Recursive emerged with the grand ambition of building systems that improve themselves through automated experimentation. Adaption&#8217;s AutoScientist pointed in a similarly important direction, automating the loop behind training, alignment, and model adaptation. The dream here is intoxicating: research no longer as a linear human process, but as a compounding machine of hypotheses, experiments, evaluations, and refinements.</p><p>That dream also deserves a raised eyebrow. Recursive self-improvement has long lived somewhere between computer science, science fiction, and existential risk seminar. Turning it into a venture-backed product category is both thrilling and slightly unhinged &#8212; which, frankly, is how many important technologies begin. The question is not whether AI can help with research. It already does. The question is whether we can build systems that improve capabilities without eroding grounding, safety, diversity, or human judgment.</p><p>Another signal came from China, where Junyang Lin, the former lead researcher behind Alibaba&#8217;s Qwen models, is reportedly raising several hundred million dollars for a new AI lab at a valuation that could land around $2 billion. That number is almost rude for a brand-new lab &#8212; especially in China, where AI startup valuations tend to be more earthbound than in the U.S. &#8212; but Lin has the rare credential that matters in frontier AI: he has already helped turn a model family into an ecosystem. His departure also exposes a delicious tension inside Chinese AI: as Alibaba shifts more energy toward proprietary models and monetization, the open-source talent orbit may be ready to spin out into independent labs. But this is not simply a &#8220;raise money, buy GPUs, train model, repeat&#8221; story. New Chinese AI labs must navigate U.S. chip export controls, constrained compute access, and the tricky problem of building research agendas that do not merely overlap with giants like Alibaba and ByteDance. In that sense, Lin&#8217;s new lab is less another shiny founder myth than a test case for whether China&#8217;s open-model momentum can become a venture-scale company.</p><p>The machine is no longer just answering. It is listening, adapting, experimenting &#8212; and, perhaps, beginning to ask what it should become next.</p><h2><strong>&#128270; AI Research</strong></h2><h3><strong><a href="https://arxiv.org/html/2605.15128v1">MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory</a></strong><a href="https://arxiv.org/html/2605.15128v1"> </a></h3><p>AI Lab: Rutgers, Notre Dame, Princeton, UMN, SAMD </p><p>Summary: MemEye is a novel evaluation framework and benchmark designed to assess whether multimodal AI agents can effectively preserve and reason over fine-grained visual evidence in long-term interactions. By evaluating 13 memory methods, the authors reveal a critical trade-off where text-based memory loses visual details while native image memory struggles with tracking temporal state changes.</p><h3><strong><a href="https://arxiv.org/html/2605.13941v1">EVOLVEMEM: Self-Evolving Memory Architecture via AutoResearch for LLM Agents</a></strong><a href="https://arxiv.org/html/2605.13941v1"> </a></h3><p><strong>AI Lab: </strong>UNC-Chapel Hill, UC Berkeley, UCSC </p><p><strong>Summary: </strong>EVOLVEMEM introduces a self-evolving memory architecture that uses an LLM-powered diagnosis module to autonomously optimize its own retrieval configuration based on failure logs. This closed-loop process allows the system to discover and refine effective retrieval strategies dynamically, significantly outperforming static baseline models on long-term memory benchmarks.</p><h3><strong><a href="https://arxiv.org/html/2605.15040v1">Orchard: An Open-Source Agentic Modeling Framework</a></strong><a href="https://arxiv.org/html/2605.15040v1"> </a></h3><p>AI Lab: Microsoft Research, Columbia University, UIUC </p><p><strong>Summary: </strong>Orchard is an open-source framework built around a lightweight, Kubernetes-native environment service that decouples sandbox management from agent harnesses to enable scalable, cost-effective agentic training. The framework includes specialized training recipes for software engineering, web navigation, and personal assistant tasks, demonstrating state-of-the-art performance and strong generalization across diverse agent domains.</p><h3><strong><a href="https://arxiv.org/html/2605.14389v1">NEXUS: An Agentic Framework for Time Series Forecasting</a></strong><a href="https://arxiv.org/html/2605.14389v1"> </a></h3><p>AI Lab: Google, Pennsylvania State University </p><p>Summary: NEXUS is a multi-agent framework that improves time series forecasting by explicitly decomposing the task into macro-level trend projection, micro-level granular analysis, and dynamic synthesis of numerical and textual contexts. By separating these reasoning stages and incorporating a calibration loop, the system matches or exceeds the accuracy of dedicated numerical foundation models while providing highly interpretable forecasting rationales.</p><h3><strong><a href="https://arxiv.org/abs/2605.07982">GLiGuard: Schema-Conditioned Classification for LLM Safeguard</a></strong><a href="https://arxiv.org/abs/2605.07982"> </a></h3><p><strong>AI Lab: </strong>Fastino Labs </p><p><strong>Summary:</strong> GLiGuard is a highly efficient, 0.3B-parameter bidirectional encoder designed for real-time LLM content moderation by encoding task definitions and label semantics directly into the input schema. This non-autoregressive architecture evaluates prompt safety, response safety, harm categories, and jailbreak strategies simultaneously in a single forward pass, achieving competitive accuracy with models up to 90 times larger while drastically reducing inference latency.</p><h2><strong>&#129302; AI Tech Releases</strong></h2><h3><strong>Interaction Models</strong></h3><p>Thinking Machines <a href="https://thinkingmachines.ai/blog/interaction-models/">unveiled a research preview of Interaction Models</a>: frontier models in which interactions is built in the model and not in the harness. </p><h2><strong>&#128225;10 AI News You Need to Know About</strong></h2><ol><li><p><strong><a href="https://www.cerebras.ai/press-release/cerebras-systems-announces-pricing-of-initial-public-offering">Cerebras IPO debut</a></strong><a href="https://www.cerebras.ai/press-release/cerebras-systems-announces-pricing-of-initial-public-offering"> </a>: Cerebras priced its IPO at $185/share to raise $5.55B, and shares then soared 68% in their Nasdaq debut on May 14, pushing the AI chipmaker&#8217;s market cap to roughly $95B.</p></li><li><p><strong><a href="https://www.recursive.com/">Recursive Superintelligence emerges from stealth</a></strong>: Former Salesforce chief scientist Richard Socher launched Recursive Superintelligence with $650M at a $4.65B valuation &#8212; led by GV and Greycroft with Nvidia and AMD Ventures participating &#8212; to build AI systems that recursively improve themselves through open-ended algorithms, starting with automating AI research itself.</p></li><li><p><strong><a href="https://techcrunch.com/2026/05/14/openai-is-reportedly-preparing-legal-action-against-apple-it-wouldnt-be-the-first-partner-to-feel-burned/">OpenAI weighs suing Apple</a></strong>: OpenAI has hired an outside law firm and is considering legal action against Apple over the ChatGPT-Siri integration, which it claims has been buried in the UI and has fallen far short of projected subscription revenue.</p></li><li><p><strong><a href="https://blogs.cisco.com/news/our-path-forward">Cisco cuts ~4,000 jobs while posting record revenue</a></strong>: Cisco is cutting roughly 5% of its workforce to redirect spending toward AI and cybersecurity, even as it touts record quarterly revenue and double-digit growth.</p></li><li><p><strong><a href="https://www.adaptionlabs.ai/blog/autoscientist">Adaption launches AutoScientist</a></strong>: Sara Hooker&#8217;s Adaption Labs unveiled AutoScientist, an automated fine-tuning system that co-optimizes data and model recipes and which the company says outperformed its in-house researchers by 35% (lifting win-rates from 48% to 64%), free for 30 days.</p></li><li><p><strong><a href="https://www.bloomberg.com/news/articles/2026-05-14/musk-altman-make-final-pitches-to-jury-in-battle-over-openai">Musk v. Altman closing arguments</a></strong>: Lawyers for Musk and OpenAI delivered closing arguments in Oakland federal court on May 14, with the nine-person jury set to begin deliberating Monday on whether Altman and Brockman breached commitments to keep OpenAI a nonprofit.</p></li><li><p><strong><a href="https://x.ai/news/grok-build-cli">xAI launches Grok Build</a></strong>: xAI rolled out an early beta of Grok Build, a new terminal-based coding agent and CLI for SuperGrok Heavy subscribers ($300/month), as Musk tries to close the coding-quality gap with Anthropic&#8217;s Claude.</p></li><li><p><strong><a href="https://nebius.com/newsroom/nebius-reports-first-quarter-2026-financial-results">Nebius Q1 sales up 684%</a></strong>: Nebius reported Q1 2026 revenue of $399M (up 684% YoY) and adjusted EBITDA of $129.5M, and announced it has secured up to 1.2 GW of power and land for a new owned AI factory in Pennsylvania.</p></li><li><p><strong><a href="https://www.bloomberg.com/news/articles/2026-05-12/ai-dictation-startup-wispr-in-funding-talks-at-2-billion-value">Wispr in funding talks at $2B</a></strong>: Wispr AI, maker of the Wispr Flow voice dictation tool, is in talks for a ~$260M round led by Menlo Ventures that would value the company near $2B &#8212; nearly triple its $700M valuation from late 2025.</p></li><li><p><strong><a href="https://x.com/theinformation/status/2054645118758433089">Former Alibaba Qwen lead launches new AI lab at ~$2B valuation</a></strong><a href="https://x.com/theinformation/status/2054645118758433089"> </a>: Junyang Lin, the former lead researcher behind Alibaba&#8217;s open-source Qwen models, is in talks with Chinese VCs Gaorong Ventures and HongShan to raise several hundred million dollars for a new AI lab at a roughly $2B post-money valuation &#8212; a level almost unheard of for a brand-new Chinese AI startup. </p></li></ol>]]></content:encoded></item><item><title><![CDATA[The Sequence Opinion #860: Every Company’s Last eXam: Some Reflection About Practical AI Evals]]></title><description><![CDATA[Some ideas about how companies should think about evaluations.]]></description><link>https://thesequence.substack.com/p/the-sequence-opinion-860-every-companys</link><guid isPermaLink="false">https://thesequence.substack.com/p/the-sequence-opinion-860-every-companys</guid><dc:creator><![CDATA[Jesus Rodriguez]]></dc:creator><pubDate>Thu, 14 May 2026 11:03:14 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!kLq6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!kLq6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!kLq6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!kLq6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!kLq6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!kLq6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!kLq6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2833340,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://thesequence.substack.com/i/197617235?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!kLq6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png 424w, https://substackcdn.com/image/fetch/$s_!kLq6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png 848w, https://substackcdn.com/image/fetch/$s_!kLq6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png 1272w, https://substackcdn.com/image/fetch/$s_!kLq6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8c31bbd1-3d2b-48bf-aa62-997259942f27_1672x941.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>For today&#8217;s essay, I want to explore an idea that has become central to how we think about AI evaluations at <a href="https://stratix.layerlens.ai/">LayerLens</a>. This is not an essay about LayerLens, but about a simple and increasingly unavoidable thesis: evals are becoming the fourth pillar of modern AI, alongside compute, data, and models. As AI systems move from chatbots to agents, from demonstrations to production workflows, every meaningful task performed by every agent inside every company will need its own evaluation layer. Not generic benchmarks. Not leaderboard theater. Practical, dynamic, company-specific exams that measure whether an AI system can actually survive contact with real work. I call this idea <strong>Every Company&#8217;s Last eXam</strong>.</p><p>Humanity&#8217;s Last Exam is a very specific kind of artifact. It is what a field builds when the old report card stops working. The core observation behind it was simple: familiar benchmarks such as MMLU had become too easy for frontier systems to cleanly separate the leaders, so researchers assembled a harder, broader, multimodal test at the frontier of human knowledge, finalized at 2,500 questions after removing errors and questions that were too easily answerable with search. And then, almost immediately, the benchmark itself taught a second lesson: even &#8220;the last exam&#8221; needs maintenance. HLE-Verified later showed that noisy items and flawed answers could materially distort comparisons, and that systematic verification could shift measured accuracy by 7 to 10 percentage points on average. In other words, the benchmark was not a stone tablet. It was infrastructure.</p><p>That is the right analogy for where enterprise AI is going. Every company now needs its own last exam: a private, living evaluation suite that captures the highest-value, highest-risk, most context-heavy work its agents are supposed to perform. Not a generic IQ test for models. Not another public leaderboard. More like a company-specific CI system for cognition. The public benchmarks still matter, just as SPEC mattered for CPUs and ImageNet mattered for vision, but production truth has moved downstream into proprietary workflows, private documents, internal policies, odd exceptions, and all the sharp edges that never make it into a paper appendix. That is why top frontier labs now emphasize task-specific evals, production-derived datasets, continuous maintenance, and explicit definitions of success rather than vibe-based model selection.</p><h3><strong>The fourth pillar</strong></h3>
      <p>
          <a href="https://thesequence.substack.com/p/the-sequence-opinion-860-every-companys">
              Read more
          </a>
      </p>
   ]]></content:encoded></item></channel></rss>