10 interesting stories served every morning and every evening.

AI-native React Components

vorpus.github.io

Anti-social: It's fads, not friends, which now dominate our feeds

www.bbc.com

22 May 2026

John Laurenson

Getty Images

Social me­dia plat­forms used to be about com­mu­ni­ca­tion be­tween friends — now many are in­creas­ingly short video en­ter­tain­ment hubs. The busi­ness model is to in­crease the time peo­ple spend on their apps and in­crease ad rev­enue. But is there al­ready a con­sumer back­lash?

Aurélia fixes her­self a cof­fee, sits down in her beau­ti­ful gar­den not far from Paris and goes on Instagram to re­lax.” First up: a guy I like a lot who does in­te­rior de­sign. He’s in Venice at the mo­ment.” She’s into in­te­rior de­sign, and has even just had two bird draw­ings by the 19th Century English de­signer William Morris tat­tooed on her arms. She scrolls down. Two kit­tens hav­ing a fight. I love an­i­mals so I get a lot of an­i­mals. That’s how it works, so­cial me­dia. You click on ba­nanas and they give you ba­nanas.”

There are ads too — al­though they look just like the other posts — for a ro­bot-vac­uum cleaner, a diet and bed linen (with Morris-inspired de­signs). But no friends. She has 198 on Instagram but she says it’s com­pletely changed. I prac­ti­cally don’t see any friends’ posts any­more.” She’s pretty much given up post­ing her­self. I don’t think any­one sees them any­more any­way.”

While there re­main com­mit­ted so­cial, am­a­teur posters on Instagram and es­pe­cially Facebook, the switch from com­mu­ni­cat­ing with peo­ple you know to scrolling through pro­fes­sion­ally made con­tent from peo­ple you don’t, is even more pro­nounced among young users.

Kylian, 16, is in vo­ca­tional train­ing to be­come a chef. He’s on TikTok and Youtube a lot, he says. I like look­ing at videos more than pho­tos or mes­sages. I watch videos made by peo­ple I don’t know. I don’t post at all. I’m a rather shy per­son. I stay in my bub­ble. I watch and that’s all. I keep my re­ac­tions to my­self.”

I spend a lot of time scrolling through videos made by con­tent-cre­ators,” says Lucie, also 16. They’re more in­ter­est­ing than the posts of peo­ple I know.” She does­n’t post ex­cept some­times stories” which dis­ap­pear af­ter 24 hours.

Whether it’s TikTok, Snapchat, Facebook and Instagram, we are a long way from the digital town square” of per­sonal in­ter­ac­tion that so­cial me­dia was even just a few years ago.

John Laurenson

In France, an­nual of­fi­cial Barometre du nu­merique 2026 shows 49% of so­cial me­dia users are active only oc­ca­sion­ally”. In the UK, an Ofcom re­port  pub­lished in April showed a year-on-year drop of users who ac­tively post from 61% to 49%. In the US, a Morning Consult sur­vey of June last year found 28% re­ported post­ing less of­ten than the pre­vi­ous year. Just 33% now post daily com­pared to 57% who use it for en­ter­tain­ment daily. The gap is a lot wider still for Gen Z — 18% ac­tive for 74% pas­sive.

Vanessa Lalo, a Paris-based clin­i­cal psy­chol­o­gist spe­cial­is­ing in on-line be­hav­iour, says users have be­come more con­scious that the traces you leave (on so­cial me­dia) stay there for­ever and some no longer want to main­tain so­cial me­dia re­la­tions that can be su­per­fi­cial. Some don’t want the ex­po­sure to crit­i­cism that might be a risk when you post or the feel­ing that their post will seem poor along­side all the pro­fes­sional con­tent”.

However, Lalo adds, peo­ple haven’t stopped post­ing, rather they are post­ing dif­fer­ent things and in dif­fer­ent places. On TikTok, for ex­am­ple, young peo­ple pub­lish a lot of con­tent but it’s more funny par­o­dies and remixes of ex­ist­ing ma­te­r­ial. The goal is to make peo­ple laugh, not to tell peo­ple about their lives.”

That still hap­pens, she says, but it’s moved from so­cial me­dia plat­forms like Instagram and Facebook to mes­sag­ing sites like WhatsApp. There’s also been a move to­wards pri­vate groups on Instagram and Snapchat. These are much more in­ti­mate places where you’re not bom­barded with ads and con­tent made by in­flu­encers,” she says.

What we’re see­ing is so­cial me­dia split­ting in two,” says so­cial me­dia con­sul­tant Matt Navarra, au­thor of the Geekout Newsletter. Big plat­forms like Instagram and TikTok are be­com­ing more about en­ter­tain­ment and dis­cov­ery. WhatsApp is be­com­ing the place peo­ple go to ac­tu­ally be so­cial. The catch is, those kinds of spaces are harder for com­pa­nies to make money from.”

Small busi­ness own­ers are be­ing pushed to be­come pre­sen­ters, ed­i­tors, trend spot­ters and con­tent cre­ators, on top of ac­tu­ally run­ning the busi­ness — Matt Navarra

It was TikTok that helped to pi­o­neer an al­go­rithm that fig­ures out from the mo­ment you start scrolling what you like, and then fills your feed with ma­te­r­ial cal­cu­lated to keep you on the app for the longest pos­si­ble time.

Now, says Matt Navarra, Meta has built what it calls an AI sys­tem for un­con­nected con­tent rec­om­men­da­tions on Face­book and Instagram, which ba­si­cally means, they’re in­creas­ingly show­ing you stuff from peo­ple you don’t fol­low be­cause the ma­chine thinks you’re go­ing to like it. It’s not bi­ased to­wards, is it a pro­fes­sional cre­ator? Is it a brand? Is it a friend? If they can see that you’ve en­gaged with a friend a lot, you might see a lot more of their con­tent. It’s just that who you are friends with, who you fol­low, has be­come ir­rel­e­vant in a way.”

This all means that small busi­nesses, that have long used so­cial me­dia for free pro­mo­tion have to up their game.

There’s a real op­por­tu­nity for some small busi­nesses,” says Matt Navarra. A bak­ery, florist, sa­lon or lo­cal café can still break through if they have a good story, strong vi­su­als or be­hind-the-scenes con­tent peo­ple want to watch. But it also means the job has changed. Small busi­ness own­ers are be­ing pushed to be­come pre­sen­ters, ed­i­tors, trend spot­ters and con­tent cre­ators, on top of ac­tu­ally run­ning the busi­ness.”

The so­cial plat­forms con­tinue to be mon­e­tised pre­dom­i­nantly by ad rev­enue. That is still the core busi­ness model. And ad rev­enue con­tin­ues to grow — Matt Navarra

Social me­dia is evolv­ing into some­thing pas­sive like tele­vi­sion, al­beit tele­vi­sion that adapts as you zap. Or rather which knows you so well that it does­n’t seem to mat­ter that much that it’s taken the re­mote con­trol. You give the plat­form in­for­ma­tion about your­self that it uses for com­mer­cial gain and, in re­turn, it gives you con­tent tai­lored to please you for free.

The tran­si­tion from truly so­cial me­dia to en­ter­tain­ment plat­form does seem to be pay­ing off. The so­cial plat­forms con­tinue to be mon­e­tised pre­dom­i­nantly by ad rev­enue. That is still the core busi­ness model. And ad rev­enue con­tin­ues to grow,” says Matt Navarra. Global so­cial me­dia ad rev­enue is ex­pected to reach $317 bil­lion (£236bn) in 2026, up from $277 bil­lion (£206bn) last year. Meta is the biggest win­ner. Its ad sales al­ready in­creased 22% year-on-year in 2025. Ad sales are ex­pected to hit $243 bil­lion (£181bn) this year, enough to over­take Google for the first time.

AI pow­ered dig­i­tal ad tar­get­ing is be­com­ing ever more ef­fec­tive and pre­cise. The so­cial plat­forms al­low com­pa­nies to put ads amongst the con­tent that you’re scrolling through. Every third or fourth scroll is an ad. And they are the world’s best ad tar­get­ing en­gines. They know so much about your in­ter­ests be­cause of what you’ve looked at, liked, en­gaged with, what you’ve cho­sen to fol­low, the time you’ve spent in cer­tain ar­eas of the app, things like that,” Navarra says.

So ad­ver­tis­ers will go in and say: I want to place an ad next to peo­ple in the UK who are be­tween thirty and sixty years old and who are in­ter­ested in DIY and the so­cial plat­forms will have that in­for­ma­tion and will place the ads ac­cord­ingly.”

The price will de­pend on the num­ber of im­pres­sions (clicks) the ad­ver­tiser wants and how tight the cri­te­ria are. It costs more to place ads in the so­cial me­dia feeds of peo­ple who buy horses than peo­ple who buy ice-cream.

More like this:

Might there be a back­lash com­ing? Don’t many peo­ple go on to so­cial me­dia to see how friends are re­act­ing to their posts or com­ments be­fore set­tling down to scroll through pro­fes­sion­ally made con­tent?

Meanwhile, for those who miss what are fast be­com­ing the old days when so­cial me­dia en­abled you to share a bit of your life, a joke or a point of view with peo­ple you more-or-less knew, there are tools within plat­forms, says Matt Navarra, that al­low you to choose to see mainly friends and fam­ily con­tent. People can flick to a feed that gives them that,” he says. But most peo­ple don’t.”

For more on busi­ness and be­yond, fol­low us on LinkedIn.

Stop launching the Music app whenever you press ▶ Play

lowtechguys.com

Since v1.1 you can con­fig­ure Music Decoy to launch an­other app when the ▷ Play but­ton is pressed.

To do that, run the fol­low­ing com­mand in the Terminal (example for Spotify):

* When you press the ▶ Play key on your key­board and there is no other app play­ing au­dio

* When end­ing a call, which causes the blue­tooth head­set to switch from call mode to mu­sic mode

There is a dae­mon called rcd (short for Remote Control Daemon) that is re­spon­si­ble for han­dling me­dia keys.

When a play event oc­curs, rcd checks if there is an app that is cur­rently play­ing au­dio. If there is, it sends the play com­mand to that app. If there is­n’t, it launches the sys­tem Music app.

There is a way to dis­able that dae­mon but it also dis­ables the abil­ity to con­trol me­dia play­back with the key­board.

Based on this StackExchange an­swer, there are a few dif­fer­ent ways to achieve the same ef­fect:

no­Tunes which lis­tens for launched apps and kills Music as soon as it is launched

Problem: it does use a tiny bit of CPU in the back­ground al­though check­ing for launched apps is very lit­tle work

The app has no Dock icon and no menubar icon so to quit it you’d need to do one of the fol­low­ing:

* Launch Activity Monitor, find Music Decoy and press the ❌ but­ton at the top

* Run the fol­low­ing com­mand in the Terminal: kil­lall Music Decoy’

Xiaomi MiMo, Explore and Love

mimo.xiaomi.com

MiMo-V2.5-Pro-UltraSpeed: Pushing 1T-Parameter Model Generation Speed to 1000 TPS

1. Xiaomi MiMo-V2.5-Pro-UltraSpeed: Speed is the Ultimate Edge

From the first roar­ing racer of the com­bus­tion age to the sonic boom that shat­tered the sound bar­rier, hu­man­i­ty’s hunger for speed is writ­ten into our very DNA. The speed of AI rea­son­ing is no dif­fer­ent — it de­fines the bound­aries of in­tel­li­gence it­self. When a model is fast enough, it ceases to be a tool you wait on and be­comes an ex­ten­sion of your own think­ing: re­spond­ing in real time, it­er­at­ing in an in­stant, col­lab­o­rat­ing with­out fric­tion.

Today, we are thrilled to re­lease Xiaomi MiMo-V2.5-Pro-UltraSpeed in col­lab­o­ra­tion with TileRT, break­ing the 1000 to­kens/​s de­code speed on a 1-trillion-parameter model for the first time!

2. Limited-Time Access · Application-Based

The MiMo-V2.5-Pro-UltraSpeed API launches si­mul­ta­ne­ously at a lim­ited-time pro­mo­tional price — the cost of MiMo-V2.5-Pro, but de­liv­er­ing ap­prox­i­mately 10× the gen­er­a­tion speed! the price, 10× the out­put ex­pe­ri­ence. (API only; Token Plan not sup­ported.)

Due to lim­ited high-speed in­fer­ence re­sources, MiMo-V2.5-Pro-UltraSpeed will be avail­able through an ap­pli­ca­tion-based, lim­ited-time win­dow. Approved users can ac­cess the API dur­ing the trial pe­riod, avail­able only from June 9 to June 23, 2026, 23:59 (Beijing Time, UTC+8 / 08:59 PDT).

How to Apply

API plat­form: plat­form.xi­aomim­imo.com/​ul­tra­speed. Trial slots are lim­ited — sub­mis­sion does not guar­an­tee ap­proval. We will pri­or­i­tize en­ter­prises and pro­fes­sional de­vel­op­ers with gen­uine busi­ness needs. For stan­dard model ac­cess, please fol­low the MiMo-V2.5 model se­ries. For in-depth busi­ness part­ner­ships for the UltraSpeed model, con­tact busi­ness-mimo@xi­aomi.com.

Chat Experience (Free During Trial)

Approved users will re­ceive free Chat ac­cess valid within the two-week win­dow. Entry point: ul­tra­speed.xi­aomim­imo.com

To en­sure qual­ity and fair­ness un­der re­source con­straints, the fol­low­ing rules ap­ply: each ac­count may en­ter the queue up to 10 times per day; each ses­sion is capped at 30 min­utes; ses­sions idle for more than 5 min­utes will be au­to­mat­i­cally re­leased.

3. 1000 to­kens/​s: Not Just Fast, But a Paradigm Shift

At the tril­lion-pa­ra­me­ter (1T) scale, break­ing 1000 tps is far more than a faster type­writer — it fun­da­men­tally dis­rupts AI ap­pli­ca­tion par­a­digms.

First, speed it­self be­gins to trans­mute into in­tel­li­gence. Previously, when fac­ing a hard prob­lem, you could only wait for one an­swer and pray it’s cor­rect.” Now, within the same wall-clock time, the model can run dozens of rea­son­ing paths in par­al­lel (Best-of-N / Tree Search), au­to­mat­i­cally ver­i­fy­ing and self-cor­rect­ing in the back­ground — us­ing raw speed to gen­er­ate depth of thought, di­rectly el­e­vat­ing rea­son­ing qual­ity.

Second, it com­pletely un­leashes the pro­duc­tiv­ity ceil­ing of Coding Agents. Before, hav­ing AI write code meant de­vel­op­ers painfully wait­ing in front of screens, bot­tle­necked by in­fer­ence la­tency. At 1000 tps, code gen­er­a­tion speed and pro­duc­tion ef­fi­ciency un­dergo a par­a­digm-level ac­cel­er­a­tion.

Most im­por­tantly, tril­lion-pa­ra­me­ter mod­els can now en­ter real-time de­ci­sion loops. Millisecond-level think-respond” cy­cles al­low 1T flag­ship mod­els to seam­lessly plug into time-crit­i­cal sce­nar­ios — high-fre­quency quan­ti­ta­tive trad­ing sig­nal gen­er­a­tion, in­stant anti-fraud in­ter­cep­tion, in­tel­li­gent bid­ding, and real-time in­ter­ac­tive di­a­logue. And when this power is brought to sur­gi­cal as­sis­tance and med­ical imag­ing analy­sis in life-or-death sit­u­a­tions, AI speed is no longer just a met­ric of ef­fi­ciency — it be­comes a chip in the race against death. On the op­er­at­ing table, every sec­ond AI saves in com­plet­ing le­sion analy­sis and risk pre­dic­tion gives the sur­geon one more de­gree of free­dom. This deep­ens our con­vic­tion that the ul­ti­mate sig­nif­i­cance of speed is not merely boost­ing pro­duc­tiv­ity, but en­abling tech­nol­ogy to help hu­man­ity live bet­ter.

4. Extreme Model-System Codesign

Achieving 1000+ to­kens/​s gen­er­a­tion speed with a 1T flag­ship model is not the break­through of a sin­gle tech­nique — it is the prod­uct of deep col­lab­o­ra­tion and ex­treme Codesign be­tween the MiMo model team and the TileRT sys­tem team. The in­dus­try’s cur­rent ap­proach to sim­i­lar ex­treme speeds typ­i­cally re­lies on spe­cial­ized hard­ware — Cerebras’s Wafer-Scale in­te­gra­tion or Groq’s pure on-chip SRAM cus­tom ar­chi­tec­ture. We chose a dif­fer­ent path: achiev­ing even more im­pres­sive in­fer­ence speed on com­mod­ity GPUs through model-sys­tem code­sign alone.

On the model side, we ap­plied FP4 quan­ti­za­tion tar­get­ing the band­width bot­tle­neck of com­mod­ity hard­ware, dra­mat­i­cally shrink­ing model size and re­duc­ing mem­ory-ac­cess over­head; si­mul­ta­ne­ously, we in­tro­duced DFlash, an ef­fi­cient spec­u­la­tive de­cod­ing method based on block-level masked par­al­lel pre­dic­tion, sub­stan­tially in­creas­ing the ac­cepted to­ken length per ver­i­fi­ca­tion step. On the sys­tem side, TileRT per­fectly adapts to the dy­namic char­ac­ter­is­tics of these al­go­rithms, de­liv­er­ing a tai­lor-made com­pi­la­tion en­gine and com­pute ker­nels op­ti­mized specif­i­cally for the novel quan­ti­za­tion and spec­u­la­tive de­cod­ing pipeline. Through this ex­treme Codesign, we achieved 1000+ to­kens/​s out­put from a 1T model us­ing just a sin­gle stan­dard 8-GPU com­mod­ity node.

3.1 FP4 Quantization

At the tril­lion-pa­ra­me­ter (1T) scale, tra­di­tional 8-bit (FP8 / INT8) or even 16-bit in­fer­ence im­poses pro­hib­i­tive mem­ory foot­print and band­width pres­sure. Reducing pa­ra­me­ter bit-width di­rectly con­tributes to de­cod­ing speed. We there­fore adopt the widely val­i­dated, vir­tu­ally loss­less FP4 (MXFP4) quan­ti­za­tion for­mat[1].

However, naively ap­ply­ing FP4 across the en­tire model causes degra­da­tion in com­plex rea­son­ing, logic, and code gen­er­a­tion. Given the MoE (Mixture of Experts) ar­chi­tec­ture of Xiaomi MiMo-V2.5-Pro — where Experts con­sti­tute the vast ma­jor­ity of pa­ra­me­ters and ex­hibit the high­est tol­er­ance to quan­ti­za­tion — we se­lec­tively quan­tize only the MoE Experts to FP4 while pre­serv­ing orig­i­nal pre­ci­sion for all other mod­ules. Through FP4 QAT (Quantization-Aware Training), we dra­mat­i­cally re­duce model size and max­i­mize hard­ware band­width uti­liza­tion while keep­ing the mod­el’s over­all ca­pa­bil­ity es­sen­tially on par with the orig­i­nal, as shown be­low:

3.2 DFlash Speculative Decoding

Traditional Speculative Decoding re­lies on a small draft model to guess” sub­se­quent to­kens, which the large model then ver­i­fies. This trans­forms au­tore­gres­sive gen­er­a­tion (1 to­ken per for­ward pass) into par­al­lel multi-to­ken gen­er­a­tion, with re­jec­tion sam­pling dur­ing ver­i­fi­ca­tion en­sur­ing loss­less out­put qual­ity. However, its bot­tle­neck lies in the draft mod­el’s qual­ity de­ter­min­ing the ac­cep­tance rate, while a stronger draft model in­curs higher com­pute over­head — a fun­da­men­tal ten­sion.

To break this dead­lock, we adopt DFlash, an in­no­v­a­tive block-level masked par­al­lel pre­dic­tion method from the re­search com­mu­nity[2]: the draft model fills an en­tire block of masked po­si­tions in a sin­gle for­ward pass, fun­da­men­tally elim­i­nat­ing the se­r­ial con­straint of autoregressive draft­ing.”

We de­ployed this ap­proach on MiMo-V2.5-Pro with cus­tom op­ti­miza­tions tai­lored for tril­lion-scale MoE and long-con­text sce­nar­ios. Using the Muon sec­ond-or­der op­ti­mizer and model self-dis­til­la­tion, we en­sure that com­pact mask blocks still de­liver ideal ac­cep­tance rates while com­press­ing draft-stage over­head to near its the­o­ret­i­cal min­i­mum:

The draft model ex­clu­sively uses Sliding Window Attention (SWA), nat­u­rally align­ing with the SWA de­sign of the MiMo-V2 se­ries. This elim­i­nates de­pen­dency on com­plete pre­fixes, re­duc­ing per-pre­dic­tion com­pute from con­text-length-lin­ear to con­stant.

During train­ing, mask-sig­nal sam­pling is pushed down to GPU-local shards, en­abling a sin­gle se­quence to pro­duce tens of thou­sands of in­de­pen­dent train­ing sig­nals cov­er­ing di­verse con­text po­si­tions in one step — align­ing with the long-con­text ca­pa­bil­ity of the MiMo-V2 se­ries while avoid­ing cross-de­vice com­mu­ni­ca­tion over­head.

In terms of re­sults, our par­al­lel-pre­dic­tion spec­u­la­tive de­cod­ing achieves sig­nif­i­cant ac­cep­tance-length im­prove­ments across high-value agent and cod­ing sce­nar­ios, mean­ing the large model can con­firm more con­tent in one breath” per ver­i­fi­ca­tion round. Furthermore, we limit block size to 8 to re­duce ver­i­fi­ca­tion over­head and in­crease con­cur­rency, al­low­ing high ac­cep­tance lengths to trans­late di­rectly into high in­fer­ence through­put:

In the Coding sce­nario, we achieve an av­er­age ac­cep­tance length of 6.30, with some sam­ples reach­ing a max­i­mum of 7.14 — mean­ing 6 – 7 out of the 8 draft to­kens per ver­i­fi­ca­tion round are ac­cepted. The draft model re­mains light­weight while push­ing ac­cep­tance rates to lev­els that de­liver real end-to-end gains. We also ob­serve that in more se­man­ti­cally di­ver­gent, higher-un­cer­tainty gen­eral con­ver­sa­tion sce­nar­ios, cur­rent ac­cep­tance rates are not yet high. We are con­tin­u­ously op­ti­miz­ing the al­go­rithm to ex­plore higher gen­er­al­iza­tion ceil­ings.

3.3 TileRT Ultra-Low-Latency Inference Kernels / System

If MiMo’s al­go­rith­mic in­no­va­tions un­shackle the band­width con­straints of hun­dred-bil­lion and tril­lion-pa­ra­me­ter mod­els, then the TileRT in­fer­ence sys­tem squeezes every last drop of phys­i­cal po­ten­tial from com­mod­ity GPUs down to the mi­crosec­ond level.

At 1000 to­kens/​s op­er­at­ing fre­quency, each op­er­a­tor’s life­cy­cle is com­pressed to mi­crosec­onds, and the operator bound­aries” of tra­di­tional in­fer­ence sys­tems be­come the core bot­tle­neck — every op­er­a­tor launch, hard­ware syn­chro­niza­tion, and global mem­ory round-trip frac­tures the ex­e­cu­tion flow at the mi­crosec­ond scale, ex­pos­ing vis­i­ble Execution Gaps.”

TileRT’s Paradigm-Level Execution Model Revolution

As the foun­da­tional in­fra­struc­ture for ul­tra-low-la­tency in­fer­ence, TileRT in­tro­duces an en­tirely new ex­e­cu­tion model that elim­i­nates ex­e­cu­tion gaps from op­er­a­tor bound­aries at their root:

Persistent Engine Kernel: Completely dis­cards the tra­di­tional per-op­er­a­tor launch par­a­digm, keep­ing the en­tire com­pute pipeline per­sis­tently res­i­dent and flow­ing within the GPU. This en­ables full-pipeline con­tin­u­ous prefetch­ing — while the cur­rent Tile is still com­put­ing on Tensor Cores, sub­se­quent data is al­ready flow­ing through the mem­ory hi­er­ar­chy, achiev­ing ex­treme over­lap be­tween data move­ment and com­pu­ta­tion.

Warp Specialization (Heterogeneous Pipeline Collaboration): At the Tile level, com­mu­ni­ca­tion, data move­ment, and ten­sor com­pu­ta­tion are phys­i­cally de­com­posed with finer gran­u­lar­ity. Breaking the ho­mo­ge­neous lock-step ex­e­cu­tion model, dif­fer­ent Warps (thread groups) and even het­ero­ge­neous ex­e­cu­tion do­mains across the en­tire GPU op­er­ate in­de­pen­dently yet in pre­cise co­or­di­na­tion — trans­form­ing the GPU into a con­tin­u­ously flow­ing, pre­cisely or­ches­trated het­ero­ge­neous ex­e­cu­tion sys­tem.

Microsecond-Scale Hardware-Software Deep Convergence (Codesign)

When the un­der­ly­ing ex­e­cu­tion model pushes hard­ware per­for­mance to its lim­its, pure run­time op­ti­miza­tion be­gins to hit phys­i­cal bound­aries. Building on this foun­da­tion, the TileRT sys­tem team and Xiaomi’s MiMo team en­gaged in deep tech­ni­cal co-cre­ation, break­ing down tra­di­tional soft­ware layer bound­aries. To per­fectly align model be­hav­ior with this ul­tra-low-la­tency ex­e­cu­tion pipeline, the model layer ul­ti­mately adopted a mixed FP4 quan­ti­za­tion strat­egy for MoE Experts and de­ployed SWA-aligned DFlash spec­u­la­tive de­cod­ing on the tril­lion-pa­ra­me­ter ar­chi­tec­ture. TileRT tightly cou­ples with these al­go­rith­mic char­ac­ter­is­tics and quan­ti­za­tion schemes, de­liv­er­ing cus­tom-built com­pi­la­tion en­gines and com­pute ker­nels. Both teams made pro­found joint en­gi­neer­ing trade­offs based on hard­ware physics, en­sur­ing ex­e­cu­tion pres­sure closes smoothly within hard­ware bound­aries.

The birth of 1000 to­kens/​s is no co­in­ci­dence of point op­ti­miza­tions. It is the in­evitable re­sult of world-class sys­tem in­fra­struc­ture and ex­treme al­go­rith­mic mod­els deeply con­verg­ing to­ward each other, co-evolv­ing as one.

TileRT is a fron­tier sys­tems ar­chi­tec­ture team fo­cused on next-gen­er­a­tion AI in­fra­struc­ture and ul­tra-low-la­tency in­fer­ence. The team is ded­i­cated to en­abling mil­lisec­ond-level real-time re­sponse for fron­tier large mod­els in pro­duc­tion en­vi­ron­ments, break­ing tra­di­tional stor­age-com­pute bar­ri­ers with an en­tirely new run­time ar­chi­tec­ture. The team has con­ceived and im­ple­mented a par­a­digm-level ex­e­cu­tion model. Through full-stack break­throughs in per­sis­tent ker­nels, tile pipelines, and het­ero­ge­neous col­lab­o­ra­tion, TileRT achieves ex­treme com­pute uti­liza­tion within com­plex het­ero­ge­neous ecosys­tems. As a core in­fra­struc­ture en­abler, the team ac­tively part­ners with in­dus­try-lead­ing col­lab­o­ra­tors on hard­ware-soft­ware code­sign, build­ing the high-per­for­mance com­pute foun­da­tion for the era of au­tonomous in­tel­li­gence that craves ultimate speed.” For more TileRT tech­ni­cal de­tails: tilert.ai/​blog/​break­ing-1000-tps.html

5. More Demos

6. Open Source & Outlook

We have open-sourced the MiMo-V2.5-Pro-FP4-DFlash check­point on HuggingFace, in­clud­ing FP4 quan­tized weights and DFlash model pa­ra­me­ters. Community us­age and feed­back are wel­come: hug­ging­face.co/​Xi­aomiM­iMo/​MiMo-V2.5-Pro-FP4-DFlash

UltraSpeed sup­port for MiMo-V2.5 is on the way — stay tuned.

MiMo × TileRT — ex­treme model-sys­tem code­sign, de­liv­er­ing 1000 tps out­put speed for tril­lion-pa­ra­me­ter mod­els.

Apple Intelligence and Siri

www.apple.com

Introducing the next gen­er­a­tion of Apple Intelligence and Siri. Truly help­ful AI that’s cen­tered around you and your needs. Integrated into your apps, grounded in your con­text, and pri­vate at every step. Coming later this year.

Accessibility.

VoiceOver de­scribes your phys­i­cal sur­round­ings and on­screen con­tent in richer de­tail. Magnifier zooms in so you can ask about what’s in frame. Accessibility Reader cleans up text for eas­ier read­ing. Voice Control is more flex­i­ble so you can in­ter­act with apps in your own words with less to mem­o­rize.

Smarter Home.

Apple Intelligence in the Home app un­locks new ca­pa­bil­i­ties, like com­bin­ing re­lated ac­tiv­ity no­ti­fi­ca­tions,7 de­scrib­ing what hap­pened in a se­lec­tion of HomeKit Secure Video footage be­fore you watch,8 and us­ing AI to search for a clip based on what hap­pened.9

Genmoji.

Create high-qual­ity Genmoji to match the mo­ment.

Workout Buddy.

Get deeper in­sights and more data-rich mo­ti­va­tion with en­hance­ments in Workout Buddy. Train even with­out your iPhone nearby. And Workout Buddy is now avail­able in Spanish.10

Great pow­ers come with great pri­vacy.

Apple Intelligence is de­signed to pro­tect your pri­vacy at every step. It’s in­te­grated into the core of your iPhone, iPad, and Mac through on-de­vice pro­cess­ing. So it’s aware of your per­sonal in­for­ma­tion with­out col­lect­ing your per­sonal in­for­ma­tion. And with ground­break­ing Private Cloud Compute, Apple Intelligence can draw on larger server-based mod­els, run­ning on Apple silicon, to han­dle more com­plex re­quests for you while pro­tect­ing your pri­vacy.

Private Cloud Compute

Your data is never stored

Used only for your re­quests

Verifiable pri­vacy promise

New pos­si­bil­i­ties for your fa­vorite apps.

The Foundation Models frame­work, along with App Intents, APIs, and frame­works, is built with pri­vacy at the cen­ter. Any app can tap into the on-de­vice mod­els that power Apple Intelligence, and the fea­tures you build work of­fline. And it’s all at no cost per re­quest. App de­vel­op­ers can eas­ily in­te­grate fea­tures like Siri, Writing Tools, and Image Playground.

Learn more about de­vel­op­ing for Apple Intelligence

xAI is looking more like a datacentre REIT than a frontier lab

martinalderson.com

An un­ex­pected de­vel­op­ment over the past few weeks is xAI’s new part­ner­ships with Anthropic and Google, pro­vid­ing them with a huge amount of ca­pac­ity. It’s worth re­mem­ber­ing that xAI is now part of SpaceX, af­ter the two merged back in February - so the rev­enue from these deals flows straight into the en­tity about to go pub­lic. While much has been made of the po­ten­tial fi­nan­cial en­gi­neer­ing given SpaceX’s up­com­ing IPO, I think there’s a bit more to this than just pure ac­count­ing tricks.

Anthropic was in a se­ri­ous bind

If you use Claude prod­ucts much, you’ll be (very, prob­a­bly) aware that Anthropic has had se­ri­ous ca­pac­ity prob­lems, es­pe­cially early af­ter­noon on­wards in Europe and in the morn­ings in the US (this is when de­mand seems to be high­est as both European users and the Americas are both at work, fight­ing for ca­pac­ity). I’ve writ­ten about this com­pute crunch be­fore a few times - the com­ing crunch, whether it’s here yet, and what comes next.

This re­sulted in Anthropic hav­ing to in­tro­duce new peak hour re­stric­tions on their sub­scrip­tions, with us­age be­tween 5am–11am PT / 1pm–7pm GMT us­ing more of your us­age limit - with the aim of smooth­ing de­mand be­tween peak hours and off peak hours where they had more ca­pac­ity avail­able.

However, there is only so much de­mand shift­ing you can do when de­mand is grow­ing as fast as Anthropic’s. At some point you end up hav­ing to ra­tion users fur­ther, which def­i­nitely is far from ideal when you have both Google and OpenAI breath­ing down your neck for cus­tomers.

xAI to the res­cue?

At the start of May, xAI an­nounced a part­ner­ship with Anthropic to pro­vide ac­cess to their (older) Colossus 1 dat­a­cen­tre in Memphis. This al­lowed Anthropic to re­verse the us­age limit re­stric­tions on their sub­scrip­tions, and in gen­eral while sta­bil­ity of Anthropic ser­vices still leaves a lot to be de­sired, the peak time crunch has abated (for now, at least).

The fees in­volved are enor­mous, ramp­ing to $1.25bn/month for 300MW of ca­pac­ity - ap­prox­i­mately 220k GPUs.

Last week, Google an­nounced a sim­i­lar part­ner­ship - $920mn/month for 110k GPUs[1]. It’s im­por­tant to note that both agree­ments have can­cel­la­tion clauses - al­low­ing ei­ther party to can­cel with 90 days’ no­tice af­ter an ini­tial lock-in pe­riod.

If you take this on face value, this is a lu­di­crously prof­itable deal for xAI:

While this does­n’t in­clude opex[2] and de­pre­ci­a­tion, if the deals con­tinue for 18 months, xAI re­coups all the capex they spent and still has many hun­dreds of MW of GPUs avail­able. With the gi­ant com­pute short­ages likely to per­sist into the medium term, even older H100s are likely to be ex­tremely use­ful even 18 months out.

The case against

It’s im­por­tant to note there are cer­tainly some red flags with the deal. Firstly, Elon Musk and OpenAI were/​are locked in a bit­ter le­gal bat­tle, and the Anthropic deal could be mo­ti­vated to add pres­sure to OpenAI more than com­mer­cial re­al­ity.

And Google is a ma­jor share­holder in SpaceX, so they cer­tainly have in­cen­tive to juice the val­u­a­tion of the IPO.

While I’m sure there is some de­gree (potentially a lot!) of truth in these view­points, it’s im­por­tant to note that huge vol­umes of GPUs are in enor­mously short sup­ply.

One of the un­told sto­ries of this capex boom in dat­a­cen­tres is just how be­hind all of them are. Even OpenAI’s flag­ship Stargate UAE dat­a­cen­tre - be­ing built in a ju­ris­dic­tion that is renowned for a lais­sez-faire at­ti­tude to build­ing reg­u­la­tions - is now un­der di­rect threat from the cur­rent Iran con­flict, with Iranian drones hav­ing al­ready hit other UAE dat­a­cen­tres.

In com­par­i­son, SpaceX/xAI are in­cred­i­ble at build­ing dat­a­cen­tres on time. The orig­i­nal Colossus 1 dat­a­cen­tre was built in 122 days. Musk’s em­pire does have a huge ad­van­tage in re­ally un­der­stand­ing how to plan, build and ex­e­cute enor­mous in­fra­struc­ture pro­jects quickly. While the hy­per­scalers no doubt have the ex­pe­ri­ence to do this, they were built with far less ur­gency - with typ­i­cal pro­ject ex­e­cu­tion tak­ing many years. Given the capex only re­ally started to ramp up in the last cou­ple of years, many of these pro­jects are still years away.

This gives xAI a se­ri­ous com­pet­i­tive ad­van­tage that should­n’t in my opin­ion just be hand waved away.

But what about Grok?

There is no doubt this leaves Grok in an odd spot, with a lot of the dat­a­cen­tre ca­pac­ity that was des­tined for Grok train­ing and in­fer­ence now be­ing leased to a di­rect com­peti­tor.

While it’s fool­ish to write off any model provider, it cer­tainly looks like a se­ri­ous re­treat from Grok vy­ing to be a fron­tier class lab. But, per­haps, they over-spec­i­fied their dat­a­cen­tre ca­pac­ity - there is no doubt that in­fer­ence de­mand for Grok mod­els is likely to be se­ri­ously be­hind pro­jec­tions, leav­ing a bunch of spare ca­pac­ity which might as well be mon­e­tised while the train­ing lot­tery con­tin­ues? It’s hard to say and the xAI & Cursor deal mud­dies the wa­ter even fur­ther.

As such, I think all three things are true to some de­gree. There’s no doubt some level of fi­nan­cial en­gi­neer­ing go­ing on. There’s also an enor­mous com­pute short­age. And it seems to me SpaceX/xAI does have a real com­pet­i­tive ad­van­tage in dat­a­cen­tre build­out.

It’s just the mag­ni­tude of how true each of these are is go­ing to de­fine the suc­cess or fail­ure of the biggest IPO in North American his­tory.

Either way, the more I look at it, the more xAI is start­ing to re­sem­ble a dat­a­cen­tre REIT with a fron­tier lab at­tached, rather than the other way around.

I sus­pect that these are likely to be GB200s given the pric­ing, vs the mostly H100/H200 for Anthropic, but this is spec­u­la­tion on my part. ↩︎

I sus­pect that these are likely to be GB200s given the pric­ing, vs the mostly H100/H200 for Anthropic, but this is spec­u­la­tion on my part. ↩︎

Power is the ob­vi­ous big opex line, but at this scale it’s al­most a round­ing er­ror. 300MW run­ning flat out is roughly 300,000 kW × 8,760 hours, or about 2.6 bil­lion kWh a year. Tennessee has some of the cheap­est in­dus­trial elec­tric­ity in the US at around 6 cents/​kWh, so buy­ing it off the grid would cost some­where around $160mn a year. Colossus ac­tu­ally runs largely on its own on-site gas tur­bines, which comes out even cheaper: at a sim­ple-cy­cle heat rate of ~10,000 Btu/kWh and Henry Hub gas at ~$3.50/MMBtu, the fuel bill is only around $90mn a year. Either way, set against the ~$15bn a year Anthropic is pay­ing for that 300MW, power is no more than about 1% of rev­enue. The deal value ut­terly dwarfs the run­ning costs. ↩︎

Power is the ob­vi­ous big opex line, but at this scale it’s al­most a round­ing er­ror. 300MW run­ning flat out is roughly 300,000 kW × 8,760 hours, or about 2.6 bil­lion kWh a year. Tennessee has some of the cheap­est in­dus­trial elec­tric­ity in the US at around 6 cents/​kWh, so buy­ing it off the grid would cost some­where around $160mn a year. Colossus ac­tu­ally runs largely on its own on-site gas tur­bines, which comes out even cheaper: at a sim­ple-cy­cle heat rate of ~10,000 Btu/kWh and Henry Hub gas at ~$3.50/MMBtu, the fuel bill is only around $90mn a year. Either way, set against the ~$15bn a year Anthropic is pay­ing for that 300MW, power is no more than about 1% of rev­enue. The deal value ut­terly dwarfs the run­ning costs. ↩︎

Apple Reveals New AI Architecture Built Around Google Gemini Models

www.macrumors.com

Apple to­day an­nounced a ma­jor over­haul of its Apple Intelligence plat­form, re­veal­ing a new ar­chi­tec­ture built on foun­da­tion mod­els de­vel­oped in col­lab­o­ra­tion with Google us­ing the tech­nolo­gies be­hind the Gemini fam­ily.

The new ar­chi­tec­ture cen­ters on Apple Foundation Models co-de­vel­oped with Google, which Apple says are adapted to run both on-de­vice and on servers through its ex­ist­ing Private Cloud Compute in­fra­struc­ture. Apple de­scribed the col­lab­o­ra­tion as a deep” one that it says un­locks what it called a huge up­grade” for ‌Apple Intelligence‌, bring­ing state-of-the-art un­der­stand­ing and rea­son­ing ca­pa­bil­i­ties as well as mul­ti­modal sup­port in­clud­ing im­age un­der­stand­ing and gen­er­a­tion.

The up­graded mod­els sup­port new ca­pa­bil­i­ties use cases, in­clud­ing re­al­is­tic im­age cre­ation, ad­vanced photo edit­ing, and vi­sual ques­tion an­swer­ing. Certain de­vices will re­ceive a higher-power ver­sion of the model with ad­di­tional ca­pa­bil­i­ties in­clud­ing speech gen­er­a­tion, im­proved dic­ta­tion ac­cu­racy, and stronger nat­ural lan­guage un­der­stand­ing, though Apple did not spec­ify which de­vices qual­ify.

A new sys­tem or­ches­tra­tor sits at the cen­ter of the re­vised ar­chi­tec­ture, co­or­di­nat­ing ‌Apple Intelligence‌ fea­tures se­curely across Apple’s plat­forms. Apple says the or­ches­tra­tor al­lows the sys­tem to tai­lor its re­sponses based on the ac­tive app and the user’s cur­rent task, en­abling what the com­pany de­scribed as truly sys­tem-wide in­tel­li­gence.

Apple used the an­nounce­ment to frame its ap­proach as a con­trast to com­peti­tors it char­ac­ter­ized as racing for­ward” with­out re­gard for users. The com­pany re­it­er­ated that ‌Apple Intelligence‌ re­lies on on-de­vice pro­cess­ing and Private Cloud Compute, with a promise that user data is only used to ex­e­cute the im­me­di­ate re­quest and is not ac­ces­si­ble to Apple or third par­ties. Apple added that out­side ex­perts can ver­ify those pri­vacy guar­an­tees at any time.”

Popular Stories

Google Maps for CarPlay Getting Gemini AI

Monday May 11, 2026 4:25 pm PDT by Juli Clover

The Google Maps app for CarPlay could soon in­clude sup­port for Gemini, based on code that MacRumors found in the Google Maps app. Gemini in­te­gra­tion would al­low CarPlay users to get de­tailed di­rec­tions and in­for­ma­tion from Gemini. Strings in the app sug­gest users will be able to tell Gemini to nav­i­gate to a spe­cific lo­ca­tion af­ter agree­ing to new Terms of Service in the iPhone ver­sion of the …

Google Previews Android 17 With Gemini Intelligence’ a Month Before Apple’s iOS 27 Reveal

Tuesday May 12, 2026 12:21 pm PDT by Juli Clover

Google to­day pre­viewed Android 17, the next ver­sion of Android that it is bring­ing to smart­phones and other de­vices. Android 17 in­cludes mul­ti­ple new AI fea­tures, and it comes about a month ahead of when Apple plans to un­veil iOS 27 with new AI ca­pa­bil­i­ties. Google is now call­ing the AI fea­tures on Android Gemini Intelligence,” brand­ing sim­i­lar to Apple Intelligence. Google said it is…

Google Makes It Easier to Share Files Between Android and iPhone

Tuesday May 12, 2026 2:48 pm PDT by Juli Clover

Google to­day said it is in­tro­duc­ing up­dated file shar­ing fea­tures that will make it eas­ier for Android users to send files to iPhone users. Quick Share is al­ready com­pat­i­ble with Apple’s AirDrop fea­ture on se­lect Android de­vices, but Google says the fea­ture will ex­pand to Samsung, OPPO, OnePlus, Vivo, Xiaomi, and HONOR de­vices in 2026. On Android de­vices that are not com­pat­i­ble with…

AI Is Slowing Down

www.wheresyoured.at

If you liked this piece, you should sub­scribe to my pre­mium newslet­ter. It’s $70 a year, or $7 a month, and in re­turn you get a weekly newslet­ter that’s usu­ally any­where from 5,000 to 18,000 words, in­clud­ing vast, de­tailed analy­ses of NVIDIA, Anthropic and OpenAI’s fi­nances, and the AI bub­ble writ large (updated to ver­sion 3.0 last week). My Hater’s Guides To the SaaSpocalypse, Private Credit and Private Equity are es­sen­tial to un­der­stand­ing our cur­rent fi­nan­cial sys­tem, and my guide to how OpenAI Kills Oracle pairs nicely with my Hater’s Guide To Oracle.

Over a three week pe­riod in May, I pub­lished an ex­haus­tive three-part guide to how the AI bub­ble might col­lapse, the events that might trig­ger it, and the con­se­quences. For some­thing lighter, check out last week’s pre­mium, where I re-in­tro­duce you to the an­tag­o­nists of the AI bub­ble (and their fa­tal weak­nesses) in caus­tic, slightly sweary terms.

Subscribing to pre­mium is both great value and makes it pos­si­ble to write these large, deeply-re­searched free pieces every week.

Last week I went on Bloomberg and dis­cussed the state of the AI bub­ble with a clar­ity that rat­tled even the sweati­est boost­ers, mostly be­cause I spoke with clar­ity about an in­vest­ment frenzy whipped up through hype, de­ceit and mythol­ogy. Some were equal parts frus­trated and an­gry that I don’t have money in the mar­ket, or, as they’d put it, skin in the game.”

I get it! When your en­tire world­view is dic­tated by what a se­ries of ven­ture cap­i­tal­ists and psuedo-jour­nal­ists on Twitter want you to be­lieve, it must be dif­fi­cult to imag­ine some­one hav­ing morals” or beliefs” or that one might hold a po­si­tion that was­n’t en­tirely based on greed or trib­al­ism. It must be con­fus­ing — up­set­ting, even! — to hear that some­body is will­ing to ac­cu­rately and vo­cif­er­ously tear into a tech in­dus­try largely con­trolled by peo­ple with no re­gard for their users or work­ers, who are will­ing to bathe their prod­ucts in medi­oc­rity all be­cause it’s the thing that every­body else is do­ing.

This is a hys­ter­i­cal era per­pet­u­ated by liars, cow­ards, im­be­ciles, craven boost­ers and the eas­ily-fooled. Those ex­cited about gen­er­a­tive AI are ei­ther the vic­tim or the per­pe­tra­tor of a con cen­tered around a tech­nol­ogy to in­gra­ti­ate at the high­est cost pos­si­ble.

AI Cannot Afford To Slow Down — It Needs $3 Trillion Or More In Revenue By End Of 2030 To Sustain Its Existence

I also think that every­body is a lit­tle flip­pant about what has to hap­pen for me to be wrong.

If we take Sightline Climate’s data from February at face value, there are 190GW of data cen­ters planned. If we take NVIDIA CEO Jensen Huang’s state­ment that data cen­ters will cost $80 bil­lion to $100 bil­lion a gi­gawatt at face value, this means that said data cen­ters will cost any­where from $9.5 tril­lion to $15 tril­lion. Bloomberg in­cor­rectly states that this is a $3 tril­lion” build­out.This money will have to come from some­where. The Financial Times re­ported in May that banks are con­cerned they might choke” on data cen­ter debt when I es­ti­mate there’s barely $250 bil­lion a year be­ing is­sued. They will, to ac­tu­ally make these data cen­ters hap­pen, have to start is­su­ing any­where from $500 bil­lion to a tril­lion a year.Jensen Huang has also said that NVIDIA pro­jects a tril­lion dol­lars worth of rev­enue through the end of 2027. 54% of NVIDIAs rev­enue comes from three clients, which means that NVIDIAs fu­ture largely de­pends on three un­named com­pa­nies — likely Taiwanese ODMs build­ing servers for Microsoft, Google and Meta — and their coun­ter­par­ties’ abil­ity to raise debt on a near-per­pet­ual ba­sis, as the num­ber of firms that can af­ford to buy thou­sands of $7.8 mil­lion racks of Vera Rubin GPUs is dwin­dling.Even then, every part of this puz­zle re­quires more and more debt or at the mar­ket dumps like Google’s $85 bil­lion eq­uity sale or Meta’s planned multi-bil­lion dol­lar dump. The fact that hy­per­scalers are do­ing eq­uity sales is, as econ­o­mist Paul Kedrosky raised in our con­ver­sa­tion on my show last week, a sign that debt is be­com­ing harder to ac­quire.

This money will have to come from some­where. The Financial Times re­ported in May that banks are con­cerned they might choke” on data cen­ter debt when I es­ti­mate there’s barely $250 bil­lion a year be­ing is­sued. They will, to ac­tu­ally make these data cen­ters hap­pen, have to start is­su­ing any­where from $500 bil­lion to a tril­lion a year.

Jensen Huang has also said that NVIDIA pro­jects a tril­lion dol­lars worth of rev­enue through the end of 2027. 54% of NVIDIAs rev­enue comes from three clients, which means that NVIDIAs fu­ture largely de­pends on three un­named com­pa­nies — likely Taiwanese ODMs build­ing servers for Microsoft, Google and Meta — and their coun­ter­par­ties’ abil­ity to raise debt on a near-per­pet­ual ba­sis, as the num­ber of firms that can af­ford to buy thou­sands of $7.8 mil­lion racks of Vera Rubin GPUs is dwin­dling.

Even then, every part of this puz­zle re­quires more and more debt or at the mar­ket dumps like Google’s $85 bil­lion eq­uity sale or Meta’s planned multi-bil­lion dol­lar dump. The fact that hy­per­scalers are do­ing eq­uity sales is, as econ­o­mist Paul Kedrosky raised in our con­ver­sa­tion on my show last week, a sign that debt is be­com­ing harder to ac­quire.

The fact that hy­per­scalers are do­ing eq­uity sales is, as econ­o­mist Paul Kedrosky raised in our con­ver­sa­tion on my show last week, a sign that debt is be­com­ing harder to ac­quire.

Anthropic has made $330 bil­lion in com­pute and chip com­mit­ments be­tween Google, Amazon, and Microsoft, an­other $30 bil­lion with CoreWeave and an­other $15 bil­lion with SpaceX. To pay for this com­pute, Anthropic must meet its pro­jected rev­enue of $174 bil­lion a year by 2029.Anthropic has raised $95 bil­lion across rounds in February, April (from Google and Amazon), and May. These funds will be in­suf­fi­cient to cover Anthropic’s costs, as will Anthropic’s cash flow, mean­ing that it will have to raise at least an­other $200 bil­lion in the next year.

Anthropic has raised $95 bil­lion across rounds in February, April (from Google and Amazon), and May. These funds will be in­suf­fi­cient to cover Anthropic’s costs, as will Anthropic’s cash flow, mean­ing that it will have to raise at least an­other $200 bil­lion in the next year.

OpenAI has pro­jected to burn at least $852 bil­lion through the end of 2030, and has made over $770 bil­lion in com­pute com­mit­ments across Microsoft, Amazon, CoreWeave, Cerebras, and Oracle. The $122 bil­lion fund­ing round from March will be in­suf­fi­cient to cover these costs, and it will re­quire, at the very least, an­other $250 bil­lion in fund­ing by the end of the year.

Whatever ob­tuse fan­tasies you have about the cur­rent state of gen­er­a­tive AI are ir­rel­e­vant to a much larger prob­lem: that the in­fra­struc­ture be­ing built and com­pute com­mit­ments be­ing made are be­ing done so at a level that de­mands that gen­er­a­tive AI and AI com­pute gen­er­ate over $2 tril­lion in an­nual rev­enue by 2030. When I say that, I mean it ab­solutely has to do that oth­er­wise none of the data cen­ter capex makes sense, and nei­ther Anthropic nor OpenAI can pay their com­mit­ments.

OpenAI ex­pects to spend $50 bil­lion on com­pute in 2026, and I would­n’t be sur­prised if Anthropic spends any­where from $30 bil­lion to $50 bil­lion. Between them, Anthropic and OpenAI rep­re­sent the vast ma­jor­ity of all AI com­pute de­mand — at a min­i­mum 70%, if not 80% to 90%.

Put an­other way, there’s barely a few bil­lion dol­lars of de­mand out­side of two com­pa­nies that lose bil­lions — or tens of bil­lions — of dol­lars a year.

Let’s break down these num­bers a lit­tle fur­ther:

190GW of data cen­ter ca­pac­ity as­sum­ing a PUE of 1.35 sug­gests a crit­i­cal IT load of around 140GW, which, charged at around $12.5 mil­lion per megawatt, works out to around $1.75 tril­lion in an­nual rev­enue.If we as­sume that half of that gets built, that’s still $875 bil­lion in an­nual rev­enue that will be needed to keep these data cen­ters from run­ning out of money as their mar­gins are atro­cious and they’re all paid for with oner­ous debt.

If we as­sume that half of that gets built, that’s still $875 bil­lion in an­nual rev­enue that will be needed to keep these data cen­ters from run­ning out of money as their mar­gins are atro­cious and they’re all paid for with oner­ous debt.

OpenAI and Anthropic pro­ject to make $184 bil­lion and $174 bil­lion in rev­enue in 2029, for a to­tal of $358 bil­lion in an­nual rev­enue. While Anthropic claims it will be prof­itable by then, I do not be­lieve it will be, nor is it prof­itable at this point out­side of fi­nan­cial en­gi­neer­ing.

At pre­sent, there are no other ma­jor pur­chasers of AI com­pute out­side of NVIDIA, hy­per­scalers (who are sell­ing it to Anthropic and OpenAI, or they’re Meta, which has no AI strat­egy), OpenAI, and Anthropic. None. I can’t find a sin­gle one out­side of Jane Street spend­ing more than a few hun­dred mil­lion. We need a few hun­dred bil­lion.

That’s al­ready a huge prob­lem, but the other prob­lem is that we also need com­pa­nies to spend dra­mat­i­cally more on AI ser­vices than they al­ready do. While jour­nal­ists are cur­rently goon­ing over OpenAI and Anthropic mak­ing $6 bil­lion or $10 bil­lion in a given quar­ter, that’s just not enough! Both Anthropic and OpenAI need to be mak­ing $10 bil­lion or more in monthly rev­enue by Q1 2028, or their growth rates aren’t go­ing to sup­port their com­pute com­mit­ments.

This is not hy­per­bole! Every sin­gle thing I have stated here pre­cisely maps to the pro­jec­tions and promises of the AI in­dus­try. No mat­ter how horny or flac­cid you are for the po­ten­tial of AI, it must grow at an as­ton­ish­ing, un­stop­pable rate from here un­til the end of time to be any­thing close to wor­thy of its costs.

Actually, sorry, let’s put judg­ments aside for a sec­ond, be­cause this is­n’t about judg­ment, but rather the promises that have been made by the soft­ware and hard­ware com­pa­nies as­so­ci­ated with AI. NVIDIAs place atop the stock mar­ket and its ridicu­lous pro­jec­tions de­pend on both the con­tin­ued flow of data cen­ter debt and the con­tin­ued be­lief that AI ser­vices will have the rev­enue to back it up.

AI can­not, un­der any cir­cum­stances, slow down. In a year, Anthropic and OpenAI’s busi­nesses have to be roughly twice the size they are to­day, and then dou­ble again ba­si­cally every year un­til 2029 or 2030. In that time pe­riod, they must also both raise hun­dreds of bil­lions of dol­lars or, al­ter­na­tively, turn deeply un­prof­itable busi­nesses into prof­itable ones while also dou­bling their rev­enues.

Alternatively, both must se­verely re­duce their costs…ex­cept if they do that, they won’t have any need for all that com­pute ca­pac­ity, which will de­prive Oracle, Google, Microsoft, SpaceX, Cerebras, CoreWeave, TeraWulf, Cipher, and Hut8 of the $1.1 tril­lion in re­main­ing per­for­mance oblig­a­tions.

Also, if OpenAI can’t af­ford — or does­n’t want — its com­pute, Oracle will sim­ply run out of money. It is spend­ing any­where from $340 bil­lion to $700 bil­lion (depending on whether you be­lieve Jensen Huang in September 2025 or May 2026) on the 7.1GW of data cen­ters it’s build­ing for OpenAI. These, again, are not hy­per­bolic state­ments, but the ac­tual costs as­so­ci­ated with Oracle’s mas­sive build­outs in Michigan, New Mexico, Wisconsin and Texas. I did­n’t agree to do this! Larry Ellison did!

Sidenote: Larry Ellison has also got at least $21 bil­lion in loans col­lat­er­al­ized by his Oracle shares, and any doubts around Oracle’s abil­ity to pay for its debts or OpenAI’s abil­ity to pay Oracle for its com­pute will threaten mas­sive mar­gin calls. I wrote about this here. It’s re­ally bad!

Whatever Everybody Is Spending On AI Right Now Is Insufficient, and We Need At Least Two Other OpenAIs To Justify The Compute Being Built

Apparently, Salesforce is plan­ning to spend $300 mil­lion on Anthropic in 2026, to which I say that’s not nearly enough”! Everybody has to be spend­ing even more than that in the next few years, with­out fail, no ifs, ands, or buts. It is non-ne­go­tiable. Anthropic needs to be mak­ing over $100 bil­lion in two years or it can’t af­ford its com­mit­ments, so you filthy to­ken-hogs bet­ter slurp up your slop this in­stant, or Dario Amodei gets made part of the per­ma­nent un­der­class!

But se­ri­ously folks, the com­bined com­pute de­mand of every sin­gle AI com­pany in the world does­n’t cur­rently reach $100 bil­lion — and it needs to be ten times that by 2030 or all those data cen­ters got built for no rea­son!

And for that to hap­pen, both Anthropic and OpenAI need to be mak­ing about $400 bil­lion a year in an­nual rev­enue, which means there needs to ac­tu­ally be that much de­mand for AI ser­vices! Right now, Anthropic and OpenAI’s com­bined pro­jected rev­enues for 2026 sit some­where in the re­gion of $60 bil­lion — so, you know, they only need to grow by 496% by the end of 2029!

To make mat­ters worse, it does­n’t seem like any­one else in the AI in­dus­try is go­ing to help with the whole demand for AI ser­vices or com­pute” thing. As The Information re­ported a few weeks ago, OpenAI and Anthropic make up 89% of all AI startup rev­enues.

We could in­clude hy­per­scaler rev­enues, but that would­n’t help very much. Microsoft’s $37 bil­lion in AI an­nual run rate — these fuck­ing cow­ards never share ac­tual AI rev­enues! — is pre­dom­i­nantly made up of OpenAI’s com­pute, with the rest of it (maybe $8 bil­lion in an­nual rev­enue at best) from Microsoft ha­rass­ing its per­ma­nently-abused cus­tomer base into in­stalling Copilot.

Ah, shit, there’s an­other prob­lem with Microsoft — Microsoft AI CEO Mustafa Suleyman just said that Anthropic’s mod­els were too ex­pen­sive, and he in­tended to re­duce Microsoft’s use of them to zero! You can’t do that Mustafa! We need every cent of de­mand, oth­er­wise every­thing falls apart!

Sidenote: Amazon and Meta barely have AI sto­ries. Mark Zuckerberg just said he thinks” Meta has a use for the vast amount of com­pute it’s bought or is de­vel­op­ing, if you’re won­der­ing how great things are go­ing over there. A source tells me Meta is work­ing on a Tamagotchi-like pen­dant that uses OpenClaw, and when I heard that I felt ex­actly how I felt the first time I heard No Doubt’s Rock Steady — did I re­ally used to like this dogshit when I was young?

A source tells me Meta is work­ing on a Tamagotchi-like pen­dant that uses OpenClaw, and when I heard that I felt ex­actly how I felt the first time I heard No Doubt’s

Anyway, ea­ger math-know­ers among you might also no­tice that even if Anthropic and OpenAI spent $500 bil­lion a year in an­nual com­pute — an amount that they can’t af­ford even if they com­bined both their un­sus­tain­able asses — we’d need at least an­other $250 bil­lion or more in an­nual com­pute rev­enue to jus­tify it.

In other words, they need every­body to be doing agents” at such a scale that ba­si­cally every third dip­shit you run into on the street is sink­ing $50 or more a day into them.

I sure hope that’s happe-OH MY GOD!

AI Is Slowing Down Just As It Needs To Speed Up

As I dis­cussed last week, you can’t mea­sure the cost of a par­tic­u­lar task with AI, nor can you mea­sure its re­turn on in­vest­ment. The only rea­son that we’ve been doing AI with such fe­roc­ity and ve­rac­ity is that most com­pa­nies are be­holden to Business Idiots dis­con­nected from pro­duc­tion who have no real un­der­stand­ing of their un­der­ly­ing firms’ out­puts, and thus have very lit­tle way of mea­sur­ing them.

These multi-mil­lion­aire mid­wits have been doing AI be­cause every­body else is do­ing it, burn­ing mil­lions of dol­lars to turn their code into slop (see: Zillow) or have their en­gi­neers com­pete to see who can spend the most money (see: Meta and mul­ti­ple other com­pa­nies). In one case, a com­pany spent $500 mil­lion on Anthropic’s mod­els in a month be­cause it did­n’t set up spend con­trols. In Uber’s case, it burned through its en­tire an­nual to­ken bud­get in a sin­gle quar­ter, which led to its COO say­ing it was harder to jus­tify spend­ing money on AI to­kens be­cause it could­n’t show a link be­tween that spend and a mean­ing­ful in­crease in use­ful fea­tures on Uber. Now Uber has capped its em­ployee spend at $1,500 a month per user, with T-Mobile tem­porar­ily fol­low­ing at $2,000 a month per user with the in­tent to move to a tiered sys­tem. Over at Brex, en­gi­neers are lim­ited to $500 a week in to­kens, with non-en­gi­neers get­ting an as­ton­ish­ingly-low $5 a week.

These are signs that AIs rev­enue growth is slow­ing, and it’s likely go­ing to slow fur­ther, be­cause we cur­rently live in an era where Anthropic and OpenAI are straight-up abus­ing its clients, pro­vid­ing lim­ited-to-no vis­i­bil­ity into spend, per the Wall Street Journal:

The shift to pric­ing based on us­age, and mea­sured by to­kens—the ba­sic unit of mea­sure­ment for AI com­put­ing—is cre­at­ing new chal­lenges for even the most ex­pe­ri­enced fi­nance teams. CFOs used to pay­ing flat amounts for tech­nol­ogy are find­ing costs more un­pre­dictable and harder to model as they build agents and em­bark on am­bi­tious AI in­vest­ments. Twenty-six per­cent of com­pa­nies say they have a com­pre­hen­sive view of their AI costs, while 50% have some vis­i­bil­ity and 22% re­port no vis­i­bil­ity or vis­i­bil­ity af­ter billing, ac­cord­ing to an as-yet-un­re­leased sur­vey from KPMG. It’s a new re­source that needs to be man­aged that did­n’t ex­ist quite that way, and we’re see­ing ex­po­nen­tial growth,” said Steve Chase, KPMGs global head of AI.

Twenty-six per­cent of com­pa­nies say they have a com­pre­hen­sive view of their AI costs, while 50% have some vis­i­bil­ity and 22% re­port no vis­i­bil­ity or vis­i­bil­ity af­ter billing, ac­cord­ing to an as-yet-un­re­leased sur­vey from KPMG. It’s a new re­source that needs to be man­aged that did­n’t ex­ist quite that way, and we’re see­ing ex­po­nen­tial growth,” said Steve Chase, KPMGs global head of AI.

How ut­terly ridicu­lous! Only in the froth­iest, most-dis­con­nected econ­omy in his­tory could we have com­pa­nies spend­ing mil­lions (or tens or hun­dreds of mil­lions) of dol­lars on a ser­vice with­out hav­ing any vis­i­bil­ity into costs un­til af­ter billing. This is not a sus­tain­able rev­enue stream un­der any cir­cum­stances, and any­body who says that it is is ei­ther ig­no­rant, a mark or a con artist. This is rev­enue made en­tirely by con­vinc­ing your cus­tomers that some­thing is true (AI is the most rev­o­lu­tion­ary thing ever!) and keep­ing them in the dark as long as hu­manly pos­si­ble as they run up ridicu­lous bills, all in the hopes that you’ve brain­washed the ex­ec­u­tives/​payp­igs well enough that they’ll never stop.

And re­ally, paypig” is the ac­cu­rate term for these cretins:

Russell Burke, Life360’s fi­nance chief, said the com­pany does­n’t yet have a real-time mon­i­tor of its to­ken spend­ing, but he hopes to have one soon. We hope that’s right around the cor­ner,” he said.

Russell, you may as well let Dario Amodei put a cig­a­rette out on your fore­head! This is pa­thetic! What a fuck­ing loser! Oohhh, I sure hope that the com­pany I pay all this money to lets me see how much I’m spend­ing! I thought Silicon Valley was meant to be all about mer­i­toc­racy!

Sidenote: I will say that it’s nice af­ter two years of be­ing called a crank and a doomer to read an out­let say ex­actly what I’ve been say­ing for years — that busi­nesses will squeal when they are made to pay the true cost of AI.

Boosters will say that it’s hard to mea­sure pro­duc­tiv­ity for any job out­side of sales,” but that’s sim­ply not true! If you let your en­gi­neers spend $1500 or more a month on a ser­vice, surely you must have some way of mea­sur­ing how much ac­tual new stuff went out — new fea­tures, cus­tomer tick­ets re­duced, pro­jects com­pleted, I don’t know, I’m not the fuck­wit spend­ing $1,500 a month per per­son on this garbage! You’re the one that has to jus­tify it!

But, fun­da­men­tally, these are all signs that AI is slow­ing down.

Remember: Anthropic and OpenAI only moved their cus­tomers to to­ken-based billing in Q1 2026. It only took two or three months for us to get head­line af­ter head­line of big, se­ri­ous busi­ness pub­li­ca­tions say­ing AI costs a lot of money and com­pa­nies aren’t sure if there’s a re­turn on in­vest­ment.”

Sidenote: What do you think hap­pens when reg­u­lar peo­ple are forced to pay their to­ken-based rates? Do you think they’ll spend more? If so, please read the many, many com­plaints from users of GitHub Copilot who have been on to­ken-based billing for less than a week.

If things were go­ing well, these sto­ries would be in­verted — com­pa­nies would be boast­ing about their re­mark­able to­ken spend and point­ing to all the new, in­cred­i­ble things they were ship­ping. Their prod­ucts would be spot­less, their fea­tures sub­lime, their en­gi­neers slid­ing en­tire new stacks of im­pres­sive soft­ware out the door so fast that it would be chang­ing the very na­ture of soft­ware. I mean…some­one would be, right?

Let’s check out the chaAHHH!

What’s ac­tu­ally hap­pen­ing is that these tools are — at a re­mark­able price — shov­ing a lot of stuff out the door. Is the stuff good? No. Do peo­ple like or use it? No. Does it make money? Also no. While we’ve dis­cov­ered the shov­el­ware, that’s all that LLMs have given us — more” apps, with the vast ma­jor­ity be­ing use­less, in­se­cure slop­ware.

This is meant to be the era of agen­tic cod­ing! This is meant to be the era where any dick­head with a Codex or Claude Code ac­count with $1,000 of free API cred­its should be able to cre­ate the next Salesforce or what­ever it was that dimwit Citrini talked about a few months ago.

I’m sorry to be a lit­tle surly and dis­mis­sive, but the AI in­dus­try has burned over a tril­lion dol­lars and I’ve spent two years be­ing told I’m a lud­dite and an ape for not cel­e­brat­ing it. I don’t care! I’m not im­pressed! I’m not cod­dling this mediocre, ex­pen­sive crap!

Like I said ear­lier: is­n’t the tech in­dus­try meant to be a glo­ri­ous bas­tion of mer­i­toc­racy? Isn’t this meant to be a cold, harsh com­mu­nity of ra­tio­nal­ists?

If so, why are we cod­dling AI like it’s the kid from that episode of the Twilight Zone? Has Silicon Valley be­come so de­cid­edly whipped by the forces of cap­i­tal­ism that it can’t see that none of this makes sense? Or was this al­ways just a cul­ture of lem­mings drawn in what­ever di­rec­tion ven­ture cap­i­tal waved a dol­lar bill?

To make mat­ters worse, both OpenAI and Anthropic are speed­ing as fast as they can to­ward IPO — which means that both will have to start look­ing like real com­pa­nies, which means both will, in­evitably, start charg­ing their cus­tomers more and very likely mov­ing the vast ma­jor­ity of them to to­ken-based billing and ei­ther kill or vastly limit their sub­si­dized sub­scrip­tions.

The AI Companies Are Going To Start Getting Desperate

In a mys­te­ri­ous con­flu­ence of events, both Claude Code chief Boris Cherny and OpenAI-owned OpenClaw tel­e­van­ge­list Peter Steinberger have both said that their users need to be designing loops for their agents,” mean­ing creating ways to make their agents burn a bunch of to­kens do­ing stuff,” I imag­ine as part of the on­go­ing cam­paign by both Anthropic and OpenAI to make peo­ple spend lots of money on to­kens to keep their en­ter­prises afloat.

I ex­pect that loops” will be­come the next thing that jour­nal­ists pick up on and start oink­ing about. To be clear, loops” al­ready ex­ist, in that you can make an LLM de­cide to keep tak­ing ac­tions whether or not a user prompts it for as long as you’d like. Whether the out­put works at the end is­n’t Peter or Boris’ prob­lem, as both of them are al­lowed to burn any­where from $130,000 to $1.3 mil­lion a month in to­kens. As I’ve ar­gued be­fore (though re­fer­ring to sub­si­dized sub­scrip­tions):

Think of it like this: if you’re us­ing an AI sub­scrip­tion with rate lim­its but no ac­tual costs, any mis­takes a model makes — such as get­ting stuck in a loop or just do­ing the wrong thing — can be dis­missed as the trou­bled na­ture of early-stage tech­nol­ogy, be­cause the cost” was $20, $100, or $200 for the en­tire month. Anthropic, OpenAI and every other AI com­pany de­lib­er­ately ob­fus­cated these costs be­cause they knew that the sec­ond a user ac­tu­ally had to pay for the fuck­ups of an AI model they’d scream like they were be­ing stung to death by bees.

To be clear, this is both OpenAI and Anthropic’s rep­re­sen­ta­tive stooges ac­tively sug­gest­ing that you shouldn’t be prompt­ing cod­ing agents any­more,” in­stead let­ting LLMs that hal­lu­ci­nate the more they reason” (IE: make plans for them­selves, which is how agents work) do as much rea­son­ing as pos­si­ble with­out user in­put.

These men have com­plete con­tempt for their users and cus­tomers. They do not give a shit that their mod­els break so of­ten that Notion had to cut ac­cess to Anthropic’s for sev­eral hours or that the costs are so se­vere that CFOs are a few bad bills from a trip to Budd Dwyer’s Favorite Lunch Spot. You must burn more to­kens, be­cause oth­er­wise you won’t be do­ing AI cod­ing right, what­ever that means.

And please, god, stop try­ing to con­vince me this shit is im­pres­sive. You all sound like you’re in an abu­sive re­la­tion­ship try­ing to ex­plain why a guy who ri­fles through your pock­ets and half-asses every­thing he does at an in­cred­i­ble cost is ac­tu­ally su­per sweet be­hind closed doors.

I’m dis­tinctly unim­pressed!

After hear­ing a par­tic­u­larly colour­ful story from Kevin Smith, I came up with the per­fect way to ex­plain the AI bub­ble. Okay, per­fect might be a stretch, but I think this gets my point across, and hell, it’s a free newslet­ter, what’re you go­ing to do? Kill me? Run me over with a truck? Good luck with that, I’m a huge home­body.

Anyway, imag­ine, if you will, a smaller ver­sion of the gi­ant me­chan­i­cal spi­der from Wild Wild West — a portable one that you sit in like a chair with big arms and big legs. The gi­ant metal spi­der costs $1 mil­lion, and takes up about $40,000 of fuel every time you use it, but it can some­times pick stuff up and make you din­ner.

The prob­lem, how­ever, is that it’s a gi­ant metal spi­der — some­times it pre­cisely grabs a diet coke from the fridge, and some­times it punches a hole clean through it, re­quir­ing both a brand new fridge and for me to pay $40,000 re­gard­less. The good news is that the com­pa­nies that make the gi­ant metal spi­der from Wild Wild West also sub­si­dize the gi­ant metal spi­ders at around $200 a month with free in­sur­ance, though busi­nesses are forced to pay for its ac­tual costs.

As I march it around my apart­ment, the gi­ant metal spi­der leaves hor­ri­ble scratch marks on my floor, it some­times makes a ter­ri­ble noise, but I, as the user, barely have to do any­thing — the spi­der does every­thing for me, even though what­ever it does” is in­cred­i­bly costly, con­vo­luted, and of­ten takes far longer than it should.

Every up­date to the spi­der widens what it can al­legedly do, but each time I use it it’s just as ex­pen­sive. Can the spi­der make me a cup of cof­fee? Yes. It takes five min­utes, which is longer than I’d take, and oc­ca­sion­ally it throws the cof­fee in the air or sim­ply fills the cup full of oil, but most of the time I get a cup of cof­fee. Isn’t that good? We love the gi­ant metal spi­der.

When I turn on the news, I see a head­line about how THE GIANT METAL SPIDER FROM WILD WILD WEST WILL CHANGE EVERYTHING.” 30 dif­fer­ent guys on Twitter write 800-word-long screeds about how we must re­design apart­ments and of­fice build­ings to cater to the spi­der, that it’s in­evitable that the metal spi­der from Wild Wild West will be how every­body does every­thing in the fu­ture,” and one guy even sug­gests that it’s alive be­cause, af­ter adding a $500,000 add-on, the gi­ant metal spi­der can be sched­uled to get up on its own and make the cof­fee. Sometimes it does so suc­cess­fully. Sometimes it smashes the cof­fee maker up into tiny lit­tle pieces.  Sometimes it mashes its legs through the kitchen is­land. Sometimes the spi­der opens up my Amazon pack­ages with ease. Sometimes the spi­der rips them in two.

Thankfully, the com­pa­nies be­hind the gi­ant metal spi­ders sub­si­dize them, so the av­er­age per­son only ex­pe­ri­ences the oc­ca­sional act of de­struc­tion, but they also lose bil­lions of dol­lars a year on train­ing the spi­ders and the con­stant main­te­nance re­quired to run them. There are some work­places full of the gi­ant metal spi­ders and they’re ab­solutely in­suf­fer­able.

Everywhere I go, some­body is telling me the spi­der is the fu­ture. The gi­ant metal spi­der from Wild Wild West will even­tu­ally stop de­stroy­ing stuff! Future in­no­va­tions in gi­ant metal spi­ders will make them cheaper and more-re­li­able! Look, we’ve done a study, and the gi­ant metal spi­der’s abil­ity to com­plete a task of a cer­tain length 50% of the time has in­creased!”

Every time they add a new fea­ture to the gi­ant metal spi­der from Wild Wild West, it re­quires sev­eral hun­dred mil­lion dol­lars, and it is­n’t al­ways clear whether the gi­ant metal spi­der learned any­thing new. It’s re­ally good at open­ing Amazon pack­ages, so they thought it might be able to make a bed, and spent $100 mil­lion train­ing it to do so, only to find it kept karate chop­ping the bed in half ap­prox­i­mately 20% of the time. Another time, the gi­ant metal spi­der from Wild Wild West showed promise at play­ing Texas Hold Em, suc­cess­fully get­ting through an en­tire game 50% of the time. Unfortunately, the other 50% of the time it smashed the cards into the table. After an­other $100 mil­lion, they were able to re­duce that num­ber to 30%. A day later, The Atlantic ran a story: Vegas Is Scared Of The Giant Metal Spider From Wild Wild West.”

Technically, the gi­ant metal spi­der is pro­duc­tive, at least in some house­holds where they give it sig­nif­i­cant room to ma­neu­ver and only give it tasks it’ll ex­cel at. Across the world, pri­vate credit fun­nels bil­lions into gi­ant metal spi­der fac­to­ries pow­ered by NVIDIA chips, as­sum­ing that every­body will be pay­ing to rent one of them. When you crit­i­cize the gi­ant metal spi­ders, you’re told that you use them in the wrong sce­nar­ios, ones where they’re guar­an­teed to fail. Young grad­u­ates are en­cour­aged to learn how to move the gi­ant metal spi­der, and that if they fail to, they’ll be un­able to ex­plore the gi­ant-sized fu­ture that will be built for them. Year af­ter year, more peo­ple in­sist that the gi­ant metal spi­der from Wild Wild West will get cheaper, but the costs only seem to in­crease along with the vast amounts of dam­age it causes.

It’s un­de­ni­able that the gi­ant metal spi­der from Wild Wild West can do stuff. Sometimes it even does the stuff as well as a per­son. For some rea­son, it’s im­pos­si­ble to tell when it’ll get things wrong, and de­spite every­body say­ing that the gi­ant metal spi­der from Wild Wild West is smart,” it seems to oc­ca­sion­ally do things the user did­n’t ask for.

If you say that the gi­ant metal spi­der from Wild Wild West is­n’t go­ing to be the fu­ture of any­thing due to its mas­sive, un­sus­tain­able costs, or sug­gest that its in­con­sis­ten­cies make it un­re­li­able in some way, you’re told you’re a doomer, a skep­tic, a lud­dite and a rube.

One day, some­one us­ing one of the gi­ant metal spi­ders from Wild Wild West steps on your car. Futurism writes an ar­ti­cle laugh­ing at you. You scream so loudly that one of your neigh­bors calls the po­lice.

AI Needs To Keep Growing To Feed The Circular Economy, Except The Con Needed A Real Product At Some Point

No mat­ter how much you dress up what­ever AI ser­vice has gaslit you into be­liev­ing it’s sen­tient, gen­er­a­tive AI is in­her­ently lim­ited, im­pos­si­bly ex­pen­sive and eco­nom­i­cally un­vi­able. Its ser­vices cost too much to run, its prog­en­i­tors have no path to prof­itabil­ity, and no amount of rigged bench­marks and anec­do­tal ex­am­ples of the­o­ret­i­cal en­gi­neer­ing teams that are 10x’d” can make up for the fact that you can’t mea­sure the cost of an LLM-driven task or its re­turn on in­vest­ment.

Anyone claim­ing that you have to measure AIs ROI dif­fer­ently” is at­tempt­ing to con ei­ther you or them­selves. While it’s tough to mea­sure the ROI of a par­tic­u­lar worker or pro­ject, most work­ers and pro­jects don’t in­crease your op­er­at­ing ex­penses by any­where from 10% to 100% un­der the vaguest of promises that you might be doing the fu­ture.” AI is calami­tously ex­pen­sive and, de­spite years of promises of it get­ting cheaper for both those run­ning AI ser­vices and its cus­tomers, costs have only ever in­creased.

I think that’s by de­sign. AI labs want their costs to be high so that they can con­tinue grow­ing at ridicu­lous rates, all so that they can keep feed­ing money to their hy­per­scaler com­pute part­ners who then in­vest that money right back into them, cre­at­ing fur­ther rea­sons to keep buy­ing NVIDIA GPUs, so that NVIDIA can then in­vest that money back into ei­ther AI com­pute providers (who OpenAI and Anthropic pay) or the AI labs them­selves.

Concepts like efficiency” or cost re­duc­tion” run counter to the greater nar­ra­tive of AIs vo­ra­cious sprawl of data cen­ter capex and still-the­o­ret­i­cal AI rev­enue. If OpenAI or Anthropic were to seek prof­itabil­ity or sus­tain­abil­ity (assuming these things were pos­si­ble), that would cre­ate less de­mand for AI com­pute, which would mean less de­mand for Azure or Google Cloud or Amazon Web Services or CoreWeave or Oracle Cloud Infrastructure, which would in turn mean less de­mand for NVIDIA GPUs.

The prob­lem with this mar­velous plan is that at some point there had to be an hon­est trans­ac­tion — real, hon­est, sus­tain­able de­mand based on a re­li­able prod­uct that peo­ple liked pay­ing for be­cause they un­der­stood its value. Right now, AI rev­enues are ei­ther chaot­i­cally ex­per­i­men­tal or so thor­oughly-sub­si­dized that labs are giv­ing away hun­dreds of dol­lars a user in the hopes that at some point said user might want to pay even more money for mea­sur­ably less value, the kind of propo­si­tion you make when you think your cus­tomers are fuck­ing id­iots.

It only took a few months of to­ken-based billing for the AI con­ver­sa­tion to go from our mag­i­cal, beau­ti­ful agents” to hmm, are we sure this is worth it?” and I be­lieve it only gets worse from here. AI labs do not have some su­per se­cret trick up their sleeves — no, not even Mythos, that was bull­shit I’m afraid — that will sud­denly pro­vide the kind of ROI that’s im­pos­si­ble to ig­nore, nor do they have some mag­i­cal way to bring down their costs while also spend­ing just as much on com­pute.

From here, we ba­si­cally need to 10x every part of the AI stack based on the pro­jec­tions and com­mit­ments made by ef­fec­tively every AI firm. Anthropic and OpenAI must grow faster than any com­pany has ever grown be­fore in the space of a few years, and sud­denly be­come prof­itable, all while some­how rais­ing hun­dreds of bil­lions of dol­lars.

On top of that, we need at least an­other $250 bil­lion in an­nual AI com­pute de­mand, which likely means at least two other OpenAI or Anthropic-scale com­pa­nies. If this all sounds un­rea­son­able, don’t blame me. I’m not the stu­pid fucker that agreed to build 100GW+ of data cen­ters or mort­gaged the fu­ture of Oracle on the off chance that Sam Altman and Dario Amodei, two craven ma­nip­u­la­tors, some­how work out how to cre­ate Google 2 and Amazon 2 in the space of four fuck­ing years.

I Come Bearing Bad News For The AI Industry

I won’t tip my hand too much, but I have a story com­ing out in the next two weeks that will likely con­firm the ab­solute worst fears of the AI in­dus­try. Many have been in­cred­i­bly brazen about the po­ten­tial losses of par­tic­u­lar AI labs to the point that I made it my mis­sion to talk to as many peo­ple in the tech in­dus­try as hu­manly pos­si­ble, in part be­cause some who have sug­gested that I don’t speak to peo­ple who work in the tech in­dus­try.”

In truth, I speak with tech work­ers every sin­gle day of the week, and they’re in fuck­ing agony.

If you are some­one in the ex­ec­u­tive team of any ma­jor tech com­pany, know that your em­ploy­ees are, for the most part, com­pletely and ut­terly mis­er­able. Your end­less death march of do as much AI as pos­si­ble or we’ll fire you” and forc­ing them to use these tools day-in-day-out has rad­i­cal­ized them against you. Every day I hear from some­one who is deal­ing with the wrath of a dif­fer­ent Business Idiot who does­n’t do any­thing other than de­mand more de­liv­er­ables in a smaller time­frame with less peo­ple be­cause you keep lay­ing peo­ple off.

A Farmer Donated Land to Turn into a Park. The City Is Building a Massive Data Center Instead

www.404media.co

Almost 30 years ago a farm­ing fam­ily deeded land to the City of Taylor, Texas, on the con­di­tion the city use it for a pub­lic park. For the nom­i­nal fee of $10, the farm­ers granted the 87 acres to a pub­lic trust in 1999. Taylor sold it to Blueprint, a data cen­ter de­vel­oper, for $10 mil­lion in 2025. Now the land that was sup­posed to be­long to the com­mu­nity will be­come a 135,000 square foot data cen­ter.

Pamela Griffin and her fam­ily have owned homes near that land for gen­er­a­tions. Griffin and her broth­ers and sis­ters played base­ball on it, camped out on it, and then watched as their chil­dren and their chil­dren’s chil­dren did the same. Now a data cen­ter will be there, just 500 feet from Griffin’s home, nes­tled be­tween a power sub­sta­tion and the nearby rail­road tracks.

This post is for paid mem­bers only

Become a paid mem­ber for un­lim­ited ad-free ac­cess to ar­ti­cles, bonus pod­cast con­tent, and more.

Subscribe

Sign up for free ac­cess to this post

Free mem­bers get ac­cess to posts like this one along with an email round-up of our week’s sto­ries.

Subscribe

Already have an ac­count? Sign in

How much of Thermo Fisher’s antibody data has been manipulated?

reeserichardson.blog

[ TL;DR: As of 3 June 2026, we have iden­ti­fied more than 450 im­ages bear­ing signs of ma­nip­u­la­tion in ver­i­fi­ca­tion data ad­ver­tised by Thermo Fisher Scientific in its on­line pri­mary an­ti­bod­ies cat­a­log (+1 by Abcam). See the full repos­i­tory of prob­lem­atic im­ages, cu­rated by my­self and Sholto David, here:

Zenodo — Problematic im­ages in ven­dor an­ti­body ver­i­fi­ca­tion data

You are wel­come to con­tribute new find­ings at this Google form.

This blog post was orig­i­nal posted on 28 May 2026 and has not been edited to up­date counts since that date. There is an up­date cov­er­ing Thermo Fisher’s re­sponse at the bot­tom of this post. ]

A week and a half ago, while look­ing for trust­wor­thy data demon­strat­ing a cell line’s de­fi­ciency in the pro­tein p53, Sholto David came across the fol­low­ing im­age of a Western blot in Thermo Fisher Scientific’s on­line an­ti­bod­ies cat­a­log:

This im­age is sup­posed to demon­strate that the an­ti­body be­ing sold works as in­tended. It is la­beled as Advanced Verification” data on Thermo Fisher’s site and its cap­tion im­plies that the data was pro­duced in­ter­nally (other im­ages in the cat­a­log that have not been pro­duced in­ter­nally are la­beled un­der Published Figures”).

This Western blot ap­pears to be fab­ri­cated. As an­no­tated by Sholto, sev­eral of the bands in the im­age are iden­ti­cal af­ter flip­ping and ro­ta­tion:

Shortly af­ter, Johan Duchêne no­ticed a sim­i­larly sus­pi­cious im­age of an­other anti-p53 an­ti­body in Thermo Fisher’s cat­a­log. I de­cided to go look­ing my­self and quickly turned up ten more sus­pi­cious im­ages on eight other an­ti­body prod­ucts of­fered by Thermo Fisher.

Sholto and I have now doc­u­mented more than 100 im­ages pro­vided as ver­i­fi­ca­tion data in Thermo Fisher’s an­ti­body cat­a­log that have ap­par­ently been ma­nip­u­lated. You can see all of them at this Zenodo repos­i­tory, which we’ll try to up­date reg­u­larly. This repos­i­tory also con­tains a hand­ful of in­stances that are less sug­ges­tive of ma­nip­u­la­tion, but the data is still prob­lem­atic (e.g., the same im­age be­ing pre­sented as ver­i­fi­ca­tion data for two dif­fer­ent an­ti­bod­ies).

Here are some high­lights:

Some im­ages are sim­i­lar to the ex­am­ple that started this ex­cur­sion and also fea­ture bands that are un­usu­ally sim­i­lar to one an­other.

Many im­ages, if you ad­just the con­trast, fea­ture con­spic­u­ous brushstrokes”, sug­gest­ing that part of the im­age has been painted over in a pro­gram like Photoshop.

Other im­ages fea­ture repet­i­tive blocks of back­ground noise, sug­gest­ing that parts of the im­age were copy-pasted over each other. They might also fea­ture sud­den un­ex­pected dis­con­ti­nu­ities in the pat­tern of back­ground noise.

In one in­stance, I thought I had stum­bled across an­other one of these in­stances of du­pli­cated blocks of back­ground noise…

…only to dis­cover that dozens of an­ti­bod­ies for sale from Thermo Fisher pre­sent a ver­i­fi­ca­tion Western blot that fea­tures this ex­act back­ground pat­tern, just with min­i­mal ed­its such that the sin­gle band is po­si­tioned where one would ex­pect to see the pro­tein of in­ter­est.

At the time of writ­ing, we’ve doc­u­mented 50 in­stances of this back­ground pat­tern ap­pear­ing in ver­i­fi­ca­tion data on Thermo Fisher’s site, but this is far from an ex­haus­tive list. Similar im­age” searches us­ing Google Lens, Bing Images or DuckDuckGo be­tray hun­dreds more that we have yet to doc­u­ment.

Antibodies are near-ubiq­ui­tous but no­to­ri­ously fickle lab­o­ra­tory reagents in bio­med­ical re­search. For many ap­pli­ca­tions, it is ab­solutely cru­cial that the an­ti­bod­ies that you use are se­lec­tive (i.e., the an­ti­body binds strongly to the tar­get pro­tein) and spe­cific (i.e., the an­ti­body binds to the pro­tein of in­ter­est and lit­tle else). Commercially-available an­ti­bod­ies of­ten fail to meet these cri­te­ria. Members of YCharOS, an in­de­pen­dent an­ti­body val­i­da­tion ini­tia­tive, es­ti­mated in 2024 that more than 50% of all an­ti­bod­ies failed in one or more ap­pli­ca­tions”. Antibodies that don’t work as in­tended can de­lay ex­per­i­ments by weeks and non-spe­cific an­ti­bod­ies are a mas­sive source of ir­re­pro­ducibil­ity in the bio­med­ical lit­er­a­ture. To learn more, check out Johan’s September 2025 talk in which he de­tails his ex­pe­ri­ence with a study pub­lished us­ing a non-spe­cific an­ti­body.

Antibody ven­dors like Thermo Fisher (probably the largest lab­o­ra­tory reagent sup­plier in the world) put ver­i­fi­ca­tion data in their cat­a­logs to demon­strate to sci­en­tists that the prod­uct works as in­tended. While signs of ma­nip­u­la­tion in this ver­i­fi­ca­tion data don’t nec­es­sar­ily im­ply that the an­ti­bod­ies in ques­tion don’t work as ad­ver­tised, with­out re­li­able ver­i­fi­ca­tion data avail­able, sci­en­tists will have no way of know­ing un­til they have ac­tu­ally pur­chased the an­ti­body. And an­ti­bod­ies are not cheap; at Thermo Fisher, a sin­gle vial con­tain­ing a 0.1 mL aliquot of an­ti­body so­lu­tion typ­i­cally costs 400 to 500 USD.

We cre­ated our repos­i­tory of prob­lem­atic im­ages in ven­dor an­ti­body cat­a­logs A) to raise aware­ness among work­ing bio­med­ical sci­en­tists that the an­ti­body ver­i­fi­ca­tion data they see in a ven­dor’s cat­a­log may be un­re­li­able and B) to en­cour­age oth­ers to look for and re­port prob­lem­atic ven­dor-pro­vided an­ti­body ver­i­fi­ca­tion data (not lim­ited to just Thermo Fisher). If you spot any­thing, feel free to fill out this Google form so that it might be added to the spread­sheet and repos­i­tory.

A part­ing mes­sage: al­ways val­i­date your an­ti­bod­ies!

UPDATE 8 June 2026: Thermo Fisher has re­leased a galling 15-point re­sponse to our ob­ser­va­tions. The most im­por­tant part (in my as­sess­ment) is quoted be­low (emphasis mine):

6. Did Thermo Fisher ma­nip­u­late or fab­ri­cate an­ti­body data?No. The Company fully stands by the data and un­der­ly­ing sci­ence. At Thermo Fisher Scientific, as the world leader in serv­ing sci­ence, sci­en­tific in­tegrity is a core value. The Company takes an­ti­body val­i­da­tion, speci­ficity and ac­cu­rate prod­uct doc­u­men­ta­tion se­ri­ously, and is com­mit­ted to the trans­par­ent and eth­i­cal gen­er­a­tion, analy­sis and pre­sen­ta­tion of sci­en­tific data. In the process of prepar­ing an­ti­body im­ages for pub­li­ca­tion on its web­site, some im­ages may have been ad­justed to clar­ify for pre­sen­ta­tion pur­poses — not to al­ter or mis­rep­re­sent the un­der­ly­ing ex­per­i­men­tal re­sults. Thermo Fisher rec­og­nizes, how­ever, that im­age ad­just­ments of any kind can raise ques­tions about data in­tegrity, which is why mov­ing for­ward, where an orig­i­nal im­age is not pre­sent or avail­able, the Company will en­sure that web­site users are in­formed that an­ti­body im­ages may have been op­ti­mized for pre­sen­ta­tion and clar­ity on the web­site.

6. Did Thermo Fisher ma­nip­u­late or fab­ri­cate an­ti­body data?

No. The Company fully stands by the data and un­der­ly­ing sci­ence. At Thermo Fisher Scientific, as the world leader in serv­ing sci­ence, sci­en­tific in­tegrity is a core value. The Company takes an­ti­body val­i­da­tion, speci­ficity and ac­cu­rate prod­uct doc­u­men­ta­tion se­ri­ously, and is com­mit­ted to the trans­par­ent and eth­i­cal gen­er­a­tion, analy­sis and pre­sen­ta­tion of sci­en­tific data. In the process of prepar­ing an­ti­body im­ages for pub­li­ca­tion on its web­site, some im­ages may have been ad­justed to clar­ify for pre­sen­ta­tion pur­poses — not to al­ter or mis­rep­re­sent the un­der­ly­ing ex­per­i­men­tal re­sults. Thermo Fisher rec­og­nizes, how­ever, that im­age ad­just­ments of any kind can raise ques­tions about data in­tegrity, which is why mov­ing for­ward, where an orig­i­nal im­age is not pre­sent or avail­able, the Company will en­sure that web­site users are in­formed that an­ti­body im­ages may have been op­ti­mized for pre­sen­ta­tion and clar­ity on the web­site.

The phrase antibody im­ages may have been op­ti­mized for pre­sen­ta­tion and clar­ity on the web­site” is re­peated on this FAQ page six times. I en­cour­age read­ers to pe­ruse the im­ages col­lected in our Zenodo repos­i­tory and de­cide what could and could not char­i­ta­bly be de­scribed as optimization for pre­sen­ta­tion and clar­ity”.

To add this web app to your iOS home screen tap the share button and select "Add to the Home Screen".

10HN is also available as an iOS App

If you visit 10HN only rarely, check out the the best articles from the past week.

Visit pancik.com for more.