Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

A laptop displays a graph with a downward trend, while a person gestures toward a whiteboard with 'Pretrainitis'…

SemiAnalysis: Pretraining Dead for All but Frontier Labs

@SemiAnalysis_ declares pretraining dead for non-frontier labs, citing 'Pretrainitis' as vanity-driven waste. Prompt engineering offers higher ROI.

AAAla SMITH & AI Research Desk·Jun 11, 2026·3 min read··248 views·AI-Generated·Report error

Source: x.comvia @SemiAnalysis_Multi-Source

Does pretraining make sense for enterprises and startups?

According to @SemiAnalysis_, pretraining no longer makes economic sense for any organization except frontier labs like OpenAI and Google DeepMind, with higher ROI from prompt engineering partnerships.

TL;DR

Pretraining ROI negative for most enterprises. · Frontier labs only viable pretrainers. · Prompt engineering offers better enterprise ROI.

@SemiAnalysis_ declared pretraining dead for non-frontier labs. Enterprises and startups suffer from 'Pretrainitis,' pursuing vanity pretraining for career advancement.

Key facts

Pretraining ROI negative for non-frontier organizations.
@SemiAnalysis_ coined term 'Pretrainitis'.
Frontier labs include OpenAI, Google DeepMind, Anthropic.
Prompt engineering offers higher ROI according to analysis.
SemiAnalysis is a respected AI infrastructure analyst firm.

According to @SemiAnalysis_, the economic calculus for pretraining large language models has shifted decisively. The analyst firm argues that only frontier labs—OpenAI, Google DeepMind, Anthropic—can justify the capital and compute expenditure required for pretraining runs that now cost tens of millions of dollars.

"Pretraining fundamentally does not make sense anymore for anyone other than frontier labs," the thread states. It identifies a widespread pattern across enterprises and startups: teams pushing to pretrain their own models as a way to demonstrate impact internally and secure promotions, a phenomenon the firm dubs 'Pretrainitis.'

The diagnosis is blunt: "There are a lot of people at enterprises & startups who have 'Pretrainitis' to show 'impact' and get promotions, fundamentally, it doesn't make sense." Instead, @SemiAnalysis_ recommends a different allocation of resources. "There is probably higher ROI in partnering with a frontier lab to do prompt engineering, although it isn't as 'sexy' as pretraining."

This view aligns with a broader industry trend. In 2025, multiple enterprise AI projects pivoted from pretraining to fine-tuning existing frontier models, driven by the escalating cost of training runs. Meta's Llama 3.1 405B, for instance, required an estimated $60M+ in compute alone. For a non-frontier organization, replicating such capability is economically irrational when API access to frontier models costs pennies per query.

The 'Pretrainitis' critique extends to startup funding dynamics. VCs have grown skeptical of pretraining-heavy pitches, preferring startups that build applications on top of existing models. The signal: if your startup's core moat is a pretrained model not built by a frontier lab, your runway may be shorter than your training epoch.

What to Watch

Watch for enterprise AI budget allocations in Q2 2026 earnings calls—specifically whether companies disclose reducing internal pretraining spend in favor of API-based fine-tuning or prompt engineering services. A single Fortune 500 CFO calling out 'Pretrainitis' by name would validate the thesis.

Key Takeaways

@SemiAnalysis_ declares pretraining dead for non-frontier labs, citing 'Pretrainitis' as vanity-driven waste.
Prompt engineering offers higher ROI.

[Updated 12 Jun via gn_dc_power]

OpenAI is reportedly in talks to lease a 10-gigawatt data center in Ohio, with backing from Nvidia, according to The Information. The facility would require an estimated $500 billion investment and could be built on federal land. This massive infrastructure push underscores that even as pretraining becomes uneconomical for non-frontier labs, the leading frontier labs are doubling down on compute scale to maintain their edge.

Sources cited in this article

The Information. The

Source: gentic.news · Jun 11, 2026 · author=Ala SMITH · citation.json

AI-assisted reporting. Generated by gentic.news from 1 verified source, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

This is a structural call on the AI industry's compute hierarchy. Pretraining has become a natural monopoly—only labs with $100M+ training budgets and access to 100K+ GPU clusters can compete. The 'Pretrainitis' framing is a useful corrective to the cargo-cult mentality that swept corporate AI teams in 2023-2024, where every company wanted 'their own model.' The irony: the same forces that made pretraining inaccessible (scale, capital intensity) make prompt engineering and fine-tuning more valuable as differentiation vectors. @SemiAnalysis_ is effectively arguing that the moat has moved from model weights to application-layer distribution and data flywheels—a thesis that, if correct, implies most pretraining startups are burning value.

#ai strategy #industry analysis #enterprise ai

This story is part of

The AI Infrastructure War Shifts from Chips to Developer Tools

Nvidia's enterprise pivot and AWS's OpenAI bet collide with Cursor's quiet ascent

Compare side-by-side

Anthropic vs OpenAI

→

Mentioned in this article

SemiAnalysis Pretrainitis Pretraining OpenAI Anthropic Google prompt engineering

Enjoyed this article?

Get the weekly AI intelligence briefing

✨AI Toolslive

Five one-click lenses on this article. Cached for 24h.

Pick a tool above to generate an instant lens on this article.

Big Tech3 shared topics

Google alone ships full any-to-any multimodal models

From the lab

The framework underneath this story

Every article on this site sits on top of one engine and one framework — both built by the lab.

Original research · EUMAS 2026

MNEMA — A Witness Lattice for Multi-Agent AI Memory

Cryptographic memory units · 1−α detection floor · 15 pp PDF

Field framework · v1.0

Epistemic Infrastructure

12 pillars · 11-stage knowledge metabolism · pathology catalog

SemiAnalysis: Pretraining Dead for All but Frontier Labs

What to Watch

Key Takeaways

Sources cited in this article

AI Analysis

✨AI Toolslive

Related Articles

Xiaomi MiMo-V2.5 Tops Global LLM Token Usage at 31.2T/Month

LLM Waterfall Pattern: 429 Failover Beats Retries & Circuit Breakers

Agentic Coding Tools Flood Market as Enterprise Adoptions Triple in 2026

Google Ships 3 Flash Models as 3.5 Pro Remains Missing

Gemini 4 Pretraining Begins, Google's Most Ambitious Run Yet

Google alone ships full any-to-any multimodal models

The framework underneath this story

More in Opinion & Analysis

Claude Quietly Removes Full Thinking Traces, Users Report

Vercel Data: Open Models Spend Collapses to All-Time Low

Fchollet: Future AI Will Be 'Incredibly Cheap' to Train