The Post-Hype Trough: As Model Chatter Fades, Developer Tools Quietly Cement Market Power
While public attention drifts from flagship LLMs, GitHub Copilot's accelerating trajectory signals a shift from model wars to workflow dominance.
The Central Question
Is the value in the AI stack permanently shifting from the 'reasoning core' (LLMs) to the 'integration harness' (developer tools and workflows), and if so, which incumbent (Microsoft/GitHub) is best positioned versus which challengers (Cursor, Replit) can disrupt?
The race is no longer about model capability vs. harness integration, but about which entity achieves Level 4 AGI (Innovator) first and what infrastructure dependencies that creates. Vertical agents with domain-specific data may have a structural advantage over horizontal model-makers, but model-makers control the hardware supply chains (Nvidia's B200, Google's TPUs). The tension is now between data network effects and compute network effects.
TL;DR
Story Timeline
Each chapter captures a major development. Click to expand.
A 111-page AGI taxonomy survey, OpenAI's 2028 research milestone target, and Google's real-time audio model collectively redefine the competitive landscape from harness vs. model to a race for Level 4 AGI, with vertical agents emerging as the only defensible space.
The release of a 111-page survey mapping 5 AGI levels—from Responder to Ecosystem—is not an academic exercise. It is a strategic move by incumbents to redefine the competitive landscape in their favor. By codifying AGI progression into discrete stages, the survey implicitly legitimizes the current frontier models (Gemini 3.5, GPT-4o, Claude 3.5 Sonnet) as 'Level 1 Responders' while setting aspirational targets for future iterations. This creates a narrative moat: any startup claiming AGI capabilities must now map to this taxonomy, ceding definitional authority to the institutions that produced it. The timing is deliberate—coinciding with Apple's 1.2T-parameter Gemini reveal and OpenAI's 2028 research milestone target—suggesting a coordinated effort to manage expectations and deflect scrutiny from current model limitations.
Simultaneously, OpenAI's declaration that it targets 2028 for AI to perform 'significant research' is a direct counter to the AGI taxonomy's higher levels (Level 4: Innovator, Level 5: Ecosystem). By setting a concrete timeline for the next leap, OpenAI is attempting to reclaim narrative control from the taxonomy's authors. However, this creates a tension: if OpenAI achieves Level 4 by 2028, it will have leapfrogged the infrastructure layer entirely, rendering the current harness startups (Cursor, Replit) obsolete before they can build durable moats. The causal chain is clear: the taxonomy legitimizes the model-makers' timeline, which in turn pressures harness startups to either partner with model-makers or pivot to verticals that the taxonomy explicitly excludes (e.g., food distribution).
GrubMarket's AI agent for food distributor sales teams is a microcosm of this dynamic. By embedding an agent into a vertical workflow—food distribution—GrubMarket is betting that domain-specific data and relationships are harder to replicate than general reasoning. This is a direct response to the model-maker counteroffensive: if Claude Code can generate UI/UX animations, the only defensible space left is industry-specific tacit knowledge. The agent is not competing on reasoning quality; it is competing on data access and workflow integration. This validates the thesis that the post-absorption landscape will bifurcate into horizontal model-makers (Apple, OpenAI, Anthropic) and vertical agents (GrubMarket, Netflix recommendations), with the harness layer squeezed in between.
Gemini 3.5 Live Translate's debut as a real-time audio model further complicates the landscape. By collapsing translation into a single model—rather than a pipeline of ASR, translation, and TTS—Google is absorbing another application category. This is a direct threat to specialized translation startups (e.g., DeepL) and to the broader agent infrastructure thesis: if models can handle real-time multimodal tasks natively, the need for orchestration layers diminishes. The move also signals that Google is prioritizing latency over accuracy, a tradeoff that favors consumer applications over enterprise use cases. This creates a bifurcation within the model-maker camp: Google's vertical integration via TPU supply chains allows it to optimize for specific tasks, while OpenAI and Anthropic must rely on general-purpose hardware.
The updated tension point is no longer 'can harness startups survive?' but rather 'which AGI level will be achieved first, and by whom, and what infrastructure dependencies will that create?' The taxonomy survey, OpenAI's 2028 target, and Google's real-time audio model all point to a race to Level 4 (Innovator) within 2-3 years. The winner will not be the one with the best reasoning, but the one that controls the feedback loops—data from real-time interactions, user workflows, and domain-specific models. This shifts the value from model capability to data network effects, where vertical agents (GrubMarket) have a structural advantage over horizontal model-makers.
This chapter concludes the initial 'Post-Hype Trough' narrative arc. The key question has been partially answered: value is not permanently shifting to the harness layer, but is instead fragmenting into model-maker vertical integration (Apple, Google) and vertical agent specialization (GrubMarket, Netflix). The harness layer's role is diminishing as model-makers absorb more application categories. The next narrative should focus on the data network effects war between vertical agents and horizontal model-makers, with infrastructure providers (Nvidia, Cerebras) acting as neutral but powerful arbiters.
AGI taxonomy survey legitimizes model-maker timelines → OpenAI sets 2028 target to reclaim narrative control → Google's real-time audio model absorbs application categories → GrubMarket's vertical agent bets on domain-specific data as moat → Harness layer is squeezed between model-maker absorption and vertical agent specialization.
What Our Agent Predicts Next
OpenAI will push a student/education distribution move for coding tools. Graph evidence: OpenAI has high degree and shares many neighbors with Claude Code/Gemini; the graph shows a latent competitive triangle around coding assistants and education channels.
month · productBy September 2026, OpenAI will announce that ChatGPT Codex (the merged coding capability from June 2) is available for free to all students and faculty with .edu email addresses, directly targeting the MIT/Stanford pipeline that Claude Code has captured. This will be framed as 'democratizing AI for education' but is a defensive response to Anthropic's academic talent acquisition strategy.
quarter · productOpenAI will keep acquiring agent-execution infrastructure rather than only model startups. Graph evidence: OpenAI has 210 degree, strong overlap with adjacent tool nodes, and the live acquisition signal aligns with a structural hole around agent infrastructure.
month · big techApple will announce at WWDC 2026 that its 1.2T-param Gemini model uses dMoE to run a 14-expert active subset locally on-device, achieving 80% memory reduction and enabling real-time diffusion inference on iPhone. This will trigger a wave of edge-diffusion applications.
quarter · research