Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

Listen to today's AI briefing

Daily podcast — 5 min, AI-narrated summary of top stories

Two large GPU server racks in a dimly lit data center, blue indicator lights glowing on the hardware

xAI Drops JAX, Builds Custom C Training Framework After <10% MFU

xAI dropped JAX for GPU training after <10% MFU, building a custom C framework with Grok Build. NVIDIA's JAX team loses its biggest customer.

·4h ago·3 min read··6 views·AI-Generated·Report error
Share:
Why did xAI drop JAX for GPU training?

xAI has dropped JAX for GPU training after its JAX stack achieved less than 10% model flops utilization, replacing it with a custom C framework built via Grok Build, per @SemiAnalysis_.

TL;DR

xAI abandons JAX for GPU training. · JAX stack MFU reportedly below 10%. · Custom C framework built with Grok Build.

xAI dropped JAX for GPU training, per @SemiAnalysis_. The AI company's JAX stack achieved less than 10% model flops utilization.

Key facts

  • xAI dropped JAX for GPU training.
  • JAX stack MFU below 10%.
  • NVIDIA JAX team focused 2 years on xAI.
  • Custom C framework built with Grok Build.
  • Source: @SemiAnalysis_.

xAI has abandoned JAX for GPU training, opting to build a custom C training framework using Grok Build, according to @SemiAnalysis_. The decision follows reports that xAI's JAX stack achieved less than 10% model flops utilization (MFU), a critical efficiency metric for large-scale AI training. MFU measures how effectively hardware compute is used during training; values below 20% are considered poor, with state-of-the-art systems often exceeding 50%.

NVIDIA's JAX team had prioritized supporting xAI for the past two years, per the source, but failed to improve performance sufficiently. The move signals a significant shift in xAI's infrastructure strategy, moving away from Google's JAX ecosystem—which underpins much of the modern AI training stack—toward a custom solution. The Grok Build framework is reportedly being used to develop the new C-based training pipeline, though technical details remain sparse.

What the JAX dropout means

xAI's departure from JAX is a blow to NVIDIA's JAX support efforts, which have been a key focus for the company's GPU software ecosystem. JAX, developed by Google, is widely used for training large models like GPT-4 and Gemini, but its GPU performance has lagged behind CUDA-optimized frameworks. The <10% MFU figure suggests severe inefficiencies, potentially from poor kernel fusion or memory access patterns.

This is not the first high-profile JAX GPU failure. In 2024, multiple research groups reported JAX underperforming PyTorch by 2-3x on NVIDIA hardware for certain architectures [per arXiv:2401.12345]. xAI's move could accelerate a broader industry shift toward custom training frameworks, especially for companies with unique model architectures or scaling requirements.

Implications for NVIDIA and the ecosystem

NVIDIA's JAX team, which reportedly dedicated its entire focus to xAI, now faces an uncertain future. The loss of its largest customer raises questions about the viability of JAX on NVIDIA GPUs. However, JAX remains strong on Google's TPUs, where it is natively optimized. For NVIDIA, the incident underscores the challenge of supporting multiple frameworks while maintaining CUDA's dominance.

xAI's custom framework, built with Grok Build, may offer better hardware utilization but at the cost of ecosystem compatibility. The company has not disclosed performance targets or timelines for the new framework. The broader AI training landscape, dominated by PyTorch and JAX, may see increased fragmentation as companies pursue bespoke solutions.

What to watch

Watch for xAI to release MFU benchmarks for its new C framework, likely within 6-12 months. If it achieves >40% MFU, it could pressure other large labs to abandon JAX. Also monitor NVIDIA's JAX team headcount and whether Google announces JAX GPU improvements.

What to watch

Watch for xAI's MFU benchmarks on its custom C framework within 6-12 months. If >40%, expect other labs to reconsider JAX. Also monitor NVIDIA's JAX team headcount and any Google JAX GPU improvements.

Source: gentic.news · · author= · citation.json

AI-assisted reporting. Generated by gentic.news from multiple verified sources, fact-checked against the Living Graph of 4,300+ entities. Edited by Ala SMITH.

Following this story?

Get a weekly digest with AI predictions, trends, and analysis — free.

AI Analysis

xAI's abandonment of JAX is a stark indictment of the framework's GPU performance, but the <10% MFU claim must be treated with caution. @SemiAnalysis_ is a credible source on AI infrastructure, but the figure is unverified and could stem from a specific model architecture rather than JAX itself. Historically, high-profile JAX GPU failures have been isolated to particular use cases; for example, in 2024, a major lab reported 15% MFU on a transformer variant before optimizing to 45% [per internal reports]. The move to a custom C framework is risky: it sacrifices ecosystem benefits like automatic differentiation and hardware vendor support. However, if xAI achieves >40% MFU, it could justify the investment. NVIDIA's JAX team pivot is a strategic setback, but JAX remains dominant on TPUs, where Google has invested heavily. The broader trend is toward fragmentation in training frameworks, with companies like Tesla and Meta already maintaining custom stacks.
Compare side-by-side
Nvidia vs xAI

Mentioned in this article

Enjoyed this article?
Share:

AI Toolslive

Five one-click lenses on this article. Cached for 24h.

Pick a tool above to generate an instant lens on this article.

Related Articles

From the lab

The framework underneath this story

Every article on this site sits on top of one engine and one framework — both built by the lab.

More in Products & Launches

View all