Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

wafer scale

30 articles about wafer scale in AI news

Cerebras Reengineers Mechanical Playbook for Wafer-Scale Chip Cooling

Cerebras disclosed three mechanical innovations—vertical power delivery, flexible interposers, and direct-impingement cooling—to prevent wafer-scale chips from cracking, rewriting engineering fundamentals.

88% relevant

Cerebras Challenges Nvidia Inference Monopoly with Wafer-Scale Edge

Cerebras is challenging Nvidia's inference dominance with wafer-scale chips, as inference workloads surpass training in AI compute spend.

70% relevant

Cerebra's Tokenomics Bet: AWS, OpenAI Deals and Wafer-Scale Edge

Cerebra's tokenomics pricing and AWS/OpenAI partnerships challenge NVIDIA's inference dominance, offering a 5x cost reduction per token via its wafer-scale architecture.

89% relevant

Cerebras WSE-3 Claims 10x Training Speed Over Nvidia H100 on GPT-Scale Model

Cerebras claims 10x training speed over Nvidia H100 for GPT-3-scale models using WSE-3. Benchmark lacks power and cost data, limiting independent verification.

64% relevant

Cerebras IPO Challenges GPU Scaling Orthodoxy

Cerebras filed for IPO on April 21, betting wafer-scale chips can disrupt Nvidia's GPU cluster model for AI workloads.

98% relevant

Terafab's 1GW AI Compute Goal Requires Massive Fab Capacity

Analysis of Terafab's stated goals shows that achieving 1GW of AI compute would require approximately 190,000 wafer starts per month across logic and memory. This underscores the unprecedented scale of semiconductor manufacturing needed for future AI infrastructure.

85% relevant

Cerebras' Strategic Partnership Yields Breakthrough AI Training Results

Cerebras Systems' partnership with Abu Dhabi's G42 has produced remarkable AI training benchmarks, achieving results 100x faster than traditional GPU clusters. The collaboration demonstrates the viability of wafer-scale computing for large language model development.

85% relevant

Nvidia B200 Costs $6,400 to Produce, Gross Margin Hits 82%

Epoch AI estimates Nvidia's B200 GPU costs $5,700–$7,300 to produce, with HBM memory and advanced packaging accounting for two-thirds of the cost. At a $30k–$40k sale price, chip-level gross margins reach ~82%, though rack-scale margins may be lower.

100% relevant

Jensen Huang's 30-Year TSMC Battle: From 3D Graphics to AI GPUs

A 30-year-old comic shows Jensen Huang convincing TSMC to supply wafers for 3D graphics chips. Today, he's still fighting for wafer supply, but now for AI GPUs, alongside Broadcom, AMD, MediaTek, and Amazon.

75% relevant

Google's Virgo Network Links 134,000 TPU v8 Chips with 47 Pbps Fabric

Google unveiled its Virgo networking stack for TPU v8, capable of linking 134,000 chips in a single fabric with 47 petabits/sec of bi-sectional bandwidth. This represents a massive scale-up in interconnect technology for large-scale AI model training.

100% relevant

Nvidia's Silicon Photonics Roadmap Targets AI Data Center Bottlenecks

Nvidia is developing its own silicon photonics-based interconnects to address the growing data transfer bottleneck within AI data centers and supercomputers. This move is critical as AI model size and cluster scale continue to grow exponentially.

86% relevant

Aehr Test Systems Lands $41M AI Chip Order; H2 Bookings Top $92M

Aehr Test Systems received a record $41 million production order from a key hyperscale AI customer. Total bookings for the second half of its fiscal year exceeded $92 million, highlighting surging demand for semiconductor test and burn-in equipment.

74% relevant

TSMC's $56B 2026 CapEx Fuels AI Chip Race with 22 New Fabs

TSMC is constructing up to 22 advanced semiconductor fabs simultaneously, backed by a $52–56 billion capital expenditure plan for 2026. This unprecedented manufacturing scale is critical for producing the 2nm-and-below chips required by next-generation AI models.

85% relevant

Nvidia's Groq Ramps Up AI Chip Production with Samsung in Major Partnership Expansion

Nvidia's recent acquisition Groq has significantly expanded its partnership with Samsung, increasing chip orders from 9,000 to 30,000 wafers. This massive production boost signals accelerated development of Groq's specialized AI inference processors amid growing market demand.

85% relevant

SemiAnalysis: N3 chip demand far outstrips current consensus estimates

SemiAnalysis argues N3 chip demand far exceeds consensus accelerator models, implying a structural silicon shortage not priced by markets.

89% relevant

Cerebras CS4 Stays on 5nm as SRAM Scaling Flattens

Cerebras CS4 stays on 5nm due to SRAM scaling flattening, per @SemiAnalysis_. 3nm offers no density gain, so the chip prioritizes yield and cost.

85% relevant

Cerebras Hits 981 Tokens/sec on 1T-Parameter Kimi K2.6, Claims 6.7× GPU Cloud Speedup

Cerebras reported 981 tokens/sec on the 1T-parameter Kimi K2.6 model, a 6.7× speedup over the next GPU cloud, validated by an independent third party.

93% relevant

Cerebras Shares Open at $385, 108% Above $185 IPO Price

Cerebras opened at $385, 108% above the $185 IPO price, raising $5.5B. The $68B market cap prices in aggressive growth against Nvidia's dominance.

100% relevant

Cerebras Understates On-Chip SRAM by 8x, SemiAnalysis Notes

Cerebras understates on-chip SRAM by 8x per SemiAnalysis, a rare under-specification in chip marketing.

75% relevant

Inference shift opens door for AI chip startups to challenge Nvidia

Inference shift from training to serving creates opportunities for AI chip startups. Nvidia's $20B Groq acquihire validates disaggregated compute strategies.

96% relevant

The $500B AI Chip Bottleneck: One Material, One Supplier

A single Japanese chemical company supplies 98% of the thin-film material used in every AI chip on earth. NVIDIA is paying half the capex to expand supplier fabs as lead times stretch past 6 months.

90% relevant

AI Chip Capacity Crisis: 10GW Left Through 2030, Prices Up Double Digits

The AI accelerator market has only 10 gigawatts of capacity left for contract through 2030, with 100GW already under contract. Prices are rising double digits as one competitor has stopped taking orders entirely.

97% relevant

Gur Singh Claims 7 M4 MacBooks Match A100, Calls Cloud GPU Training a 'Scam'

Developer Gur Singh posted that seven M4 MacBooks (2.9 TFLOPS each) match an NVIDIA A100's performance, calling cloud GPU training a 'scam' and advocating for distributed, consumer-hardware approaches.

77% relevant

Jensen Huang: Nvidia is a 'Computing Company,' Not a Car

Nvidia CEO Jensen Huang, in a new interview, argued that Nvidia is a 'computing company' and not a car—a product that can be easily interchanged. This distinction underscores Nvidia's strategy to be the indispensable platform for AI infrastructure.

85% relevant

Houthi Threat to Bab el-Mandeb Strains AI Chip Supply Chain

Escalating Middle East conflict threatens two key maritime chokepoints, Bab el-Mandeb and Hormuz, jeopardizing the helium and energy supplies that underpin global advanced AI chip manufacturing at TSMC and SK Hynix.

95% relevant

Neuromorphic Computing Patents Surge 401% in 2025, Hits 596 by 2026

Patent filings for neuromorphic computing—hardware that mimics the brain's architecture—surged 401% in 2025, reaching 596 by early 2026. This indicates the technology is transitioning from lab prototypes to commercial products.

87% relevant

TSMC 2nm Capacity Constraints Create Opening for Samsung in AI Chip Foundry Race

TSMC has reportedly hit a 'hard capacity wall' at its 2nm node, creating a strategic opportunity for Samsung Foundry to capture AI accelerator business from major clients like Nvidia and OpenAI. This bottleneck could reshape the competitive landscape for advanced semiconductor manufacturing.

85% relevant

Kyushu University AI Model Achieves 44.4% Solar Cell Efficiency, Surpassing Theoretical SQ Limit

Researchers at Kyushu University used an AI-driven inverse design method to create a photonic crystal solar cell with 44.4% efficiency, exceeding the 33.7% Shockley-Queisser limit for single-junction cells.

85% relevant

Elon Musk Announces $20 Billion Austin Chip Fab, Calls It Most Ambitious Manufacturing Project Since Manhattan Project

Elon Musk announced a $20 billion semiconductor fabrication plant in Austin, Texas, describing it as the most ambitious manufacturing project since the Manhattan Project. The announcement was made via a retweet but lacks specific technical details on node size, capacity, or timeline.

85% relevant

Elon Musk Says Global Chip Fabs Supply Only 2% of Tesla's AI Compute Needs, Driving Terafab Build

Elon Musk stated current global chip fabrication capacity can supply only about 2% of Tesla's AI compute requirements, necessitating the construction of a 'terafab' even if suppliers expand.

87% relevant