broadcom
30 articles about broadcom in AI news
OpenAI-Broadcom Chip Hints at Token Price Collapse
OpenAI and Broadcom are co-developing a custom AI inference chip that could cut token prices by an order of magnitude, per @mweinbach. The chip targets inference workloads, not training, and aims to reduce dependency on Nvidia.
OpenAI, Broadcom Unveil Jalapeño ASIC for LLM Inference
OpenAI and Broadcom unveiled Jalapeño, a custom ASIC for LLM inference, targeting volume deployment by late 2026. No performance metrics were disclosed.
Meta Expands Broadcom Partnership for Next-Gen AI Infrastructure
Meta is expanding its partnership with semiconductor giant Broadcom to co-develop its next-generation AI infrastructure. This move signals a continued, long-term commitment to custom silicon for AI training and inference.
Broadcom to Manufacture Google TPU Chips in Foundry Partnership
Google has licensed its Tensor Processing Unit (TPU) intellectual property to Broadcom for chip fabrication. This allows Google to earn from its IP while Broadcom manages the complex hardware build and networking integration.
Broadcom's $100 Billion AI Chip Forecast Signals Industry Transformation
Broadcom CEO Hock Tan projects the company's AI chip sales will exceed $100 billion in 2027, challenging Nvidia's dominance. This forecast comes alongside Anthropic resuming Pentagon talks about military AI applications.
Jensen Huang's 30-Year TSMC Battle: From 3D Graphics to AI GPUs
A 30-year-old comic shows Jensen Huang convincing TSMC to supply wafers for 3D graphics chips. Today, he's still fighting for wafer supply, but now for AI GPUs, alongside Broadcom, AMD, MediaTek, and Amazon.
Anthropic Secures Multi-Gigawatt Google TPU Deal for Frontier Claude Models
Anthropic announced a multi-gigawatt agreement with Google and Broadcom for next-generation TPU capacity, coming online in 2027, to train and serve frontier Claude models.
Wiwynn Shows First SCADA Server: 2.9PB, No CPU for I/O
Wiwynn showed first Nvidia SCADA server at Computex 2026: 2.9 PB storage, 528M IOPS, GPUs bypass CPU for I/O. Marks shift in AI storage architecture.
Google Books Intel for 3M+ TPUs in 2028 as TSMC CoWoS Hits Capacity Wall
Google booked Intel to package 3M+ TPUs in 2028 as TSMC CoWoS capacity caps out. SK hynix tests HBM on Intel EMIB, potentially unlocking Nvidia's Feynman architecture.
Ethernet AI Switch Sales Double, InfiniBand Rebounds 22% in Q2
Ethernet AI switch sales doubled YoY as InfiniBand rebounded 22%. Ethernet now captures 41% of AI interconnect revenue, up from 28%.
WiFi routers can identify individuals with near-perfect accuracy, KIT shows
KIT researchers show WiFi routers can identify individuals with near-perfect accuracy via beamforming feedback, tested on 197 subjects.
CNAS Report: AI Hits Silicon Wall as Chip Supply Trails $700B CapEx
CNAS report warns semiconductor manufacturing cannot keep pace with AI demand as hyperscalers plan $700B+ CapEx in 2026. Silicon replaces power as the near-term constraint.
Anthropic Targets $900B Valuation in $50B Funding Round
Anthropic plans $50B raise at $900B valuation, with $45B annualized revenue — up 5x from 2024. IPO under consideration.
Anthropic Doubles Claude Code Rate Limits, Leases All of SpaceX's Colossus 1
Anthropic doubled Claude Code's 5-hour rate limits and removed peak-hour throttling for Pro, Max, Team, and seat-based Enterprise plans, then disclosed the source of the new capacity: a lease on the entire Colossus 1 data center — 300 MW and ~220,000 NVIDIA GPUs in Memphis — that SpaceX absorbed when it took over xAI.
OpenAI's MRC Protocol Sprays Packets Across 100+ Paths to Fix GPU Stragglers
OpenAI open-sourced MRC, a networking protocol that sprays packets across hundreds of paths to reduce GPU idle time from congestion and failures, contributed to OCP.
NVIDIA Open-Sources MRC, the RDMA Protocol Powering OpenAI's Blackwell Clusters
NVIDIA open-sourced MRC, a multi-path RDMA protocol used by OpenAI on Blackwell clusters, enabling microsecond rerouting across 64 paths.
Astera Labs Scorpio X-Series Switch Targets 49% Collective IO Cut for Idle GPUs
Astera Labs introduced Scorpio X-Series 320-lane switch targeting 49% collective IO reduction for fragmented AI workloads. Shipments to hyperscalers began, with broad ramp in H2 2026.
Qualcomm Ships Hyperscaler Custom Silicon by December 2026
Qualcomm is developing custom silicon for an unnamed hyperscaler, with shipments expected December 2026, marking its most concrete data-center comeback move.
Google Inks Pentagon AI Deal, Reverses 2018 Stance
Google signed a deal allowing the Pentagon to use its AI models for classified work and 'any lawful government purpose,' reversing its 2018 exit from Project Maven. The contract includes non-binding language on surveillance and autonomous weapons, and requires Google to adjust AI safety filters at government request.
Google Splits TPU Line: 8t for Training, 8i for Inference
At Cloud Next 2026, Google introduced two new AI chips — TPU 8t for training and TPU 8i for inference — splitting its custom silicon for the first time. OpenAI, Anthropic, and Meta are buying multi-gigawatt TPU capacity, signaling a crack in NVIDIA's 81% market share.
Utah Hyperscale Data Center to Exceed State Power Use
A hyperscale data center in Box Elder County, Utah, developed by Kevin O'Leary's O'Leary Digital, is set to generate and consume more power than the state itself, moving toward final approval.
Google to Invest Up to $40 Billion in Anthropic
Google will invest up to $40 billion in Anthropic: $10B immediate, $30B tied to performance milestones, plus 5GW of TPU compute capacity by 2027. The deal mirrors Amazon's earlier $25B commitment and reinforces the circular compute-for-equity pattern dominating AI infrastructure spending.
Meta Deploys Millions of Amazon Graviton CPUs for AI Agents
Meta will deploy tens of millions of AWS Graviton5 CPU cores for AI agent workloads, signaling that agentic inference favors CPUs over GPUs. The deal deepens Meta's $200B+ infrastructure push amid layoffs and cloud rivalry.
Arista Doubles 2026 AI Revenue Target to $3B+ on Open Ethernet
Arista Networks doubled its 2026 AI networking revenue target to over $3 billion, citing expanded roles for open Ethernet in AI data centers. This signals a major shift toward disaggregated, standards-based networking for AI clusters.
Cisco Reveals Scale-Across GPU Networking Needs 14x DCI Bandwidth
Cisco's chief architect detailed the massive bandwidth requirements for connecting AI clusters via 'scale-across' GPU networking, which needs 14x the capacity of traditional data center interconnects. This shift is creating a multi-billion dollar market for 800G coherent pluggables and deep-buffered switches.
Foxconn to Mass-Produce 10,000+ CPO Optical Switches for AI in Q3 2026
Foxconn's manufacturing arm will begin volume production of advanced co-packaged optics (CPO) switches in Q3 2026, targeting over 10,000 units. This move directly addresses the critical bandwidth and power bottlenecks in next-generation AI data center infrastructure.
Nvidia's Silicon Photonics Roadmap Targets AI Data Center Bottlenecks
Nvidia is developing its own silicon photonics-based interconnects to address the growing data transfer bottleneck within AI data centers and supercomputers. This move is critical as AI model size and cluster scale continue to grow exponentially.
AMD Backs UALink Open Interconnect to Challenge NVIDIA NVLink in AI
AMD is supporting the newly formed UALink Consortium, which aims to create an open standard for connecting AI accelerators. This move challenges NVIDIA's control over the critical NVLink technology that underpins its AI data center systems.
IOWN Forum Pushes All-Photonic WAN for AI Neocloud Interconnects
The IOWN Global Forum is focusing its optical networking tech on datacenter interconnects, aiming to let GPU 'neoclouds' and financial firms use cheaper, remote facilities without latency penalties for AI workloads.
Anthropic's Claude Mythos Scores 83.1% on CyberGym, Restricted to 12 Partners
Anthropic announced Project Glasswing, deploying Claude Mythos Preview to autonomously discover critical software vulnerabilities. Scoring 83.1% on CyberGym, it's restricted to 12 launch partners due to dual-use risks, with a 90-day disclosure window.