broadcom

30 articles about broadcom in AI news

OpenAI-Broadcom Chip Hints at Token Price Collapse

OpenAI and Broadcom are co-developing a custom AI inference chip that could cut token prices by an order of magnitude, per @mweinbach. The chip targets inference workloads, not training, and aims to reduce dependency on Nvidia.

Jun 24, 202675% relevant

OpenAI, Broadcom Unveil Jalapeño ASIC for LLM Inference

OpenAI and Broadcom unveiled Jalapeño, a custom ASIC for LLM inference, targeting volume deployment by late 2026. No performance metrics were disclosed.

Jun 24, 202698% relevant

Meta Expands Broadcom Partnership for Next-Gen AI Infrastructure

Meta is expanding its partnership with semiconductor giant Broadcom to co-develop its next-generation AI infrastructure. This move signals a continued, long-term commitment to custom silicon for AI training and inference.

Apr 14, 202685% relevant

Broadcom to Manufacture Google TPU Chips in Foundry Partnership

Google has licensed its Tensor Processing Unit (TPU) intellectual property to Broadcom for chip fabrication. This allows Google to earn from its IP while Broadcom manages the complex hardware build and networking integration.

Apr 8, 202685% relevant

Broadcom's $100 Billion AI Chip Forecast Signals Industry Transformation

Broadcom CEO Hock Tan projects the company's AI chip sales will exceed $100 billion in 2027, challenging Nvidia's dominance. This forecast comes alongside Anthropic resuming Pentagon talks about military AI applications.

Mar 5, 202675% relevant

Jensen Huang's 30-Year TSMC Battle: From 3D Graphics to AI GPUs

A 30-year-old comic shows Jensen Huang convincing TSMC to supply wafers for 3D graphics chips. Today, he's still fighting for wafer supply, but now for AI GPUs, alongside Broadcom, AMD, MediaTek, and Amazon.

Apr 23, 202675% relevant

Anthropic Secures Multi-Gigawatt Google TPU Deal for Frontier Claude Models

Anthropic announced a multi-gigawatt agreement with Google and Broadcom for next-generation TPU capacity, coming online in 2027, to train and serve frontier Claude models.

Apr 6, 202695% relevant

Wiwynn Shows First SCADA Server: 2.9PB, No CPU for I/O

Wiwynn showed first Nvidia SCADA server at Computex 2026: 2.9 PB storage, 528M IOPS, GPUs bypass CPU for I/O. Marks shift in AI storage architecture.

Jun 12, 202689% relevant

Google Books Intel for 3M+ TPUs in 2028 as TSMC CoWoS Hits Capacity Wall

Google booked Intel to package 3M+ TPUs in 2028 as TSMC CoWoS capacity caps out. SK hynix tests HBM on Intel EMIB, potentially unlocking Nvidia's Feynman architecture.

Jun 10, 202698% relevant

Ethernet AI Switch Sales Double, InfiniBand Rebounds 22% in Q2

Ethernet AI switch sales doubled YoY as InfiniBand rebounded 22%. Ethernet now captures 41% of AI interconnect revenue, up from 28%.

Jun 3, 202675% relevant

WiFi routers can identify individuals with near-perfect accuracy, KIT shows

KIT researchers show WiFi routers can identify individuals with near-perfect accuracy via beamforming feedback, tested on 197 subjects.

May 24, 202675% relevant

CNAS Report: AI Hits Silicon Wall as Chip Supply Trails $700B CapEx

CNAS report warns semiconductor manufacturing cannot keep pace with AI demand as hyperscalers plan $700B+ CapEx in 2026. Silicon replaces power as the near-term constraint.

May 11, 202690% relevant

Anthropic Targets $900B Valuation in $50B Funding Round

Anthropic plans $50B raise at $900B valuation, with $45B annualized revenue — up 5x from 2024. IPO under consideration.

May 8, 2026100% relevant

Anthropic Doubles Claude Code Rate Limits, Leases All of SpaceX's Colossus 1

Anthropic doubled Claude Code's 5-hour rate limits and removed peak-hour throttling for Pro, Max, Team, and seat-based Enterprise plans, then disclosed the source of the new capacity: a lease on the entire Colossus 1 data center — 300 MW and ~220,000 NVIDIA GPUs in Memphis — that SpaceX absorbed when it took over xAI.

May 6, 2026100% relevant

OpenAI's MRC Protocol Sprays Packets Across 100+ Paths to Fix GPU Stragglers

OpenAI open-sourced MRC, a networking protocol that sprays packets across hundreds of paths to reduce GPU idle time from congestion and failures, contributed to OCP.

May 6, 202688% relevant

NVIDIA Open-Sources MRC, the RDMA Protocol Powering OpenAI's Blackwell Clusters

NVIDIA open-sourced MRC, a multi-path RDMA protocol used by OpenAI on Blackwell clusters, enabling microsecond rerouting across 64 paths.

May 6, 2026100% relevant

Astera Labs Scorpio X-Series Switch Targets 49% Collective IO Cut for Idle GPUs

Astera Labs introduced Scorpio X-Series 320-lane switch targeting 49% collective IO reduction for fragmented AI workloads. Shipments to hyperscalers began, with broad ramp in H2 2026.

May 5, 202692% relevant

Qualcomm Ships Hyperscaler Custom Silicon by December 2026

Qualcomm is developing custom silicon for an unnamed hyperscaler, with shipments expected December 2026, marking its most concrete data-center comeback move.

May 1, 202676% relevant

Google Inks Pentagon AI Deal, Reverses 2018 Stance

Google signed a deal allowing the Pentagon to use its AI models for classified work and 'any lawful government purpose,' reversing its 2018 exit from Project Maven. The contract includes non-binding language on surveillance and autonomous weapons, and requires Google to adjust AI safety filters at government request.

Apr 28, 202695% relevant

Google Splits TPU Line: 8t for Training, 8i for Inference

At Cloud Next 2026, Google introduced two new AI chips — TPU 8t for training and TPU 8i for inference — splitting its custom silicon for the first time. OpenAI, Anthropic, and Meta are buying multi-gigawatt TPU capacity, signaling a crack in NVIDIA's 81% market share.

Apr 27, 2026100% relevant

Utah Hyperscale Data Center to Exceed State Power Use

A hyperscale data center in Box Elder County, Utah, developed by Kevin O'Leary's O'Leary Digital, is set to generate and consume more power than the state itself, moving toward final approval.

Apr 26, 2026100% relevant

Google to Invest Up to $40 Billion in Anthropic

Google will invest up to $40 billion in Anthropic: $10B immediate, $30B tied to performance milestones, plus 5GW of TPU compute capacity by 2027. The deal mirrors Amazon's earlier $25B commitment and reinforces the circular compute-for-equity pattern dominating AI infrastructure spending.

Apr 24, 2026100% relevant

Meta Deploys Millions of Amazon Graviton CPUs for AI Agents

Meta will deploy tens of millions of AWS Graviton5 CPU cores for AI agent workloads, signaling that agentic inference favors CPUs over GPUs. The deal deepens Meta's $200B+ infrastructure push amid layoffs and cloud rivalry.

Apr 24, 202696% relevant

Arista Doubles 2026 AI Revenue Target to $3B+ on Open Ethernet

Arista Networks doubled its 2026 AI networking revenue target to over $3 billion, citing expanded roles for open Ethernet in AI data centers. This signals a major shift toward disaggregated, standards-based networking for AI clusters.

Apr 22, 2026100% relevant

Cisco Reveals Scale-Across GPU Networking Needs 14x DCI Bandwidth

Cisco's chief architect detailed the massive bandwidth requirements for connecting AI clusters via 'scale-across' GPU networking, which needs 14x the capacity of traditional data center interconnects. This shift is creating a multi-billion dollar market for 800G coherent pluggables and deep-buffered switches.

Apr 21, 202685% relevant

Foxconn to Mass-Produce 10,000+ CPO Optical Switches for AI in Q3 2026

Foxconn's manufacturing arm will begin volume production of advanced co-packaged optics (CPO) switches in Q3 2026, targeting over 10,000 units. This move directly addresses the critical bandwidth and power bottlenecks in next-generation AI data center infrastructure.

Apr 20, 202685% relevant

Nvidia's Silicon Photonics Roadmap Targets AI Data Center Bottlenecks

Nvidia is developing its own silicon photonics-based interconnects to address the growing data transfer bottleneck within AI data centers and supercomputers. This move is critical as AI model size and cluster scale continue to grow exponentially.

Apr 20, 202686% relevant

AMD Backs UALink Open Interconnect to Challenge NVIDIA NVLink in AI

AMD is supporting the newly formed UALink Consortium, which aims to create an open standard for connecting AI accelerators. This move challenges NVIDIA's control over the critical NVLink technology that underpins its AI data center systems.

Apr 17, 202684% relevant

IOWN Forum Pushes All-Photonic WAN for AI Neocloud Interconnects

The IOWN Global Forum is focusing its optical networking tech on datacenter interconnects, aiming to let GPU 'neoclouds' and financial firms use cheaper, remote facilities without latency penalties for AI workloads.

Apr 17, 202678% relevant

Anthropic's Claude Mythos Scores 83.1% on CyberGym, Restricted to 12 Partners

Anthropic announced Project Glasswing, deploying Claude Mythos Preview to autonomously discover critical software vulnerabilities. Scoring 83.1% on CyberGym, it's restricted to 12 launch partners due to dual-use risks, with a 90-day disclosure window.

Apr 12, 202686% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety