amd

30 articles about amd in AI news

AMD Gives OSS Maintainers $3.6M MI355X Cluster Access

AMD gives vLLM/SGLang maintainers $3.6M MI355X cluster access, ending NVIDIA's monopoly on OSS inference hardware access.

May 13, 202675% relevant

AMD ROCm Performance Jumps 75x in 14 Days Post-DeepSeek v4

AMD ROCm stack improved 75x in 14 days post-DeepSeek v4 via fused operations. Still needs 5x more to match B200 performance.

May 10, 2026100% relevant

AMD Launches PCIe GPU for AI Workloads, Targets Existing Server Install Base

AMD launched a PCIe-based GPU for AI workloads, targeting existing servers. The card provides immediate boost without new data center buildouts.

May 8, 202690% relevant

AMD MI350P PCIe Card Claims 40% FP8 Lead Over Nvidia H200 NVL

AMD launched MI350P PCIe AI card with 144GB HBM3E, claiming 39% FP8 lead over Nvidia H200 NVL. Targets drop-in air-cooled server upgrades.

May 7, 202698% relevant

AMD Backs UALink Open Interconnect to Challenge NVIDIA NVLink in AI

AMD is supporting the newly formed UALink Consortium, which aims to create an open standard for connecting AI accelerators. This move challenges NVIDIA's control over the critical NVLink technology that underpins its AI data center systems.

Apr 17, 202684% relevant

AMD AI Director Reports Claude Code Quality Decline, Cites 234k Tool Calls

An AMD AI executive presented data from over 6,800 sessions showing Claude Code's performance has declined since early March, with rising instances of shallow reasoning and incomplete tasks. This raises significant trust issues for engineers using the model in complex development workflows.

Apr 11, 202689% relevant

MLPerf 6.0: NVIDIA Sweeps New Benchmarks, AMD MI355X Within 30% on Select Tests

MLPerf 6.0 results show NVIDIA winning every new benchmark, with its GB300 NVL72 system achieving nearly 3x more throughput than six months ago. AMD's MI355X showed progress, coming within 10-30% on select single-node tests but skipping most new benchmarks.

Apr 7, 202685% relevant

Developer Ranks NPU Model Compilation Ease: Apple 1st, AMD Last

Developer @mweinbach ranked the ease of using AI coding agents to compile ML models for NPUs. Apple's ecosystem was rated easiest, while AMD's tooling was ranked most difficult.

Apr 5, 202675% relevant

Meta's $100B AMD Gamble: The AI Chip War Enters Its Most Strategic Phase

Meta has secured a landmark deal to purchase up to $100 billion worth of AMD AI chips, receiving a massive stock warrant in return. This unprecedented agreement signals Meta's aggressive push to diversify its AI infrastructure beyond Nvidia while pursuing ambitious 'personal superintelligence' goals.

Feb 24, 202690% relevant

Meta's $100 Billion AMD Bet: The AI Infrastructure Arms Race Reaches New Heights

Meta has reportedly signed a staggering $100 billion agreement with AMD to secure 6GW of data center capacity, signaling an unprecedented commitment to AI infrastructure. The timing—just before NVIDIA's quarterly results—highlights intensifying competition for computing resources essential for next-generation AI models.

Feb 24, 202695% relevant

NVIDIA's DreamDojo: Teaching Robots to 'Dream' in Pixels with 44,000 Hours of Human Experience

NVIDIA has open-sourced DreamDojo, a revolutionary robot world model trained on 44,711 hours of real-world human video. Instead of relying on physics engines, it predicts action outcomes directly in pixel space, potentially accelerating robotics development by orders of magnitude.

Feb 20, 202685% relevant

NVIDIA Vera CPU Benchmarks: 1.55x Faster Than Intel Xeon in Phoronix Tests

NVIDIA Vera CPU benchmarks show 1.55x performance over Intel Xeon 6980P and 10% over AMD EPYC 9575F, with 1.2 TB/s memory bandwidth.

May 27, 2026100% relevant

Jensen Huang's 30-Year TSMC Battle: From 3D Graphics to AI GPUs

A 30-year-old comic shows Jensen Huang convincing TSMC to supply wafers for 3D graphics chips. Today, he's still fighting for wafer supply, but now for AI GPUs, alongside Broadcom, AMD, MediaTek, and Amazon.

Apr 23, 202675% relevant

UALink 2.0 Spec Finalized, Aims to Challenge NVLink for AI Clusters

The UALink 2.0 interconnect specification has been finalized, providing a standardized way to link AI accelerators from AMD, Intel, and others. However, it lags behind NVIDIA's established NVLink technology in real-world deployment.

Apr 20, 202696% relevant

DeepSeek V4 Launch Signals China's Strategic Shift in AI Chip Independence

DeepSeek's upcoming V4 multimodal model prioritizes domestic chip partners Huawei and Cambricon over NVIDIA and AMD, marking a significant move toward Chinese AI self-sufficiency amid ongoing U.S. export restrictions.

Feb 28, 202678% relevant

Nvidia N1X Arm Laptop Chip Nears Reveal at Computex

Nvidia, Microsoft, Arm tease N1X Arm laptop chip debut at Computex. Nvidia enters Windows-on-Arm without owning the architecture it tried to buy.

May 30, 202685% relevant

Blackwell NVLink Breaks Confidential Compute, 61% Regression Reported

NVIDIA Blackwell confidential computing disables NVLink multicast, causing 61% regression on SGLang Qwen3.5 397B. Hopper had unencrypted NVLink, compounding the issue.

May 30, 202699% relevant

Cerebras CS4 Stays on 5nm as SRAM Scaling Flattens

Cerebras CS4 stays on 5nm due to SRAM scaling flattening, per @SemiAnalysis_. 3nm offers no density gain, so the chip prioritizes yield and cost.

May 27, 202685% relevant

Karpathy: Neural nets will become the host, CPUs the co-processor

Karpathy predicts neural networks will become the host OS, with CPUs as co-processors, rendering most classical app interfaces obsolete.

May 23, 202685% relevant

NVIDIA Vera Rubin NVL72 Cuts Agentic AI Cost 10x vs Blackwell

NVIDIA Vera Rubin NVL72 cuts agentic AI inference cost 10x vs Blackwell, per Huang at Dell event. 5,000 enterprises already on Dell factories.

May 18, 202695% relevant

vLLM Optimizations Cut Voice AI Latency by 40% on 6-GPU Cluster

vLLM optimizations on a 6-GPU cluster reduced voice AI latency by 40% for a Qwen-based system, enabling 500 concurrent sessions per node without hardware upgrades.

May 16, 202682% relevant

Anthropic Leases xAI's Colossus 1 After Mixed-Architecture Flaw Blocked

Anthropic leased xAI's 220K-GPU Colossus 1 after its mixed architecture failed to train Grok. Musk builds Blackwell-only Colossus 2 for training and IPO.

May 15, 2026100% relevant

Qwen 3.6 27B Hits 34 tok/s on M5 Max MacBook Pro

Qwen 3.6 27B hits 34 tok/s on M5 Max MacBook Pro with 90% acceptance rate, per @rohanpaul_ai. Shows viable local LLM inference on Apple Silicon.

May 14, 202675% relevant

US Clears Nvidia H200 Sales to 10 China Firms, Reversing Ban

US cleared Nvidia H200 sales to 10 China firms on May 14, 2026, reversing a ban that had reduced Nvidia's China share to zero.

May 14, 2026100% relevant

Cerebras Shares Open at $385, 108% Above $185 IPO Price

Cerebras opened at $385, 108% above the $185 IPO price, raising $5.5B. The $68B market cap prices in aggressive growth against Nvidia's dominance.

May 14, 2026100% relevant

Cerebra's Tokenomics Bet: AWS, OpenAI Deals and Wafer-Scale Edge

Cerebra's tokenomics pricing and AWS/OpenAI partnerships challenge NVIDIA's inference dominance, offering a 5x cost reduction per token via its wafer-scale architecture.

May 13, 202689% relevant

Hermes Agent Hits 140K GitHub Stars, Nvidia RTX as Local Inference Bedrock

Hermes Agent hit 140K GitHub stars, most-used on OpenRouter. Runs locally on Nvidia RTX with self-evolving skills and Qwen 3.6 models that beat prior 120B-parameter models.

May 13, 2026100% relevant

Nvidia Blackwell CLC Boosts GEMM Tile Scheduling by 15% Over Static Persistence

Nvidia Blackwell CLC delivers up to 15% higher GEMM throughput via dynamic persistent tile scheduling, fixing load imbalance without startup overhead.

May 11, 202695% relevant

Cerebras Understates On-Chip SRAM by 8x, SemiAnalysis Notes

Cerebras understates on-chip SRAM by 8x per SemiAnalysis, a rare under-specification in chip marketing.

May 7, 202675% relevant

Nvidia Ships AI Factory Blueprints: 4-Node to 128-Cluster Specs

Nvidia published three validated AI data center blueprints — RTX PRO, HGX, NVL72 — spanning 4-node to 128-node clusters, targeting agentic AI and trillion-parameter models.

May 7, 202680% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety