hpc
23 articles about hpc in AI news
Bull Delivers HPC Infrastructure to Power Mimer AI Factory
Bull, a subsidiary of Atos, has supplied the core HPC infrastructure for Mimer's new AI factory. This facility is dedicated to training and developing large language models for the European market.
FRAGATA: A Hybrid RAG System for Semantic Search Over 20 Years of HPC
A new paper details FRAGATA, a system enabling semantic search over two decades of technical support tickets at a supercomputing center. It uses hybrid retrieval-augmented generation (RAG) to find relevant past incidents despite typos, language, or wording differences, showing a qualitative improvement over the legacy search.
Trillion Labs Builds Industrial World Models on NVIDIA Omnibus
Trillion Labs announced Industrial World Models for AI Factories using NVIDIA Omniverse and Nemotron to optimize data centers and power plants.
JPMorgan, OQC, AMD Build First Quantum AI Data Center for Finance
JPMorgan, OQC, and AMD are building a dedicated quantum AI data center for financial workflows, moving from remote-access demos to enterprise-grade infrastructure. No budget or timeline disclosed.
Liquid Cooling Hits 15kW: CoolIT Coldplate Quadruples Capacity for AI
CoolIT demoed a 15kW single-phase coldplate, quadrupling capacity, while Vertiv, Accelsius, and LiquidStack launched products targeting scalable AI cooling deployment.
Liquid Cooling Crosses 50% by 2027? Rack Densities Force Shift
AI-driven rack densities are pushing liquid cooling adoption past 50% in new hyperscale builds by 2027, though cost and expertise remain barriers.
Ayar Labs Joins NVIDIA NVLink Fusion Ecosystem for Co-Packaged Optics
Ayar Labs joined NVIDIA's NVLink Fusion ecosystem to bring co-packaged optics to AI factories, following its $500M Series E and alongside Lightmatter's similar move.
Google and Blackstone Launch TPU Venture, Challenging Nvidia Dominance
Google and Blackstone launched a TPU venture, financing AI infrastructure outside the hyperscale cloud model. Enterprise buyers get a standalone alternative to Nvidia-dominated GPU clusters.
Cerebras IPO Challenges GPU Scaling Orthodoxy
Cerebras filed for IPO on April 21, betting wafer-scale chips can disrupt Nvidia's GPU cluster model for AI workloads.
Nebius Breaks Ground on 1GW Missouri AI Campus Despite Local Opposition
Nebius broke ground on a 1GW AI data center campus in Missouri despite local opposition. The project is the company's first US gigawatt-scale facility.
AMD Launches PCIe GPU for AI Workloads, Targets Existing Server Install Base
AMD launched a PCIe-based GPU for AI workloads, targeting existing servers. The card provides immediate boost without new data center buildouts.
NVIDIA, DOE Build 100K-GPU Supercomputer for Science
DOE and NVIDIA announced Solstice, a 100K-GPU Vera Rubin supercomputer delivering 5,000 exaflops, and Equinox with 10K Blackwell GPUs.
NVIDIA Vera Rubin VR NVL72: Value Extraction Engine Arrives
NVIDIA's Vera Rubin VR NVL72 shifts from value vendor to value extractor, targeting TCO. SemiAnalysis argues this overturns prior pricing paradigm.
How a Custom Multimodal Transformer Beat a Fine-Tuned LLM for Attribute
LeBonCoin's ML team built a custom late-fusion transformer that uses pre-computed visual embeddings and character n-gram text vectors to predict ad attributes. It outperformed a fine-tuned VLM while running on CPU with sub-200ms latency, offering calibrated probabilities and 15-minute retraining cycles.
Intel's UCIe-S Hits 48 Gb/s on 22nm, Beats 3nm EMIB
Intel demonstrated a UCIe-S die-to-die interconnect on 22nm hitting 48 Gb/s/lane over standard organic substrate, beating a 3nm EMIB design with 3× higher data rate and 2.8× higher bandwidth density. This signals a strategic shift away from EMIB for Intel's own products toward UCIe over substrate.
Applied Digital Lands 300MW Lease with Hyperscaler at Louisiana Site
Applied Digital secured a 300MW lease with an investment-grade hyperscaler at its Delta Forge 1 site in Louisiana, with a total reported value of $7.5 billion, signaling continued demand for AI data center capacity.
Microsoft's Fairwater AI Data Center Launches Early, Boosts Azure Capacity
Microsoft has launched its Fairwater AI data center ahead of schedule. The facility adds significant high-performance computing capacity to Azure's AI infrastructure, crucial for training and running large models.
DOE Seeks Input on AI Infrastructure for Federal Lands
The U.S. Department of Energy has published a Request for Information (RFI) to solicit input on developing AI and high-performance computing infrastructure on DOE-owned lands. This marks a significant step in the federal government's strategy to directly address the national AI compute shortage.
Dual-Enhancement Product Bundling
Researchers propose a dual-enhancement method for product bundling that integrates interactive graph learning with LLM-based semantic understanding. Their graph-to-text paradigm with Dynamic Concept Binding Mechanism addresses cold-start problems and graph comprehension limitations, showing significant performance gains on benchmarks.
Elice Group Expands AI Infrastructure with Modular Data Centers, Plans IPO
Elice Group, a Korean AI and EdTech company, is accelerating its AI infrastructure expansion using modular data centers and preparing for an initial public offering in 2026 to fuel growth.
Alibaba's XuanTie C950 CPU Hits 70+ SPECint2006, Claims RISC-V Record with Native LLM Support
Alibaba's DAMO Academy launched the XuanTie C950, a RISC-V CPU scoring over 70 on SPECint2006—the highest single-core performance for the architecture—with native support for billion-parameter LLMs like Qwen3 and DeepSeek V3.
Silicon Photonics Breakthrough Enters Mass Production, Paving Way for Next-Generation AI Infrastructure
STMicroelectronics has begun mass production of its PIC100 silicon photonics platform, enabling 800G and 1.6T data rates critical for AI data centers. This breakthrough technology replaces copper with light for faster, more efficient data transmission between AI accelerators.
Amazon's $11 Billion AI Power Play: Inside the Indiana Data Center That's Reshaping Tech Infrastructure
Amazon is building an $11 billion AI data center campus in Indiana that will draw 2.2 gigawatts of power—enough for 1.7 million homes. This massive investment highlights the escalating infrastructure demands of artificial intelligence and the growing geographic shift in tech's physical footprint.