Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

scaling laws

30 articles about scaling laws in AI news

Roman Yampolskiy: 'AGI is a Question of Cost, Not Time' as Scaling Laws Hold

AI safety researcher Roman Yampolskiy argues that achieving AGI is now a matter of computational and financial resources, not theoretical possibility, citing the continued validity of scaling laws and early signs of recursive self-improvement.

87% relevant

UniMixer: A Unified Architecture for Scaling Laws in Recommendation Systems

A new arXiv paper introduces UniMixer, a unified scaling architecture for recommender systems. It bridges attention-based, TokenMixer-based, and factorization-machine-based methods into a single theoretical framework, aiming to improve parameter efficiency and scaling return on investment (ROI).

96% relevant

Beyond the Benchmark: New Model Separates AI Hype from True Capability

A new 'structured capabilities model' addresses a critical flaw in AI evaluation: benchmarks often confuse model size with genuine skill. By combining scaling laws with latent factor analysis, it offers the first method to extract interpretable, generalizable capabilities from LLM test results.

72% relevant

Kimi Team's 'Attention Residuals' Replace Fixed Summation with Softmax Attention, Boosts GPQA-Diamond by +7.5%

Researchers propose Attention Residuals, a content-dependent alternative to standard residual connections in Transformers. The method improves scaling laws, matches a baseline trained with 1.25x more compute, and adds under 2% inference overhead.

97% relevant

Huawei's τ Scaling Law Redefines Transistor Race Without EUV

Huawei's τ Scaling Law at IEEE ISCAS replaces geometric transistor scaling with time-based optimization, targeting 1.4nm density by 2031 without EUV, challenging US export controls.

99% relevant

Lloyds Banking Group Details 'Atlas' ML Platform for Scaling AI in a

A technical blog post details how Lloyds Banking Group rebuilt its internal Machine Learning platform, Atlas, on a cloud-native architecture to overcome scaling limits and meet stringent regulatory requirements. This is a blueprint for operationalizing AI in high-stakes, governed industries.

88% relevant

Scaling Law Plateau Not Universal: More Tokens Boost Reasoning AI Performance

Empirical evidence indicates the 'second scaling law'—performance gains from increased computation—does not fully plateau for many reasoning tasks. Benchmark results may be artificially limited by token budgets, not model capability.

85% relevant

Agent Harness Scaling: EFC Predicts Success at R2 0.99 vs 0.42

New research introduces Effective Feedback Compute (EFC), which predicts agent success at R2 0.99 vs 0.42 for raw tokens. Reallocating compute by EFC lifts success 3x at the same budget.

88% relevant

Robotics' Scaling Breakthrough: How SONIC's 42M-Parameter Model Achieves Perfect Real-World Transfer

Researchers have demonstrated that robotics can scale like language models, with SONIC training a 42M-parameter model on 100M human motion frames. The system achieved 100% success transferring to real robots without fine-tuning, marking a paradigm shift in robotic learning.

95% relevant

Ahold Delhaize USA Scales Personalization Across Banners

Ahold Delhaize USA is scaling AI-driven personalization across banners like Stop & Shop and Giant Food, using data and ML to tailor shopping experiences. This matters for retail as it demonstrates a major grocer's commitment to AI for customer loyalty and revenue growth.

78% relevant

Anthropic Warns Upcoming LLMs Could Cause 'Serious Damage'

Anthropic has issued a stark warning that its upcoming large language models could cause 'serious damage.' The company states there is 'no end in sight' to capability scaling and proliferation risks.

85% relevant

US Data Center Power Demand Hits 15 GW, Grid Constraints Emerge

US data center power demand reached 15 gigawatts in 2023, up from 11 GW in 2022. This rapid growth highlights a widening bottleneck: compute infrastructure is scaling faster than power delivery systems can support.

75% relevant

daVinci-LLM 3B Model Matches 7B Performance, Fully Open-Sourced

The daVinci-LLM team has open-sourced a 3 billion parameter model trained on 8 trillion tokens. Its performance matches typical 7B models, challenging the scaling law focus on parameter count.

95% relevant

Gamma 31B Model Reportedly Outperforms Qwen 3.5 397B, Highlighting Efficiency Leap

A developer's social media post claims the Gamma 31B model outperforms the much larger Qwen 3.5 397B. If verified, this would represent a dramatic efficiency gain in large language model scaling.

85% relevant

QuatRoPE: New Positional Embedding Enables Linear-Scale 3D Spatial Reasoning in LLMs, Outperforming Quadratic Methods

Researchers propose QuatRoPE, a novel positional embedding method that encodes 3D object relations with linear input scaling. Paired with IGRE, it improves spatial reasoning in LLMs while preserving their original language capabilities.

79% relevant

The Two-Year AI Leap: How Model Efficiency Is Accelerating Beyond Moore's Law

A viral comparison reveals AI models achieving dramatically better results with identical parameter counts in just two years, suggesting efficiency improvements are outpacing hardware scaling. This development challenges assumptions about AI progress and has significant implications for deployment costs and capabilities.

85% relevant

Anthropic's RSP v3.0: From Hard Commitments to Adaptive Governance in AI Safety

Anthropic has released Responsible Scaling Policy 3.0, shifting from rigid safety commitments to a more flexible, adaptive framework. The update introduces risk reports, external review mechanisms, and unwinds previous requirements the company says were distorting safety efforts.

80% relevant

The Billion-Dollar Blind Spot: Why AI's Evaluation Crisis Threatens Progress

AI researcher Ethan Mollick highlights a critical imbalance: while billions fund model training, only thousands support independent benchmarking. This evaluation gap risks creating powerful but poorly understood AI systems with potentially dangerous flaws.

85% relevant

OpenAI Launches GPT-5.6 Sol Under US Government Restrictions

OpenAI's GPT-5.6 Sol beats Claude Mythos 5 in agentic coding (88.8% vs 88%) but US government restricts access to select partners, a policy OpenAI calls unsustainable.

100% relevant

Jim Keller: Tenstorrent IPO Looms as BlackHole Chip Scales

Jim Keller confirmed Tenstorrent's IPO plans as BlackHole chip scales for AI inference, competing with Nvidia. No revenue disclosed.

98% relevant

Colossus 2: xAI's Memphis Cluster Hits 300,000 GPUs

xAI's Colossus 2 hits 300,000 GPUs, targeting 1M by year-end. Training Grok-3, the $6B cluster challenges OpenAI and Google.

98% relevant

Why Traditional Retail Metrics Break Down in Agentic Commerce

Valtech's 2026 research shows 96% of retailers face integration barriers, 48% are stuck in AI pilot purgatory, and nearly 75% can't link AI spend to metrics, as agentic commerce fragments customer journeys beyond traditional measurement frameworks.

100% relevant

11-Agent Company Earned $0: CLAUDE.md Mistakes Cost Revenue

11-agent company experiment earned $0 after 896 tasks. Operator open-sourced CLAUDE.md template with 72 lessons on coordination failures and legal constraints.

98% relevant

NHN Deploys 7,656-GPU AI Cluster in Seoul

NHN launched a 7,656-GPU cluster in Seoul, South Korea, for domestic enterprise AI workloads. The cluster targets inference and training, competing with Naver and Kakao.

90% relevant

Billionaire Sues Tiny Michigan Township to Force OpenAI Data Center Through

Billionaire Steven Roth's Related Digital sued Saline Township, Michigan, after it rejected a 21M sq ft OpenAI data center, forcing approval via 'exclusionary zoning' claim.

76% relevant

Recursive Multi-Agent Systems Top Hugging Papers; Eywa Bridges LLMs and Scientific Models

Recursive Multi-Agent Systems leads Hugging Papers with 242 upvotes. Eywa and OneManCompany signal a move from chat-based to structural agent collaboration.

89% relevant

GPT-5.5 Pro Leapfrogs on Epoch Benchmark; Base Model Beats Prior Pro

A tweet from @kimmonismus reveals GPT-5.5 Pro shows significant Epoch benchmark gains, and the non-Pro GPT-5.5 surpasses GPT-5.4 Pro, suggesting major efficiency improvements at OpenAI.

99% relevant

Talkie: Vintage LLM Trained on 260B Pre-1931 English Tokens

Talkie is a new 'vintage language model' trained on 260 billion tokens of historical English text from before 1931, developed by a team including Alec Radford, co-author of the original GPT paper. It offers a unique linguistic artifact for NLP research.

85% relevant

Google Hits 75% AI-Generated Code, Up From 50% in Fall 2025

Google reports 75% of all new code is now AI-generated and engineer-approved, a sharp increase from 50% last fall. This indicates a massive, accelerating shift in software development practices at the tech giant.

85% relevant

Anthropic Hiring Data Center Leasing Principals in Europe & Australia

Anthropic is actively hiring for data center leasing roles in Europe and Australia, revealing a strategic push to build out its own compute infrastructure as it scales its AI models.

100% relevant