foundation models
30 articles about foundation models in AI news
ModelBest Hits $1B+ Valuation for On-Device Foundation Models
ModelBest, a Chinese developer of on-device AI foundation models, raised several hundred million RMB, reaching a valuation exceeding $1 billion. The funding will accelerate its push to deploy efficient models directly on smartphones and IoT devices.
VAST's $50M Funding Signals 3D AI Revolution: From Foundation Models to World Simulation
AI startup VAST has secured $50 million in Series A funding while advancing its 3D foundation models that are setting new industry standards. The company is preparing to launch its first world model, positioning itself at the forefront of spatial AI development.
Beyond General AI: How Liquid Foundation Models Are Revolutionizing Drug Discovery
Researchers have developed MMAI Gym, a specialized training platform that teaches AI the 'language of molecules' to create more efficient drug discovery models. The resulting Liquid Foundation Models outperform larger general-purpose AI while requiring fewer computational resources.
Apple Siri Rebuilt as System-Wide AI Agent in iOS 27, Powered by Apple Foundation Models and Google Gemini
Apple is rebuilding Siri into a conversational system-wide AI agent with deep app integration and personal data access, launching in iOS 27. The overhaul includes a standalone app, web browsing, and writing tools, powered by Apple's models and a Google Gemini partnership.
PRAGMA: Revolut's Foundation Model for Banking Event Sequences
A new research paper introduces PRAGMA, a family of foundation models designed specifically for multi-source banking event sequences. The model uses masked modeling on a large corpus of financial records to create general-purpose embeddings that achieve strong performance on downstream tasks like fraud detection with minimal fine-tuning.
Zuckerberg: Most Businesses Will Run Custom AI Layers, Not Frontier Models
Mark Zuckerberg predicts most businesses will not own frontier AI models but will build customized operational layers on top of shared models to handle support, sales, and operations. This vision positions foundation models as infrastructure, with value captured in the business-specific layer.
Microsoft World-R1: RL Aligns Text-to-Video with 3D Physics
Microsoft's World-R1 framework applies reinforcement learning with feedback from pre-trained 3D foundation models to align text-to-video outputs with physical 3D constraints, improving structural coherence without modifying the underlying video diffusion architecture.
Paper Details Full-Stack MFM Acceleration: Quant, Spec Decode, HW Co-Design
A research paper details a full-stack approach for accelerating multimodal foundation models, combining hierarchy-aware mixed-precision quantization, structural pruning, speculative decoding, model cascading, and a specialized hardware accelerator. Demonstrated on medical and code generation tasks.
Why the Best Generative AI Projects Start With the Most Powerful Model —
The article suggests that while initial AI projects leverage the broad capabilities of large foundation models, the most successful implementations eventually transition to smaller, more targeted systems. This reflects a maturation from experimentation to production optimization.
NVIDIA Advances AI Robotics with Simulation-First Training, Isaac & Jetson
NVIDIA showcased AI robotics advances using foundation models and synthetic environments for training, enabling scalable deployment in real-world sectors like agriculture and solar. Key platforms are the Isaac simulator and Jetson edge AI hardware.
AlphaEarth Embeddings Outperform Prithvi, Clay in Urban Signal Benchmark
Researchers benchmarked three geospatial foundation models—AlphaEarth, Prithvi, and Clay—on predicting 14 neighborhood-level urban indicators from satellite imagery. AlphaEarth's compact 64-dimensional embeddings proved most informative, achieving the highest predictive skill for built-environment-linked outcomes like chronic health burdens.
Mercor Data Breach Exposes Expert Human Annotation Pipeline Used by Frontier AI Labs
Hackers have reportedly accessed Mercor's expert human data collection systems, which are used by leading AI labs to build foundation models. This breach could expose proprietary training methodologies and sensitive model development data.
Geometric Latent Diffusion (GLD) Achieves SOTA Novel View Synthesis, Trains 4.4× Faster Than VAE
GLD repurposes features from geometric foundation models like Depth Anything 3 as a latent space for multi-view diffusion. It trains significantly faster than VAE-based approaches and achieves state-of-the-art novel view synthesis without text-to-image pretraining.
CausalTimePrior: The Missing Link for AI That Understands Time and Cause
Researchers have introduced CausalTimePrior, a new framework to generate synthetic time series data with known interventions. This breakthrough addresses a critical gap in training AI models to understand causality over time, paving the way for foundation models in time series analysis.
NeuroSkill: MIT's Breakthrough AI Agent Reads Your Mind Before You Ask
MIT researchers have developed NeuroSkill, a revolutionary AI system that integrates brain-computer interfaces with foundation models to create proactive agents that respond to implicit human cognitive and emotional states, running fully offline on edge devices.
Bridging Data Worlds: How MultiModalPFN Unifies Tabular, Image, and Text Analysis
Researchers have developed MultiModalPFN, an AI framework that extends TabPFN to handle tabular data alongside images and text. This breakthrough addresses a critical limitation in foundation models for structured data, enabling more comprehensive analysis in healthcare, marketing, and other domains where multiple data types coexist.
Time-Series AI Learns to Adapt on the Fly: New Framework Eliminates Fine-Tuning for Unseen Tasks
Researchers have developed ICTP, a framework that equips time-series foundation models with in-context learning capabilities, allowing them to adapt to completely new tasks without fine-tuning. This breakthrough improves performance on unseen tasks by 11.4% and represents a significant step toward more flexible, efficient AI systems for real-world time-series applications.
Hitachi's Industrial Gambit: Why Domain Expertise May Be the Missing Link in Physical AI
While tech giants focus on foundation models, Hitachi is betting its industrial expertise and operational data will win the physical AI race. The company's partnerships with Daikin and JR East demonstrate how domain knowledge bridges the gap between digital intelligence and real-world machinery.
Fine-Tuning vs RAG: A Foundational Comparison for AI Strategy
The source provides a foundational comparison of fine-tuning and Retrieval-Augmented Generation (RAG) for enhancing AI models. It uses the analogy of teaching during training versus providing a book during an exam, clarifying their distinct roles in AI application development.
CanViT: First Active-Vision Foundation Model Hits 45.9% mIoU on ADE20K with Sequential Glimpses
Researchers introduce CanViT, the first task- and policy-agnostic Active-Vision Foundation Model (AVFM). It achieves 38.5% mIoU on ADE20K segmentation with a single low-resolution glimpse, outperforming prior active models while using 19.5x fewer FLOPs.
AGIBOT Launches GE-Sim 2.0: A Foundation Model for Robot Simulation
AGIBOT has launched GE-Sim 2.0, a foundation model for robot simulation. It allows AI agents to generate and reason within photorealistic simulated environments for planning and training.
Google's TimesFM: 200M-Param Foundation Model for Zero-Shot Time Series
Google released TimesFM, a 200M-parameter foundation model for time series forecasting that works without training on user data. It's now available open-source and as a product inside BigQuery.
Hugging Face Transfers Safetensors to PyTorch Foundation
Hugging Face is transferring ownership of the Safetensors library to the PyTorch Foundation, shepherded by the Linux Foundation. The move aims to establish it as a neutral, community-driven standard for safe AI model serialization.
VMLOPS's 'Basics' Repository Hits 98k Stars as AI Engineers Seek Foundational Systems Knowledge
A viral GitHub repository aggregating foundational resources for distributed systems, latency, and security has reached 98,000 stars. It addresses a widespread gap in formal AI and ML engineering education, where critical production skills are often learned reactively during outages.
Google Open-Sources TimesFM: A 100B-Point Time Series Foundation Model for Zero-Shot Forecasting
Google has open-sourced TimesFM, a foundation model for time series forecasting trained on 100 billion real-world time points. It requires no dataset-specific training and can generate predictions instantly for domains like traffic, weather, and demand.
Seed1.8 Model Card Released: A 1.8B Parameter Foundation Model for Generalized Real-World AI Agents
Researchers have introduced Seed1.8, a 1.8 billion parameter foundation model designed for generalized real-world agency. It maintains strong LLM and vision-language capabilities while adding unified interfaces for search, code execution, and GUI interaction.
Anthropic Donates to Linux Foundation, Citing Critical Need for Open Source AI Security
Anthropic announced a donation to the Linux Foundation to support securing open source software, which it calls the foundation AI runs on. The move highlights growing industry focus on securing the software supply chain for AI systems.
Google's TimesFM Foundation Model: A New Paradigm for Time Series Forecasting
Google Research has open-sourced TimesFM, a 200 million parameter foundation model for time series forecasting. Trained on 100 billion real-world time points, it demonstrates remarkable zero-shot forecasting capabilities across diverse domains without task-specific training.
The Fragile Foundation: How AI Lab Failures Could Trigger a $1.5 Trillion Infrastructure Collapse
A Reuters analysis reveals that the failure of major AI labs like OpenAI or Anthropic could trigger a catastrophic chain reaction, jeopardizing the $650 billion data center boom and $900 billion in financial investments that depend on their insatiable demand for computing power.
Google Launches Gemini Embedding 2: A New Multimodal Foundation for AI
Google has launched Gemini Embedding 2, a second-generation multimodal embedding model. This technical release, alongside the removal of API rate limits, provides developers with a more powerful and accessible tool for building AI applications that understand text, images, and other data types.