data acquisition

30 articles about data acquisition in AI news

Sam Altman Hints at OpenAI Acquisition Targeting 'Mixture' of Product Company and Research Lab

In an interview, OpenAI CEO Sam Altman indicated the company is considering an acquisition that looks like 'a mixture' of both a product company and a research lab. This suggests a strategic move to acquire teams that can both advance AI capabilities and rapidly productize them.

93% relevant

OpenClaw Creator Peter Steinberger Declined OpenAI Acquisition Offer, Citing Vision Alignment

Peter Steinberger, creator of the ClawdBot/OpenClaw robotics project, revealed on the Lex Fridman Podcast that he declined an acquisition offer from OpenAI. He cited a misalignment in vision for the project's future as the primary reason.

85% relevant

China Bars Manus Founders from Leaving Country Amid Meta Acquisition Scrutiny

Chinese authorities have restricted the founders of AI startup Manus from leaving China as they scrutinize Meta's acquisition. The probe focuses on whether the company restructured overseas to sidestep technology transfer and national security rules.

97% relevant

Meta's Strategic Acquisition of Moltbook Signals Major Shift Toward Autonomous AI Agents

Meta has acquired startup Moltbook to accelerate development of autonomous AI agents that could act online for users and businesses. The founders will join Meta's Superintelligence Labs, aiming to build platforms where millions of AI assistants interact across Facebook, WhatsApp, and Instagram.

95% relevant

Anthropic's Strategic Acquisition: How Vercept Will Transform Claude Into a True Digital Assistant

Anthropic has acquired AI startup Vercept to enhance Claude's ability to interpret and interact with computer screens. This move positions Claude to become a more capable AI agent that can perform complex digital tasks autonomously.

75% relevant

Anthropic's Strategic Acquisition of Vercept Signals Major Shift Toward Autonomous AI Agents

Anthropic has acquired Seattle-based AI startup Vercept, known for its computer-use agent Vy that can operate a full desktop environment. The move accelerates Anthropic's push beyond conversational AI toward autonomous task completion, following Meta's recent poaching of a Vercept founder.

70% relevant

Nebius Makes $275M Bet on AI Agent Search with Tavily Acquisition

European cloud provider Nebius acquires AI search startup Tavily for $275 million, integrating agentic search capabilities into its AI cloud platform to challenge major players in the competitive AI infrastructure market.

70% relevant

OpenAI Acquires Developer Tooling Startup Astral, Maker of Ruff and uv

OpenAI has acquired developer tooling startup Astral, known for creating the high-speed Python linter Ruff and package manager uv. The acquisition is positioned as a boost for OpenAI's Codex team, with plans to continue supporting Astral's open-source projects.

97% relevant

The Great GPU Scramble: How Hardware Shortages Are Defining the AI Arms Race

Oracle founder Larry Ellison identifies GPU acquisition as the primary bottleneck in AI development, with companies racing to secure limited hardware for breakthroughs in medicine, video generation, and autonomous systems.

85% relevant

The AI Scare Trade: How Market Fears Are Fueling an Unprecedented M&A Frenzy

A wave of AI-driven disruption is creating an 'AI scare trade' in capital markets, sparking fierce competition between traditional firms and AI startups. This has triggered a surge in mergers and acquisitions as companies race to adapt or acquire the technology reshaping entire industries.

75% relevant

Manus Agents Brings Sophisticated AI Reasoning to Everyday Chat Apps

Manus has launched Manus Agents, bringing multi-step reasoning, task execution, and tool integrations directly into chat applications starting with Telegram. This development makes advanced AI agents accessible to casual users and sheds light on Meta's recent acquisition of the company.

85% relevant

Google, CoreWeave Sell Record $5.7B in Junk Bonds for AI Data Centers

Google and its partner CoreWeave sold a record $5.7 billion in high-yield bonds to fund AI data center expansion. The deal was oversubscribed, showing strong investor appetite for AI infrastructure debt.

88% relevant

Figure CEO: Data Scarcity is the 'Only Thing' Holding Back General Robots

Figure CEO Brett Adcock asserts that solving general robotics is contingent on acquiring a 'pile of data' for training, highlighting the extreme cost and difficulty of collecting real-world robotic interaction data.

85% relevant

New arXiv Study Finds No Saturation Point for Data in Traditional Recommender Systems

A new arXiv preprint systematically tests how recommendation model performance scales with training data size. Using 10 algorithm variants across 11 large datasets, the research finds that normalized performance (NDCG@10) generally keeps improving up to 100 million interactions, with no clear saturation point for typical models.

90% relevant

Analysis: Meta's AI Investment Strategy Questioned as Scale AI Acquihire and Data Center Spend Top $700B

An analysis estimates Meta's total AI investment at ~$700B, including a ~$14.3M Scale AI acquihire and over $600B in data centers. The post questions why this has not yielded a competitive upcoming model against Chinese open-source labs.

85% relevant

FedAgain: Dual-Trust Federated Learning Boosts Kidney Stone ID Accuracy to 94.7% on MyStone Dataset

Researchers propose FedAgain, a trust-based federated learning framework that dynamically weights client contributions using benchmark reliability and model divergence. It achieves 94.7% accuracy on kidney stone identification while maintaining robustness against corrupted data from multiple hospitals.

79% relevant

Survey Benchmarks Four Approaches to Synthetic Brain Signal Generation for BCI Data Scarcity

A comprehensive survey categorizes and benchmarks four methodological approaches to generating synthetic brain signals for BCIs, addressing data scarcity and privacy constraints. The authors provide an open-source codebase for comparing knowledge-based, feature-based, model-based, and translation-based generative algorithms.

84% relevant

From Garbage to Gold: A Theoretical Framework for Robust Tabular ML in Enterprise Data

New research challenges the 'Garbage In, Garbage Out' paradigm, proving that high-dimensional, error-prone tabular data can yield robust predictions through proper data architecture. This has profound implications for enterprise AI deployment.

74% relevant

How to Use Claude Code for Personal Data Analysis: A 14-Year Journal Case Study

A developer processed 5,000 journal files with Claude Code to gain self-development insights. Here's how you can apply this technique to your own data.

95% relevant

AI Agents Gain Financial Autonomy: New Tool Enables AI to Purchase Premium Data

A groundbreaking development allows AI agents to autonomously pay for high-quality data through premium APIs. The system self-determines budget allocation with zero manual setup, currently operational across multiple AI platforms.

85% relevant

Vision AI Trends 2026: Manufacturing, Warehouse Automation, and Luxury Authentication Enter Visual Data Era

A 2026 trends report highlights Vision AI's expansion into manufacturing quality inspection, warehouse automation, and luxury brand authentication, marking a shift toward 3D visual data systems. This reflects the maturation of computer vision beyond basic recognition into operational and trust applications.

95% relevant

Tsinghua Breakthrough: LLMs with Search Freedom Outperform Expensive Fine-Tuning for Temporal Data

Tsinghua University researchers demonstrate that giving standard LLMs autonomous search capabilities for temporal data achieves 88.7% accuracy, surpassing specialized fine-tuned models by 10.7%. This challenges costly training approaches for time-sensitive tasks.

95% relevant

The Silent Data Harvest: Stanford Exposes How AI Giants Use Your Private Conversations

Stanford researchers reveal that all major AI companies—OpenAI, Google, Meta, Anthropic, Microsoft, and Amazon—train their models on user chat data by default, with minimal transparency, unclear opt-out mechanisms, and concerning practices around data retention and child privacy.

95% relevant

Google's TimesFM: 200M-Param Foundation Model for Zero-Shot Time Series

Google released TimesFM, a 200M-parameter foundation model for time series forecasting that works without training on user data. It's now available open-source and as a product inside BigQuery.

97% relevant

OpenCAD Browser Tool Enables Local, Private Text-to-CAD Conversion Without Cloud API

A developer has released an open-source text-to-CAD tool that runs entirely in a user's browser, enabling private, local 3D model generation from natural language descriptions. This approach bypasses cloud API costs and data privacy issues inherent in most current AI CAD solutions.

89% relevant

Italy Apparel Market Report Highlights Luxury Demand and Fast Fashion Shift

A market report on Italy's apparel sector details sustained luxury demand, a consumer shift towards fast fashion, and the overall growth outlook. This provides direct, data-driven context for brands operating in or targeting the Italian market.

72% relevant

Tsinghua & Peking University Researchers Train Humanoid Robot to Play Tennis Using Scattered, Imperfect Human Motion Clips

A team from Tsinghua, Peking University, and other labs taught a humanoid robot to play tennis using short, imperfect human swing clips instead of perfect match data. The system uses a physics simulator to correct errors, lowering the barrier for teaching robots complex physical tasks.

97% relevant

The Fragile Foundation: How AI Lab Failures Could Trigger a $1.5 Trillion Infrastructure Collapse

A Reuters analysis reveals that the failure of major AI labs like OpenAI or Anthropic could trigger a catastrophic chain reaction, jeopardizing the $650 billion data center boom and $900 billion in financial investments that depend on their insatiable demand for computing power.

85% relevant

Palantir and NVIDIA Forge Strategic Alliance to Power Next-Generation AI Platforms

Palantir Technologies and NVIDIA have announced a major collaboration to develop enterprise AI platforms. The partnership aims to integrate Palantir's data analytics with NVIDIA's accelerated computing to deliver powerful AI solutions for government and commercial sectors.

85% relevant

The AI Frontier Narrows: xAI and Meta Lag as Three-Way Race Intensifies

Recent benchmark data suggests xAI's Grok 4.2 and Meta's models are falling behind in the frontier AI race, which now appears to be a tight contest between three leading players. This consolidation signals a pivotal shift in competitive dynamics.

85% relevant