data science
30 articles about data science in AI news
Build Reusable Data Science Workflows with Claude Skills and Subagents
Claude Skills and Subagents let you package prompts into reusable modules, freeing data scientists from repetitive AI adjustments for EDA, modeling, and deployment.
Claude Code Now Integrates with Google Colab via Official MCP Server
Google released an official, open-source MCP server for Google Colab, enabling Claude Code to automate data science workflows directly from your terminal.
How Netflix's Recommendation System Works: A Technical Breakdown
An explainer on the data science behind Netflix's recommendation engine, covering collaborative filtering, content-based filtering, and hybrid approaches. This provides a foundational understanding of personalization systems relevant to retail.
OpenCSF: A 1.5TB Free Computer Science Library Emerges from Unstructured Web Data
A new open-source dataset called OpenCSF has been compiled, containing 1.5TB of computer science materials scraped from public web sources. It provides a massive, free corpus for AI training and research in software engineering and CS education.
Periodic Labs Hits $7.5B Valuation, Raises $500M for AI Science
Periodic Labs raising $500M at $7.5B valuation. Sixfold value jump in 8 months. Talent exodus from big AI labs signals shift to autonomous science.
Anthropic Launches Dedicated Science Blog to Chronicle AI Research and Applications
Anthropic has launched a new Science Blog to publish its research and case studies on using AI to accelerate scientific discovery, aligning with its mission to increase the pace of scientific progress.
Nvidia's Jensen Huang Dismisses Custom AI Chip Threat: 'Science Projects' Versus 'AI Factories'
Nvidia CEO Jensen Huang confidently dismissed concerns about custom AI chips challenging Nvidia's dominance, framing competitors' efforts as 'science projects' while Nvidia builds revenue-generating 'AI factories' with a complete platform approach.
Google's TITANS Architecture: A Neuroscience-Inspired Revolution in AI Memory
Google's TITANS architecture represents a fundamental shift from transformer limitations by implementing cognitive neuroscience principles for adaptive memory. This breakthrough enables test-time learning and addresses the quadratic scaling problem that has constrained AI development.
How a 50-Year-Old Computer Science Concept Just Outperformed Anthropic's Claude Code
A small startup has outperformed Anthropic's flagship Claude Code using a novel architecture based on persistent memory systems. This breakthrough demonstrates how classic computer science principles can solve modern AI limitations in context retention and reasoning.
Columbia Prof: LLMs Can't Generate New Science, Only Map Known Data
Columbia CS Professor Vishal Misra argues LLMs cannot generate new scientific ideas because they learn structured maps of known data and fail outside those boundaries. True discovery requires creating new conceptual maps, a capability current architectures lack.
NVIDIA, DOE Build 100K-GPU Supercomputer for Science
DOE and NVIDIA announced Solstice, a 100K-GPU Vera Rubin supercomputer delivering 5,000 exaflops, and Equinox with 10K Blackwell GPUs.
Researchers Achieve Ultra-Long-Horizon Agentic Science with Cohesive AI Agents
A research team has developed AI agents capable of executing and maintaining coherent, long-horizon scientific research workflows. This addresses a core challenge in creating autonomous systems for complex discovery.
ML-Master 2.0 Hits 56.44% on MLE-Bench in 24-Hour Agentic Science Run
Researchers from Shanghai Jiao Tong University demonstrated ML-Master 2.0, an autonomous research agent that operated continuously for 24 hours on the MLE-Bench, achieving a 56.44% medal rate. The breakthrough centers on Hierarchical Cognitive Caching for state management, not reasoning, enabling long-horizon scientific workflows.
Anthropic's AI Researchers Outperform Humans, Discover Novel Science
Anthropic reports its AI systems for alignment research are surpassing human scientists in performance and generating novel scientific concepts, broadening the exploration space for AI safety.
Neuroscience Visualization: Time-Lapse Video Shows Lab-Cultured Neurons Forming Connections
A researcher shared a time-lapse video of actual neurons in a lab dish forming new connections. This raw visualization provides a direct, non-AI view of biological computation.
Microsoft's Phi-4-Vision: A Compact AI Model That Excels at Math, Science, and Understanding Interfaces
Microsoft has released Phi-4-reasoning-vision-15B, a 15-billion parameter open-weight multimodal model designed for tasks requiring both visual perception and selective reasoning. The compact model excels at scientific, mathematical, and GUI understanding while balancing compute efficiency.
Zatom-1: The First Unified AI Model for 3D Molecular and Materials Science
Researchers have developed Zatom-1, the first foundation model that simultaneously handles generative and predictive tasks for both molecules and materials. This multimodal flow matching approach enables faster sampling and improved accuracy across chemical domains.
BioBridge AI Merges Protein Science with Language Models for Breakthrough Biological Reasoning
Researchers introduce BioBridge, a novel AI framework that combines protein language models with general-purpose LLMs to enable enhanced biological reasoning. The system achieves state-of-the-art performance on protein benchmarks while maintaining general language understanding capabilities.
OpenAI Launches GPT-Rosalind for Drug Discovery, GPT-5.4-Cyber for Security
OpenAI launched GPT-Rosalind, a life sciences model performing above the 95th percentile of human experts on novel biological data, and GPT-5.4-Cyber, a cybersecurity variant. These releases, alongside a major Agents SDK update, signal a pivot from general AI to specialized, high-stakes enterprise domains.
Anthropic Launches STEM Fellows Program to Pair Experts with AI Research
Anthropic announced the Anthropic STEM Fellows Program, a new initiative to bring science and engineering experts into its research teams for collaborative, months-long projects aimed at accelerating progress with AI.
Stanford 2026 AI Index: Models Beat Human Baselines, U.S.-China Gap Narrows
The 423-page Stanford 2026 AI Index Report reveals frontier AI models now match or exceed human baselines on hard coding, science, and math tests. Global AI adoption has hit ~53% in just three years, while the U.S.-China capability gap shrinks.
OpenAI Readies Next-Gen Model Launch, Claims 'Significant Step Forward'
OpenAI is in final preparations to launch its next generation of AI models, which the company claims represents a 'very significant step forward' with revolutionary potential for science and the economy. The launch could happen imminently, possibly within the week.
Boston Consulting Group on 'Speaking Your AI Agent’s Language'
BCG highlights the critical need for effective human-AI agent communication as a cornerstone of digital transformation, particularly in complex, regulated industries like life sciences. This principle is broadly applicable to retail.
AI Accelerates Genomic Discovery, Unlocking '7 Years of Potential in 30 Minutes'
An AI science-research technology is reportedly accelerating discovery in genomics at an unprecedented rate, described as unlocking seven years of potential work in just thirty minutes.
Mirendil: Ex-Anthropic Scientists Launch $1B Venture to Build AI That Thinks Like a Scientist
Former Anthropic researchers are raising $175M at a $1B valuation for Mirendil, a startup aiming to build AI systems for long-term scientific reasoning. The goal is to accelerate breakthroughs in biology and materials science, aligning with a broader industry push toward autonomous AI researchers.
The AI Trap: How Professors Are Fighting Back Against Student Over-Reliance on Language Models
University professors are deploying 'trap words' in digital assignments to catch students who blindly use AI for complex cognitive tasks. While science departments embrace these tools, literature professors report a collapse in students' ability to synthesize information independently.
From Bota to Enhe: The Dawn of Physical AI in Biomanufacturing
Bota Bio has rebranded as Enhe Technology and launched SAION AI, a pioneering Physical AI platform for biomanufacturing. The platform claims state-of-the-art performance across four key life science AI benchmarks, signaling a major shift in how biology is engineered.
Microsoft's EMPO²: A Memory-Augmented RL Framework That Supercharges LLM Agent Exploration
Microsoft has unveiled EMPO², a hybrid reinforcement learning framework that enhances LLM agents with augmented memory for true exploration. The system combines on- and off-policy optimization to discover novel states, achieving 128.6% performance gains over existing methods on ScienceWorld benchmarks.
MIRA Benchmark Tests Cross-Category IR Across 4 Scholarly Data Types
MIRA benchmark tests cross-category retrieval across four scholarly data types using real user queries and LLM-assisted judgments.
DOE's Portsmouth Site to Host World's Largest AI Data Center
A special report details plans for the world's largest AI data center at the DOE's Portsmouth, Ohio site, signaling a massive government-led expansion of compute capacity for AI research and national security applications.