mathematics

30 articles about mathematics in AI news

Terence Tao on AI's Impact: 'The Way We Do Everything, Including Mathematics, Will Change'

Fields Medalist Terence Tao states we are entering an unpredictable era where AI will fundamentally change how we do everything, including mathematics. He expressed a personal preference for a more stable, 'boring' period of continuity.

85% relevant

Terence Tao Demonstrates AI's Growing Role in Formal Mathematics with Claude and Lean

Fields Medalist Terence Tao has released a video showing how Claude Code can be used to formalize mathematical proofs in Lean, highlighting AI's expanding capabilities in high-level mathematics.

85% relevant

OpenAI Internal Model Reportedly Solves Three New Erdős Problems, Marking AI Advance in Pure Mathematics

An internal AI model at OpenAI has reportedly solved three previously unsolved mathematical problems from the Erdős collection. This development signals a potential leap in AI's capacity for abstract reasoning and formal theorem proving.

85% relevant

Terence Tao on AI and Mathematics: Collaboration, Not Replacement, for Problems Like Riemann Hypothesis

Fields Medalist Terence Tao suggests AI may not solve problems like the Riemann Hypothesis alone, but through a 'collaboration we can't yet imagine' blending AI power with human insight.

85% relevant

A Deep Dive into LoRA: The Mathematics, Architecture, and Deployment of Low-Rank Adaptation

A technical guide explores the mathematical foundations, memory architecture, and structural consequences of Low-Rank Adaptation (LoRA) for fine-tuning LLMs. It provides critical insights for practitioners implementing efficient model customization.

100% relevant

Mathematics Enters New Era as AI Generates Novel Proofs, Says Fields Medalist Terence Tao

Fields Medalist Terence Tao reveals AI is now producing unique mathematical proofs, though verification remains a bottleneck. He argues that to fully leverage AI, mathematicians must design problems that are easily checkable by both humans and machines.

85% relevant

AI's 'Cheap Wins' in Mathematics Signal a New Era of Human-Machine Collaboration

Fields Medalist Terence Tao reveals AI is solving easier Erdős problems, but the real breakthrough is AI as a tireless junior co-author accelerating mathematical discovery through tedious work automation.

85% relevant

Mathematics Enters New Era as Terence Tao Declares AI's Research Breakthroughs Are Real

Fields Medalist Terence Tao states AI has moved beyond hype to become a genuine tool for mathematical discovery, marking a paradigm shift in how research is conducted. His endorsement signals AI's maturation from experimental assistant to collaborative partner in solving complex problems.

85% relevant

AI Engineer Publishes Free Open-Source Textbook Compiling Math, CS, and AI Concepts

An AI engineer has compiled a comprehensive, free open-source textbook covering mathematics, computer science, and AI concepts. The resource is built with an intuitive, visual-first approach to aid learning.

89% relevant

AI Engineer Henry Ndubuaku Releases Open-Source 'Maths, CS & AI Compendium' Textbook

AI engineer Henry Ndubuaku has published a free, open-source textbook compiling mathematics, computer science, and AI concepts. The resource emphasizes intuitive understanding over notation and has reportedly helped users land roles at DeepMind, OpenAI, and Nvidia.

85% relevant

Terence Tao: LLM Math is Simple Undergraduate Linear Algebra, But Why They Work Remains a Mystery

Fields Medalist Terence Tao explains that the mathematics to build and run LLMs is straightforward linear algebra. The real puzzle is why they perform unpredictably across tasks, a gap in theory for 'meso-scale' natural data.

85% relevant

ChatGPT-5.2 Proves Mathematical Conjecture in Groundbreaking 'Vibe-Proving' Case Study

Researchers demonstrate ChatGPT-5.2 (Thinking) successfully resolving a mathematical conjecture about spectral regions through iterative 'vibe-proving' workflows. The case study reveals where AI assistance proves most valuable in research mathematics and where human expertise remains irreplaceable.

70% relevant

Sam Altman Outlines 3 AI Futures: Research, Operations, Personal Agents

OpenAI CEO Sam Altman outlined three potential outcomes for AI development: systems that conduct scientific research, accelerate company operations, and serve as trusted personal agents. This vision frames the strategic direction for OpenAI and the broader industry.

85% relevant

AI Forecasters Revise AGI Timeline: Key Milestones Pulled Forward to 2029-2030 After Recent Model Progress

A significant update from AI forecasters indicates key AGI milestones have been pulled forward, with the median prediction for AGI arrival shifting from 2032 to 2029-2030. This revision follows rapid progress in recent model capabilities, particularly in reasoning and tool use.

85% relevant

Alibaba's Qwen3.6-Plus Reportedly Under Half the Size of Kimi K2.5, Nears Claude Opus 4.5 Performance

Alibaba's Tongyi Lab announced Qwen3.6-Plus, a model reportedly under half the size of Moonshot's Kimi K2.5 while approaching Claude Opus 4.5 performance, signaling major efficiency gains in China's LLM race.

100% relevant

VMLOps Launches Free 230+ Lesson AI Engineering Course with Production-Ready Tool Portfolio

VMLOps has launched a free, hands-on AI engineering course spanning 20 phases and 230+ lessons. It uniquely culminates in students building a portfolio of usable tools, agents, and MCP servers, not just theoretical knowledge.

87% relevant

VMLOPS's 'Basics' Repository Hits 98k Stars as AI Engineers Seek Foundational Systems Knowledge

A viral GitHub repository aggregating foundational resources for distributed systems, latency, and security has reached 98,000 stars. It addresses a widespread gap in formal AI and ML engineering education, where critical production skills are often learned reactively during outages.

75% relevant

Qwen 3.6 Plus Preview Launches on OpenRouter with Free 1M Token Context, Disrupting API Pricing

Alibaba's Qwen team has released a preview of Qwen 3.6 Plus on OpenRouter with a 1 million token context window, charging $0 for both input and output tokens. This directly undercuts paid long-context offerings from Anthropic and OpenAI.

97% relevant

ChatGPT GPT-5.4 Pro's 'Thinking' Harness Shows Advanced Scientific Paper Comprehension, Including Figure Analysis

OpenAI's ChatGPT GPT-5.4 Pro, with its 'Thinking' harness, demonstrates advanced multimodal understanding of scientific papers, identifying key figures and extracting visual information beyond text parsing.

85% relevant

Rumor: Anthropic Preparing 'Mythos' and 'Capybara' Model Launches, Potentially Challenging GPT-4o

Unconfirmed reports suggest Anthropic is developing two new AI models: 'Mythos,' a new top-tier model, and 'Capybara,' a smaller, faster variant. This follows a pattern of rapid iteration in the frontier model race.

85% relevant

QuatRoPE: New Positional Embedding Enables Linear-Scale 3D Spatial Reasoning in LLMs, Outperforming Quadratic Methods

Researchers propose QuatRoPE, a novel positional embedding method that encodes 3D object relations with linear input scaling. Paired with IGRE, it improves spatial reasoning in LLMs while preserving their original language capabilities.

79% relevant

Study of 280,000 Samples Shows AI Detectors Fail on Short Coursework and STEM Writing, Flagging Real Student Work

A comprehensive study testing 13 AI detectors on 280,000+ samples found they perform unreliably, especially on short assignments and STEM writing, where real student work is often flagged as AI-generated due to formulaic language.

87% relevant

Nature Report: China's Public R&D Spending Nears US Levels, Shifting Global Science Funding Landscape

A new Nature report indicates China is close to surpassing the US in public R&D spending. This shift in funding could alter which nation sets the global pace for scientific research, though China still lags in fundamental research output.

97% relevant

Alibaba's Qwen Team Announces More Open-Source Models Coming at ModelScope DevCon

Alibaba's Qwen team announced at the ModelScope DevCon in Nanjing that they will release more open-source Qwen models. This signals continued investment in their competitive open-weight LLM series.

85% relevant

Learning to Disprove: LLMs Fine-Tuned for Formal Counterexample Generation in Lean 4

Researchers propose a method to train LLMs for formal counterexample generation, a neglected skill in mathematical AI. Their symbolic mutation strategy and multi-reward framework improve performance on three new benchmarks.

77% relevant

Terence Tao Suggests AI Tools Like Lean Could Lower Barrier to Mathematical Research

Fields Medalist Terence Tao posits that AI tools, including proof assistants like Lean, could enable high school students to contribute to frontier math research, accelerating careers and discovery.

85% relevant

NVIDIA Releases Nemotron-Cascade 2: A 30B MoE Model with 3B Active Parameters

NVIDIA has open-sourced Nemotron-Cascade 2, a 30B parameter Mixture-of-Experts model that activates only 3B parameters per token. It claims 'gold medal performance' on IMO and IOI 2025 benchmarks.

100% relevant

GitHub Repository 'Math Textbooks' Aggregates Hundreds of Free University-Level Math Texts

An unmaintained GitHub repository has compiled links to hundreds of free, legally-hosted math textbooks from universities like MIT, Harvard, and Stanford. The collection spans from undergraduate calculus to graduate-level quantum field theory.

85% relevant

Terence Tao: AI's 'Brute-Test' Approach to Math Research Could Narrow Human Efficiency Gap

Mathematician Terence Tao observes AI can synthesize millions of papers and brute-force test ideas, while humans rely on pattern recognition from few examples. He suggests the gap may narrow as AI systems develop world models, causal reasoning, and active learning.

85% relevant

The Coming Revolution in AI Training: How Distributed Bounty Systems Will Unlock Next-Generation Models

AI development faces a bottleneck: specialized training environments built by small teams can't scale. A shift to distributed bounty systems, crowdsourcing expertise globally, promises to slash costs and accelerate progress across all advanced fields.

85% relevant