operations research

30 articles about operations research in AI news

Sam Altman Outlines 3 AI Futures: Research, Operations, Personal Agents

OpenAI CEO Sam Altman outlined three potential outcomes for AI development: systems that conduct scientific research, accelerate company operations, and serve as trusted personal agents. This vision frames the strategic direction for OpenAI and the broader industry.

Apr 4, 202685% relevant

AI Labs Shift from Pure Engineering to Scaled Human Operations

As frontier AI models advance, the demand for expert human feedback—from annotators to red-teamers—is increasing, creating a labor market that resembles scaled human operations more than traditional software development.

Apr 14, 202685% relevant

ToolTree: A New Planning Paradigm for LLM Agents That Could Transform Complex Retail Operations

Researchers propose ToolTree, a Monte Carlo tree search-inspired method for LLM agent tool planning. It uses dual-stage evaluation and bidirectional pruning to improve foresight and efficiency in multi-step tasks, achieving ~10% gains over state-of-the-art methods.

Mar 16, 202670% relevant

Guardian AI: How Markov Chains, RL, and LLMs Are Revolutionizing Missing-Child Search Operations

Researchers have developed Guardian, an AI system that combines interpretable Markov models, reinforcement learning, and LLM validation to create dynamic search plans for missing children during the critical first 72 hours. The system transforms unstructured case data into actionable geospatial predictions with built-in quality assurance.

Mar 11, 202683% relevant

Palantir's Maven Smart System: The AI-Powered Battlefield Dashboard Revolutionizing Military Operations

Palantir's Maven Smart System represents a paradigm shift in military intelligence, fusing drone, satellite, radar, and signals intelligence into a single AI-powered dashboard that automates target detection and kill-chain management.

Mar 13, 202697% relevant

Microsoft AI CEO Predicts Professional AGI Within 2-3 Years, Redefining Institutional Operations

Microsoft AI CEO Mustafa Suleyman forecasts professional-grade artificial general intelligence arriving within 2-3 years, capable of coordinating teams and running institutions. He distinguishes this practical milestone from the more nebulous concept of superintelligence.

Mar 10, 202685% relevant

Coresight Research Report: Technology and Resilience as Path to Stronger Retail Margins

Coresight Research has published a report titled 'Supply Chain Insights for Food, Drug and Mass Retail: Technology, Resilience and the Path to Stronger Margins.' The research focuses on how strategic tech adoption can fortify operations and profitability in key retail segments.

Apr 8, 202681% relevant

Apple's Neural Engine Jailbroken: Researchers Unlock Full Training Capabilities on M-Series Chips

Security researchers have reverse-engineered Apple's Neural Engine, bypassing private APIs to enable full neural network training directly on ANE hardware. This breakthrough unlocks 15.8 TFLOPS of compute previously restricted to inference-only operations across all M-series devices.

Mar 5, 202695% relevant

flexvec: A New SQL Kernel for Programmable Vector Retrieval

A new research paper introduces flexvec, a retrieval kernel that exposes the embedding matrix and score array as a programmable surface via SQL, enabling complex, real-time query-time operations called Programmatic Embedding Modulation (PEM). This approach allows AI agents to dynamically manipulate retrieval logic and achieves sub-100ms performance on million-scale corpora on a CPU.

Mar 25, 202676% relevant

Accenture's Memex(RL) Revolutionizes AI Agent Memory for Complex Tasks

Accenture researchers have developed Memex(RL), a breakthrough system that gives AI agents structured, searchable memory for long-horizon tasks. This solves the critical problem of agents losing track of past experiences during complex operations like deep research and multi-step planning.

Mar 9, 202685% relevant

Safety Gap: OpenAI's Most Powerful AI Models Released Without Critical Risk Assessments

OpenAI's GPT-5.4 Pro, potentially the world's most capable AI for high-risk tasks like bioweapons research and cyber operations, has been released without published safety evaluations or system cards, continuing a concerning pattern with 'Pro' model releases.

Mar 8, 202685% relevant

Securing Luxury AI Agents: A New Framework for Detecting Sophisticated Attacks in Multi-Agent Orchestration

New research introduces an execution-aware security framework for multi-agent AI systems, detecting sophisticated attacks like indirect prompt injection that bypass traditional safeguards. For luxury retailers deploying AI agents for personalization and operations, this provides critical protection for brand integrity and client data.

Mar 6, 202660% relevant

Preventing AI Team Meltdowns: How to Stop Error Cascades in Multi-Agent Retail Systems

New research reveals how minor errors in AI agent teams can snowball into systemic failures. For luxury retailers deploying multi-agent systems for personalization and operations, this governance layer prevents cascading mistakes without disrupting workflows.

Mar 6, 202670% relevant

Meta to Cut 8,000 Jobs in May, Redirecting Capital to AI Infrastructure

Meta is reportedly planning to lay off 8,000 employees in May, the first round of major cuts this year. The move signals a capital shift from general operations to concentrated investment in AI infrastructure like chips and data centers.

Apr 17, 202699% relevant

AI Sales Agent 'SalesOS' Automates Full Outbound Pipeline

An AI agent called SalesOS has been developed to automate the full outbound sales pipeline, including lead sourcing, personalized outreach, and meeting scheduling. This represents a push toward fully autonomous sales operations.

Apr 11, 202685% relevant

Japanese Team Develops Cardboard Drone Flying at 120 km/h, Assembled in 5 Minutes for Swarm Applications

Researchers in Japan have demonstrated a functional drone constructed entirely from cardboard, capable of 120 km/h flight and 5-minute assembly. The design enables mass production in standard cardboard factories, targeting low-cost, disposable swarm operations.

Mar 30, 202685% relevant

Unitree Robotics Releases UnifoLM-WBT-Dataset: A Large-Scale, Real-World Robotics Dataset for Embodied AI

Chinese robotics firm Unitree Robotics has open-sourced the UnifoLM-WBT-Dataset, a high-quality dataset derived from real-world robot operations. The release aims to accelerate training for embodied AI and large language models applied to physical systems.

Mar 28, 202685% relevant

UiPath Launches AI Agents for Retail Pricing, Promotions, and Stock Management

UiPath has announced new AI agents designed to autonomously handle core retail operations: dynamic pricing, promotional planning, and inventory gap resolution. This represents a significant move by a major automation player into agentic AI for retail.

Mar 25, 202695% relevant

Pentagon to Integrate Palantir's AI Platform as Core Military System, Despite Anthropic Supply Chain Concerns

The Pentagon is moving to adopt Palantir's AI platform as a core system for military operations. This comes despite reported complications involving Anthropic's Claude AI, which was recently flagged as a supply chain risk.

Mar 23, 202685% relevant

Multi-Agent Coding Systems Compared: Claude Code, Codex, and Cursor

A hands-on comparison reveals three fundamentally different approaches to multi-agent coding. Claude Code distinguishes between subagents and agent teams, Codex treats it as an engineering problem, and Cursor implements parallel file-system operations.

Mar 19, 202670% relevant

Mastercard Launches Agent Suite to Power Agentic AI in Digital Commerce

Mastercard has launched Agent Suite, a new service offering combining technical support and customizable AI agents to help businesses integrate agentic AI into operations. This marks a significant move by a major payments network to facilitate the shift from generative to agentic AI in commerce.

Mar 10, 202680% relevant

AI's Exponential Leap: How Task Length Capabilities Are Redefining Intelligence

A new visualization reveals AI's exponential growth in handling complex tasks, moving from simple commands to sophisticated multi-step operations. This development fundamentally changes how we understand artificial intelligence's potential.

Mar 8, 202685% relevant

GuardClaw: The Cryptographic Audit Trail That Could Make AI Agents Accountable

GuardClaw introduces cryptographically verifiable execution logs for AI agents, creating immutable records of autonomous actions. This open-source protocol could revolutionize accountability in AI systems performing financial trades, infrastructure changes, and critical operations.

Mar 4, 202675% relevant

AI as a Double-Edged Sword: How ChatGPT Exposed a Chinese Influence Operation

OpenAI uncovered a Chinese intimidation campaign targeting dissidents abroad after a law enforcement official used ChatGPT to document covert operations. The incident reveals how AI tools can both enable and expose state-sponsored influence activities.

Feb 27, 202685% relevant

Shopify Engineering Teases 'Autoresearch' Beyond Model Training in 2026 Preview

Shopify Engineering has previewed a 2026 perspective suggesting 'autoresearch'—automated research processes—will have applications extending beyond just training AI models. This signals a broader operational automation strategy for the e-commerce giant.

Apr 15, 2026100% relevant

New Research: How Online Marketplaces Can Use Demand Allocation to Control Seller Inventory

Researchers propose a model where a marketplace platform, by controlling the timing and predictability of order allocation to sellers, can influence their safety-stock inventory and their choice to use platform fulfillment services. This identifies demand allocation as a key operational lever for digital marketplaces.

Apr 9, 202678% relevant

Research: Cheaper Reasoning Models Can Cost 3x More Due to Higher Error Rates and Retry Loops

New research indicates that selecting AI models based solely on per-token pricing can be a false economy. Models with lower accuracy often require multiple expensive retries, ultimately increasing total costs by up to 300%.

Mar 29, 202687% relevant

IBM Research Survey Proposes Framework for Optimizing LLM Agent Workflows

IBM researchers published a comprehensive survey categorizing approaches to LLM agent workflow optimization along three dimensions: when structure is determined, which components get optimized, and what signals guide optimization.

Mar 27, 202699% relevant

ColBERT-Att: New Research Enhances Neural Retrieval by Integrating Attention into Late Interaction

Researchers propose ColBERT-Att, a novel neural information retrieval model that integrates attention weights into the late-interaction framework. The method shows improved recall accuracy on standard benchmarks like MS-MARCO, BEIR, and LoTTE.

Mar 27, 202686% relevant

ServiceNow Research Launches EnterpriseOps-Gym: A 512-Tool Benchmark for Testing Agentic Planning in Enterprise Environments

ServiceNow Research and Mila have released EnterpriseOps-Gym, a high-fidelity benchmark with 164 database tables and 512 tools across eight domains to evaluate LLM agents on long-horizon enterprise workflows.

Mar 18, 202695% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety