scheduling
30 articles about scheduling in AI news
Nvidia Blackwell CLC Boosts GEMM Tile Scheduling by 15% Over Static Persistence
Nvidia Blackwell CLC delivers up to 15% higher GEMM throughput via dynamic persistent tile scheduling, fixing load imbalance without startup overhead.
Claude Adds Dynamic Loop Scheduling to AI Agent Workflows
Anthropic has added dynamic loop scheduling to Claude, allowing the AI to intelligently schedule repeated tasks without a fixed interval. This is a foundational capability for creating more autonomous and efficient AI agents.
Boll & Branch Deploys OpenClaw AI Agent 'Tess' Across Operations, From Scheduling to Customer Insights
Bedding brand Boll & Branch created an AI agent named 'Tess' using open-source platform OpenClaw. Initially a scheduling assistant, Tess now integrates with Slack, Shopify, and marketing tools to generate customer reports and analyze social trends, supporting the brand's physical retail expansion.
Florida Homeowner Sells Property for $100K Above Estimate Using AI for Pricing, Staging, and Scheduling
A Florida homeowner bypassed real estate agents, using an unspecified AI tool to manage pricing, staging, and buyer scheduling via text prompts. The property sold for $100,000 above initial estimates, with only a human lawyer involved for final closing documents.
Perplexity Claims 3x Blackwell Inference Throughput for 70B Models
Perplexity AI claims 3x inference throughput for 70B models on Nvidia Blackwell GPUs via FP4 and custom scheduling. The gain exceeds Nvidia's own 2x marketing claim.
Emergent AI Launches Work Stress Copilot, Integrates with Slack & Teams
Emergent AI has launched a new 'Work Stress Copilot' agent that integrates with Slack and Microsoft Teams to autonomously manage calendar scheduling, email triage, and meeting prep. The tool aims to directly reduce cognitive load by automating repetitive administrative work.
Postiz: Open-Source AI Social Suite Challenges Buffer, Hootsuite on Price
Postiz, an open-source AI social media platform, offers scheduling, content creation, and analytics across 25+ platforms. Its self-hosted version is free, challenging paid tools like Buffer ($6/channel) and Hootsuite ($199/month).
AI Sales Agent 'SalesOS' Automates Full Outbound Pipeline
An AI agent called SalesOS has been developed to automate the full outbound sales pipeline, including lead sourcing, personalized outreach, and meeting scheduling. This represents a push toward fully autonomous sales operations.
Anthropic's Claude Code Adds Scheduled, Cloud-Based Task Execution
Anthropic's Claude Code now supports scheduling recurring, cloud-based tasks. Users can set a repository, schedule, and prompt, with Claude executing the task automatically.
Helium: A New Framework for Efficient LLM Serving in Agentic Workflows
Researchers introduce Helium, a workflow-aware LLM serving framework that treats agentic workflows as query plans. It uses proactive caching and cache-aware scheduling to reduce redundancy, achieving up to 1.56x speedup over current systems.
Jinn: Run Claude Code as a Multi-Agent Team with Cron Jobs and Slack Integration
Jinn is an open-source gateway daemon that turns Claude Code CLI into a multi-agent system with scheduling, Slack integration, and a web dashboard.
MetaClaw: Personal AI Agent That Meta-Learns from Conversations Using Cloud LoRA and Skill Synthesis
MetaClaw is a personal AI agent that automatically evolves from every conversation. It meta-learns in the wild using cloud LoRA and skill synthesis, scheduling weight updates during idle time with zero downtime.
Beyond Euclidean Distances: How Asymmetric Routing AI Can Optimize Luxury Logistics and Last-Mile Delivery
RADAR introduces a neural framework that solves real-world asymmetric vehicle routing problems, crucial for optimizing luxury goods delivery, store replenishment, and client appointment scheduling in complex urban environments.
i10X Launches Supera Agent That Executes Full Workflows from Prompt
i10X launched Supera, an AI agent that autonomously executes multi-step workflows from a single prompt with user approval and cost transparency.
Bluezoo Launches AI Agent for In-Store Video Advertising
Bluezoo launched an AI agent for in-store video advertising that uses computer vision to analyze shopper engagement and optimize ad content in real time, promising improved ad effectiveness for retailers.
NVFP4 GEMM on RTX Pro Blackwell: SM12x Breaks from B200 Programming Model
NVIDIA's SM12x architecture drops tcgen05.mma for mma.sync, breaking B200 kernel compatibility. SM8x kernels port easily; developers must maintain separate codebases.
UnitedHealth Bets $3B on AI Agents to Fix the Denial Machine It Built
UnitedHealth Group committed $3 billion to AI agents that call doctors, read charts to nurses, and process claims — a bet that the insurer that drew fury over algorithmic denials can use the same class of technology to restore trust. Under new CEO Stephen Hemsley, the company targets a 30% cut in pr
Cerebras Hits 981 Tokens/sec on 1T-Parameter Kimi K2.6, Claims 6.7× GPU Cloud Speedup
Cerebras reported 981 tokens/sec on the 1T-parameter Kimi K2.6 model, a 6.7× speedup over the next GPU cloud, validated by an independent third party.
vLLM Optimizations Cut Voice AI Latency by 40% on 6-GPU Cluster
vLLM optimizations on a 6-GPU cluster reduced voice AI latency by 40% for a Qwen-based system, enabling 500 concurrent sessions per node without hardware upgrades.
Hims & Hers to Launch AI Weight-Loss Agent as GLP-1 Demand Surges
Hims & Hers to launch AI weight-loss agent for GLP-1 users, announced during Q1 2026 earnings call. Revenue grew 25% to $420M.
8-Agent System Builder: Anthropic's Simpler Approach Beat My 2-Day Build
Engineer built 8-agent system in 2 days; Anthropic's simpler 2-agent approach outperformed it. Lesson: minimal agent architecture beats complex orchestration.
LLMs Fail at Implicit Travel Constraints, New Benchmark Shows
LLMs fail at implicit travel constraints, a new arXiv paper decomposes planning into 5 atomic skills, finding structural biases and ineffective self-correction.
IREN Acquires Mirantis for $625M to Own AI Data Center Stack
IREN acquired Mirantis for $625M to add Kubernetes and OpenStack expertise, aiming to control the full AI infrastructure stack and compete with cloud providers.
Astera Labs Scorpio X-Series Switch Targets 49% Collective IO Cut for Idle GPUs
Astera Labs introduced Scorpio X-Series 320-lane switch targeting 49% collective IO reduction for fragmented AI workloads. Shipments to hyperscalers began, with broad ramp in H2 2026.
RoundPipe: Full Fine-Tune 32B Models on a Single 24GB GPU
RoundPipe fine-tunes 32B models on a single 24GB GPU with 1.5-2.2× speedups via round-robin pipeline dispatch.
Meta Deploys AI Agents to Automate Hyperscale Performance Tuning
Meta deployed unified AI agents to automate hyperscale performance optimization, aiming to reduce manual tuning and costs amid a $145B AI capex push.
GPT-5.5 Launches: The Super App Strategy, Not the Model
OpenAI released GPT-5.5, codenamed Spud, 48 days after GPT-5.4. The model itself is less interesting than the super app strategy, 35x cost reduction on GB200 hardware, and 48-day release cadence that signals a deliberate acceleration.
SSL: Structured Skill Language Boosts Skill Discovery MRR to 0.707
Researchers propose SSL, a three-layer typed JSON representation for AI agent skills, replacing unstructured SKILL.md prose. Using an LLM normalizer, SSL improves Skill Discovery MRR from 0.573 to 0.707 and Risk Assessment macro F1 from 0.744 to 0.787 on a newly released 6,184-skill corpus.
JPMorgan: Agentic AI Could Flip Server Ratio to CPU-Heavy
JPMorgan reports that agentic AI workloads could increase CPU demand, potentially flipping the GPU-to-CPU ratio from 7-8 GPUs per CPU to CPU-heavy deployments, with a $100B TAM for AI CPU infrastructure.
China's OpenClaw Mandate: Subsidies, Quotas, and Firing for Non-Use
In China, OpenClaw ('raising lobsters') is subsidized by Shenzhen and mandated for daily employee tasks, with non-use leading to termination. Meanwhile, using OpenAIClaw elsewhere risks firing. This signals a stark AI adoption divide.