framework

30 articles about framework in AI news

How to Set Up CLAUDE.md: The Five-Question Framework That Makes Claude

Set up CLAUDE.md with `claude init` and the five-question framework (Who/What/Where/Why/How) to give Claude Code persistent project context, under 200 lines.

Jul 22, 2026100% relevant

Claude Agentic Framework Uses 20 Specialized Agents to Enforce a 3-Stage

The Claude Agentic Framework enforces a Spec → Build → Review pipeline with 20 specialized agents and PowerShell hooks, preventing Claude Code from coding too early or finishing incomplete.

Jul 21, 202698% relevant

Microsoft Merges AutoGen and Semantic Kernel into Agent Framework

Microsoft merged AutoGen and Semantic Kernel into Agent Framework, a unified production-grade framework for .NET and Python with graph-based workflows and Foundry deployment.

Jul 17, 202685% relevant

Vercel Services Cuts Multi-Framework Deploy Time 60%

Vercel Services unifies multi-framework projects (FastAPI, Next.js, Flask, Go) into one Vercel Project with atomic deployments, internal service bindings, and zero-config auto-detection.

Jun 30, 202657% relevant

Microsoft Open-Sources AgentEngine: Multi-Agent Orchestration Framework

Microsoft open-sourced AgentEngine, a multi-agent orchestration framework, on April 14, 2026. Engineer @pauliusztin_ called it a standout project in agent engineering this year.

Jun 28, 202690% relevant

Anthropic's 19-Page AI Framework Skips Runtime Safety, Mandates 15-Day Reports

Anthropic's 19-page AI framework requires 15-day reporting for model subversion but mandates no runtime safety properties, skipping certification core aviation adopted decades ago.

Jun 11, 202667% relevant

NEMA, ASHRAE, PNNL Launch AI Data Center Framework as Power Demand Hits 175 TWh

NEMA, ASHRAE, and PNNL launched an AI data center framework addressing 70-100 kW per rack power demands as global AI electricity consumption could hit 175 TWh annually.

Jun 10, 202692% relevant

xAI Drops JAX, Builds Custom C Training Framework After <10% MFU

xAI dropped JAX for GPU training after <10% MFU, building a custom C framework with Grok Build. NVIDIA's JAX team loses its biggest customer.

May 30, 202691% relevant

12-Metric Agent Eval Framework From 100+ Deployments Hits Production

12-metric evaluation framework for production AI agents from 100+ deployments targets task success, cost, latency, tool use, and safety.

May 13, 202674% relevant

MM-LLM Framework Boosts Recommendation AUC 0.35%, Online Metrics 0.02%

arXiv paper proposes LLaMA2-based MM-LLM framework for recommendation, achieving 0.35% AUC gain and 0.02% online lift at scale.

May 12, 202685% relevant

DataArc-SynData-Toolkit: Open-Source Framework for Multimodal Synthetic Data

DataArc-SynData-Toolkit is an open-source framework for multimodal synthetic data, aiming to lower technical barriers for LLM training. It features a configuration-driven pipeline with visual interface and modular architecture.

May 12, 202670% relevant

R³AG: A New Routing Framework That Matches Queries to Retriever

R³AG is a novel routing framework that dynamically selects the optimal retriever for each query in RAG systems, considering not just relevance but also how well the retrieved document helps the generator produce correct answers. It uses contrastive learning to model query-specific preferences, consistently outperforming existing methods on knowledge-intensive tasks.

Apr 28, 202678% relevant

KARL: RL Framework Cuts LLM Hallucinations Without Accuracy Loss

KARL introduces a reinforcement learning framework that dynamically estimates an LLM's knowledge boundary to reward abstention only when appropriate, achieving a superior accuracy-hallucination trade-off on multiple benchmarks without sacrificing correctness.

Apr 28, 202676% relevant

40-Author Survey Unveils 'Levels × Laws' Framework for Agent World Models

A 40-author survey introduces a 'levels × laws' framework for world models in AI agents, spanning 3 capability levels and 4 law regimes, synthesizing 400+ works. It provides a shared vocabulary for designing and evaluating world models across traditionally siloed research communities.

Apr 27, 202685% relevant

LLM-as-a-Judge Framework Fixes Math Evaluation Failures

Researchers propose an LLM-as-a-judge framework for evaluating math reasoning that beats rule-based symbolic comparison, fixing failures in Lighteval and SimpleRL. This enables more accurate benchmarking of LLM math abilities.

Apr 27, 202682% relevant

ASPIRE: New Framework Makes Spectral Graph Filters Learnable for

Researchers propose ASPIRE, a bi-level optimization framework that makes spectral graph filters fully learnable for collaborative filtering, solving the 'low-frequency explosion' problem and matching task-specific designs.

Apr 27, 202690% relevant

VLAF Framework Reveals Widespread Alignment Faking in Language Models

Researchers introduce VLAF, a diagnostic framework that reveals alignment faking is far more common than previously known, affecting models as small as 7B parameters. They also show a single contrastive steering vector can mitigate the behavior with minimal computational overhead.

Apr 24, 202682% relevant

TACO Framework Cuts Agent Token Overhead 10% via Self-Evolving Compression

Researchers introduced TACO, a framework that enables terminal agents to automatically discover and refine context compression rules from their own interaction trajectories. This approach cuts token overhead by approximately 10% on benchmarks like TerminalBench and SWE-Bench Lite while preserving task accuracy.

Apr 22, 202687% relevant

A Practical Framework for Moving Enterprise RAG from POC to Production

The article presents a detailed, production-ready framework for building an enterprise RAG system, covering architecture, security, and deployment. It provides a concrete path for companies to move beyond experimental prototypes.

Apr 22, 202672% relevant

CAST: A New Framework for Semantic-Level Complementary Recommendations

Researchers propose CAST, a sequential recommendation framework that models transitions between discrete item semantic codes (e.g., specifications) and injects LLM-verified complementary knowledge. It achieves significant performance gains by moving beyond simplistic co-purchase statistics to capture genuine complementarity.

Apr 22, 202678% relevant

VoteGCL: A Novel LLM-Augmented Framework to Combat Data Sparsity in

A new paper introduces VoteGCL, a framework that uses few-shot LLM prompting and majority voting to create high-confidence synthetic data for graph-based recommendation systems. It integrates this data via graph contrastive learning to improve accuracy and mitigate bias, outperforming existing baselines.

Apr 22, 202690% relevant

GraphRAG-IRL: A Hybrid Framework for More Robust Personalized Recommendation

Researchers propose GraphRAG-IRL, a hybrid recommendation framework that addresses LLMs' weaknesses as standalone rankers. It uses a knowledge graph and inverse reinforcement learning for robust pre-ranking, then applies persona-guided LLM re-ranking to a shortlist, achieving significant NDCG improvements.

Apr 22, 202692% relevant

CS3: A New Framework to Boost Two-Tower Recommenders Without Slowing Them Down

Researchers propose CS3, a plug-and-play framework that strengthens the ubiquitous two-tower recommendation architecture. It uses three novel mechanisms to improve model alignment and knowledge transfer, delivering significant revenue gains in a live ad system while maintaining millisecond latency.

Apr 22, 2026100% relevant

LLMAR: A Tuning-Free LLM Framework for Recommendation in Sparse

Researchers propose LLMAR, a tuning-free recommendation framework that uses LLM reasoning to infer user 'latent motives' from sparse text-rich data. It outperforms state-of-the-art models in sparse industrial scenarios while keeping inference costs low, offering a practical alternative to costly fine-tuning.

Apr 21, 202680% relevant

MiniMax Added as Official Provider for OpenClaude AI Framework

MiniMax has been integrated as an officially supported provider for the OpenClaude framework, giving developers a new, enterprise-backed model option for running the open-source Claude alternative.

Apr 20, 202689% relevant

Andrej Karpathy's LLM-Wiki Framework Solves AI Amnesia with Persistent Knowledge

Andrej Karpathy published a two-page framework called LLM-Wiki that transforms how AI systems handle accumulated knowledge. Instead of retrieving from raw documents each time, the AI compiles sources into its own structured wiki that persists across sessions.

Apr 19, 202685% relevant

TRACE: A Multi-Agent LLM Framework for Sustainable Tourism Recommendations

A new research paper introduces TRACE, a modular LLM-based framework for conversational travel recommendations. It uses specialized agents to elicit sustainability preferences and generate 'greener' alternatives through interactive explanations, aiming to reduce overtourism and carbon-intensive travel.

Apr 17, 202692% relevant

FeCoSR: A Federated Framework for Cross-Market Sequential Recommendation

A new arXiv paper introduces FeCoSR, a federated collaboration framework for cross-market sequential recommendation. It tackles data isolation and market heterogeneity by enabling many-to-many collaborative training with a novel loss function, showing advantages over traditional transfer approaches.

Apr 16, 202682% relevant

New Research Proposes Unified LLM Framework for Need-Driven Service

A new arXiv paper introduces a large language model framework that unifies living need prediction and service recommendation for local life services. It uses behavioral clustering to filter noise and a curriculum learning + RL strategy to navigate complex decision paths. Experiments show it significantly improves both need prediction and recommendation accuracy.

Apr 16, 202682% relevant

MVCrec: A New Multi-View Contrastive Learning Framework for Sequential

Researchers propose MVCrec, a framework that applies multi-view contrastive learning between sequential (ID-based) and graph-based views of user interaction data to improve recommendation accuracy. It outperforms 11 leading models, showing significant gains in key metrics.

Apr 16, 202684% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety