feature engineering
30 articles about feature engineering in AI news
MedFeat: How AI is Revolutionizing Medical Feature Engineering with Model-Aware Intelligence
Researchers have developed MedFeat, an innovative framework that combines large language models with clinical expertise to create smarter features for medical predictions. Unlike traditional approaches, MedFeat incorporates model awareness and explainability to generate features that improve accuracy and generalization across healthcare settings.
Agentic Harness Engineering Boosts Coding Agents 7% on Terminal-Bench 2
Agentic Harness Engineering introduces a structured approach to evolving coding-agent harnesses, using revertible components, condensed experience, and falsifiable decisions. On Terminal-Bench 2, pass@1 climbs from 69.7% to 77.0% in ten iterations, beating human-designed baselines.
How Intercom Ships AI Features 10x Faster with Claude Code and Rails
Intercom developers share battle-tested workflows for using Claude Code inside a large Rails codebase to build AI-first features rapidly.
Shopify Engineering details 'Flow generation through natural language'
Shopify Engineering describes a 2026 approach to generating complex workflows (flows) from natural language prompts using an agentic modeling framework, enabling non-technical users to create automation.
Claude AI Adds Meal Planning Feature, Aims at Nutritionist Market
Anthropic's Claude AI assistant has been updated to create detailed weekly meal plans tailored to user-defined nutrition targets. This feature expansion moves Claude into the health and wellness productivity space, competing with specialized apps.
Shopify Engineering Teases 'Autoresearch' Beyond Model Training in 2026 Preview
Shopify Engineering has previewed a 2026 perspective suggesting 'autoresearch'—automated research processes—will have applications extending beyond just training AI models. This signals a broader operational automation strategy for the e-commerce giant.
EgoAlpha's 'Prompt Engineering Playbook' Repo Hits 1.7k Stars
Research lab EgoAlpha compiled advanced prompt engineering methods from Stanford, Google, and MIT papers into a public GitHub repository. The 758-commit repo provides free, research-backed techniques for in-context learning, RAG, and agent frameworks.
Axios Supply Chain Attack Highlights AI-Powered Social Engineering Threat to Open Source
The recent Axios npm package supply chain attack was initiated by highly sophisticated social engineering targeting a developer. This incident signals a dangerous escalation in the targeting of open source infrastructure, where AI tools could amplify attacker capabilities.
Open-Source Multi-Agent LLM System for Complex Software Engineering Tasks Released by Academic Consortium
A consortium of researchers from Stony Brook, CMU, Yale, UBC, and Fudan University has open-sourced a multi-agent LLM system specifically architected for complex software engineering. The release aims to provide a collaborative, modular framework for tackling tasks beyond single-agent capabilities.
ChatGPT Launches 'Library' Feature: Persistent Document Storage Across Conversations with 512MB File Limits
OpenAI introduces ChatGPT Library, a persistent storage system that saves uploaded files (PDFs, docs, images) at the account level for reuse across different chats. The feature is rolling out to Plus, Team, and Enterprise users with specific file size and token limits.
Garry Tan's gstack: The 13-Skill Setup That Turns Claude Code Into a Virtual Engineering Team
Install Garry Tan's open-source gstack to get 13 specialized Claude Code skills (/plan-ceo-review, /review, /qa) that act as a full engineering team, shipping production code faster.
Evolving Demonstration Optimization: A New Framework for LLM-Driven Feature Transformation
Researchers propose a novel framework that uses reinforcement learning and an evolving experience library to optimize LLM prompts for feature transformation tasks. The method outperforms classical and static LLM approaches on tabular data benchmarks.
From Agentic Coding to Autonomous Factories: How Cursor Automations Is Redefining Software Engineering
Cursor's new Automations feature transforms AI-assisted coding from a manual, agent-babysitting model to an event-driven system where AI agents trigger automatically based on workflows. This addresses the human attention bottleneck in managing multiple coding agents simultaneously.
Inside Claude Code’s Leaked Source: A 512,000-Line Blueprint for AI Agent Engineering
A misconfigured npm publish exposed ~512,000 lines of Claude Code's TypeScript source, detailing a production-ready AI agent system with background operation, long-horizon planning, and multi-agent orchestration. This leak provides an unprecedented look at how a leading AI company engineers complex agentic systems at scale.
Anthropic Deploys Multi-Agent Harness to Scale Claude's Frontend Design & Autonomous Software Engineering
Anthropic engineers detail a multi-agent system that orchestrates multiple Claude instances to tackle complex, long-running software tasks like frontend design. The approach aims to overcome single-model context and reasoning limits.
Stop Telling Claude What to Do: The Shift to Outcome Engineering
Move from step-by-step prompting to defining the desired outcome. Let Claude figure out the steps, making your CLAUDE.md files more powerful and efficient.
Anthropic's Claude 3.5 Sonnet Used to Build DCF Models and Earnings Reports via Prompt Engineering
A prompt engineer has shared 13 detailed prompts that guide Anthropic's Claude 3.5 Sonnet through complex financial analysis tasks, including building DCF models and generating earnings reports. The prompts demonstrate the model's ability to follow structured, multi-step reasoning for specialized professional work.
The AI Paradox: How Cheaper Code Creation Is Fueling a Software Engineering Boom
Contrary to fears of AI replacing developers, the Jevons Paradox suggests that making software creation cheaper through AI tools actually increases demand for human engineers who can design, review, and integrate complex systems at scale.
The Agency: 147 Open Source AI Agents Hit 50K GitHub Stars in 2 Weeks
The Agency is an open source repository with 147 specialized AI agents across 12 divisions (engineering, design, marketing, etc.) that hit 50K GitHub stars in under two weeks. It provides one-command install for tools like Claude Code and Cursor, with full modding support.
SpaceXAI Partners with Cursor AI to Build 'World's Best' Coding Assistant
SpaceXAI and Cursor AI announced a partnership to integrate SpaceX's engineering data with Cursor's editor, aiming to create a top-tier AI for coding and knowledge work.
Layers on Layers — How You Can Improve Your Recommendation Systems
An IBM article critiques monolithic recommendation engines for trying to do too much with one score. It proposes a layered architecture—candidate generation, ranking, and business logic—to improve performance and adaptability. This is a direct, practical framework for engineering teams.
Chamath: AI Coding Agents Erase the '10x Engineer' Advantage
Chamath Palihapitiya argues AI coding agents are eliminating the '10x engineer' by making the most efficient code paths obvious to all, similar to how AI solved chess. This reduces technical differentiation and shifts the basis of engineering value.
OpenAI Open-Sources Agents SDK, Supports 100+ LLMs
OpenAI has open-sourced its internal Agents SDK, a lightweight framework for building multi-agent systems. It features three core primitives, works with over 100 LLMs, and has gained 18.9k GitHub stars immediately.
Claude Code Reverse-Engineered: 98.4% of Codebase is Operational Harness
A reverse-engineering analysis of Claude Code reveals only 1.6% of its codebase is AI decision logic, with the rest being operational infrastructure. This challenges current agent design paradigms by prioritizing a robust deterministic harness over complex model routing.
A Practical Guide to Building Real-Time Recommendation Systems
This article provides a practical overview of building real-time recommendation systems, covering core components like data ingestion, feature stores, and model serving. It matters because real-time personalization is becoming a baseline expectation in digital commerce.
Pinterest's Request-Level Deduplication
Pinterest's engineering blog details 'request-level deduplication,' a critical efficiency technique for modern recommendation systems. By eliminating redundant processing of massive user sequences, they achieve 10-50x storage compression and significant training speedups, while solving novel training challenges like batch correlation.
LLM-HYPER: A Training-Free Framework for Cold-Start Ad CTR Prediction
A new arXiv paper introduces LLM-HYPER, a framework that treats large language models as hypernetworks to generate parameters for click-through rate estimators in a training-free manner. It uses multimodal ad content and few-shot prompting to infer feature weights, drastically reducing the cold-start period for new promotional ads and has been deployed on a major U.S. e-commerce platform.
American Express Launches Developer Kit and Purchase Protection for
American Express has introduced a new developer toolkit and a purchase protection feature designed for 'agentic commerce'—transactions initiated by AI agents. This move aims to provide infrastructure and consumer confidence for the emerging automated shopping ecosystem.
Pinterest Details 'Request-Level Deduplication' to Scale Massive
Pinterest's engineering team published a detailed technical breakdown of 'request-level deduplication'—a family of techniques that eliminate redundant processing of user data across thousands of candidate items in their recommendation system. This approach was critical to scaling their Foundation Model by 100x while controlling infrastructure costs.
Karpathy's LLM Wiki Hits 5k Stars, Gains Memory Lifecycle Extension
Andrej Karpathy's LLM Wiki repository gained 5,000 GitHub stars in two days. A developer has now extended it with memory lifecycle features, addressing a noted gap.