lean

30 articles about lean in AI news

Agentic Commerce Needs Clean, Structured Data to Deliver ROI at Scale

Retail Dive reports that agentic commerce, with 4,700% YoY traffic growth, demands clean, structured data. Melissa's data quality assessment helps retailers identify weak spots for AI readiness.

Jul 6, 202682% relevant

Mistral's Leanstral 1.5 hits 100% on miniF2F, finds 5 real bugs

Mistral's Leanstral 1.5 scores 100% on miniF2F, solves 587 Putnam problems, and finds 5 real bugs in open-source code.

Jul 4, 2026100% relevant

Google LEAP Scaffold Lifts Lean-IMO-Bench One-Shot Solve Rate from <10% to 70%

Google's LEAP scaffold lifts Lean-IMO-Bench one-shot solve rate from <10% to 70%, solving all 12 Putnam 2025 problems.

Jun 3, 202685% relevant

Glean benchmark: Off-the-shelf MCP costs 30% more tokens than indexed context

Glean benchmark: off-the-shelf MCP in Claude Cowork loses 2.5x more tasks and uses 30% more tokens than indexed context.

May 15, 202688% relevant

Onyx: Open-Source AI Enterprise Search Challenges Glean's $7.2B Valuation

Open-source platform Onyx provides self-hosted AI enterprise search connecting to 40+ tools, offering a free alternative to Glean's $50/user/month SaaS. Backed by YC and $10M seed funding, it's used by Netflix and Ramp.

Apr 22, 202685% relevant

Microsoft's 'Markdownify' Converts PDFs, Audio, Video to Clean LLM Markdown

Microsoft launched 'Markdownify', a Python tool that converts PDFs, Word docs, Excel, PowerPoint, audio, and YouTube URLs into clean Markdown. This addresses a major pain point in AI pipelines where raw file parsing breaks context and structure.

Apr 8, 202685% relevant

Clean Up Messy Claude Code Terminal Pastes in One Click

Use the 'Cleanup Claude Code Paste' web tool to instantly clean copied terminal output, removing the prompt character and fixing line-wrapping issues for clean prompts.

Apr 6, 202687% relevant

Microsoft's Satya Nadella Details Internal 'Lean for Knowledge Work' AI Initiative

Microsoft CEO Satya Nadella described the company's internal application of AI to streamline knowledge work, framing it as a 'Lean' manufacturing-style efficiency push for cognitive tasks. The initiative focuses on using AI to reduce process friction and improve productivity across internal operations.

Mar 29, 202685% relevant

Chinese Startup Pairs Human Cleaners with Autonomous AI Robots for Household Chores

A new home service in China deploys autonomous AI robots alongside human cleaners to perform household chores. This represents an early commercial implementation of mobile manipulation AI in domestic settings.

Mar 22, 202685% relevant

XSquareRobot and 58.com Launch China's First Human-Robot Home Cleaning Service in Shenzhen

A new service in Shenzhen pairs human cleaners with autonomous AI robots running on the WALL-A system. The robot handles repetitive tasks while the human manages complex judgment, with real home deployment providing training data.

Mar 21, 202692% relevant

Terence Tao Demonstrates AI's Growing Role in Formal Mathematics with Claude and Lean

Fields Medalist Terence Tao has released a video showing how Claude Code can be used to formalize mathematical proofs in Lean, highlighting AI's expanding capabilities in high-level mathematics.

Mar 8, 202685% relevant

Big Tech Earnings: Google Has the Cleanest AI Story, Says Analyst

A market analyst argues Alphabet has the strongest fundamental AI story among Big Tech earnings today, driven by Cloud and TPUv8 demand. Microsoft has the easiest beat-and-run setup due to a beaten-down stock, while Meta and Amazon face higher expectations after recent gains.

Apr 29, 202675% relevant

What Cursor's 8GB Storage Bloat Teaches Us About Claude Code's Clean Architecture

A deep dive into Cursor's scattered 8GB local storage reveals why Claude Code's ~/.claude/projects/*.jsonl approach is better for developers.

Mar 28, 202698% relevant

Learning to Disprove: LLMs Fine-Tuned for Formal Counterexample Generation in Lean 4

Researchers propose a method to train LLMs for formal counterexample generation, a neglected skill in mathematical AI. Their symbolic mutation strategy and multi-reward framework improve performance on three new benchmarks.

Mar 23, 202677% relevant

DeMellier grows by leaning into craftsmanship and alternative materials as

DeMellier founder Mireia Llusia-Lindh explains how focusing on craftsmanship, alternative materials, and controlled growth is driving demand, with Lyst searches up 97% YoY. The strategy echoes broader shifts at Kering and Bottega Veneta as the luxury sector loses 70 million customers due to value concerns.

Apr 24, 202673% relevant

Quaise Energy Raises $134M for Superhot Geothermal Drilling

Quaise Energy raised $134M for superhot geothermal drilling, targeting 24/7 clean power for AI data centers by 2030.

Jul 7, 202675% relevant

Mirage Probes Paper Reveals Two Distinct VLM Failure Modes

Mirage Probes paper reveals VLMs have two distinct failure modes—textual biases and spurious images—requiring different mitigations. Text cleaning only fixes one; the other needs representational interventions.

Jun 15, 202690% relevant

Polarization by Default: New Study Audits Recommendation Bias in LLM-Based

A controlled study of 540,000 LLM-based content selections reveals robust biases across providers. All models amplified polarization, showed negative sentiment preferences, and exhibited distinct trade-offs in toxicity handling and demographic representation, with political leaning bias being particularly persistent.

Apr 20, 202684% relevant

JBM-Diff: A New Graph Diffusion Model for Denoising Multimodal Recommendations

A new arXiv paper introduces JBM-Diff, a conditional graph diffusion model designed to clean 'noise' from multimodal item features (like images/text) and user behavior data (like accidental clicks) in recommendation systems. It aims to improve ranking accuracy by ensuring only preference-relevant signals are used.

Apr 7, 202678% relevant

Google Launches AI Edge Eloquent: Free, Offline-First Dictation App on iOS

Google has quietly launched AI Edge Eloquent, a free, subscription-less dictation app for iOS. It uses a Gemma-based speech recognition model to process audio locally, removing filler words and self-corrections to produce cleaner text.

Apr 6, 202697% relevant

A Practical Guide to Fine-Tuning Open-Source LLMs for AI Agents

This Portuguese-language Medium article is Part 2 of a series on LLM engineering for AI agents. It provides a hands-on guide to fine-tuning an open-source model, building on a foundation of clean data and established baselines from Part 1.

Apr 6, 202674% relevant

Claude Code Hooks: How to Auto-Format, Lint, and Test on Every Save

Configure hooks in .claude/settings.json to run prettier, eslint, and tests automatically, ensuring clean code without manual intervention.

Apr 4, 202695% relevant

Axios NPM Package Under Active Supply Chain Attack, Potentially Impacts 100M+ Weekly Installs

The widely-used JavaScript HTTP client library Axios may be compromised via a malicious dependency in its latest release, exhibiting malware-like behavior including shell execution and artifact cleanup. With over 100 million weekly downloads, this represents a critical software supply chain threat.

Mar 31, 202699% relevant

How to Auto-Approve Safe WebFetches While Blocking Suspicious URLs with Hooks

Use Claude Code's PreToolUse hooks to automatically allow clean documentation URLs while forcing manual review for any URL containing query parameters, eliminating repetitive prompts without sacrificing security.

Mar 31, 202681% relevant

The Leaked 'Employee-Grade' CLAUDE.md: How to Use It Today

A leaked CLAUDE.md used by Anthropic employees reveals advanced directives for verification, context management, and anti-laziness. Here's the cleaned-up version you can use.

Mar 30, 202695% relevant

New Research Proposes FilterRAG and ML-FilterRAG to Defend Against Knowledge Poisoning Attacks in RAG Systems

Researchers propose two novel defense methods, FilterRAG and ML-FilterRAG, to mitigate 'PoisonedRAG' attacks where adversaries inject malicious texts into a knowledge source to manipulate an LLM's output. The defenses identify and filter adversarial content, maintaining performance close to clean RAG systems.

Mar 30, 202692% relevant

Arxitect: The Claude Code Plugin That Enforces SOLID Principles Automatically

Install Arxitect to make Claude Code's implementations adhere to API design, OO principles, and Clean Architecture—preventing technical debt accumulation.

Mar 25, 202695% relevant

Claude Code's Deny List Bypass: How to Protect Your Codebase from Compound Commands

Claude Code's deny lists only check the first token of compound commands, allowing dangerous actions like 'git clean' to slip through. Here's how to protect yourself.

Mar 25, 202695% relevant

OpenAI in Advanced Talks to Buy Electricity from Sam Altman-Backed Helion Energy

OpenAI is negotiating to purchase electricity from fusion startup Helion Energy, with a potential deal securing 12.5% of Helion's initial power output. This move signals a strategic push by the AI giant to lock in massive, clean energy for future compute needs.

Mar 23, 202695% relevant

Agent HTTP: Add a Production-Ready HTTP API to Claude Code in 5 Minutes

Agent HTTP is an MCP server that gives Claude Code a clean HTTP API, enabling programmatic control and integration without terminal scraping.

Mar 20, 202687% relevant

Explore More

AI Agents Large Language Models Claude Code OpenAI RAG MCP Fine-tuning Benchmarks Open Source AI AI Safety