editing
30 articles about editing in AI news
ByteDance Open-Sources BAGEL: 7B Multimodal Model for Image Gen, Editing, Understanding
ByteDance open-sourced BAGEL, a 7B multimodal model for image gen, editing, style transfer, and understanding under Apache 2.0.
NemoVideo AI Automates Video Editing Based on Text Prompts
A video creator states NemoVideo AI now automates complex editing tasks like cuts and transitions from simple text descriptions, reducing a 5-hour manual process to a prompt-driven workflow.
GenRecEdit: A Model Editing Framework to Fix Cold-Start Collapse in Generative Recommenders
A new research paper proposes GenRecEdit, a training-free model editing framework for generative recommendation systems. It directly injects knowledge of cold-start items, improving their recommendation accuracy to near-original levels while using only ~9.5% of the compute time of a full retrain.
Mirage's Cappy Edits Video via Text Message with No App
Mirage launched Cappy, a text-based video editing service that delivers fully edited videos via SMS. This first-of-its-kind approach eliminates traditional editing interfaces entirely.
Onlook: Open-Source AI Tool Edits React Code Visually, Hits 23.9K GitHub Stars
Onlook, an open-source desktop app, enables visual editing of live React and Next.js applications, with AI generating and writing code changes directly to the codebase. It has gained 23.9K GitHub stars, positioning itself as a free alternative to paid design tools like Figma.
Microsoft Expands Word Copilot for Legal, Finance, and Compliance Docs
Microsoft is giving its Copilot AI a more significant role within Microsoft Word for editing legal, financial, and compliance documents, indicating a push into specialized, high-stakes enterprise workflows.
Generative World Renderer: 4M+ RGB/G-Buffer Frames from Cyberpunk 2077 & Black Myth: Wukong Released for Inverse Graphics
A new framework and dataset extracts over 4 million synchronized RGB and G-buffer frames from Cyberpunk 2077 and Black Myth: Wukong, enabling AI models to learn inverse material decomposition and controllable game environment editing.
Luma AI Launches Uni-1, a Unified Image Model Priced at $0.09 per 2K Image, Challenging Google Nano Banana
Luma AI released Uni-1, a single transformer model for image understanding and generation. It ranks first in human preference tests for style/editing and reference tasks, and is priced lower than Google's Nano Banana models.
Renoise AI Tool Enables Programmatic Video Generation, Promising Faster Production
Renoise has launched an AI tool that generates videos through code rather than traditional editing. The platform claims to produce high-quality videos more easily and faster than previous methods.
Black Forest Labs Unleashes FLUX.2 klein: Sub-Second AI Image Generation Hits Hugging Face
Black Forest Labs has released FLUX.2 klein on Hugging Face, delivering state-of-the-art image generation and editing in under a second. The model runs on consumer GPUs with just 13GB VRAM, making high-speed AI art creation dramatically more accessible.
Seedream 5.0 Lite Emerges as a Precision Tool for AI Image Generation
Seedream 5.0 Lite has launched on HailuoAI, emphasizing unprecedented user control and consistency in AI image generation. The model introduces features like multi-reference image locking and precise editing, moving beyond random outputs toward reliable creative workflows.
Veeso AI Emerges as Template-Free Design Challenger, Promising Instant Visuals from Raw Text
Veeso AI has launched as a potential competitor to Canva, claiming to transform plain text into complete, polished designs instantly without templates or manual editing. The tool aims to democratize design by eliminating the need for drag-and-drop interfaces or design expertise.
NotebookLM's PowerPoint Integration: AI Research Assistant Evolves into Presentation Creator
Google's NotebookLM has expanded beyond research summarization to include slide generation and editing capabilities with direct PowerPoint export. This transforms the AI research assistant into a complete presentation workflow tool.
The AI Image Generation Revolution Hits a Tipping Point: All Major Models Now Accessible in One Platform
A new platform has emerged that consolidates access to leading AI image models including Sora, Flux, and Seedream 4.5, enabling text-to-image generation, editing, and style swapping without multiple subscriptions or specialized software.
PartRAG Revolutionizes 3D Generation with Retrieval-Augmented Part-Level Control
Researchers introduce PartRAG, a breakthrough framework that combines retrieval-augmented generation with diffusion transformers for precise part-level 3D creation and editing from single images. The system achieves superior geometric accuracy while enabling localized modifications without regenerating entire objects.
Sam Altman: AI inference costs dropped 1000x from o1 to GPT-5.4
Sam Altman stated AI inference costs for solving a fixed hard problem dropped ~1000x from o1 to GPT-5.4 in ~16 months, crediting cross-layer engineering optimizations, not a single breakthrough.
AI editor matches pro on 84% of video cuts in blind test
AI editor matched pro on 84% of video cuts in blind test of 4-hour project. Suggests editorial judgment is partially automatable.
9-Line Agent: Cursor Beats Claude, OpenAI SDKs in Dev Build Test
A developer built the same agent in Cursor (9 lines), Claude Code (47 lines), and OpenAI Codex (31 lines). The gap is in tool orchestration architecture, not model capability.
Claudectl: The Windows Workspace Manager That Makes Claude Code
Claudectl solves Claude Code's biggest pain point on Windows: losing context when switching projects. Install via `pipx install claudectl` for session browsing, CLAUDE.md scaffolding, and per-project MCP/M model configs.
Claude Code's June 15 Agentic Credit Split: How to Avoid Hitting the $20 Wall
Claude Code's June 15 agentic credit split moves `claude -p` and CI workflows to a separate $20/month bucket on Pro. Upgrade to Max 5x or switch to direct API for production pipelines.
Claude Code Generates Production Lottie Animations via Show HN
Claude Code claimed to generate production Lottie animations via Show HN. No demo or code published; 2 points, 0 comments. Unverified.
LTX Studio Turns AI Video Clips Into Editable Scenes
LTX Studio + LTX-2.3 lets users edit AI video scenes, not just generate clips. This shifts AI video from demo to production tool.
Sequential Thinking MCP: Break Down Hard Problems Into Solvable Steps in
Sequential Thinking MCP forces Claude Code into structured multi-step reasoning. Install via npx to decompose architecture decisions, debug distributed systems, and design schemas with iterative analysis.
Claude Code Token Costs Got You Down? Here's How to Cut Usage 40% Without
Claude Code users frustrated by token costs should use /compact, optimize CLAUDE.md, and route cheap models via OpenRouter for simple tasks—no local model matches Claude's quality yet.
Claude Opus 4.8 Launches Dynamic Workflows for Agentic Code
Claude Opus 4.8 launched with dynamic workflows for Claude Code, enabling multi-step agentic coding. The release addresses quality issues after a ~25% instruction miss rate post-4.6.
HAVEN Benchmark Exposes MLLM Gap Between Fluency and Video Understanding
HAVEN benchmark tests MLLMs on hierarchical video understanding across frame, shot, and video levels. Results show top models lack grounded multimodal reasoning despite fluent text generation.
ByteDance Lance 3B MoE Beats 7B Models on Multimodal Benchmarks
ByteDance released Lance, a 3B multimodal MoE model that beats 7B+ models on benchmarks through multi-task synergy and specialized pathways.
SenseTime Open-Sources Omni-Modal Model That Thinks in Pixels and Words
SenseTime open-sourced an omni-modal AI that reasons in pixel-word space without visual encoder or VAE, challenging dominant multimodal architectures.
Claude Code Autonomously Ported Lightroom CC to Linux
Claude Opus 4.7 autonomously ported Adobe Lightroom CC to Linux via Wine after a single prompt, handling DLL patching and cloud sync integration.
MIT Open-Sources AI That Turns Photos Into Editable CAD Models
MIT open-sourced an AI that turns photos into editable CAD files, threatening $150/hour modeling work. No benchmarks or training details disclosed.