Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

editing

30 articles about editing in AI news

ByteDance Open-Sources BAGEL: 7B Multimodal Model for Image Gen, Editing, Understanding

ByteDance open-sourced BAGEL, a 7B multimodal model for image gen, editing, style transfer, and understanding under Apache 2.0.

95% relevant

NemoVideo AI Automates Video Editing Based on Text Prompts

A video creator states NemoVideo AI now automates complex editing tasks like cuts and transitions from simple text descriptions, reducing a 5-hour manual process to a prompt-driven workflow.

85% relevant

GenRecEdit: A Model Editing Framework to Fix Cold-Start Collapse in Generative Recommenders

A new research paper proposes GenRecEdit, a training-free model editing framework for generative recommendation systems. It directly injects knowledge of cold-start items, improving their recommendation accuracy to near-original levels while using only ~9.5% of the compute time of a full retrain.

95% relevant

Mirage's Cappy Edits Video via Text Message with No App

Mirage launched Cappy, a text-based video editing service that delivers fully edited videos via SMS. This first-of-its-kind approach eliminates traditional editing interfaces entirely.

75% relevant

Onlook: Open-Source AI Tool Edits React Code Visually, Hits 23.9K GitHub Stars

Onlook, an open-source desktop app, enables visual editing of live React and Next.js applications, with AI generating and writing code changes directly to the codebase. It has gained 23.9K GitHub stars, positioning itself as a free alternative to paid design tools like Figma.

89% relevant

Microsoft Expands Word Copilot for Legal, Finance, and Compliance Docs

Microsoft is giving its Copilot AI a more significant role within Microsoft Word for editing legal, financial, and compliance documents, indicating a push into specialized, high-stakes enterprise workflows.

85% relevant

Generative World Renderer: 4M+ RGB/G-Buffer Frames from Cyberpunk 2077 & Black Myth: Wukong Released for Inverse Graphics

A new framework and dataset extracts over 4 million synchronized RGB and G-buffer frames from Cyberpunk 2077 and Black Myth: Wukong, enabling AI models to learn inverse material decomposition and controllable game environment editing.

85% relevant

Luma AI Launches Uni-1, a Unified Image Model Priced at $0.09 per 2K Image, Challenging Google Nano Banana

Luma AI released Uni-1, a single transformer model for image understanding and generation. It ranks first in human preference tests for style/editing and reference tasks, and is priced lower than Google's Nano Banana models.

95% relevant

Renoise AI Tool Enables Programmatic Video Generation, Promising Faster Production

Renoise has launched an AI tool that generates videos through code rather than traditional editing. The platform claims to produce high-quality videos more easily and faster than previous methods.

85% relevant

Black Forest Labs Unleashes FLUX.2 klein: Sub-Second AI Image Generation Hits Hugging Face

Black Forest Labs has released FLUX.2 klein on Hugging Face, delivering state-of-the-art image generation and editing in under a second. The model runs on consumer GPUs with just 13GB VRAM, making high-speed AI art creation dramatically more accessible.

85% relevant

Seedream 5.0 Lite Emerges as a Precision Tool for AI Image Generation

Seedream 5.0 Lite has launched on HailuoAI, emphasizing unprecedented user control and consistency in AI image generation. The model introduces features like multi-reference image locking and precise editing, moving beyond random outputs toward reliable creative workflows.

85% relevant

Veeso AI Emerges as Template-Free Design Challenger, Promising Instant Visuals from Raw Text

Veeso AI has launched as a potential competitor to Canva, claiming to transform plain text into complete, polished designs instantly without templates or manual editing. The tool aims to democratize design by eliminating the need for drag-and-drop interfaces or design expertise.

85% relevant

NotebookLM's PowerPoint Integration: AI Research Assistant Evolves into Presentation Creator

Google's NotebookLM has expanded beyond research summarization to include slide generation and editing capabilities with direct PowerPoint export. This transforms the AI research assistant into a complete presentation workflow tool.

85% relevant

The AI Image Generation Revolution Hits a Tipping Point: All Major Models Now Accessible in One Platform

A new platform has emerged that consolidates access to leading AI image models including Sora, Flux, and Seedream 4.5, enabling text-to-image generation, editing, and style swapping without multiple subscriptions or specialized software.

85% relevant

PartRAG Revolutionizes 3D Generation with Retrieval-Augmented Part-Level Control

Researchers introduce PartRAG, a breakthrough framework that combines retrieval-augmented generation with diffusion transformers for precise part-level 3D creation and editing from single images. The system achieves superior geometric accuracy while enabling localized modifications without regenerating entire objects.

70% relevant

Sam Altman: AI inference costs dropped 1000x from o1 to GPT-5.4

Sam Altman stated AI inference costs for solving a fixed hard problem dropped ~1000x from o1 to GPT-5.4 in ~16 months, crediting cross-layer engineering optimizations, not a single breakthrough.

85% relevant

AI editor matches pro on 84% of video cuts in blind test

AI editor matched pro on 84% of video cuts in blind test of 4-hour project. Suggests editorial judgment is partially automatable.

65% relevant

9-Line Agent: Cursor Beats Claude, OpenAI SDKs in Dev Build Test

A developer built the same agent in Cursor (9 lines), Claude Code (47 lines), and OpenAI Codex (31 lines). The gap is in tool orchestration architecture, not model capability.

72% relevant

Claudectl: The Windows Workspace Manager That Makes Claude Code

Claudectl solves Claude Code's biggest pain point on Windows: losing context when switching projects. Install via `pipx install claudectl` for session browsing, CLAUDE.md scaffolding, and per-project MCP/M model configs.

90% relevant

Claude Code's June 15 Agentic Credit Split: How to Avoid Hitting the $20 Wall

Claude Code's June 15 agentic credit split moves `claude -p` and CI workflows to a separate $20/month bucket on Pro. Upgrade to Max 5x or switch to direct API for production pipelines.

100% relevant

Claude Code Generates Production Lottie Animations via Show HN

Claude Code claimed to generate production Lottie animations via Show HN. No demo or code published; 2 points, 0 comments. Unverified.

75% relevant

LTX Studio Turns AI Video Clips Into Editable Scenes

LTX Studio + LTX-2.3 lets users edit AI video scenes, not just generate clips. This shifts AI video from demo to production tool.

75% relevant

Sequential Thinking MCP: Break Down Hard Problems Into Solvable Steps in

Sequential Thinking MCP forces Claude Code into structured multi-step reasoning. Install via npx to decompose architecture decisions, debug distributed systems, and design schemas with iterative analysis.

75% relevant

Claude Code Token Costs Got You Down? Here's How to Cut Usage 40% Without

Claude Code users frustrated by token costs should use /compact, optimize CLAUDE.md, and route cheap models via OpenRouter for simple tasks—no local model matches Claude's quality yet.

90% relevant

Claude Opus 4.8 Launches Dynamic Workflows for Agentic Code

Claude Opus 4.8 launched with dynamic workflows for Claude Code, enabling multi-step agentic coding. The release addresses quality issues after a ~25% instruction miss rate post-4.6.

100% relevant

HAVEN Benchmark Exposes MLLM Gap Between Fluency and Video Understanding

HAVEN benchmark tests MLLMs on hierarchical video understanding across frame, shot, and video levels. Results show top models lack grounded multimodal reasoning despite fluent text generation.

85% relevant

ByteDance Lance 3B MoE Beats 7B Models on Multimodal Benchmarks

ByteDance released Lance, a 3B multimodal MoE model that beats 7B+ models on benchmarks through multi-task synergy and specialized pathways.

90% relevant

SenseTime Open-Sources Omni-Modal Model That Thinks in Pixels and Words

SenseTime open-sourced an omni-modal AI that reasons in pixel-word space without visual encoder or VAE, challenging dominant multimodal architectures.

87% relevant

Claude Code Autonomously Ported Lightroom CC to Linux

Claude Opus 4.7 autonomously ported Adobe Lightroom CC to Linux via Wine after a single prompt, handling DLL patching and cloud sync integration.

100% relevant

MIT Open-Sources AI That Turns Photos Into Editable CAD Models

MIT open-sourced an AI that turns photos into editable CAD files, threatening $150/hour modeling work. No benchmarks or training details disclosed.

87% relevant