Skip to content
gentic.news — AI News Intelligence Platform
Connecting to the Living Graph…

update

30 articles about update in AI news

Codex Update Cuts GUI Workflow Latency 42%

Codex app update cuts GUI workflow latency 42%, enabling near-human-speed interface operation for autonomous app building and debugging.

84% relevant

OpenAI Codex Update Adds macOS Agent, Browser, Memory; 3M Weekly Users

OpenAI released a major Codex update featuring background macOS automation, an in-app browser, persistent memory, and 90+ plugins. With 3M weekly users and nearly half of usage now non-coding, Codex is being repositioned as a general work agent.

100% relevant

Anthropic Launches @ClaudeDevs X Account for API Developer Updates

Anthropic has launched @ClaudeDevs on X, a new channel for developers to receive direct updates on API releases, changelogs, and community news. This formalizes a direct line of communication for its growing developer ecosystem.

75% relevant

Kimi 2.6 Code Model Teased in Leaked Image, Suggesting Moonshot AI Update

A screenshot circulating online appears to show a 'Kimi 2.6' code model interface, suggesting Moonshot AI is preparing an update to its Kimi Chat platform focused on coding tasks.

85% relevant

Rumor: Anthropic's Next Claude Update May Include AI App Builder

A rumor on X claims the next Claude update will include an app builder, allowing users to create applications through conversational AI. This could significantly lower the barrier to app development.

87% relevant

How to Decode Anthropic's Press Releases for Better Claude Code Updates

Claude Code users should learn to filter Anthropic's technical announcements for actionable updates on model capabilities, context windows, and API pricing that affect daily development.

97% relevant

Google's Cookie Policy Update and the Challenge of AI-Powered Personalization

Google has updated its user-facing cookie and data consent interface, emphasizing its use of data for personalization and ad measurement. This reflects the ongoing tension between data-driven AI services and user privacy, a critical issue for luxury retail's digital transformation.

82% relevant

TensorFlow Playground Interactive Demo Updated for 2026, Enabling Real-Time Neural Network Visualization

The TensorFlow Playground, an educational web tool for visualizing neural networks, has been updated. Users can now adjust hyperparameters and watch the model train and visualize decision boundaries in real-time.

85% relevant

OpenAI's Spring Update Keynote Hits 11M YouTube Views in 4 Days, Signaling Massive Mainstream Interest

OpenAI's Spring Update keynote reached over 11 million views on YouTube in just four days, demonstrating unprecedented public engagement with a technical AI announcement.

85% relevant

New Research Diagnoses LLMs' Struggle with Multiple Knowledge Updates in Context

A new arXiv paper reveals a persistent bias in LLMs when facts are updated multiple times within a long context. Models increasingly favor the earliest version, failing to track the latest state—a critical flaw for dynamic knowledge tasks.

78% relevant

Claude 3.5 Sonnet's Latest Update Redefines AI Agent Capabilities for Real-World Tasks

Anthropic's Claude 3.5 Sonnet 4.6 update demonstrates remarkable improvements in agentic workflows and computer interaction, positioning it as a leading model for practical AI applications. Early adopters report unprecedented efficiency in real-world task automation.

85% relevant

Almanac: Open-Source Wiki Auto-Updates From Claude Code Chats

Almanac auto-generates a markdown wiki from Claude Code chats and repo history, solving the agent context gap. Free open-source tool, MacOS-only.

90% relevant

Newline's 'Skills' Update Shows Where MCP Servers Are Headed

The Newline MCP server now supports modular 'Skills,' allowing developers to customize their Claude Code environment with specific, installable capabilities for more targeted workflows.

100% relevant

Alibaba's Qwen Team Teases Qwen 3.6 Model, Signaling Major Open-Source LLM Update

Alibaba's Qwen team has teased the imminent release of Qwen 3.6, the next major version of its open-source large language model series. This follows the release of Qwen 2.5 in late 2024 and signals continued aggressive competition in the open-weight model space.

85% relevant

AI Shopping Update: OpenAI Focuses on Discovery, Meta Launches Checkout & Shopify Offers Catalog Integration

A trio of major AI shopping announcements: OpenAI shifts focus to product discovery, Meta launches in-app checkout for AI shopping ads, and Shopify opens its catalog integration to any brand. This signals a rapid move from conversational AI to transactional agentic systems.

95% relevant

Memento-Skills Agent System Achieves 116.2% Relative Improvement on Humanity's Last Exam Without LLM Updates

Memento-Skills is a generalist agent system that autonomously constructs and adapts task-specific agents through experience. It enables continual learning without updating LLM parameters, achieving 26.2% and 116.2% relative improvements on GAIA and Humanity's Last Exam benchmarks.

85% relevant

ReCast: A New RL Technique That Fixes Sparse-Hit Learning in Generative

Researchers propose ReCast, a 'repair-then-contrast' framework that fixes a fundamental flaw in group-based RL for generative recommendation: many sampled groups never become learnable. ReCast restores learnability for zero-reward groups and replaces normalization with contrastive updates, achieving up to 36.6% improvement in Pass@1 and 16.6x faster actor updates.

84% relevant

Fanuc robot arms combine AI and computer vision to adopt flexible workflows

Fanuc has updated its robot arms with AI and computer vision, enabling them to handle flexible workflows rather than fixed, repetitive tasks. This shift allows for greater adaptability in manufacturing environments.

74% relevant

Claude Code Security Alert: Patch Now, Stop Using Authentication Helpers

A critical security leak reveals three command injection vulnerabilities in Claude Code. Users must update and stop using authentication helpers to prevent credential theft and supply chain attacks.

100% relevant

OpenAI Launches GPT-Rosalind for Drug Discovery, GPT-5.4-Cyber for Security

OpenAI launched GPT-Rosalind, a life sciences model performing above the 95th percentile of human experts on novel biological data, and GPT-5.4-Cyber, a cybersecurity variant. These releases, alongside a major Agents SDK update, signal a pivot from general AI to specialized, high-stakes enterprise domains.

90% relevant

Opus 4.7's Tokenizer Change: How to Measure Your Real Claude Code Costs

Claude Opus 4.7's updated tokenizer means the same input can cost 40%+ more than 4.6. Use the Claude Token Counter to measure real costs before upgrading.

100% relevant

Claude AI Adds Meal Planning Feature, Aims at Nutritionist Market

Anthropic's Claude AI assistant has been updated to create detailed weekly meal plans tailored to user-defined nutrition targets. This feature expansion moves Claude into the health and wellness productivity space, competing with specialized apps.

85% relevant

GPT-5.5 Limited Rollout Begins, Frontend Improvements Noted

OpenAI has started a limited rollout of GPT-5.5 to select users, with early reports highlighting significant frontend quality improvements. This suggests an incremental update focused on user experience rather than core model capabilities.

85% relevant

Autogenesis Protocol Enables Self-Evolving AI Agents Without Retraining

A new paper introduces Autogenesis, a self-evolving agent protocol. Agents can assess their own shortcomings, propose and test improvements, and update their operational framework in a continuous loop.

89% relevant

Ethan Mollick Proposes AI Model 'Changelog' for Task-Level Performance Tracking

AI researcher Ethan Mollick argues labs should release a 'changelog' alongside model cards, detailing performance changes on individual tasks. This would increase transparency as model updates become more frequent.

85% relevant

llm-anthropic 0.25 Adds Opus 4.7 with xhigh Thinking Effort — Here's How

Update to llm-anthropic 0.25 to access Claude Opus 4.7 with xhigh thinking_effort for tackling your most challenging code problems.

100% relevant

Claude Opus 4.7 Launches with 3.75MP Vision, Agentic Coding, and New Tokenizer

Anthropic launched Claude Opus 4.7 today with 3x higher vision resolution (3.75MP), self-verifying coding outputs, and stricter instruction following. The update targets enterprise agentic workflows and knowledge work benchmarks.

100% relevant

New Research Proposes Lightweight Method to Fix Stale Semantic IDs in

Researchers propose a method to update 'stale' Semantic IDs in generative retrieval systems without full retraining. Their alignment technique improves key metrics and reduces compute costs by ~8-9x, addressing a core challenge in dynamic recommendation environments.

74% relevant

Claude 4.6 Migration Deadline

Anthropic is retiring Opus 4 and Sonnet 4 on June 15, 2026. Migrate to 4.6 models now to gain 1M context and higher output limits, but update your code for adaptive thinking and output_config changes.

100% relevant

Stealth 100B Model Appears on OpenRouter, Possibly DeepSeek or Kimi

A new, unannounced 100-billion-parameter AI model has appeared on the OpenRouter API platform. Its origin is unknown, but observers speculate it could be a variant from DeepSeek or an update to Kimi's code model.

85% relevant