rate limits
30 articles about rate limits in AI news
Anthropic Doubles Claude Code Rate Limits, Leases All of SpaceX's Colossus 1
Anthropic doubled Claude Code's 5-hour rate limits and removed peak-hour throttling for Pro, Max, Team, and seat-based Enterprise plans, then disclosed the source of the new capacity: a lease on the entire Colossus 1 data center — 300 MW and ~220,000 NVIDIA GPUs in Memphis — that SpaceX absorbed when it took over xAI.
Anthropic Permanently Increases API Rate Limits for All Subscribers
Anthropic has permanently increased API rate limits for all subscribers, a move that expands developer capacity without a price hike. This follows a period of high demand and frequent limit adjustments.
How to Bypass Claude Code Rate Limits for $2/Month with a Proxy API
A developer reveals a $2/month proxy setup for unlimited Claude Code API access, crucial for deep work like Linux kernel contributions where rate limits break flow.
OpenAI Reportedly Planning Premium ChatGPT Tiers with Higher Rate Limits
OpenAI appears to be preparing new premium ChatGPT subscription tiers priced at $100 and $200 per month, offering 5x and 20x higher usage rates respectively. This move signals a strategic shift toward serving power users and enterprise customers who require more intensive AI interactions.
Claude Account Switcher: Hot-Swap Accounts in One Click to Bypass Rate Limits
A new macOS menu bar app lets you instantly switch between multiple Claude Code accounts, displaying live usage data and eliminating manual logout/login friction.
How Claude Code Users Are Hitting Usage Limits and What To Do About It
Claude Code power users are hitting rate limits. Here's how to optimize your workflow to stay productive when the meter runs red.
Claude Code Digest — Apr 11–Apr 14
Bypass Claude Code rate limits for just $2/month with a proxy API and unlock unlimited access.
Claude Opus 4.6 Unlimited Access Deal Sparks Developer Interest
A developer reports finding a deal for unlimited Claude Opus 4.6 usage without rate limits, potentially offering significant cost savings for heavy users compared to Anthropic's official API pricing.
Google's Gemini API Goes Free: A Game-Changer for AI Development and Experimentation
Google has removed rate limits and introduced free access to its Gemini API, enabling developers to experiment with AI prompts in CI/CD pipelines and agent systems without billing concerns. This move democratizes access to advanced language models and encourages innovation.
Google Launches Gemini Embedding 2: A New Multimodal Foundation for AI
Google has launched Gemini Embedding 2, a second-generation multimodal embedding model. This technical release, alongside the removal of API rate limits, provides developers with a more powerful and accessible tool for building AI applications that understand text, images, and other data types.
OpenAI Lets Codex Users Bank Rate Limit Resets
OpenAI lets Codex users save rate limit resets. One free saved reset for Go, Plus, Pro, Business tiers.
Claude Code Doubles Usage Limits After 80x Growth Surge
Anthropic doubled Claude Code usage limits after 80x growth. Product lead says no roadmap exists.
Federated Rec System Beats Centralized CTR in 53-Day User Study
A 53-day federated recommender study with 22 users showed user-controlled personalization achieving 65.37% CTR, challenging the privacy-utility tradeoff assumption.
Four years of AI coding: speed gains, cognitive atrophy, rate-limit panic
A developer's four-year account shows speed gains from Copilot to Cursor to Claude Code, but reveals cognitive atrophy and rate-limit dependency as hidden costs.
Datacenter Developers Flee City Zoning for Unincorporated County Land
Datacenter developers are siting projects on unincorporated county land to avoid city zoning delays, redrawing the AI infrastructure map per @SemiAnalysis_.
Tencent Releases MegaStyle: 1.4M AI-Generated Image Style Dataset
Tencent has open-sourced MegaStyle, a 1.4 million image dataset for style transfer and text-to-image fine-tuning. It was generated by systematically pairing 170,000 style prompts with 400,000 content prompts using the Qwen-Image model.
Claude AI Generates Weekly Meal Plans with Nutrition Goals
A prompt library demonstrates Claude's ability to create personalized weekly meal plans that meet specific nutrition targets, potentially saving users hundreds on groceries and dietitian fees.
Principal Engineer: Claude Code Rushes, Codex Deliberate; Guardrails Are Key
A senior engineer with 100 hours in Claude Code and 20 in Codex reports Claude often rushes to patch, while Codex is more deliberate. The real product is the guardrail system—docs and review loops—not the AI itself.
New arXiv Paper Proposes LLM-Generated 'Reference Documents' to Speed Up
A new arXiv preprint introduces a method for efficient LLM-based reranking. It uses LLMs to generate 'reference documents' that help dynamically truncate long ranked lists and optimize batch processing, achieving up to 66% speedup on TREC benchmarks.
Claude Max Price Hike Tested: How to Lock in Your Current Rate
Anthropic is testing higher prices for Claude Max tiers; developers should verify their account's pricing and consider annual billing to lock in rates.
AI Fact-Checks Rated More Helpful, Less Ideological Than Human Ones
A new experiment found LLM-generated fact-checks are rated as more helpful and less ideological than human ones, achieving broader acceptance across political lines. This suggests AI could reduce polarization in online information verification.
Gemma 4 Integrated into Android Studio for AI-Assisted App Development
Google has integrated its Gemma 4 language model into Android Studio's Agent mode, providing developers with AI-assisted coding features like refactoring and feature development within the official Android IDE.
Anthropic Forms Corporate PAC to Influence AI Policy Ahead of Midterms
Anthropic is forming a corporate PAC to lobby on AI policy, signaling a strategic shift towards direct political engagement as regulatory debates intensify in Washington. This move follows similar efforts by OpenAI and Google.
arXiv Survey Maps KV Cache Optimization Landscape: 5 Strategies for Million-Token LLM Inference
A comprehensive arXiv review categorizes five principal KV cache optimization techniques—eviction, compression, hybrid memory, novel attention, and combinations—to address the linear memory scaling bottleneck in long-context LLM inference. The analysis finds no single dominant solution, with optimal strategy depending on context length, hardware, and workload.
ChatGPT Launches 'Library' Feature: Persistent Document Storage Across Conversations with 512MB File Limits
OpenAI introduces ChatGPT Library, a persistent storage system that saves uploaded files (PDFs, docs, images) at the account level for reuse across different chats. The feature is rolling out to Plus, Team, and Enterprise users with specific file size and token limits.
Topview Agent V2 Integrates Seedance 2.0 AI Video Model for Text-to-Hollywood-Level Video Generation
Topview has integrated the Seedance 2.0 AI video model into its Topview Agent V2 platform. Users can now generate full-length, high-quality videos from text prompts for any industry.
RAG Fails at Boundaries, Not Search: A Critical Look at Chunking and Context Limits
An analysis argues that RAG system failures are often due to fundamental data boundary issues—chunking, context limits, and source segmentation—rather than search algorithm performance. This reframes the primary challenge for AI practitioners implementing knowledge retrieval.
How to Get 2x Claude Code Usage and Work Around Weekly Limits
Anthropic is offering 2x usage to some Claude Code users. Here's how to check your eligibility and optimize your workflow within the limits.
How to Orchestrate Claude Code with GPT and Gemini Using CLI Calls and Shared Context Files
A developer's system for making Claude Code orchestrate GPT and Gemini via CLI calls, using shared markdown files for persistent context and a session closer agent for compounding knowledge.
Claude Code's Off-Peak Limits Double for Two Weeks: How to Schedule Your Heavy Workloads
Anthropic is doubling Claude's usage limits during off-peak hours for two weeks. Schedule your most demanding coding tasks for these times to maximize output.