All News

Anthropic Adds Weekly Rate Limits for Claude AI

Anthropic announced weekly rate limits for Claude subscribers, capping model usage to curb 24/7 demand impacting performance. From August 28, heavy users—about 5%—will see weekly caps alongside existing 5-hour daily limits, with options to purchase extra capacity. The change sparks backlash among developers and raises questions on long-running AI projects and enterprise planning.

Published July 29, 2025 at 01:13 AM EDT in Artificial Intelligence (AI)

Anthropic is introducing weekly rate limits for Claude subscribers starting August 28, aiming to rein in “24/7” usage and keep its AI services running smoothly. The new caps affect roughly 5% of users, primarily heavy actors running multiple instances of Claude Code around the clock.

Why Weekly Rate Limits?

Anthropic says a small group of users has been operating Claude non-stop, squeezing system capacity and impacting response times. By adding weekly limits on top of the existing 5-hour daily cap, the company hopes to restore reliable service for the majority of subscribers.

  • Prevent “always-on” Claude Code instances from hogging compute
  • Reduce policy violations like account sharing and reselling access
  • Ensure consistent performance for all users

New Rate Limits Details

While Anthropic hasn’t published exact weekly quotas, it notes that most Claude Max 20x users can expect roughly 240–480 hours on the Sonnet 4 model and 24–40 hours on Opus 4 per week. Heavy Opus users or those spinning up multiple coding agents may exhaust these limits sooner.

Developer Backlash and Enterprise Impact

Some developers argue the move penalizes responsible users for the few who abused the system. Enterprises running long-term AI workflows worry they’ll hit caps mid-project and need to purchase extra API credits at standard rates to stay on track.

QuarkyByte Analysis Optimizing AI Usage

Rate limits are a balancing act between fair access and model performance. QuarkyByte recommends that teams:

  • Profile AI workloads to forecast peak usage windows
  • Choose subscription tiers aligned with project runtimes
  • Set up real-time alerts to flag cap thresholds before they’re reached

By applying these steps, organizations can avoid unexpected interruptions, optimize AI spend, and secure the compute power needed for long-running Claude Code or custom agents.

Keep Reading

View All
The Future of Business is AI

AI Tools Built for Agencies That Move Fast.

QuarkyByte can help your team forecast AI workload and select the right subscription tier to avoid disruptions. We'll analyze usage patterns to optimize costs and ensure capacity for long-running AI projects. Discover how our insights keep your models running reliably under heavy demand.