What Happens When You Hit Your Codex Usage Limit?

ChatGPT plan limits reset on rolling windows. How limits behave, and how multi-account and API-key fallback keep production requests flowing.

Hitting a usage limit mid-workload is the first real operational question for anyone moving API traffic onto a ChatGPT subscription. The answer with Codex Hosted is: nothing dramatic. The request that would have failed runs on your next lane, a second ChatGPT account or your own API key, and the subscription lane comes back when the window resets. Here is how limits behave and how to set up the fallback so you never think about it.

How ChatGPT plan limits work

OpenAI enforces Codex usage per plan over rolling windows rather than a monthly meter. The shape, per OpenAI’s docs: limits apply over hours-scale windows, some plans add a weekly component, and the exact message counts vary by plan, by model, and over time as models rotate. OpenAI publishes current numbers on its pricing page; we deliberately do not hardcode them here because they drift.

Two properties matter for production use:

  • Limits are windows, not budgets. You are never “out” for the month. Capacity comes back on a schedule, which is what makes a subscription behave like bandwidth instead of a wallet.
  • Limits are per account. Two Pro accounts have two independent windows. That is the basis of multi-account fallback.

OpenAI also sells on-demand credits to Plus and Pro users who reach a limit and want to continue without upgrading. That is a manual, in-product action; useful as a relief valve, not an automation strategy.

The fallback order

ProxyLLM’s gateway treats your credentials as an ordered list of lanes:

1. Codex · account A      (flat, subscription window A)
2. Codex · account B      (flat, subscription window B)
3. OpenAI API key         (metered, no window)

A request tries the highest available lane. When account A’s window is exhausted, traffic moves to account B without dropping the request; when B exhausts too, traffic runs metered on your API key until a subscription window resets, then moves back up. Every entry in the request log names the lane that served it, so a spike in API-key traffic is visible the day it happens, not on billing day.

You choose the depth. Solo devs commonly run one account plus an API key. Agencies running steady volume add a second account because the math favors it.

The math of a second account

A second account is rarely about doubling capacity; it is about absorbing bursts that would otherwise spill onto metered pricing. Worked example with our planning estimates (treat them as estimates, not guarantees):

SetupMonthly costSustained capacity
Pro 5x + API fallback$100 + metered overflow~$3,500 API-equivalent, overflow at API rates
Pro 5x + Pro 5x + API fallback$200 + rare overflow~$7,000 API-equivalent across two windows

If your overflow regularly costs more than $100/month at API rates, the second account pays for itself. The calculator does this against your actual bill, and the full cost comparison walks the tier math.

Account rules, stated plainly

Each connected account must be your own, with its own paid subscription, used for your own workloads. OpenAI’s terms prohibit sharing an account or making it available to someone else; they do not prohibit one customer owning two subscriptions. Inside ProxyLLM, each account lives in its own isolated container, signed in by you through OpenAI’s device-code flow. We never pool capacity across customers in either direction. The broader policy picture, including why OpenAI keeps the final call, is in our honest reading of the terms.

Setting it up

  1. Connect account A in the dashboard (device-code sign-in, about a minute).
  2. Connect account B the same way, or skip it.
  3. Add your OpenAI API key as the last lane. It is encrypted at rest and only used when subscription lanes are exhausted.
  4. Watch the per-lane breakdown in the request log for a week and size your setup from observed traffic, not guesses.

Limits are the one part of subscription-backed AI that people fear before trying and stop noticing after. Set the fallback order once and the gateway does the worrying. If you have not priced your current bill against a plan tier yet, the calculator takes thirty seconds.

Frequently asked questions

What happens when I hit my ChatGPT plan's Codex limit?

With ProxyLLM, requests keep flowing. The gateway fails over to a second connected ChatGPT account if you have one, then to your own OpenAI API key, until your plan's window resets. Without a fallback, you wait for the reset or buy on-demand credits from OpenAI.

Can I connect multiple ChatGPT accounts to ProxyLLM?

Yes. Each account gets its own isolated container, and you set the fallback order. When one account's limit hits, the next takes over automatically, and the request log shows which account served each call.

How long until a Codex usage limit resets?

OpenAI applies limits over rolling windows (currently hours-scale windows for messages, with weekly components on some plans), and the exact numbers vary by plan and model. Check OpenAI's pricing page for current figures; ProxyLLM's dashboard shows your observed usage and the next reset.

Is it against OpenAI's rules to use two ChatGPT accounts?

Each account must be yours, with its own subscription, used for your own workloads. What OpenAI's terms prohibit is sharing one account between people. Two accounts you own and pay for, each in its own container, is two subscriptions used as intended. OpenAI retains discretion over its services either way.

More on Policies & limits
Codex Hosted · the main feature

Run your AI workloads on your ChatGPT subscription.

ProxyLLM runs OpenAI's Codex for you, signed in with your own ChatGPT account. Your apps call one OpenAI-compatible endpoint and the work bills to your flat plan instead of per-token API pricing.