Is this an API key error?

Usually no. API-key problems more often return 401 or 403. Selected model is at capacity points more directly to model capacity, routing, or an upstream account pool that cannot serve the chosen model.

Should I keep retrying the same model?

No. Repeated retries can amplify a capacity problem. Wait briefly or switch to another available model. If you use an API proxy, ask the provider whether that model route is currently full.

Why does normal chat work while Codex says the model is full?

Codex tasks often carry longer context and tool calls, and they may use a dedicated Codex model or model group. A normal chat model being available does not prove the selected Codex route has capacity.

Model model_at_capacity api proxy

Selected model is at capacity

This message usually means the selected Codex model or model route has no available capacity right now. It is not primarily an API-key error or local configuration failure; switch models, wait briefly, or ask your proxy provider to check model-route capacity.

Snapshot

Error: Selected model is at capacity. Please try a different model. Treat it as model-level capacity or routing congestion first: switch models, wait briefly, reduce context, and ask your proxy provider to check account-pool and model-group availability.

#selected model is at capacity #codex selected model is at capacity #codex model at capacity #codex please try a different model #openai codex model capacity

What This Error Means

Selected model is at capacity. Please try a different model. is mainly about the selected model. The request is not primarily failing at authentication; the selected model, model group, or upstream route cannot currently allocate capacity.

If you are using the official Codex login flow, this may be temporary OpenAI-side model demand. If you are using a proxy, Sub2API, CPA, NewAPI, or a team gateway, it may also mean the proxy’s account pool, model mapping, or route has no healthy upstream available.

Do not reduce this to “Codex is broken.” The sharper diagnosis is: this model route cannot schedule resources right now.

The OpenAI Developer Community had a matching report on April 23, 2026: a user said gpt-5.4 consistently showed this message, and later replies reported long-running workflows being interrupted in Codex Desktop on macOS and Windows. That pattern fits Codex client behavior, model capacity, and streaming stability more than a standard HTTP 429.

In some Sub2API setups, users observed that the upstream failure is closer to Our servers are currently overloaded, and the proxy layer then surfaces it as the Codex at-capacity message. The operational problem is not only the failure itself; Codex may not automatically continue after this message, so long coding tasks get interrupted.

Common Causes

The selected Codex model is under high demand, and the service is asking you to choose another model.
Your proxy maps the selected model to an upstream account pool that is full, rate limited, or cooling down.
Your API key group has no available supplier for that Codex model, or the model is only enabled for some plans.
The current task has too much context or too many tool calls, making a high-cost model more likely to hit capacity controls.
Multiple Codex sessions are sharing the same account, key, or provider route at the same time.

How To Fix It Quickly

Switch to another available model first. Do not keep hammering the same peak-demand Codex, pro, or high-reasoning model.
Wait 1 to 5 minutes before retrying. Capacity errors are often temporary, and immediate repeated retries rarely help.
Reduce context pressure: start a fresh session, attach fewer files, narrow the requested change, or split the work into smaller steps.
If you use an API proxy, switch model groups, backup routes, or providers, and ask support whether this model has schedulable upstream accounts.
If you use the official Codex App or VS Code extension, keep the screenshot, time, model name, and request ID, then report it through Codex feedback or GitHub issues.
If every model fails, check the OpenAI status page, your proxy status page, balance, plan permissions, and Base URL configuration.

Sub2API Mitigation

If you run or administer Sub2API, you can try an error passthrough rule under account management. Match upstream messages containing Our servers are currently overloaded or at capacity, then rewrite them to an error text that your client treats as retryable, such as upstream failed.

This reduces long-task interruptions; it does not create more model capacity. Keep retries bounded, and watch usage, failure rate, and loops. Hiding every capacity error can blind you to a real capacity problem and burn balance on useless retries.

How It Differs From 429 And 503

429 is more about request rate, quota, or throttling. 503 is more about temporary service unavailability or no available upstream account.

Selected model is at capacity is not 429. It more specifically points to the selected model or upstream model route having no available capacity, sometimes underneath an upstream overloaded response or streaming interruption. Your first move should be switching model, switching model route, or using bounded retry, not regenerating keys, reinstalling Codex, or migrating the entire provider.

One hard truth: if you use a proxy, this message does not automatically prove official Codex is unstable. It may simply mean the proxy oversold a popular model route. Run a minimal request and switch-model test before blaming the whole stack.

Selected model is at capacity

Snapshot

What This Error Means

Common Causes

How To Fix It Quickly

Sub2API Mitigation

How It Differs From 429 And 503

Sources

Related errors

Related guides

Related topics

Common questions