> ## Documentation Index
> Fetch the complete documentation index at: https://wb-21fd5541-wbdocs-1882.mintlify.site/llms.txt
> Use this file to discover all available pages before exploring further.

# API error code 503 - The engine is currently overloaded

A 503 error with the message "The engine is currently overloaded, please try again later" means the W\&B Inference server is experiencing high traffic and cannot process your request right now.

## Why this happens

During periods of high demand, the inference engine may become temporarily overloaded. This is a transient condition that typically resolves on its own as traffic subsides.

## What you can do

1. **Retry after a short delay**
   * Wait a few seconds before retrying your request
   * Use exponential backoff to avoid adding to the congestion

2. **Spread out requests**
   * If you're sending many requests, consider spacing them out over time
   * Implement request queuing to smooth traffic spikes

***

<Badge stroke shape="pill" color="orange" size="md">[Server Errors](/support/inference/tags/server-errors)</Badge>
