Troubleshooting | JoyToken

Start by checking whether the request reached JoyToken, then decide whether the issue is authentication, policy, wallet, routing, provider, or client parsing.

Quick Checklist

Check	Expected
Base URL	`https://api-dev.joytokens.ai/openai/v1`
Chat endpoint	`POST /chat/completions`, full path `/openai/v1/chat/completions`
Auth	`Authorization: Bearer $JOY_TOKEN_API_KEY`
Content-Type	`application/json`
Model	Start with `auto`, then move to a concrete `model_key`
Messages	`messages` must be non-empty
Request ID	Send `X-Request-ID` to correlate gateway logs and Usage
Wallet	Personal or organization wallet has balance
Policy	IP, tier, fixed model, and model blacklist allow the request

Status Codes

Status	Common cause	Action
`400`	Invalid JSON, empty `messages`, or body over 8 MiB	Compare against the Chat Completions example
`401`	Missing API key	Check the Authorization header
`403`	Invalid key or policy rejection	Check key status, IP, model, and tier
`402`	Insufficient wallet balance or quota exceeded	Top up, switch billing account, or lower tier
`404`	Endpoint or model does not exist	Check URL and model key
`405`	Wrong HTTP method	Chat Completions must use POST
`502`	Router, provider-adapter, or upstream provider failure	Check request ID, selected model, provider, and upstream error
`503`	Downstream service unavailable, such as admin-bff not configured or gRPC unavailable	Retry later or check service status
`504`	Downstream timeout	Reduce request complexity, enable streaming, or check provider status

Error Shape

Model call errors keep the OpenAI-compatible style:

Error response

1 {
2   "error": {
3     "type": "policy_rejected",
4     "message": "requested tier is not allowed"
5   }
6 }

The model list endpoint /api/v1/models uses the front-gateway response shape:

Models error

1 {
2   "code": 50300,
3   "message": "admin-bff not available"
4 }

Log Correlation

Server-side requests should always include a readable request ID:

Traceable request

$ curl https://api-dev.joytokens.ai/openai/v1/chat/completions \
>   -H "Authorization: Bearer $JOY_TOKEN_API_KEY" \
>   -H "Content-Type: application/json" \
>   -H "X-Request-ID: prod-chat-20260629-0001" \
>   -d '{
>     "model": "auto",
>     "messages": [{ "role": "user", "content": "ping" }]
>   }'

Then check in this order:

api-gateway logs: whether the request arrived and whether key source is Bearer or X-API-Key.
API key validation: valid, status, api_key_id, tier.
Policy: IP, tier, fixed model, blacklist.
Routing: whether selected_model was returned.
Wallet: whether balance precheck and freeze succeeded.
Provider: actual X-DAOE-Used-Model and X-DAOE-Used-Provider.
Usage / Billing: whether usage was recorded and freeze was settled.

Common Issues

Issue	Explanation
Works locally, fails with `403` in production	Production egress IP is not in policy allowlist, or the production key has different tier/model policy
`auto` works, fixed model fails	API key fixed model, model blacklist, or missing model in the model pool
Fixed model works, `auto` fails	No candidate model remains after policy/tier filtering, or router returned no `selected_model`
Streaming UI hangs	Client does not parse SSE correctly, or renders the metadata event as assistant delta
No Usage record	Provider failed, stream usage was incomplete, billing calculate/record usage failed, or async settle is still pending
Wallet has balance but request fails	Check whether balance exists in the same tier as the selected tier

Minimal Reproduction

Start with the smallest request to rule out SDK and business-code issues:

Minimal request

$ curl https://api-dev.joytokens.ai/openai/v1/chat/completions \
>   -H "Authorization: Bearer $JOY_TOKEN_API_KEY" \
>   -H "Content-Type: application/json" \
>   -H "X-Request-ID: debug-minimal-001" \
>   -d '{
>     "model": "auto",
>     "messages": [{ "role": "user", "content": "ping" }],
>     "stream": false
>   }'

If the minimal request works, add back tier, tools, streaming, long context, and fixed model one at a time.