Available models

See the models you can train with Serverless RL.

Serverless RL currently only supports a single open-source foundation model for training.

To express interest in a particular model, contact support.

Model catalog

Model Model ID (for API usage) Type Context Window Parameters Description
Qwen2.5 14B Instruct Qwen/Qwen2.5-14B-Instruct Text 32.8K 14.7B-14.7B (Active-Total) Dense multilingual instruction-tuned model with tool-use and structured output support