| Ollama's cloud is currently in preview. For full documentation, see [Ollama's documentation](https://docs.ollama.com/cloud).
Ollama’s [Turbo](https://ollama.com/turbo) is a new way to run open-source models with acceleration from datacenter-grade hardware.
## Cloud Models
Currently, the following models are available in Turbo:
[Cloud models](https://ollama.com/cloud) are a new kind of model in Ollama that can run without a powerful GPU. Instead, cloud models are automatically offloaded to Ollama's cloud while offering the same capabilities as local models, making it possible to keep using your local tools while running larger models that wouldn’t fit on a personal computer.
-`gpt-oss:20b`
-`gpt-oss:120b`
Ollama currently supports the following cloud models, with more coming soon:
## Get started
-`gpt-oss:20b-cloud`
-`gpt-oss:120b-cloud`
-`deepseek-v3.1:671b-cloud`
-`qwen3-coder:480b-cloud`
### Ollama for macOS & Windows
### Get started
Download Ollama
To run a cloud model, open the terminal and run:
- Select a model such as `gpt-oss:20b` or `gpt-oss:120b`
- Click on **Turbo**. You’ll be prompted to create an account or sign in
### Ollama’s CLI
-[Sign up](https://ollama.com/signup) for an Ollama account
- Add your Ollama key [to ollama.com](https://ollama.com/settings/keys).
On macOS and Linux:
```shell
cat ~/.ollama/id_ed25519.pub
```
On Windows:
```
type "%USERPROFILE%\.ollama\id_ed25519.pub"
```
- Then run a model setting `OLLAMA_HOST` to `ollama.com`: