Ollama

Development8192MB+ RAM

Run large language models locally with an OpenAI-compatible API. Supports Llama, Qwen, Mistral, DeepSeek, Gemma and 100+ open models.

Download Server Compass View Documentation

aillmlocalopenai-compatiblellamaqwenmistral

Deploy Ollama in 3 Steps

Connect Your VPS

Add your server credentials to Server Compass

Select Ollama

Choose from our template library

Deploy & Configure

Fill in settings and click Deploy

No Docker knowledge required

Video Tutorials

Learn How to Deploy Ollama

All tutorials

How to self host LLM to your VPS | Deploy Ollama & OpenWebUI in 1 click

Learn how to self-host large language models on your VPS using Server Compass. Deploy Ollama and Open WebUI with one click for a private ChatGPT-like experience.

Watch tutorial

Step-by-step deployment guide

Deploy Ollama on a VPS with Server Compass

Use the Ollama template in Server Compass to deploy a self-hosted local LLM runtime API on your VPS, then verify the API landing response in a browser.

About 10 minutesBrowser verified

Step 1

Open the server Apps tab

Select the tutorial-vps VPS, open the Apps tab, and start a new app deployment. Keep sensitive server details hidden before capturing or sharing screenshots.

Server Compass Apps tab before creating an Ollama app

Step 2

Choose an app template

Click New App and choose the template deployment path so Server Compass can load the built-in catalog.

Choosing to deploy an app from a Server Compass template

Step 3

Search for Ollama

Use the template picker search to find Ollama in the Server Compass template catalog.

Searching for Ollama in the Server Compass template picker

Step 4

Select the Ollama template

Choose the Ollama template. Server Compass fills the Ollama service, persistent model volume, keep-alive setting, and API port.

Ollama template selected in Server Compass

Step 5

Review the Ollama settings

Confirm the app name and compose services. In this run, the app was named ollama-demo and used host port 11434.

Reviewing Ollama project settings and compose services

Step 6

Deploy Ollama

Review the generated compose settings, confirm the web port is available, and click Deploy Now.

Reviewing Ollama environment variables and port before deployment

Step 7

Watch the deployment progress

Keep the deployment modal open while Server Compass uploads the compose file, pulls the Ollama image, starts the container, and verifies the stack.

Server Compass deploying the Ollama template on the VPS

Step 8

Confirm Ollama is running

After deployment finishes, return to the Apps tab and confirm the Ollama app is marked Running with its application URL available.

Ollama template running in the Server Compass Apps tab

Step 9

Open the Ollama API in the browser

Click Open Application or open the application URL in a browser. The Ollama API landing response confirms the local LLM API is reachable.

The deployed Ollama API landing response loaded in a browser

After Ollama Opens

Do not expose Ollama directly to the public internet without a firewall, VPN, or authenticated reverse proxy.
Use a VPN, IP allowlist, or authenticated reverse proxy for production model API access.
Pull only the models your VPS can support and monitor disk usage as model files accumulate.
Keep the Ollama image updated for runtime and model-serving fixes.
Restrict API access to trusted applications and networks before serving private prompts or data.

Verified Result

The Ollama API endpoint loaded successfully in a browser.

Download Server Compass Docs

Ollama deployment questions

What does the Ollama template deploy?

It deploys the Ollama container with a persistent `/root/.ollama` model volume and exposed Ollama API port.

Which port did the tutorial use?

The tutorial used host port 11434, which maps to the Ollama API on container port 11434.

Why does the browser verification stop at the first API response?

A fresh Ollama deployment is considered reachable when the HTTP API returns the Ollama running response.

Should this become a blog post?

No. The deployment guide should live on the Ollama template detail page and be linked from the reusable template deployment docs page.

Self-Deploy Guide

DIY Ollama Deployment

Learn how to self-host Ollama with this hands-on deployment guide.

Start a Secure Shell Session

Open your terminal and connect to your server. Replace the IP address with your VPS IP.

terminal

# SSH into your server
ssh root@your-server-ip

# Using a custom SSH key
ssh -i ~/.ssh/id_rsa root@your-server-ip

First time? Need Docker? Install it: curl -fsSL https://get.docker.com | sh

Prepare Your Workspace

Set up a clean directory for your application.

terminal

# Create and navigate to project directory
mkdir -p ~/apps/ollama
cd ~/apps/ollama

Set Up Container Configuration

Set up the container stack using this Docker Compose configuration:

docker-compose.yml

services:
  ollama:
    image: ollama/ollama:latest
    ports:
      - "11434:11434"
    volumes:
      - ollama_data:/root/.ollama
    environment:
      - OLLAMA_KEEP_ALIVE=5m
    restart: unless-stopped

volumes:
  ollama_data:

Setup Variables

PORTHost port to expose Ollama API(default: 11434)

KEEP_ALIVEModel keep-alive duration(default: 5m)

Bring Up the Application

Launch your application stack in the background.

terminal

# Start the containers in detached mode
docker compose up -d

# Check if containers are running
docker compose ps

# View logs
docker compose logs -f

Configure Firewall

Configure your firewall to permit external connections.

terminal

# Allow the application port through firewall
sudo ufw allow 11434/tcp
sudo ufw reload

# Access your app at:
# http://your-server-ip:11434

Skip the Terminal

Prefer a visual interface? Use Server Compass.

Let Server Compass handle the complexity. Deploy Ollama with a simple, intuitive interface.

Visual configuration UI
One-click deployment
Automatic SSL setup
Zero-downtime updates
Built-in monitoring
One-click rollbacks

Download Server Compass$29 one-time • Lifetime license

After Deployment

After deploying Ollama with Server Compass, complete these steps to finish setup

Open the Ollama tab in ServerCompass to manage models and test the API

Pull your first model (Qwen3.5-9B recommended)

Use the API section to get endpoint URL and code snippets

Test with the built-in chat interface

Need help? Check out our documentation for detailed guides.

Ollama FAQ

Common questions about self-hosting Ollama

How do I deploy Ollama with Server Compass?

Simply download Server Compass, connect to your VPS, and select Ollama from the templates list. Fill in the required configuration and click Deploy. The entire process takes under 3 minutes.

What are the system requirements for Ollama?

Ollama requires a minimum of 8192MB RAM. We recommend a VPS with at least 16384MB RAM for optimal performance. Any modern Linux server with Docker support will work.

Can I migrate my existing Ollama data?

Yes! Server Compass provides volume mapping that allows you to import existing data. You can also use standard Ollama backup and restore procedures.

How do I update Ollama to the latest version?

Server Compass makes updates easy. Simply click the Update button in your deployment dashboard, and the latest Ollama image will be pulled and deployed with zero downtime.

Is Ollama free to self-host?

Ollama is open-source software. You only pay for your VPS hosting (typically $5-20/month) and optionally Server Compass ($29 one-time). No subscription fees or per-seat pricing.

Related Templates

View all Development

PocketBase

Open-source backend in a single file with realtime database, auth, and file storage

Appwrite

Open-source backend-as-a-service - self-hosted Firebase alternative

Parse Server

Open-source backend framework with dashboard

Supabase

Full Supabase self-hosted with Kong, GoTrue Auth, Realtime, and Studio

Ready to Self-Host Ollama?

Download Server Compass and deploy Ollama to your VPS in under 3 minutes. No Docker expertise required.

Download Server Compass