Contents

  1. What Is OpenClaw? Everyone
  2. Your Setup Journey Everyone
  3. Setting It Up Everyone
  4. What to Do With It Everyone
  5. How OpenClaw Compares Reference
  6. Security for Experts Expert
  7. Deep Dives Reference
  8. Steinberger’s Perspective Reference

The OpenClaw Guide

From “what is this?” to your own personal AI assistant. Every decision, every step, every risk — explained.
Updated May 2026 · Based on OpenClaw v2026.5.5+

Table of Contents

  1. What Is OpenClaw? Everyone
  2. Your Setup Journey: Five Decisions Everyone
  3. Setting It Up Everyone
  4. What to Do With It Everyone
  5. How OpenClaw Compares Reference
  6. Security for Experts Expert
  7. Deep Dives Reference
  8. Steinberger’s Perspective Reference

What Is OpenClaw?

The Elevator Pitch

OpenClaw is a free, open-source program that turns AI models like Claude, ChatGPT, or open-source alternatives into a personal assistant that can actually do things on your behalf — not just answer questions.

Think of it this way: ChatGPT is like having a very smart friend you can text for advice. OpenClaw is like hiring that smart friend as your full-time personal assistant, giving them the keys to your house, your email password, and your calendar, and saying “handle it.”

It was created by Peter Steinberger and released in early 2026. In just 60 days it became the fastest-growing open-source project in history — faster than Linux, WordPress, or any other community-built software before it. By May 2026 it had passed 340,000 GitHub stars and was being run by hundreds of thousands of people. Governance moved to an independent non-profit foundation in February when Steinberger joined OpenAI; the project itself stays MIT-licensed and provider-neutral.

An analogy: Imagine a universal remote control, but instead of controlling just your TV, it can control your email, calendar, smart lights, music, and dozens of other things. And instead of pressing buttons, you just tell it what you want in plain English.

Imagine This

It’s 6:45 AM. Before your alarm goes off, a text arrives on your phone — from your AI assistant, through the same messaging app you use to text your friends:

“Good morning. Two emails need replies today — one from your kid’s school about the field trip permission slip (deadline Friday), one from the dentist confirming Thursday at 2pm. Your Amazon package arrives today by 8pm. Weather: 64°F and sunny, perfect for that run you’ve been skipping.”

Over breakfast, you text back: “What can I make with the chicken thighs in the fridge? We need something quick tonight.” A few seconds later, three recipes appear, sorted by prep time, with a grocery list for the one ingredient you’re missing.

At lunch, your partner texts the assistant in your family group chat: “What time is soccer practice on Saturday and where?” It checks the shared calendar and responds with the time, address, and a note that it conflicts with the dentist appointment it mentioned earlier.

Driving home, you text: “Turn on the AC, I’ll be there in 20 minutes.” The thermostat drops. After dinner, you walk into the kitchen and say “Hey Q, what’s on the calendar tomorrow?” and a voice reads back your schedule from the speaker on the counter. At bedtime, you say “goodnight” and the lights dim, the porch light turns on, and the thermostat adjusts for sleeping.

None of this required opening an app, visiting a website, or remembering a special command. You just texted. That’s the whole point.

This isn’t science fiction. People are running setups exactly like this right now, on a spare laptop, a Raspberry Pi, or a Mac Mini tucked in a closet. The rest of this guide shows you how.

What Can It Actually Do?

OpenClaw uses a skills system — modular add-ons that teach it how to interact with different services. Here’s what people are actually building:

Run Your Household

Morning briefings that summarize your inbox, packages, and schedule before you open your laptop. Smart home control via text message — “I’m heading to bed” dims the lights and drops the thermostat. Family coordination across shared calendars. Meal planning with grocery lists grouped by aisle. Seasonal maintenance reminders.

Take Control of Your Time

Calendar management that finds free slots and respects your preferences. Travel planning with day-by-day itineraries. Deal-finding that compares prices across retailers and alerts you when wishlist items go on sale.

Know Yourself Better

Health insights from your Apple Watch or Fitbit. Workout programming that adapts to how you feel. Text-based journaling compiled into structured entries each evening. Meeting transcription that extracts action items.

Never Stop Learning

Deep research on demand. A personal knowledge base that organizes every article and note. Language practice with a patient conversation partner. Podcast and reading curation that triages your subscriptions.

Your Money, Your Data

Budget tracking that categorizes every transaction. Photo library organization that makes sense of 47,000 photos. Both handle highly sensitive data — prime candidates for local AI models.

Hobbies & Projects

Track where you left off, store reference links and materials, get seasonal reminders. A personal wiki for everything you’re working on.

The key difference from existing AI chatbots: OpenClaw doesn’t just tell you what to do. It does it. When you say “reschedule my 2pm to Thursday,” it actually moves the meeting.

Each use case has different risk levels. We cover the details in Section 4.

How Is It Different from ChatGPT or Claude?

ChatGPT / ClaudeOpenClaw
What it isAI you chat with in a browserAI agent that lives on your computer
What it doesAnswers questions, writes textTakes actions: sends emails, moves meetings, controls devices
Where it runsCompany’s serversComputer you own or rent
How you talk to itCompany’s website or appSignal, Telegram, WhatsApp, Discord
Which AIOnly that company’s modelAny model — Claude, GPT, local, open-source
Cost~$20/month subscriptionFree software + AI API ($3–60/month, or free local)
Your dataGoes to AI companyYou choose: cloud or fully local

For Claude Code, Cowork, and Channels comparison, see Section 5.

The Honest Truth

OpenClaw is genuinely exciting, but you should go in with open eyes.

Security is on you. 170+ security vulnerabilities have been found and patched in OpenClaw’s first six months — including 13 new CVEs in April 2026 alone (one a CVSS 8.7 privilege escalation). That’s a lot — it means you need to keep it updated and properly configured.

The skills marketplace has a trust problem. The “ClawHavoc” attack planted 1,400+ malicious skills, including one with 340,000+ installs that silently exfiltrated credentials. ClawHub added a verified-publisher program in March, but most skills are still unverified. Be careful what you install.

The AI providers are cracking down. In Feb 2026 Google started permanently banning paid AI Pro/Ultra accounts for routing Gemini through OpenClaw. Anthropic followed, then partially reversed — but starting June 15, programmatic Claude usage through OpenClaw lives on a separate metered credit pool (Pro $20, Max 5x $100, Max 20x $200) that doesn’t roll over and won’t spill into your chat allowance. Translation: if you want OpenClaw on a flat-rate plan, you can’t. See Decision 3.

It can make mistakes with real consequences. An AI agent that takes actions can send an email to the wrong person or turn off your alarm system.

It requires some technical comfort. “If you can’t understand how to run a command line, this is far too dangerous of a project for you to use safely.”

That’s why this guide exists Every risk above is manageable. The next section walks through every decision with recommendations and trade-offs. Follow it, and you’ll be fine.

Your Setup Journey: Five Decisions

Before you install anything, you need to make five choices. Each one shapes what your setup looks like, what it costs, and how safe it is.

Decision 1: OpenClaw or NemoClaw?

OpenClaw is the employee. NemoClaw is the building with locked doors, security cameras, and badge readers.

OpenClaw is the AI agent itself. Default: everything allowed unless you restrict it.

NemoClaw is NVIDIA’s security wrapper. Default: everything blocked unless you allow it.

OpenClawNemoClaw
Default stancePermissiveLocked down
Internet accessAgent can reach anythingOnly services you approve
File accessFull accessSandbox only
PasswordsAgent can read themStored outside, invisible to agent
Setup difficultyModerateHarder (requires Docker)
Maturity~6 months, large community~2 months, alpha (early preview since March 16, 2026)

Recommendation

Start with plain OpenClaw. Simpler to set up, larger community. Once comfortable, consider NemoClaw for stronger security defaults.

Decision 2: Where Will It Run?

OpenClaw needs a computer that stays on. The community overwhelmingly favors dedicated local hardware — it keeps your data physically in your house, costs almost nothing to run, and can serve local AI models without paying for a GPU-equipped VPS.

Just want to try it out?

Your existing computer is fine for experimenting. OpenClaw stops when your laptop sleeps, but that’s OK while you’re learning. No cost, no setup hassle.

Want it always on? (Most people)

Dedicate a machine to it. The community favorites:

  • Mac Mini M4 (~$600) — the default recommendation in every OpenClaw forum. Draws 10–15W at idle (~$15/year in electricity), completely silent, runs macOS so iMessage bridge works natively.
  • Raspberry Pi 5 (8GB) (~$80) — the budget pick. Use an NVMe SSD, not an SD card — the performance difference is dramatic for OpenClaw’s constant small reads/writes.
  • Old laptop — anything collecting dust works. Just don’t use the same machine you do banking on.

Want local AI too?

You need more horsepower. A Mac with 64GB RAM or a used RTX 3090 build ($900–1,200) can run Qwen 3.5 27B for free, private inference. See Hardware Deep Dive. You can also use a two-machine setup: a cheap always-on box for the gateway plus a beefier machine for local models.

What about a cloud VPS?

A $5–10/month server (DigitalOcean, Hetzner) works and gives you 24/7 uptime without worrying about home internet or power. But your personal data — emails, calendar, health — lives on someone else’s hardware, which undercuts one of OpenClaw’s main selling points. Best for: people without spare hardware, or as a secondary always-on gateway paired with local models at home.

Recommendation

A dedicated local machine. Mac Mini if you’re buying new, Raspberry Pi 5 if you’re on a budget, old laptop if you have one. Keep it separate from your daily driver. A cloud VPS is a solid alternative if you don’t have spare hardware, but local is where the community has landed.

Decision 3: Which AI Brain?

This comes down to one question: how much do you trust cloud AI with your data?

Path A: Cloud models (easiest, best quality)

ModelProviderBest ForCostNotes
GPT-5.4OpenAI (US)All-rounderModerate90.5% PinchBench. API only.
Grok 4.1 minixAI (US)Budget$0.20/$0.50/1M75x cheaper than Opus. Now supports SuperGrok OAuth login (v2026.5.16-beta.3).
Claude Haiku 4.5Anthropic (US)Budget + qualityLow89.5% PinchBench.
Claude Opus 4.7Anthropic (US)When nothing else will do$15/$75/1MReleased April 16. Highest scores on coding (+13% vs 4.6) and vision (98.5%).
Provider crackdowns — read this before you connect anythingThe frontier labs spent the last three months tightening the rules around exactly what OpenClaw does. Here’s where each one stands as of May 2026:
  • Anthropic (Claude): Initially banned third-party agents from Claude subscriptions in February, then reversed. Starting June 15, 2026, programmatic Claude usage (Agent SDK, OpenClaw, GitHub Actions) runs on a separate monthly credit pool that mirrors your plan price: Pro $20, Max 5x $100, Max 20x $200. Credits don’t roll over and you can’t spill into your interactive chat allowance. Most OpenClaw users will burn through Pro credits in a single afternoon — plan on pay-as-you-go API billing, not a subscription.
  • Google (Gemini): Started permanently banning paid AI Pro and Ultra subscribers in February 2026 for routing Gemini through OpenClaw via the Antigravity OAuth flow. No warnings, no appeals, no refunds. Some accounts lost access to their entire Google ecosystem. If you want Gemini, use the Vertex AI / AI Studio API with a billing-enabled GCP project — never log in with your personal Google account.
  • OpenAI (GPT-5.4, Codex): No subscription bans so far. The API is the supported path. OpenClaw 2026.4.10+ added a bundled Codex provider that uses native OAuth (intended for paid Codex/Plus users), but the ToS still bans “programmatically extracting data” and “using ChatGPT to power third-party services” — same legal trapdoor as everyone else. Safest path: a regular API key.
  • xAI (Grok): Most permissive. v2026.5.16-beta.3 added SuperGrok OAuth login that explicitly authenticates xai/* models for OpenClaw. xAI is currently the only major lab actively building toward agent-via-subscription use rather than away from it.
The pattern: a flat-rate consumer subscription is not a license to run an agent. If you want to use OpenClaw against a frontier model, use the provider’s pay-per-token API. Always read the current ToS before you connect a key.

Path B: Local models (private, free after hardware)

Nothing leaves your machine. Community favorite: Qwen 3.5 27B via Ollama (90% PinchBench). See Qwen Deep Dive.

ollama pull qwen3.5:27b && ollama serve

Path C: Both (the smart move)

Route different tasks to different models. See Model Routing Config.

Data residency Every hosted model runs under its provider’s data-handling terms and regulatory jurisdiction, which vary by provider. Review the provider’s documentation before sending sensitive material to any cloud model. For anything that shouldn’t leave your machine, run it locally.

Recommendation

Start with a cheap cloud model (Grok mini or Claude Haiku). Add local later for sensitive data. ~$10–20/month.

Decision 4: How Will You Talk to It?

Two options: text (via messaging apps) or voice (via smart speakers). Most people start with text and add voice later.

Option A: Messaging Apps

PlatformSetupE2E EncryptedMulti-DeviceNotes
TelegramEasiestNoYesMost popular for beginners.
SignalModerateYesYesStrong privacy.
WhatsAppModerateYesLimitedUnofficial bridge — Meta could break it.
DiscordEasyNoYesGood if already there.
iMessageHarderYesApple onlyRequires Mac + BlueBubbles.

Recommendation

Telegram for most people. Signal for privacy. iMessage if your household is all Apple.

Option B: Voice (The Alexa Replacement)

If you’d rather talk to OpenClaw than text it, you can — but not through Alexa or Google Home directly. Those devices are locked to their cloud services. Instead, you replace them with hardware that runs Home Assistant’s voice pipeline, with OpenClaw as the brain behind it.

The chain works like this:

Custom wake word → speech-to-text (Whisper, runs locally) → OpenClaw processes your request → text-to-speech (Piper or ElevenLabs) → response plays on the device you spoke to

Everything runs on your network. No cloud. No subscription. No data leaves your house.

Hardware Options

DeviceCostNotes
Home Assistant Voice PE~$70Dedicated hardware. Dual mic array, physical mute switch, good speaker. The “real product” option — closest to an Echo experience.
ESP32-S3 satellite$4–15A tiny chip + mic that does wake word detection on its own NPU. Cheap enough to put one in every room. Pairs with any existing speaker.

Either device plugs into Home Assistant’s Assist pipeline. The OpenClaw HA integration registers as a native conversation agent, so voice commands flow straight to your OpenClaw instance.

You pick your own wake word — “Hey Q,” “OK Claw,” whatever you want. You can even clone a voice for responses.

Voice vs Alexa: Honest Trade-offs

Alexa / EchoOpenClaw + HA Voice
IntelligencePattern matching — needs exact phrasesFull LLM — understands context (“I’m heading to bed”)
MusicAmazon Music, Spotify, etc. built inSpotify via HA integration, or cast to speakers
ShoppingBuilt inNot supported by voice (use text for this)
PrivacyAlways listening, cloud-processed100% local. Nothing leaves your network.
SetupPlug in and goRequires Home Assistant + configuration
Per-room cost$25–100 per Echo$4–15 per ESP32 satellite, or $70 for Voice PE
You can run both Keep Alexa for music and shopping. Use OpenClaw voice satellites for everything that benefits from actual intelligence — home control, questions, briefings, family coordination. Many people run both side by side.

Decision 5: What Should It Do First?

RiskUse CasesWhy
LOWMeals, travel, knowledge base, languages, home maintenance, podcasts, pets, hobbiesNo sensitive data. Mistakes are inconvenient, not dangerous.
LOW-MEDWorkout coach, research, shoppingSome personal data. Web browsing risk.
MEDIUMCalendar, family, photos, journaling, meeting notesReveals habits, location, relationships.
HIGHEmail, smart home, healthSensitive data. Physical security implications.
VERY HIGHPersonal financeFinancial data. A breach exposes everything.

Recommendation

Start low-risk. Travel, meals, or hobbies. Build confidence, then move up. Details in Section 4.

Setting It Up

You’ve made your decisions. Now let’s build.

How It Works Under the Hood

OpenClaw runs in the background listening for messages and doing tasks. Three parts:

It also has memory (SOUL.md and MEMORY.md files) so it remembers between conversations.

Installation

Tell us about your setup and we’ll show you exactly the right steps:

Your computer
Platform
Install method
Messaging
AI model
Your hardware

Step 1: Prerequisites

Install Docker

Docker runs OpenClaw inside an isolated container — think of it as a sealed room on your computer. Even if OpenClaw gets compromised, it can’t reach your personal files, passwords, or other programs.

  1. Download Docker Desktop for Mac (free for personal use).
  2. Install it — open the downloaded DMG, drag Docker to Applications, and launch it.
  3. Verify it’s running — look for the whale icon in your menu bar. The first launch takes a minute to start up.
Apple Silicon required NemoClaw requires an M1 or newer Mac. Intel Macs are not supported.
  1. Download Docker Desktop for Windows.
  2. Install it — run the installer. When asked, choose the WSL 2 backend (recommended). You may need to restart your computer.
  3. Verify it’s running — look for the whale icon in your system tray. Open PowerShell and type docker --version to confirm.
  1. Install Docker:
    sudo apt update && sudo apt install docker.io -y
    sudo systemctl enable docker --now
    sudo usermod -aG docker $USER

    The last command lets you run Docker without sudo. Log out and back in for it to take effect.

  2. Verify: docker --version
iMessage requires a Mac iMessage only works with a Mac running BlueBubbles as a bridge. You’ll need access to a Mac somewhere on your network, even if OpenClaw itself runs on Windows. Follow the BlueBubbles setup below on that Mac first.
iMessage requires a Mac iMessage only works with a Mac running BlueBubbles as a bridge. You’ll need access to a Mac somewhere on your network, even if OpenClaw itself runs on Linux. Follow the BlueBubbles setup below on that Mac first.

Set Up iMessage for Your Agent

We’ll give your agent its own iMessage identity so your family can text it like a real person — separate from your personal Messages. This uses BlueBubbles, a free open-source iMessage bridge.

Here’s how the pieces fit together: Your Mac will run two user accounts side by side. The agent’s account runs BlueBubbles + Messages (the bridge). Your account (or Docker) runs OpenClaw (the brain). They talk to each other over a local connection. You never need to sit at the agent’s account after the initial setup.

  1. Create a macOS user for the agent. On your Mac, go to System Settings → Users & Groups and add a new standard user. Call it something like “Q” or “OpenClaw.” This user only exists to run BlueBubbles — OpenClaw itself runs under your own account.
  2. Create an Apple ID for your agent. Log into the new macOS user you just created (Apple menu → Log Out, then log in as the agent user). Then go to System Settings → Sign in with your Apple Account → Create Account. You have two options for the email:
    • Use an existing email you have lying around (a spare Gmail, Outlook, etc.)
    • Create a new @icloud.com email — tap “Don’t have an email address?” and Apple will let you pick a free @icloud.com address

    When it asks for a phone number, use your own — Apple allows the same number on multiple Apple IDs. The phone number is just for security verification, not the agent’s iMessage identity (that’s the email).

    Getting “Your account cannot be created”? Apple rate-limits account creation per device. Make sure you’re creating the account from the agent’s macOS user, not your own — Apple is more likely to allow it on a “fresh” user. If it still fails, wait an hour and try again, or try from appleid.apple.com instead (website only allows existing emails, not new @icloud.com addresses).

    This is free. Your family will text this email address via iMessage to reach the agent.

  3. Set up Messages. Still logged in as the agent user, open Messages. It should already be signed in with the Apple ID you just created. Send a test message to yourself to confirm it works.
  4. Download and install BlueBubbles. Still logged in as the agent user, download BlueBubbles from bluebubbles.app/install. You’ll see two DMG files — pick arm64 if you have an M1, M2, M3, or M4 Mac (most Macs from 2020 onward). Open the DMG and drag BlueBubbles to Applications.
  5. Launch BlueBubbles. macOS will block it with a warning that it “can’t verify the developer.” This is normal for apps downloaded outside the App Store — it just means the developers haven’t paid Apple’s $99/year fee to get the app “notarized.” To get past it: right-click (or Control-click) BlueBubbles in Applications and choose Open. You’ll see the same warning but now with an “Open Anyway” button. If you don’t see it, go to System Settings → Privacy & Security, scroll down, and click “Open Anyway” next to the BlueBubbles message. You only need to do this once.
    Is BlueBubbles safe? Yes. It’s open source (anyone can read the code), has been around since 2021, and is the most widely used iMessage bridge in the OpenClaw and Home Assistant communities. It runs entirely locally — your messages never pass through their servers. It’s the tool the OpenClaw docs officially recommend for iMessage.
  6. Grant permissions. BlueBubbles will ask for Full Disk Access. Go to System Settings → Privacy & Security → Full Disk Access and enable BlueBubbles. This lets it read the agent’s iMessage database.
  7. Set a server password. In BlueBubbles settings, set a strong password. This protects the agent’s iMessage account — don’t skip it.
  8. Enable the web API. In BlueBubbles Settings, toggle on the web API. It will show a local URL like http://localhost:1234 — note this URL and your password. You’ll need both when the OpenClaw wizard asks for iMessage configuration.
  9. Set BlueBubbles to launch at login. In BlueBubbles Settings, enable “Launch at startup.” Then go to System Settings → Battery → Options and enable “Prevent automatic sleeping when the display is off.”
  10. Switch back to your account. Go to Apple menu → Log Out and log back into your normal user. Important: Use fast user switching (System Settings → Control Center → Fast User Switching → show in menu bar) so both accounts stay logged in simultaneously. The agent’s account runs BlueBubbles in the background even while you’re using your own account.
BlueBubbles runs outside Docker BlueBubbles runs as a regular Mac app under the agent’s user account — not inside a container. OpenClaw (inside Docker on your account) connects to BlueBubbles over the local network using the URL and password from step 7.
Add the agent as a contact On everyone’s phone, add the agent’s Apple ID email as a contact named “Q” (or whatever you chose). Now you can text Q just like you’d text a family member. You can also add Q to group chats for family coordination.

Once BlueBubbles shows “Server Running” in its status bar, you’re ready to continue with the OpenClaw installation below. OpenClaw runs under your own user account (or in Docker) — not the agent’s account.

WhatsApp uses an unofficial bridge The connection uses Baileys, a community-maintained library. Meta could break compatibility at any time. It works well today but isn’t guaranteed long-term.

Install Ollama (Local AI)

Ollama is a free tool that runs AI models on your own computer. Your data stays on your machine — nothing is sent to the cloud.

Ollama runs outside Docker Ollama needs direct access to your GPU/CPU, so it runs on your computer as a regular app — not inside the container. OpenClaw (inside Docker) connects to Ollama over the network. We’ll set that up in the model routing step.
  1. Install Ollama:

    Download from ollama.com/download, open the DMG, and drag to Applications. Or install via terminal:

    curl -fsSL https://ollama.com/install.sh | sh

    Download the installer from ollama.com/download and run it. Ollama will start automatically.

    curl -fsSL https://ollama.com/install.sh | sh
  2. Download your AI model:
    ollama pull gemma4:e2b

    Gemma 4 E2B (Google) — only needs ~4 GB. Less capable than cloud models but fine for simple tasks on limited hardware.

    ollama pull gemma4:e4b

    Gemma 4 E4B (Google) — a Mixture of Experts model that only activates ~4B parameters per request, so it runs fast on 16 GB. Great balance of quality and speed.

    ollama pull qwen3.5:27b

    Qwen 3.5 27B — the community’s #1 pick. Scores 90% on PinchBench (the OpenClaw benchmark), within striking distance of the best cloud models. See the full Qwen story.

    ollama pull qwen3.5:27b

    Qwen 3.5 27B — community #1 pick, 90% PinchBench, uses ~20 GB leaving plenty of headroom on your 64 GB machine. See the full Qwen story.

    This download is a few GB — takes a minute or two depending on your internet.

  3. Start Ollama and test it:
    ollama serve

    Ollama now listens at http://localhost:11434. OpenClaw will auto-detect it. Since OpenClaw is running in Docker, you’ll need to point it to your host machine instead — we’ll configure that in the model routing step.

    Test that it works:

    ollama run gemma4:e2b "Hello, what can you do?"
    ollama run gemma4:e4b "Hello, what can you do?"
    ollama run qwen3.5:27b "Hello, what can you do?"

    If you get a response, your local AI is working. Press Ctrl+D to exit.

Hybrid setup You’ll configure OpenClaw to use your local model for sensitive data (health, finances, journals) and a cloud model for everything else. We’ll set up the routing rules in the model routing step after installation.

No prerequisites needed — the installer handles everything.

Step 2: Open a Terminal

Press Cmd + Space to open Spotlight, type Terminal, and hit Enter. A window with a command prompt will appear.

Click the Start menu and search for PowerShell. Right-click it and choose Run as Administrator. A blue window with a command prompt will appear.

Open your terminal emulator.

Step 3: Install OpenClaw

Step 3: Install NemoClaw

Now the main event. Paste this into your terminal and press Enter:

curl -fsSL https://openclaw.ai/install.sh | bash
irm https://openclaw.ai/install.ps1 | iex

This downloads the official setup script. It will install Node.js if you don’t have it and set OpenClaw to run automatically in the background. Takes a few minutes.

Now the main event. This single command downloads OpenClaw and starts it inside a Docker container:

docker run -d --name openclaw \
  -p 127.0.0.1:18789:18789 \
  --read-only \
  --tmpfs /tmp \
  -v ./openclaw-data:/app/data \
  -v ./openclaw-config:/home/node/.openclaw \
  ghcr.io/openclaw/openclaw:latest

Docker will download the OpenClaw image (a few hundred MB, takes a minute), then start it in the background. The container is read-only for security, with two writable folders: openclaw-data for runtime data and openclaw-config for your settings. Both are stored on your computer, not inside the container.

Verify it’s running:

docker ps

You should see openclaw in the list with a status of “Up” or “healthy.” If it’s not there, check what went wrong with docker logs openclaw. Don’t move to the next step until this is working.

Running OpenClaw commands with Docker Since OpenClaw is inside a container, you can’t run openclaw commands directly in your terminal. Prefix them with docker exec openclaw. For example:
# Instead of: openclaw devices list
# Run:
docker exec openclaw openclaw devices list

# Instead of: openclaw security audit --fix
# Run:
docker exec openclaw openclaw security audit --fix
This applies to any openclaw command you see in this guide or that the agent suggests in the chat.

For a fully hardened setup, see Docker Hardening.

Paste this into your terminal:

curl -fsSL https://www.nvidia.com/nemoclaw.sh | bash

Paste this into PowerShell:

irm https://www.nvidia.com/nemoclaw.ps1 | iex

This downloads the NemoClaw sandbox image (~2.4 GB). Takes a few minutes. NemoClaw wraps OpenClaw with network restrictions, credential isolation, and a privacy router that automatically keeps sensitive data on your machine.

Expect some troubleshooting NemoClaw is alpha software. Common issues: Docker not running, not enough RAM, and network policies blocking services you need. The GitHub Issues page is active and helpful.

Step 4: Set Up Your Agent

The installer launches a setup wizard automatically. It asks:

Open http://localhost:18789 in your browser. The dashboard will ask for a Gateway Token — OpenClaw generated one automatically. Find it by running:

cat ./openclaw-config/openclaw.json | grep token

Copy the token value, paste it into the Gateway Token field, leave the Password field blank (it’s optional), and click Connect.

You’ll land on the OpenClaw dashboard. There’s no setup wizard — instead, you configure everything from the sidebar. Here’s what to set up:

Run nemoclaw my-assistant connect to start the setup wizard. It asks:

  • Which messaging app? You’ll need to create a Telegram bot first. Open Telegram, search for @BotFather, send /newbot, and follow the prompts. Copy the token it gives you. You’ll need to register a phone number with Signal for the bot. The wizard walks you through connecting via Signal’s CLI bridge. You’ll need to create a bot in the Discord Developer Portal. Create an application, add a bot, and copy the token. The wizard will ask for your BlueBubbles server URL and password from the setup above. The wizard will show a QR code. Scan it with WhatsApp on your phone (Settings → Linked Devices → Link a Device).
  • Which AI provider? It will ask for an API key — a special password you get from the AI provider’s website. Go to your provider’s site (e.g. OpenAI, Anthropic, or xAI), create an account, and generate an API key. Copy it and paste it when the wizard asks. Choose “Ollama” or “Local” when the wizard asks. It will auto-detect Ollama at localhost:11434. No API key needed.
  • A security password for your OpenClaw setup.

Follow the prompts — it’s like setting up a new phone.

  1. Set up your AI provider first. You need to do this before anything else — even the “Help me configure a channel” button needs a working AI.

    In the left sidebar, click AI & Agents. You’ll need an API key — a special password from the AI provider’s website. Go to your provider’s site (e.g. OpenAI, Anthropic, or xAI), create an account, generate an API key, and paste it in.

    The dashboard UI defaults to OpenAI, but you need to point it to your local Ollama instead. Open this file in a text editor:

    ./openclaw-config/agents/main/agent/models.json

    Replace its contents with:

    {
      "providers": {
        "ollama": {
          "baseUrl": "http://host.docker.internal:11434/v1",
          "apiKey": "ollama",
          "models": [{ "id": "gemma4:e2b", "name": "Gemma 4 E2B" }]
        }
      }
    }
    {
      "providers": {
        "ollama": {
          "baseUrl": "http://host.docker.internal:11434/v1",
          "apiKey": "ollama",
          "models": [{ "id": "gemma4:e4b", "name": "Gemma 4 E4B" }]
        }
      }
    }
    {
      "providers": {
        "ollama": {
          "baseUrl": "http://host.docker.internal:11434/v1",
          "apiKey": "ollama",
          "models": [{ "id": "qwen3.5:27b", "name": "Qwen 3.5 27B" }]
        }
      }
    }

    Then restart the container:

    docker restart openclaw

    Refresh the dashboard — you should see your local model in the top bar dropdown instead of GPT-5.4. host.docker.internal is Docker’s way of saying “the computer this container is running on” — that’s where Ollama lives.

  2. Test the AI. Go back to Chat in the sidebar and type a message like “Hello, what can you do?” If you get a response, your AI is working. If you get an API key error, double-check step 1.
  3. Set up your messaging channel. Now click Communications in the sidebar (under Settings), or type “Help me configure a channel” in the chat. Select Telegram. You’ll need a bot token — open Telegram, search for @BotFather, send /newbot, follow the prompts, and paste the token it gives you. Select Signal. You’ll need to register a phone number for the bot and connect via Signal’s CLI bridge. Select Discord. You’ll need a bot token from the Discord Developer Portal — create an application, add a bot, and paste the token. Select BlueBubbles/iMessage. Enter the server URL and password from the BlueBubbles setup above. Select WhatsApp. A QR code will appear — scan it with WhatsApp on your phone (Settings → Linked Devices → Link a Device).
  4. Save your gateway token in your password manager. If you ever need it again, it’s in ./openclaw-config/openclaw.json.

Step 5: Verify Everything Works

Run these two commands to make sure everything is healthy and secure:

openclaw doctor
openclaw security audit --fix

Run the health check:

nemoclaw my-assistant health

Think of this like a virus scan — run it regularly, ideally once a week.

Running on a dedicated machine? Consider creating a separate user account just for OpenClaw. This limits what the agent can access if something goes wrong. On Mac, go to System Settings → Users & Groups and add a new standard user. On Windows, go to Settings → Accounts → Other users and add a new standard user. On Linux: sudo adduser openclaw

Your First Conversation

Open your messaging app and send: “What day is it?” If OpenClaw responds, you’re connected. Try: “What’s the weather?” and “Set a reminder for tomorrow at 9am.”

If something isn’t working, run openclaw doctor. Most common: messaging bridge not connected.

Connecting Services

Start with these five: Weather, Calendar (read-only), Reminders, Web search, Smart home (lights/thermostat only).

Be careful what you install The ClawHavoc attack planted 1,467 malicious skills before being cleaned up — one had 340,000+ installs and silently exfiltrated credentials. ClawHub added a verified-publisher program (blue checkmark) in March 2026 and now runs automated scans on every submission, but most skills are still unverified and review is largely automated. Filter to verified publishers with public source code, check the rating and install count, and run openclaw security audit --fix after every install.

Setting Up Model Routing

You’re using cloud models only, so no routing needed — OpenClaw sends everything to the provider you chose in the setup wizard. You can always add a local model later and come back to this step.

You’re running everything locally. Open your openclaw.yaml and set your default model:

models:
  default: ollama/gemma4:e2b
models:
  default: ollama/gemma4:e4b
models:
  default: ollama/qwen3.5:27b

Since OpenClaw is running in Docker, point it to Ollama on your host machine:

  providers:
    ollama:
      host: http://host.docker.internal:11434

The hybrid setup routes sensitive data to your local model and everything else to a fast, cheap cloud model. Add this to your openclaw.yaml:

models:
  default: xai/grok-4.1-mini           # Cheap cloud for most tasks
  routing:
    - match:
        categories: [health, finance, journal, personal]
      provider: ollama/gemma4:e2b       # Sensitive data stays local
      force_local: true
models:
  default: xai/grok-4.1-mini           # Cheap cloud for most tasks
  routing:
    - match:
        categories: [health, finance, journal, personal]
      provider: ollama/gemma4:e4b       # Sensitive data stays local
      force_local: true
    - match:
        keywords: [complex, analyze, plan, research]
        fallback_on_failure: true
      provider: openai/gpt-5.4         # Hard tasks get a premium model
models:
  default: ollama/qwen3.5:27b          # Local for most tasks (free)
  routing:
    - match:
        categories: [health, finance, journal, personal]
      provider: ollama/qwen3.5:27b     # Sensitive data never leaves
      force_local: true
    - match:
        keywords: [complex, analyze, plan, research]
        fallback_on_failure: true       # If local struggles, escalate
      provider: openai/gpt-5.4         # Cloud for hard tasks

Add this to connect to Ollama on your host machine:

  providers:
    ollama:
      host: http://host.docker.internal:11434

This keeps health data, financial info, and personal journals on your machine while using cloud models for everything else. See the full routing reference for more options.

Locking the Front Door

By default, OpenClaw might accept connections from anyone on your network — or worse, the internet. These settings lock it down. Open your OpenClaw configuration file (openclaw.yaml) in a text editor:

You’ll find it at ~/.openclaw/openclaw.yaml. Open it with: open -e ~/.openclaw/openclaw.yaml

You’ll find it at %USERPROFILE%\.openclaw\openclaw.yaml. Open it with Notepad or your preferred text editor.

You’ll find it at ~/.openclaw/openclaw.yaml. Open it with: nano ~/.openclaw/openclaw.yaml

1. Only accept local connections. Add this line to your config file:

gateway_host: 127.0.0.1

This tells OpenClaw: “Only accept connections from this computer. Ignore everyone else.”

2. Disable network broadcasting. By default, OpenClaw announces itself to other devices on your Wi-Fi (like AirDrop). Turn this off by adding:

disable_bonjour: true

3. Need remote access? If you want to reach OpenClaw from your phone or another computer, use Tailscale (free for personal use). It creates a private, encrypted tunnel between your devices — much safer than opening OpenClaw to the internet. Never expose OpenClaw’s port directly.

4. Credential hygiene.

Managing Costs

OpenClaw itself is free. The AI it talks to is not (unless you run local models). Each task triggers 5–10 API calls, so costs add up faster than you’d expect. Here’s what real usage looks like:

UsageGrok mini / HaikuGPT-5.4 / SonnetOpusLocal (Qwen)
Light (a few tasks/day)$3–8/mo$15–30/mo$50–100/moFree
Moderate (daily, 2–4 hrs)$10–20/mo$40–80/mo$200+/moFree
Heavy (automated 24/7)$30–60/mo$100–200/mo$500+/moFree

This is why model routing matters — use a cheap model for 90% of tasks and only escalate to a premium model when it actually matters. One community member cut their bill from $200 to under $20/month this way.

Keeping It Updated

170+ security vulnerabilities found in 6 months. 13 new CVEs were patched in April 2026 alone, including a CVSS 8.7 privilege escalation. Keeping OpenClaw updated is not optional — treat it like updating your phone.

To update:

openclaw update

Or rerun the install script — it detects the existing installation and updates it.

To update: Docker doesn’t auto-update. Pull the new image and restart:

docker pull ghcr.io/openclaw/openclaw:latest
docker stop openclaw && docker rm openclaw
docker run -d --name openclaw \
  -p 127.0.0.1:18789:18789 \
  --read-only --tmpfs /tmp \
  -v ./openclaw-data:/app/data \
  -v ./openclaw-config:/home/node/.openclaw \
  ghcr.io/openclaw/openclaw:latest

Your data is safe — it lives in the ./openclaw-data folder, which survives container replacement.

To update:

nemoclaw update

This pulls the latest NemoClaw and OpenClaw images and restarts the sandbox.

Regular maintenance:

What to Do With It

Detailed guidance for each use case, organized from lowest risk to highest. Start at the top.

LOW RISK Start Here

Meal Planning & Recipes

Weekly dinner plans, grocery lists by aisle, recipes from what’s in your fridge.

Tip: Always double-check allergy handling. Use draft mode for grocery delivery orders.

Travel Planning

Day-by-day itineraries, flight monitoring, loyalty point tracking, rainy-day backups.

Tip: Verify restaurants and addresses — AI can confidently recommend places that don’t exist.

Knowledge Base

Searchable library of articles, notes, and highlights. “What was that blue light article?”

Tip: Use local AI for personal notes. Keep original sources.

Language Learning

Always-available conversation partner with grammar corrections and custom flashcards.

Tip: Don’t rely on it for medical or legal translations.

Home Maintenance

Tracks repairs, paint colors, warranty dates, contractor contacts. Seasonal reminders.

Tip: Don’t store alarm codes in the agent’s memory.

Podcast & Reading Curation

Triages subscriptions, sends daily “three things worth your time.” Learns your taste.

Tip: Process locally if uncomfortable with interest profiling.

Pet Care

Vet appointments, medication schedules, vaccination records, food reorder reminders.

Tip: Symptoms are notes for your vet, not diagnoses.

Hobby & Project Tracker

Where you left off, parts lists, reference links, seasonal task reminders.

Tip: Standard precautions sufficient.

LOW-MEDIUM RISK Some Caution

Workout Coach

LOW-MEDIUM 3–5 hrs/week

Periodized training plans that adapt: “I’m sore” triggers active recovery. Pulls watch data.

  • Training advice without medical context could cause injury
  • Local model for fitness data. Don’t follow AI for rehab.

Research & Deep Dives

LOW-MEDIUM 1–2 hrs/day

Structured briefings with sources for big decisions. “Research solar panel economics for my area.”

  • Malicious websites can contain hidden instructions that trick the agent into doing things you didn’t ask for
  • AI can present made-up information as fact — always verify important findings
  • Run the browser in an isolated environment. Cross-reference important findings yourself.

Shopping & Deals

LOW-MEDIUM 2–4 hrs/week

Price comparison, review filtering, wishlist monitoring, sale alerts.

  • Malicious product pages can contain hidden instructions that trick the agent
  • Never give credit card credentials. Verify prices yourself.

MEDIUM RISK Personal Data

Calendar & Scheduling

MEDIUM 15–30 min/day

Finds free slots, respects preferences, daily agenda briefings, conflict warnings.

  • Calendar reveals location, habits, social circle
  • Write access risks accidental changes
  • Start read-only. Require approval before changes.

Family Coordination

MEDIUM 30–60 min/day

Multi-calendar views, reminders, shared grocery lists, pickup coordination.

  • Children’s data is especially sensitive and legally protected
  • Read-only for family. Children’s data local only. Confirm before sending on behalf.

Photo Organization

MEDIUM 3–5 hrs/month

Groups by events, tags people/places, finds duplicates, instant search.

  • Photos contain EXIF (location) and biometric data (faces)
  • Process locally. Strip EXIF before any cloud processing.

Journaling

MEDIUM 15–30 min/day

Text thoughts throughout the day, compiled into structured entries. Mood patterns over time.

  • Journal entries are the most sensitive data you could hand an AI
  • Local model only. Encrypt at rest. Review memory files periodically.

Meeting Notes

MEDIUM 30–60 min/meeting

Transcription, key point extraction, action items. Great for doctor visits.

  • Consent requirements vary by jurisdiction
  • Transcribe on your own machine first (using a tool called Whisper) rather than sending audio to the cloud. Always get consent. Review action items before acting on them.

HIGH RISK Proceed With Care

Email Digest

HIGH 30–45 min/day

Prioritized morning summary. “Two emails need replies, three packages shipping.”

  • Email contents travel to external AI
  • Spam or scam emails can contain hidden text that tricks the agent into doing something harmful
  • Give it read-only email access — it can read but never send or delete. Filter out unknown senders. Consider a local AI model to keep email data on your machine.

Smart Home

HIGH 15–30 min/day

Lights, thermostat, speakers via text. “I’m heading to bed” triggers a scene.

  • Compromised agent could unlock doors or disable cameras
  • Never connect locks or security systems. PIN for anything security-related. Separate hub as buffer.

Health Tracking

HIGH 30–60 min/day

Wearable insights, habit correlation, weekly wellness summaries.

  • Health data is legally protected and among the most sensitive categories
  • Cloud processing risks insurance profiling
  • Local AI only. Never cloud. Never act without consulting a doctor. Read-only access.

VERY HIGH RISK Expert Territory

Personal Finance

VERY HIGH 2–3 hrs/week

Transaction categorization, weekly spending reports, subscription tracking, unusual charge alerts.

  • Financial data breach exposes entire spending history
  • Cloud AI logging risk
  • Read-only bank access only. Never grant transfers.
  • Local AI only. Or export CSV and process offline.

How OpenClaw Compares

The Claude Ecosystem vs OpenClaw

Anthropic has four pieces that overlap with what OpenClaw does:

OpenClaw is like texting an assistant with keys to many rooms. The Claude stack is like texting a specialist at your desk who happens to have a remote control for your laptop. The specialist knows more about what’s in front of them; the assistant can go to more places and works with any model.

OpenClaw wins: 7+ messaging platforms, always-on, any AI model, 50+ life-service integrations, no subscription required (just an API key).

Claude stack wins: security managed by Anthropic, full computer use on your desktop, no DIY infrastructure, official mobile approval flow.

DimensionClaude EcosystemOpenClaw
Chat from phoneChannels (with mobile approval relay) + DispatchNative (7+ platforms)
Desktop agentCowork with computer use (Pro/Max)Not a focus
Background jobsDispatch (can launch Claude Code sessions)Cron, skills, custom workflows
SecurityManaged by AnthropicDIY; 170+ vulnerabilities in 6 months
AI modelsClaude onlyAny (30+ providers + local)
Data sovereigntyData to AnthropicYou choose (cloud or fully local)
Life integrationsLimited50+ services
Cost modelSubscription, plus separate metered agent credits from June 15Free software + your choice of API or local model

When to Use Which

Claude Code + Channels for developers who want to text their dev environment and get tool-approval prompts on their phone. Cowork + Dispatch for non-technical desktop work, especially since Cowork added computer-use for Pro/Max in April. OpenClaw for automating your life across messaging apps with any AI model — including local ones the Claude stack can’t touch.

Many people use both Claude for coding/desk work, OpenClaw for the home, calendar, messaging, and smart-home side of life. The metered-credit shift on June 15 makes “use Claude through Claude products, use OpenClaw with cheaper/local models” the path of least resistance.

vs Hermes Agent

The other open-source personal-agent framework worth taking seriously in May 2026 is Hermes Agent from Nous Research. It launched in February with a different architectural bet — a closed learning loop, agent-curated memory, secure-by-default sandboxing — and as of mid-May has zero published CVEs against OpenClaw’s 170+. It’s the most direct competitor.

I wrote a full, opinionated head-to-head: OpenClaw vs Hermes Agent — The Blunt Comparison →

Security for Experts

If you followed Section 3, you’re already reasonably secure. This is for sysadmins and security professionals.

10 Security Principles

  1. Start small: Low-risk use cases first
  2. Least privilege: Minimum permissions
  3. Human in the loop: Approve all changes
  4. Defense in depth: Container + firewall + auth + scoped tokens
  5. Assume compromise: Limit blast radius
  6. Local AI for sensitive data: Ollama + Qwen 3.5 27B
  7. Regular audits: Weekly security audit, monthly memory review
  8. Stay updated: 170+ CVEs in 6 months; 13 new in April 2026
  9. Vet all skills: Never install unaudited skills
  10. Monitor costs: Spending limits detect abuse

Verification Checklist

  1. openclaw security audit --fix — zero findings
  2. Port scan — 18789 not externally accessible
  3. Gateway rejects unauthenticated WebSocket
  4. API spending limits set
  5. mDNS disabled (OPENCLAW_DISABLE_BONJOUR=1)
  6. Test benign prompt injection for guardrails
  7. SOUL.md/MEMORY.md free of unexpected content
  8. allow_url_actions: false in openclaw.yaml
  9. Version 2026.5.5+ (all CVEs through April 2026 patched)
  10. Non-localhost WebSocket rejected
  11. Docker: non-root, --cap-drop=ALL, --read-only

Critical Attack Vectors

1. The “Lethal Trifecta”

System Access + Execution Power + Untrusted Ingestion. Most setups have all three. Never combine in a single agent.

2. Time-Shifted Memory Poisoning

Malicious SOUL.md/MEMORY.md inputs that “detonate” later. Treat memory files as code. File integrity monitoring.

3. Log Poisoning → Prompt Injection

Malicious content in logs read by the agent. Make logs write-only from agent perspective.

4. Container Escape via API

CVE-2026-25253 WebSocket hijacking works even inside Docker.

5. Localhost Trust (ClawJacked)

Gateway exempted localhost from rate limiting. Browser JS could brute-force at hundreds/second.

Docker Hardening

services:
  openclaw:
    image: ghcr.io/openclaw/openclaw:2026.5.5
    read_only: true
    cap_drop: [ALL]
    security_opt:
      - no-new-privileges:true
      - seccomp=openclaw-seccomp.json
    user: "1000:1000"
    ports: ["127.0.0.1:18789:18789"]
    volumes:
      - ./config:/app/config:ro
      - ./data:/app/data
    mem_limit: 2g
    cpus: 1.0
    networks: [openclaw-internal]

  egress-proxy:
    image: nginx:alpine
    networks: [openclaw-internal, external]

networks:
  openclaw-internal:
    internal: true
  external:
    driver: bridge

Never mount docker.sock. Never use --network=host.

Incident Response

  1. Immediately: Disconnect from network
  2. Within 1 hour: Rotate ALL credentials
  3. Audit: Check logs for unexpected system.run
  4. Review: SOUL.md/MEMORY.md for injected instructions
  5. Scan: Check for backdoor skills
  6. Report: GitHub Security Advisory

Scale of the problem: 135K+ exposed instances across 82 countries (April 2026 scan), 63% of which run without any authentication. ClawHavoc planted 1,467 malicious skills (one with 340K+ installs) before ClawHub added publisher verification. Meta banned OpenClaw from corporate devices. Palo Alto called it “the potential biggest insider threat of 2026.”

Deep Dives

Reference material linked from the main guide.

Hardware for Local Models

Local inference is memory-bandwidth-bound.

LPDDR5X shortage Prices up 50% through mid-2026. 26–39 week lead times.

Mac Studio M4 Max (64GB) — ~$2,700

546 GB/s bandwidth, nearly 2x M5 Pro. The sweet spot. Skip 128GB unless running 70B+ models.

Availability 64GB+ configs backordered to June. RTX 3090 and Strix Halo use different memory, unaffected.

Used RTX 3090 Build — ~$900–1,200

Community “value king.” 24GB VRAM, ~50 tokens/sec on Qwen 3.5 27B. Noisy and power-hungry.

AMD Strix Halo — $1,499–2,700+

Up to 128GB unified memory. ~273 GB/s bandwidth (lower than M4 Max). Good non-Apple option.

Two-Machine Setup

Cheap machine as always-on server. Powerful machine runs Ollama. OpenClaw routes over home network.

Minimum Viable

16GB+ RAM can run Qwen 2.5 Coder 14B. Enough to experiment.

VRAM guide: 8GB = 7-8B models. 12-16GB = 14B. 20-24GB = 27-35B (sweet spot). 64-128GB unified = very large.

The Qwen Story

Qwen (“chwen”) by Alibaba Cloud. Timeline: April 2023 closed beta → Sept 2023 public → June 2024 Qwen 2 → April 2025 Qwen 3 (Apache 2.0, 36T tokens, 119 languages) → Feb 2026 Qwen 3.5 (27B matches GPT-5 Mini on SWE-bench, 600M+ downloads, 170K+ derivative models) → April 16, 2026 Qwen 3.6, with a dense 27B that hits 77.2% on SWE-bench (best dense coding model on the leaderboard) and a 35B-A3B MoE variant tuned for consumer GPUs. Ollama shipped native Qwen 3.6 support on launch day, including a qwen3.6 tag that handles quantization automatically.

Why so good: $53B Alibaba AI investment. Apache 2.0 as ecosystem strategy. Relentless iteration.

BenchmarkQwen 3.5 27BGPT-5.4Opus 4.6
PinchBench90.0%90.5%93.3%
SWE-bench72.4%80.9%
IFEval95.0%
HumanEval92.7%

Pros: Free forever (Apache 2.0). Runs on consumer hardware (~20GB). Privacy by default. 90% of frontier. Massive ecosystem.

Cons: Ollama tool-calling bugs (set reasoning: false). Can overthink/loop. Weaker on complex multi-step reasoning. 128K context (vs 1M+ GPT-5.4).

Local vs cloud nuance Local Qwen = math on your hard drive, nothing leaves. Cloud Qwen (via Alibaba’s hosted API) sends each request to the provider’s servers under their data-handling terms. Fundamentally different — local is the only path where nothing leaves your machine.
ollama pull qwen3.5:27b && ollama serve
export OLLAMA_FLASH_ATTENTION=1
export OLLAMA_CONTEXT_LENGTH=65536

Local Model Options

ModelBest ForVRAM
Qwen 3.6 27BBest all-around (new May 2026 pick)~20 GB
Qwen 3.6 35B-A3B MoEFast on consumer GPUs (only ~3B active)~22 GB
Qwen 3.5 27BStable fallback, still excellent~20 GB
Qwen 2.5 Coder 32BCoding~20 GB
Llama 4 ScoutGeneral purpose16+ GB
DeepSeek-R1 32BReasoning~20 GB
Qwen 2.5 Coder 14BLimited hardware~10 GB

PinchBench Rankings

PinchBench: 23 real tasks, 75+ models (May 2026 snapshot).

#ModelScore
1Claude Opus 4.7~95% (est.)
2Claude Opus 4.693.3%
3Qwen 3.6 27B~92% (est.)
4Trinity Large (Arcee AI)91.9%
5GPT-5.490.5%
6Qwen 3.5 27B90.0%
7MiniMax M2.789.8%
8Claude Haiku 4.589.5%
9Qwen 3.5 397B MoE89.1%
10Nemotron 3 Super 120B88.6%

What changed in May: Opus 4.7 (April 16) and Qwen 3.6 27B (also April 16) reshuffled the top of the list. Qwen 3.6 hit 77.2% SWE-bench as the best dense coding model and is the new community pick if you can spare ~20 GB VRAM — see the Qwen story above.

Model Routing Configuration

models:
  default: ollama/qwen3.5:27b

  routing:
    - match:
        categories: [health, finance, journal, personal]
      provider: ollama/qwen3.5:27b
      force_local: true

    - match:
        keywords: [complex, analyze, plan, research, compare]
        fallback_on_failure: true
      provider: openai/gpt-5.4

    - match:
        keywords: [critical, important, difficult]
        manual_escalation: true
      provider: anthropic/claude-opus-4-7

  providers:
    ollama:
      host: http://192.168.1.50:11434
    openai:
      api_key: ${OPENAI_API_KEY}
    anthropic:
      api_key: ${ANTHROPIC_API_KEY}

Routes to Qwen by default (free), forces sensitive data local, escalates hard tasks to GPT-5.4, reserves Opus for explicit request. ~$10–15/month.

Steinberger’s Perspective

Based on public statements — blog (Feb 15, 2026), Pragmatic Engineer podcast, Lex Fridman #491, Fortune/TechCrunch/CNBC interviews.

His Philosophy

“An AI that actually does things.” Vision: “an agent that even my mum can use.” He calls this “agentic engineering” and frames AI agents as a learned skill: “You pick up the guitar — you’re not going to be good at the guitar in the first day.”

Where He’d Push Back

  1. Absolutist prohibitions: He favors graduated trust over blanket bans.
  2. Model routing: A core design principle of OpenClaw.
  3. Empowerment first: Excitement before caution. Fair — but this guide errs on safety.

Uncomfortable Truths

ClawHavoc response was slow. He joined OpenAI Feb 14, 2026 and transferred OpenClaw to an independent foundation with OpenAI backing. The foundation board still hasn’t published its governance documents as of mid-May 2026, and Steinberger himself hasn’t posted a new public update on OpenClaw’s direction since the February announcement — an unusual silence for a maintainer who used to write weekly. One maintainer’s warning still stands: “If you can’t run a command line, this is too dangerous.”

The Balanced Take

This guide favors security researchers (Microsoft, Palo Alto, Cisco) over the creator’s optimism. That’s right for safety. But Steinberger’s graduated-trust approach is valid for people who understand their risks. The Feb-to-May 2026 stretch — Google account bans, Anthropic’s metered-credit pivot, 13 new CVEs in a single month — vindicates the cautious framing more than the optimistic one.

Sources