Docs / Troubleshooting

Troubleshooting

Most problems come down to one of three things: the server isn’t running, Ollama isn’t reachable, or the context is mis-tuned. Start with the quick reference, then jump to the section you need.

Quick reference

ProblemFirst thing to check
Chat says server offlineOpen the tray, click Start Server, then check Status
VS Code command missingReinstall the VSIX and reload VS Code
Server starts but the model is slowFirst response loads the model — use Resume VRAM to warm it
GPU needed for another appClick Pause VRAM
Attachments don’t appearDrop them on the chat/composer area, or use the + button
Test command blockedAdd the command name in the tray under Tool Config
Responses look truncatedSet OLLAMA_CONTEXT_LENGTH=16384 and restart Ollama
Ollama errorsRestart Ollama, then run tray Status again

The chat says the server is offline

  1. Open the Riverforge Tray and click Start Server. Wait for the status log to show online.
  2. Check the header dot in VS Code — it turns green when the server is reachable. The extension reconnects on its own.
  3. Still red? Confirm the server is up from a terminal:
    PS> (Invoke-WebRequest -UseBasicParsing http://127.0.0.1:8765/ready).StatusCode  # 200
  4. If the tray also says offline, start it from the tray. If Ollama is down, restart it from its tray icon.

The VS Code command is missing

The installer can occasionally land the extension in a different VS Code profile than the one you use. Install the bundled VSIX into your window by hand: Extensions view → Install from VSIX… → pick riverforge-vscode.vsix → reload. After an upgrade, if the commands still look old, reinstall the latest VSIX. See Installation.

Responses are slow

Output looks cut off

By default Ollama serves a 2048-token context and silently truncates anything longer. Set OLLAMA_CONTEXT_LENGTH=16384 on the Ollama service and restart it. Riverforge sets this for you during install — this only bites on a hand-tuned or source setup. See Models & Hardware.

“CUDA out of memory” / Ollama crashes

  1. Click Pause VRAM, close other GPU-heavy apps, then Resume VRAM.
  2. Confirm OLLAMA_CONTEXT_LENGTH=16384 and OLLAMA_MAX_LOADED_MODELS=3, then restart Ollama.
  3. Stick with the default model unless you’re deliberately trying a larger one.
  4. If a large model still won’t load, switch back to the 4B default — it’s sized for an 8 GB card.

Riverforge can’t see Ollama

It edits files but the tests never pass

Windows Defender flags Ollama

Defender occasionally quarantines ollama.exe after an update. Add %LOCALAPPDATA%\Programs\Ollama\ to your exclusions list and reinstall or restore Ollama.

Still stuck?

The tray’s Status button gives you a one-look snapshot of the server, Ollama, the loaded model and your tools — start there. For a deeper look, Riverforge keeps logs in your data folder; open it with the tray’s Open Data Folder button. Those logs are the most useful thing to include if you report a problem.