RustyLLM

LLM inference compiled to WebAssembly — runs entirely in your browser

WASM

Load Model

Drop a GGUF file here or click to browse
Recommended: nomic-embed-text, all-MiniLM-L6-v2 (<500 MB)

The server must send Access-Control-Allow-Origin: *.
Many models on Hugging Face GGUF CDN support CORS.

Loading…
Model loaded:
A
B
C
D
Ctrl+Enter
Ctrl+Enter
Copied!