Everything runs on your device. Your messages are not uploaded and there is no account. The model downloads once (hundreds of MB) the first time and is cached by your browser for next time. A WebGPU browser is required.
Private AI Chat
Chat with an AI model that runs entirely on your device, in your browser. It suits quick drafting, brainstorming, rewriting a note, or asking a question you would rather keep off a server.
Smaller models like TinyLlama 1.1B load fastest, while larger ones such as Llama 3.2 3B or Phi-3 mini answer more strongly but take a bigger first download. Replies stream in token by token, generated on your own hardware. The conversation stays in this tab for the current session and is not saved or synced anywhere.
Frequently Asked Questions
Does my chat leave my device?
No. The model runs in your browser and your messages stay on your device.