That's not a constraint. That's the point.
Nemilia is built around the idea of AI sovereignty — your agents, your workflows, your data, your keys, all running on your machine. Nothing phoned home, nothing locked to a platform, nothing that breaks when a SaaS company changes its pricing.
What you get: - Build custom agents with their own roles, personalities, system prompts, and model overrides - Design custom workflows with drag-and-drop pipeline builder — chain agents in any order, set HITL review checkpoints, run parallel DAG execution - MCP tool execution — build agents that call external tools in real time (filesystem, databases, APIs, code execution) over streamableHttp or SSE - Document RAG with hybrid Transformers.js vector embeddings + BM25 keyword search, fully client-side - WebGPU offline mode — run Llama, Qwen, Phi, Gemma entirely in the browser, no API key, no internet - Connect any provider: OpenAI, Anthropic, Groq, Gemini, Mistral, DeepSeek, Ollama, LM Studio — swap anytime, your workflows follow - API keys encrypted AES-256-GCM, never leave your browser - Workspace syncs to a real folder on disk as plain JSON and Markdown — version control it, edit it in VS Code, take it anywhere
One file. Any browser. Your AI team.
Getting started takes about 60 seconds: download the file, open it, paste any API key, point it at LM Studio/JAN, or load up a model directly into your browser via WebGPU.
Released it publicly last week. Would genuinely love feedback.
Site: https://nemilia.com GitHub: https://github.com/luislopez1212/nemilia
P.S. Nemilia means "to think, to remember, to imagine" in Nahuatl, the language of the Aztecs. Felt like the right name for something that does all three.