With this project you can hot-swap entire large models (32B) on demand.
Its great for:
Serverless AI Inference
Robotics
On Prem deployments
Local Agents
And Its open source.
Let me know if anyone wants to contribute :)
With this project you can hot-swap entire large models (32B) on demand.
Its great for:
Serverless AI Inference
Robotics
On Prem deployments
Local Agents
And Its open source.
Let me know if anyone wants to contribute :)