Key features:
Uses docker model run syntax.
Supports Dynamic GGUFs (handling quantization overhead).
Cross-platform support.
Example usage: docker model run ai/gpt-oss:20B
Happy to answer questions about the implementation or the quantization methods used.
ericcurtin•1h ago