https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-26...
Last time I ported a TTS model to Rust using candle, this time I ported an ASR model to Rust with burn.
I was able to lean on the wgpu backend to get the model running in the browser after sharding it.
Here is the HF Space:
https://huggingface.co/spaces/TrevorJS/voxtral-mini-realtime
and here are the model weights (q4 + tokenizer):
https://huggingface.co/TrevorJS/voxtral-mini-realtime-gguf
and the code:
https://github.com/TrevorS/voxtral-mini-realtime-rs
Didn't have a chance to use agent teams with this project, maybe next one! :)