Cohere launches an open-source voice model specifically for transcription

Relatively light at just 2 billion parameters, the model is meant for use with consumer-grade GPUs for those who want to self-host it. It currently supports 14 languages.

Mistral releases a new open-source model for speech generation

Mistral’s new speech model can run on a smartwatch or a smartphone.