2021-01-10, 13:55
Hi there. My name is Nickolay, I've been working on speech technology for many years. I've implemented many things in CMU Pocketsphinx, Kaldi and various other speech libraries. These days I develop speech recognition toolkit called Vosk https://github.com/alphacep/vosk-api. There are many speech libraries these days, some have nice features, some hard to use, here are some unique things:
I personally use Vosk on RPi3B with Respeaker 4-mic Array hat controlling Kodi. We have just released an updated version 0.3.17 with a great speed improvements specifically for small devices.
In this topic I will post news about voice control addon we develop for Kodi. You are also welcome to ask me anything about Vosk or speech technology in general.
- Vosk is an offline open source speech recognition toolkit. It enables speech recognition models for 17 languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino.
- Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification.
- Speech recognition bindings implemented for various programming languages like Python, Java, Node.JS, C#, C++ and others.
- Vosk works on Raspberry Pi3 and Pi4 but it also scales from mobile phones to big callcenter cluster. Vosk can also create subtitles for movies, transcription for lectures and interviews.
I personally use Vosk on RPi3B with Respeaker 4-mic Array hat controlling Kodi. We have just released an updated version 0.3.17 with a great speed improvements specifically for small devices.
In this topic I will post news about voice control addon we develop for Kodi. You are also welcome to ask me anything about Vosk or speech technology in general.