Offline voice control with Vosk speech recognition
#1
Hi there. My name is Nickolay, I've been working on speech technology for many years. I've implemented many things in CMU Pocketsphinx, Kaldi and various other speech libraries. These days I develop speech recognition toolkit called Vosk https://github.com/alphacep/vosk-api. There are many speech libraries these days, some have nice features, some hard to use, here are some unique things:
 
  • Vosk is an offline open source speech recognition toolkit. It enables speech recognition models for 17 languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino.
  • Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification.
  • Speech recognition bindings implemented for various programming languages like Python, Java, Node.JS, C#, C++ and others. 
  • Vosk works on Raspberry Pi3 and Pi4 but it also scales from mobile phones to big callcenter cluster. Vosk can also create subtitles for movies, transcription for lectures and interviews.

I personally use Vosk on RPi3B with Respeaker 4-mic Array hat controlling Kodi. We have just released an updated version 0.3.17 with a great speed improvements specifically for small devices.

In this topic I will post news about voice control addon we develop for Kodi. You are also welcome to ask me anything about Vosk or speech technology in general.
Reply
#2
(2021-01-10, 13:55)nshmyrev Wrote: Hi there. My name is Nickolay, I've been working on speech technology for many years. I've implemented many things in CMU Pocketsphinx, Kaldi and various other speech libraries. These days I develop speech recognition toolkit called Vosk https://github.com/alphacep/vosk-api. There are many speech libraries these days, some have nice features, some hard to use, here are some unique things:
 
  • Vosk is an offline open source speech recognition toolkit. It enables speech recognition models for 17 languages and dialects - English, Indian English, German, French, Spanish, Portuguese, Chinese, Russian, Turkish, Vietnamese, Italian, Dutch, Catalan, Arabic, Greek, Farsi, Filipino.
  • Vosk models are small (50 Mb) but provide continuous large vocabulary transcription, zero-latency response with streaming API, reconfigurable vocabulary and speaker identification.
  • Speech recognition bindings implemented for various programming languages like Python, Java, Node.JS, C#, C++ and others. 
  • Vosk works on Raspberry Pi3 and Pi4 but it also scales from mobile phones to big callcenter cluster. Vosk can also create subtitles for movies, transcription for lectures and interviews.

I personally use Vosk on RPi3B with Respeaker 4-mic Array hat controlling Kodi. We have just released an updated version 0.3.17 with a great speed improvements specifically for small devices.

In this topic I will post news about voice control addon we develop for Kodi. You are also welcome to ask me anything about Vosk or speech technology in general.

Hey. I am facing some issue when I try to integerate a custom model build in kaldi. 
I am using vosk android demo master android app and giving my own model files but I am facing this error.

VoskAPI: ExpectToken():io-funcs.cc:200) Failed to read token [started at file position 0], expected <TransitionModel>

Please help me resolve it.
Below is image which shows my model structure and android studio error
Image
Image
Reply
#3
Hello Fariba

This is not directly related to Kodi, you'd better file an issue on our github about this. As for error, it seems your final.mdl file is corrupted, probably it was a symlink and the original file was moved. You'd better make sure you have a real model file there.
Reply
#4
@nshmyrev, if this addon is still developed, can you please share a link?
Reply
#5
I don't believe there is any Kodi addon utilizing this yet, but it appears the development of Vosk (per initial post) is still very active: https://github.com/alphacep/vosk-api
Reply
#6
Yeah, I tried Vosk and it works nicely, the only missing part is Kodi integration
Reply

Logout Mark Read Team Forum Stats Members Help
Offline voice control with Vosk speech recognition0