speechcat
2022-01-06 22:21:50.841929+01 by Dan Lyke 0 comments
Launching spchcat, an open-source speech recognition tool for Linux and Raspberry Pi
... I’ve been following the Coqui.ai team’s work since they launched, and was very impressed by the quality of the open source speech models and code they have produced. I didn’t have an easy way to run them myself though, especially on live microphone input. With that in mind, I decided my holiday project would be writing a command line tool using Coqui’s speech to text library. To keep it as straightforward as possible I modeled it on the classic Unix
cat
command, where the default would be to read audio from a microphone and output text (though it ended up expanding to system audio and files too) so I called itspchcat
. You can now download it yourself for Pi’s and x86 Linux from speechcat.org!