Editing Speech recognition (section)

=== Software ===
In terms of freely available resources, [[Carnegie Mellon University]]'s [[CMU Sphinx|Sphinx]] toolkit is one place to start to both learn about speech recognition and to start experimenting. Another resource (free but copyrighted) is the [[HTK (software)|HTK]] book (and the accompanying HTK toolkit). For more recent and state-of-the-art techniques, [[Kaldi (software)|Kaldi]] toolkit can be used.<ref>Povey, D., Ghoshal, A., Boulianne, G., Burget, L., Glembek, O., Goel, N., ... & Vesely, K. (2011). The Kaldi speech recognition toolkit. In IEEE 2011 workshop on automatic speech recognition and understanding (No. CONF). IEEE Signal Processing Society.</ref> In 2017 [[Mozilla]] launched the open source project called [[Common Voice]]<ref>{{Cite web |title=Common Voice by Mozilla |url=https://voice.mozilla.org/ |url-status=dead |archive-url=https://web.archive.org/web/20200227020208/https://voice.mozilla.org/ |archive-date=27 February 2020 |access-date=9 November 2019 |website=voice.mozilla.org}}</ref> to gather big database of voices that would help build free speech recognition project DeepSpeech (available free at [[GitHub]]),<ref>{{Cite web |date=9 November 2019 |title=A TensorFlow implementation of Baidu's DeepSpeech architecture: mozilla/DeepSpeech |url=https://github.com/mozilla/DeepSpeech |via=GitHub |access-date=9 September 2024 |archive-date=9 September 2024 |archive-url=https://web.archive.org/web/20240909053949/https://github.com/mozilla/DeepSpeech |url-status=live }}</ref> using Google's open source platform [[TensorFlow]].<ref>{{Cite web |date=9 November 2019 |title=GitHub - tensorflow/docs: TensorFlow documentation |url=https://github.com/tensorflow/docs |via=GitHub |access-date=9 September 2024 |archive-date=9 September 2024 |archive-url=https://web.archive.org/web/20240909053830/https://github.com/tensorflow/docs |url-status=live }}</ref> When Mozilla redirected funding away from the project in 2020, it was forked by its original developers as Coqui STT<ref>{{Cite web |title=Coqui, a startup providing open speech tech for everyone |url=https://github.com/coqui-ai |access-date=2022-03-07 |website=GitHub |archive-date=9 September 2024 |archive-url=https://web.archive.org/web/20240909054614/https://github.com/coqui-ai |url-status=live }}</ref> using the same open-source license.<ref>{{Cite magazine |last=Coffey |first=Donavyn |date=2021-04-28 |title=Māori are trying to save their language from Big Tech |url=https://www.wired.co.uk/article/maori-language-tech |access-date=2021-10-16 |magazine=Wired UK |language=en-GB |issn=1357-0978 |archive-date=9 September 2024 |archive-url=https://web.archive.org/web/20240909053950/https://www.wired.com/story/maori-language-tech/ |url-status=live }}</ref><ref>{{Cite web |date=2021-07-07 |title=Why you should move from DeepSpeech to coqui.ai |url=https://discourse.mozilla.org/t/why-you-should-move-from-deepspeech-to-coqui-ai/82798 |access-date=2021-10-16 |website=Mozilla Discourse |language=en-US}}</ref>

Google [[Gboard]] supports speech recognition on all [[Android (operating system)|Android]] applications. It can be activated through the [[microphone]] [[Icon (computing)|icon]].<ref>{{Cite web |title=Type with your voice |url=https://support.google.com/gboard/answer/2781851?hl=en&co=GENIE.Platform%3DAndroid |access-date=9 September 2024 |archive-date=9 September 2024 |archive-url=https://web.archive.org/web/20240909054332/https://support.google.com/gboard/answer/2781851?hl=en&co=GENIE.Platform%3DAndroid |url-status=live }}</ref> Speech recognition can be activated in [[Microsoft Windows]] operating systems by pressing Windows logo key + Ctrl + S.<ref>{{cite web|url=https://support.microsoft.com/en-us/windows/use-voice-recognition-in-windows-83ff75bd-63eb-0b6c-18d4-6fae94050571|title=Use voice recognition in Windows|archive-url=https://web.archive.org/web/20250409223456/https://support.microsoft.com/en-us/windows/use-voice-recognition-in-windows-83ff75bd-63eb-0b6c-18d4-6fae94050571|archive-date=April 9, 2025|url-status=live}}</ref>

The commercial cloud based speech recognition APIs are broadly available.

For more software resources, see [[List of speech recognition software]].