Speech to Text Conversion in Python

Diff-ETS: Learning a Diffusion Probabilistic Model for Electromyography-to-Speech Conversion

Abstract: Electromyography-to-Speech (ETS) conversion has demonstrated its potential for silent speech interfaces by generating audible speech from Electromyography (EMG) signals during silent ...

Inc42

Gnani.ai Launches Indic Speech-To-Text Model Under IndiaAI Mission

Gnani.ai has launched Vachana STT, a speech-to-text model built for Indian languages, under the IndiaAI Mission. The startup ...

GitHub

Kokoro Web - Free AI Text to Speech

Kokoro Web is powered by hexgrad/Kokoro-82M, an open-weight 82 million parameter Text-to-Speech model available on Hugging Face. Despite its lightweight architecture, it delivers comparable quality to ...

19d

Five thoughts from CEO Matt Garman’s keynote at AWS re:Invent

Amazon Web Services Inc. Chief Executive Matt Garman’s keynote at AWS re:Invent was filled with product updates with vision sprinkled in to help customers understand why the innovation matters.

GitHub

Moshi: a speech-text foundation model for real time dialogue

Finally, the code for the web UI client used in the Moshi demo is provided in the client/ directory. If you want to fine tune Moshi, head out to kyutai-labs/moshi ...

IEEE

From Characters to Subwords: Modeling Unit Conversion for Low-resource Speech Recognition

Abstract: Multilingual automatic speech recognition (ASR) models greatly facilitate recognizing low-resource languages by sharing representations across similar languages. However, the commonly ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results