Transformer Model Algorithm in Speech Recognition

Amazon researchers develop cutting-edge Base TTS text-to-speech model

Amazon.com Inc. researchers have developed a new text-to-speech model, Base TTS, that can pronounce words more naturally than earlier neural networks. TechCrunch reported the project late Wednesday.

Meta Expands AI Speech Recognition to 1,600+ Languages

Omnilingual Automatic Speech Recognition can transcribe speech in over 1,600 languages — including 500 low-resource languages ...

Drax model from aiOla makes AI speech recognition viable and reliable in noisy environments

With Drax, aiOla says, it has come up with a novel technique for training speech recognition systems that’s finally able to ...

InfoQ

Facebook Open-Sources Two Billion Parameter Multilingual Speech Recognition Model XLS-R

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. This article dives into the happens-before ...

Ars Technica

Whisper AI model automatically recognizes speech and translates it to English

On Wednesday, OpenAI released a new open source AI model called Whisper that recognizes and translates audio at a level that approaches human recognition ability. It can transcribe interviews, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results