New open source tools to unlock speech and audio data

Introducing Lhotse, a Python library for handling speech data. By Piotr Żelasko, Jan Vainer, Tomáš Nekvinda, and Ben Lorica. Introduction Of the many voice applications for AI, speech recognition is the most widely known and deployed as a building block of voice assistants. Voice and speech recognition market alone is expected to grow from $9.4Continue reading “New open source tools to unlock speech and audio data”

Speech synthesis technologies will drive the next wave of innovative voice applications

Deep learning is revolutionizing text-to-speech and speech synthesis technologies. By Yishay Carmiel and Ben Lorica. Introduction Recent progress in natural language processing (NLP) and speech models have made voice applications accessible to companies across industries. From smartphone applications and personal assistants to sales and customer support to smart home speakers and appliances, voice applications haveContinue reading “Speech synthesis technologies will drive the next wave of innovative voice applications”

Commercial speech recognition systems in the age of big data and deep learning

[A version of this post appears on the O’Reilly Radar.] The O’Reilly Data Show Podcast: Yishay Carmiel on applications of deep learning in text and speech. Subscribe to the O’Reilly Data Show Podcast to explore the opportunities and techniques driving big data, data science, and AI. Find us on Stitcher, TuneIn, iTunes, SoundCloud, RSS. InContinue reading “Commercial speech recognition systems in the age of big data and deep learning”