Building AI Text to Speech & Speech to Text with Python
Building AI speech to speech translation, AI meeting transcriber & summariser, and voice command recognition system
4.63 (4 reviews)

2 835
students
3 hours
content
May 2025
last update
$44.99
regular price
What you will learn
Learn how to build AI text to speech system using gTTS
Learn how to build AI speech to text system using Open AI Whisper
Learn how to build AI speech to speech translation system using NLP
Learn how to build AI meeting transcriber and summarizer system using DeepSeek
Learn how to build voice command recognition system for smart home automation simulation
Learn the basic fundamentals of AI text to speech synthesis and automatic speech recognition, such as getting to know their use cases and technical limitations
Learn how AI text to speech system works starting from converting written text into phonemes and acoustic features, then generating realistic human like voice
Learn how AI speech to text system works starting from capturing raw audio waveforms, then extracting features like MFCCs and using models like Open AI Whisper
Learn how AI speech to speech translation system works starting from recognizing input in the source language, translating it using NMT, synthesizing the speech
Learn how AI meeting transcriber and summarizer works starting from recording multi-speaker conversations, perform transcription, generate meeting summary
Learn how a voice command recognition system works by analyzing audio input, transcribing speech, and triggering predefined actions based on recognized phrases
Learn how to integrate AI models from Hugging Face library
Course Gallery




Loading charts...
6607643
udemy ID
10/05/2025
course created date
21/05/2025
course indexed date
Bot
course submited by