Audio & Speech

Speech recognition, TTS, voice cloning, audio engineering

46 episodes Page 3 of 3

#598: Audio Engineering as Prompt Engineering: Better Sound, Better AI

Can better audio quality actually make an AI smarter? Discover how audio post-production functions as a new form of prompt engineering.

prompt-engineeringlarge-language-modelsaudio-engineering

#233: How Math Gives Microphones Directional Ears

Discover how math and physics turn simple microphones into "sound spotlights" that can isolate a single voice in even the noisiest environments.

beamforming-technologymicrophone-arraysdigital-signal-processing

#196: Why Your Irish Accent Sounds American

Herman and Corn dive into the mechanics of neural text-to-speech, exploring how AI masters human prosody and the "average voice" accent problem.

neural-text-to-speechvoice-cloninggenerative-modeling

#99: The Mic That Hears You from Across the Desk

Tired of headsets? Herman and Corn explore professional microphone setups for seamless, high-accuracy AI voice dictation from a distance.

voice-dictationai-accuracymicrophonesaudio-qualitysignal-to-noise-ratio

#58: Clean Audio, Messy Reality: Noise Removal for Voice-to-Text

Fussy baby, clean audio? We dive into noise removal for voice-to-text. Discover why cleaner audio can transcribe worse.

noise-removalvoice-to-textaudio-processingsignal-processingreal-time-audio

#57: From Lawyers in Limousines to Developers in Their PJs: The Voice Tech Revolution

From limo-riding lawyers to pajama-clad coders, voice tech is booming. Discover how AI is making it a force for good.

voice-technologyaccessibilityproductivity