Audio & Speech

Speech recognition, TTS, voice cloning, audio engineering

47 episodes Page 3 of 3

#647: The Golden Rule of Audio Engineering

Why does digital data need to become analog? Explore the physics of sound and the critical role of the DAC in modern audio engineering.

audio-engineeringsignal-processingdigital-to-analog

#598: Audio Engineering as Prompt Engineering: Better Sound, Better AI

Can better audio quality actually make an AI smarter? Discover how audio post-production functions as a new form of prompt engineering.

prompt-engineeringlarge-language-modelsaudio-engineering

#233: How Math Gives Microphones Directional Ears

Discover how math and physics turn simple microphones into "sound spotlights" that can isolate a single voice in even the noisiest environments.

beamforming-technologymicrophone-arraysdigital-signal-processing

#196: Why Your Irish Accent Sounds American

Herman and Corn dive into the mechanics of neural text-to-speech, exploring how AI masters human prosody and the "average voice" accent problem.

neural-text-to-speechvoice-cloninggenerative-modeling

#99: The Mic That Hears You from Across the Desk

Tired of headsets? Herman and Corn explore professional microphone setups for seamless, high-accuracy AI voice dictation from a distance.

voice-dictationai-accuracymicrophonesaudio-qualitysignal-to-noise-ratio

#58: Clean Audio, Messy Reality: Noise Removal for Voice-to-Text

Fussy baby, clean audio? We dive into noise removal for voice-to-text. Discover why cleaner audio can transcribe worse.

noise-removalvoice-to-textaudio-processingsignal-processingreal-time-audio

#57: From Lawyers in Limousines to Developers in Their PJs: The Voice Tech Revolution

From limo-riding lawyers to pajama-clad coders, voice tech is booming. Discover how AI is making it a force for good.

voice-technologyaccessibilityproductivity