← All Tags

#speech-to-text

7 episodes

#69: Unsung Hero: The Gooseneck Mic's AI Power

The gooseneck mic: a humble hero with surprising AI power. Discover its secret to crystal-clear speech-to-text accuracy!

gooseneck-micspeech-to-textmicrophoneai-voice-captureaudio-technology

#33: The Unseen Magic of AI's Ears: Decoding VAD

Ever wonder how your AI knows you're talking? We're diving deep into VAD, the unseen magic behind AI's ears.

voice-activity-detectionvadspeech-recognitionasrspeech-to-text

#29: The Multimodal Audio Revolution: A Screen-Free Future?

Is multimodal audio the future? We explore if AI can truly displace traditional speech-to-text for a screen-free world.

multimodal-audiospeech-to-textscreen-freeaudio-aiaccessibility

#28: Your AI, Evolving: Beyond the Static Snapshot

Is your AI an "old suit" that no longer fits? We explore evolving AI that learns and adapts with you.

machine-learningcontinual-learningadaptive-aifine-tuningpersonalized-ai

#10: How ASR Went From Frustration To ... Whisper Magic

Speech to text: from frustrating to fantastic. Uncover the magic behind its rapid rise and connection to the AI boom!

automatic-speech-recognitionspeech-to-textasr-technology

#7: Building Custom ASR Tools

Ever wondered how to build your own ASR tools from scratch? Discover the why and how in this episode!

asrspeech-recognitioncustom-asrmachine-learningspeech-to-text

#3: Safetensors or something else: STT inference formats explained

Unpacking ASR weight formats: Safetensors and beyond. Tune in to understand the distinctions.

safetensorsasrspeech-recognitioninferenceweight-formats