#speech-to-text
7 episodes
#69: Unsung Hero: The Gooseneck Mic's AI Power
The gooseneck mic: a humble hero with surprising AI power. Discover its secret to crystal-clear speech-to-text accuracy!
#33: The Unseen Magic of AI's Ears: Decoding VAD
Ever wonder how your AI knows you're talking? We're diving deep into VAD, the unseen magic behind AI's ears.
#29: The Multimodal Audio Revolution: A Screen-Free Future?
Is multimodal audio the future? We explore if AI can truly displace traditional speech-to-text for a screen-free world.
#28: Your AI, Evolving: Beyond the Static Snapshot
Is your AI an "old suit" that no longer fits? We explore evolving AI that learns and adapts with you.
#10: How ASR Went From Frustration To ... Whisper Magic
Speech to text: from frustrating to fantastic. Uncover the magic behind its rapid rise and connection to the AI boom!
#7: Building Custom ASR Tools
Ever wondered how to build your own ASR tools from scratch? Discover the why and how in this episode!
#3: Safetensors or something else: STT inference formats explained
Unpacking ASR weight formats: Safetensors and beyond. Tune in to understand the distinctions.