← All Tags

#speech-to-text

8 episodes

Unsung Hero: The Gooseneck Mic's AI Power

The gooseneck mic: a humble hero with surprising AI power. Discover its secret to crystal-clear speech-to-text accuracy!

gooseneck micspeech-to-textmicrophoneAI voice captureaudio technology

The Unseen Magic of AI's Ears: Decoding VAD

Ever wonder how your AI knows you're talking? We're diving deep into VAD, the unseen magic behind AI's ears.

voice activity detectionVADspeech recognitionASRspeech-to-text

The Multimodal Audio Revolution: A Screen-Free Future?

Is multimodal audio the future? We explore if AI can truly displace traditional speech-to-text for a screen-free world.

multimodal audiospeech-to-textscreen-freeaudio AIaccessibility

Your AI, Evolving: Beyond the Static Snapshot

Is your AI an "old suit" that no longer fits? We explore evolving AI that learns and adapts with you.

machine learningcontinual learningadaptive aifine-tuningpersonalized ai

Building Custom ASR Tools

Ever wondered how to build your own ASR tools from scratch? Discover the why and how in this episode!

ASRspeech recognitioncustom asrmachine learningspeech to text

Local STT For AMD GPU Owners

AMD GPU? No problem! Dive into local AI adventures like on-device speech to text.

GPUamd-gpuspeech-to-text

How ASR Went From Frustration To ... Whisper Magic

Speech to text: from frustrating to fantastic. Uncover the magic behind its rapid rise and connection to the AI boom!

automatic-speech-recognitionspeech-to-textasr-technology

Safetensors or something else: STT inference formats explained

Unpacking ASR weight formats: Safetensors and beyond. Tune in to understand the distinctions.

safetensorsASRspeech recognitioninferenceweight formats