#edge-computing

20 episodes

May 13

#2811: Cloudflare's Endgame: From CDN to Cloud Platform

How a spam-tracking side project became the CDN that's quietly building a new kind of cloud.

edge-computingserverless-gpucloud-computing

May 12

#2776: Where Does Your Vercel Site Actually Live?

Your Vercel site lives everywhere and nowhere. Here's what's actually happening under the hood.

edge-computingserverless-gpulatency

May 6

#2668: When a Flamethrower Is Overkill

Tesseract, EasyOCR, or a cloud vision model? How to build a fast, reliable label scanner for real-world conditions.

computer-visionedge-computinglatency

Apr 20

#2357: Microsoft's Phi: When Data Quality Beats Model Size

Explore Microsoft AI's Phi family of small language models, designed for edge deployment and high efficiency.

small-language-modelsedge-computingbenchmarks

Apr 17

#2288: The Invisible Gatekeeper of Voice Tech

How voice activity detection shapes every step of the voice tech pipeline, and why it’s harder than it seems.

speech-recognitionaudio-processingedge-computing

Apr 17

#2271: Vector Search in a Single File

What if you could do vector search with just SQLite? We explore sqlite-vec, the extension that adds embeddings to the world's simplest database, an...

vector-databasesedge-computingdata-storage

Apr 6

#2067: MoE vs. Dense: The VRAM Nightmare

MoE models promise giant brains on a budget, but why are engineers fleeing back to dense transformers? The answer is memory.

ai-modelsfine-tuningedge-computing

Apr 4

#2019: Local AI vs Cloud AI: The Agent Identity Crisis

Your desktop is becoming a life support system for AI agents. We explore the sharp trade-offs between local-first and cloud-native architectures.

local-aiai-agentsedge-computing

Apr 2

#1927: Workers vs. Servers: The 2026 Compute Showdown

Is the persistent server dead? We compare Cloudflare Workers, GitHub Actions, and VPS options for modern app architecture.

edge-computingserverless-gpulatency

Apr 2

#1919: Android Dev Without Android Studio: Is It Actually Good?

How to ship an Android app without ever opening Android Studio or touching a line of Java.

open-source-aiedge-computingsoftware-development

Apr 1

#1870: Learning to Break Things Safely

Learn how to safely build and test autonomous AI agents using a disposable VPS, Docker containers, and secure networking.

ai-agentslocal-aiedge-computing

Mar 31

#1809: The TTS Developer's Dilemma: Size vs. Speed

Stop guessing. We break down the critical trade-offs between model size, latency, and sample rate for production-ready voice apps.

text-to-speechgpu-accelerationedge-computing

Mar 29

#1758: The Internet's Physical Bread Delivery System

Netflix doesn't stream from California to Jerusalem. It uses local boxes in your city. Here's how the internet physically moves data to you.

edge-computingnetwork-securityfiber-optics

Mar 29

#1705: Microsoft's Phi: The Small Model Bet for Agentic AI

Microsoft is pushing small language models like Phi for agentic AI. Here’s why that strategy matters for speed, cost, and edge computing.

small-language-modelsai-agentsedge-computing

Mar 25

#1541: Why Your Phone Beats Your PC at Video

Explore why mobile devices handle real-time video AI better than desktops and how the NPU gap is finally closing in 2026.

npuedge-computingcomputer-vision

Mar 25

#1539: Escaping the Cloud Dictation Trap

Stop shouting at your phone. Discover how dedicated hardware and local AI are making instant, private voice-to-text a reality.

speech-recognitionedge-computinghardware-engineering

Mar 12

#1124: The Database Explosion: Why One Size No Longer Fits All

From vector stores to edge computing, discover why the world now has over 1,000 databases and why Postgres isn't always the answer.

vector-databasesdata-storageedge-computing

Feb 22

#772: Beyond the Build: Can Static Sites Truly Scale?

Is your static site hitting a wall? Discover how modern frameworks handle thousands of pages without crashing your build pipeline.

architecturestatic-site-generationedge-computing

Dec 29

#120: Silencing the Siren: Real-Time AI Noise Reduction

How do phones remove sirens and crying babies in real time? Explore the neural networks and hardware making crystal-clear audio possible.

noise-reductionaudio-engineeringmobile-devicesedge-computingnpu

Dec 9

#38: Why Local AI Inference Is Beating the Cloud

AI supercomputers are landing on your desk! Discover why local AI is indispensable for enterprises facing API costs, latency, and privacy.

ai-supercomputerslocal-aiedge-computingai-inferenceai-training