AI

Artificial intelligence, machine learning, and everything LLM

1009 episodes Page 14 of 51

#2441: When One Sentence Beats Four Clicks

What happens when you ditch the admin panel and let AI agents manage your systems directly?

ai-agentsmodel-context-protocolapi-integration

#2440: Build Your Own CRM With AI Agents

Off-the-shelf CRMs are built for sales teams, not solo operators. Here's why building your own with AI might be smarter.

ai-agentsdiysmall-language-models

#2439: AI Collapses the Framework Decision

Why Airtable fails for multi-user tools, and how AI builders are changing the framework decision for small businesses.

software-developmentproductivityai-agents

#2436: The One-in-Ten-Thousand Design Constraint

How survey-grade precision and Python tools shape local map projections — and the silent failures that break your analysis.

geodesycoordinate-systemsdata-integrity

#2435: The Hidden Difficulty of Data Modeling

Stop designing database schemas from scratch. Here's where to find ready-made templates for common business apps.

software-developmentdata-integrityopen-source

#2434: From Spreadsheets to Databases: The Mental Shift

Stop treating databases like bigger spreadsheets. Learn the one conceptual shift that actually matters.

data-integrityknowledge-managementsoftware-development

#2431: The 3 Markets in an AI Trench Coat

GPUs, LPUs, and ASICs: why the best hardware for AI depends entirely on what you're trying to do.

gpu-accelerationai-inferenceai-training

#2430: Where Men's Advocacy Crosses Into Misogyny

How to acknowledge real male grievances without falling into the manosphere's woman-hating fringe.

misinformationextremismsocial-engineering

#2426: Why DeepSeek V4's Prose Feels More Vivid Than Claude or GPT

A million-token context window at 2% the KV-cache cost — and prose that actually breathes. Here's what makes V4 different.

large-language-modelsopen-source-aifine-tuning

#2425: Can One Button Solve Your Streaming Frustrations?

A deep dive into JustWatch, Trakt, Letterboxd, and why your ideal streaming app doesn't exist yet.

model-context-protocolgeo-blockingstreaming

#2424: What Feminists Actually Mean by "The Patriarchy

Unpacking the structural concept, the popular shorthand, and where the line gets blurry between critiquing systems and demonizing individuals.

cultural-biasmisinformationfree-speech

#2413: When Your AI Says No to Everything

Why LLMs refuse 73% of harmless prompts — and the trade-off between safety and usefulness.

ai-safetyai-alignmentprompt-engineering

#2412: When AI Caves: Progressive vs. Regressive Sycophancy

Why do LLMs agree with you even when you're wrong? We break down the SycEval benchmark and the 78% persistence problem.

ai-safetyai-alignmenthallucinations

#2411: Are Political Bias Benchmarks Actually Measuring Anything?

Why the Political Compass Test fails, and what researchers are building instead to actually measure model bias.

ai-ethicscultural-biasbenchmarks

#2410: How Researchers Actually Measure Censorship in Chinese LLMs

Beyond headlines: the actual benchmarks, methodologies, and pitfalls in detecting political refusal in Chinese language models.

large-language-modelsai-safetycultural-bias

#2409: When AI Cheats on Cultural Knowledge

Five benchmarks that reveal how AI systems fail at cultural knowledge — and what their methodologies tell us.

cultural-biasbenchmarksmultimodal-ai

#2408: How Backpropagation Actually Unlocks Neural Networks

How error signals flow backward through networks to make learning possible — and why "it's just calculus" misses the point.

transformersai-trainingai-history

#2407: Three Landings in 90 Days: Pilot Automation Dependency

Why pilots aren't hand-flying enough, the regulatory floor that lets it happen, and what airlines are doing about it.

aviation-technologyhuman-factorssituational-awareness

#2406: Why Million-Token Context Windows Can't Handle 3 Reasoning Steps

Needle-in-a-haystack is dead. Here's what actually measures whether models can think across long documents.

context-windowreasoning-modelsbenchmarks

#2405: LLM Benchmarks Are Full of Noise: Statistical Rigor in AI Evals

Why most benchmark claims in AI are statistically indefensible — and what to do about it.

benchmarksinterpretabilityllm-as-a-judge