#prompt-engineering
81 episodes · Page 2 of 4
#2228: Tuning RAG: When Retrieval Helps vs. Hurts
How do you prevent retrieval from suppressing a model's reasoning? We diagnose our own pipeline's four control levers and multi-source fusion strat...
#2219: Spec-Driven Life: How AI Planning Beats Project Paralysis
What makes AI agents reliably productive? A structured spec that externalizes memory and chunks work into manageable pieces. Can the same framework...
#2207: Specs First, Code Second: Inside Agentic AI's New Era
As AI coding agents evolve from autocomplete to autonomous cloud workers, the bottleneck has shifted—now it's about how clearly you specify what ne...
#2192: How We Built a Podcast Pipeline
Hilbert reveals the complete technical architecture behind 2,000+ episodes—from voice memos to GPU-powered TTS, with Claude models, LangGraph workf...
#2191: Making Multi-Agent AI Actually Work
Research from Google DeepMind, Stanford, and Anthropic reveals most multi-agent systems waste tokens and amplify errors. Single agents with better ...
#2175: Let Your AI Argue With Itself
What happens when you let multiple AI personas debate each other instead of asking one model one question? A deep dive into synthetic perspective e...
#2174: Role-Playing as Orchestration
How a role-playing protocol from NeurIPS 2023 became one of AI's most underrated agent frameworks—and what happens when you scale it to a million a...
#2170: Pricing Agentic AI When Nothing's Predictable
How do you charge fixed prices for systems that operate in fundamental uncertainty? Consultants are discovering frameworks that work—but they requi...
#2165: Strip Your Agent to Bash
The frameworks matter less than you think. What separates a working agent from a failing one is the harness—the orchestration, memory, and tool des...
#2133: Engineering Geopolitical Personas: Beyond Caricatures
How to build LLMs that simulate state actors with strategic fidelity, not just surface mimicry.
#2125: Why Agentic Chunking Beats One-Shot Generation
A single prompt can't write a 30-minute script. Here’s the agentic chunking method that fixes coherence.
#2110: Tuning AI Personality: Beyond Sycophancy
AI models swing between obsequious flattery and cold dismissal. Here’s why that happens and how to fix it.
#2069: The Vibe Coding Trap: Why Your Agent Skills Keep Breaking
Stop guessing at the agentskills.io spec. Learn the exact YAML fields, directory structure, and authoring patterns to make Claude Code skills that ...
#2037: The Hidden Hierarchy of Claude Code Extensions
Stop manually typing slash commands. Here’s the definitive hierarchy of Claude Code extensions—from legacy shortcuts to autonomous agents.
#2028: Agent Skills Are the New Apps
AI agents are getting an App Store for brains. Discover how modular skills are replacing massive prompts and what it means for the future of work.
#2026: Prompt Layering: Beyond the Monolithic Prompt
Stop writing giant, monolithic prompts. Learn how to stack modular layers for cleaner, more powerful AI applications.
#1956: AI Skills: From Vibe Coding to Procedural Playbooks
Forget messy system prompts. Agent skills turn AI into a Swiss Army knife of modular, auditable procedures.
#1929: From Vibe Checks to Model Metrics
We stopped "vibe-checking" our AI scripts and built a science fair for models. Here's how we grade them.
#1906: Is Your AI Model Agentic-Ready or Just Wearing a Suit?
Native tool calling is the difference between a working product and a debugging nightmare.
#1894: Engineering Serendipity: Tuning AI for Better Brainstorming
Stop asking chatbots for generic ideas. Learn how to configure AI as a structured, critical partner for business innovation and career pivots.
#1858: Multi-Model Agents: The Instruction & Context Gap
Mixing AI models creates chaos. Learn the practical fixes for context windows, tokenization, and output formats.
#1828: Mastering 2M Token Context in Agentic Pipelines
A massive context window sounds like a dream, but it can quickly become a nightmare for complex AI workflows.
#1804: The Fork in the Road: Why AI Agents Check Old Receipts First
Stop your AI agent from overthinking. Learn why it checks old memories instead of booking flights—and how to fix the "eagerness" problem.
#1777: Claude Called My Prompt "Rambling" and I'm Not Okay
When an AI coding tool critiques your prompt's literary quality, it raises a massive technical question about engineered personality.