#rag
82 episodes · Page 3 of 4
#1731: Why Deep Research Agents Are Being Forgotten
Specialized research agents outperform general orchestrators by 40-60% on verification tasks, yet developer hype is fading. Here's why.
#1728: The AI Carpool: Emergent Collaboration Through Role-Playing
CAMEL AI lets two agents role-play to solve tasks autonomously. No complex code—just emergent teamwork.
#1727: The Great Architectural Heist: LSP as AI's Universal Plumbing
Explore how the Language Server Protocol is being repurposed to integrate AI directly into code editors, unifying development workflows.
#1725: The Death of the Lonely Chatbot
Forget chatbots: AI orchestration is now the key to scaling intelligent agents in the enterprise.
#1713: Why Native AI Search Grounding Still Fails
Native search grounding is expensive and flaky. Here’s why bolt-on tools still win for accurate, real-time AI answers.
#1708: Why Your AI Agent Forgets Everything (And How to Fix It)
Learn how Letta's memory-first architecture solves the AI context bottleneck for long-term agents.
#1700: Can LLMs Learn Continuously Without Forgetting?
We explore a new approach: micro-training updates every few days to keep AI knowledge fresh without constant web searches.
#1666: The Agent Mesh: Shared Context That Changes Everything
Grok 4.20’s native multi-agent architecture cuts token costs by 75% and enables real-time cross-agent reasoning.
#1629: From DAGs to Loops: Why Agents Need Stateful Cycles
Stop building linear chains and start building cycles to create agents that can reason, self-correct, and maintain complex state.
#1601: Cohere: The Switzerland of Enterprise AI
While others chase viral memes, Cohere is quietly building the secure, cloud-agnostic infrastructure powering the global enterprise.
#1592: The Vector Debt Trap: Choosing Embeddings That Last
Stop treating embedding models like plumbing. Learn how to navigate vector debt, multimodal retrieval, and database configuration for RAG.
#1565: Machine-Readable Safety: Markdown for AI Agents
Transform bloated government data into clean Markdown to power life-saving AI agents during emergencies.
#1482: The Hidden Cost of Choosing an Embedding Model
From Matryoshka models to multimodal search, discover how the fundamental units of AI memory are being optimized for efficiency and scale.
#1212: The Postgres Vector Revolution: Killing the Sprawl
Is your tech stack a sprawling suburb of microservices? Discover why a 40-year-old database is winning the AI infrastructure war.
#1123: When One Database Isn't Enough
Can Postgres 18 finally replace the data warehouse? We dive into data gravity, columnar storage, and the physics of scaling in the AI age.
#1103: The Kitchen War: When Theory Meets Messy Reality
Explore the mechanics of LLM context windows and attention, and witness what happens when technical debates collide with household chores.
#1100: The Truth Conflict: Why AI Ignores the Facts You Give It
Discover why AI models ignore provided documents in favor of old training data and how to build a reliable "hierarchy of truth" for RAG systems.
#995: Democratizing Intelligence: From PDFs to Policy
How can AI transform dense government reports into actionable intelligence? Explore the physics of Iranian missiles and the future of OSINT.
#959: The Infinite Content Problem: AI’s War on Truth
Explore how AI is scaling disinformation to an industrial level and what the "liar's dividend" means for the future of shared reality.
#948: Can AI Search Survive the Fog of War and SEO Spam?
Explore how AI is moving from static models to real-time data and whether specialized search tools can survive the rise of the tech giants.
#869: Why Tiny Digital Savants Are Outperforming God-Models
Are massive AI models hitting a wall? Discover why the future belongs to lean, domain-specific "digital savants" and vertical pre-training.
#846: Beyond the Vector: Building Long-Standing AI Memory
Stop relying on basic vector search. Discover how Graph RAG and RAPTOR are creating AI systems with true long-standing memory.
#810: The Agentic Interview: How AI Learns to Know You
Stop dumping data. Discover how agentic interviews are transforming AI from a passive listener into a proactive, structured partner.
#809: Beyond the Prompt: The Shift to AI Context Engineering
Is prompt engineering still magic, or just plumbing? Explore why the field is shifting toward context engineering and systematic evaluation.