AI

Artificial intelligence, machine learning, and everything LLM

940 episodes Page 12 of 47

#2383: The Blame Gap: Public Anger vs. Breach Reality

How much blame do companies deserve for data breaches? The answer isn't as simple as you think.

cybersecuritydata-securitydigital-privacy

#2377: Is Geopolitical Neutrality a Sustainable AI Strategy?

How DeepSeek carved a niche with efficiency, neutrality, and innovative dialogue handling — and what it means for AI's future.

ai-trainingai-modelsgeopolitical-strategy

#2374: How Granular Can MoE Experts Get?

Exploring the limits of expert granularity in Mixture of Experts models—how narrow can segmentation go before efficiency or accuracy suffers?

large-language-modelstransformersai-models

#2373: How Facial Recognition Maps Your Face—And Your Rights

The same AI that organizes your photos can track you in a crowd. How does facial recognition work—and why is it so hard to evade?

privacydigital-privacysurveillance-technology

#2372: Choosing the Right Sandbox for Your Threat Model

Explore the tools and methods for creating secure, isolated environments to test malware, browse privately, and protect sensitive systems.

cybersecurityprivacyoperating-systems

#2368: The Multi-Stage Pipeline Behind Netflix's Recommendations

Unpacking the multi-stage AI pipeline behind Netflix, Spotify, and Amazon’s "you might also like" suggestions—from candidate generation to real-tim...

ai-modelsdata-storageai-training

#2366: Why LLMs Forget the Middle of Long Conversations

Why do large language models struggle with the middle of long conversations? Explore the science behind attention dilution and practical fixes.

transformerscontext-windowmodel-collapse

#2359: When the Sandbox Doesn't Fit: Sysadmins Using a Dev Tool

Discover why Claude Code excels as a sysadmin tool despite being designed for developers — and the challenges that come with it.

automationoperating-systemsinfrastructure

#2357: Microsoft's Phi: When Data Quality Beats Model Size

Explore Microsoft AI's Phi family of small language models, designed for edge deployment and high efficiency.

small-language-modelsedge-computingbenchmarks

#2356: Why AI Coding Needs Two Brains

Discover how specialized fast apply models streamline AI-powered code edits, cutting costs and latency while maintaining precision.

software-developmentai-modelsproductivity

#2355: Why Open-Weight Models Are Winning

Discover how Cogito v2.1 leverages process supervision and MoE architecture to redefine reasoning efficiency in open-weight AI models.

large-language-modelsopen-sourceai-training

#2354: Profiling a Ghost Model

A deep dive into Amazon Nova, a mysterious AI model family on Bedrock — and the gaps in what we know.

ai-modelscloud-computingenterprise-hardware

#2353: Evaluating Enterprise AI: Palmyra X5

Explore Palmyra X5, Writer’s flagship AI model designed for enterprise workloads, featuring a million-token context window and agentic capabilities.

ai-modelscontext-windowai-orchestration

#2352: The Structured Output Gap in Vision APIs

How do object detection APIs like Gemini, AWS Rekognition, and YOLO compare for automated annotation workflows?

computer-visionapi-integrationbenchmarks

#2351: AI Model Spotlight: ** Aion-2.0

Why is a biopharma AI lab releasing a storytelling-optimized model? We explore Aion-2.0’s architecture, pricing, and niche adoption.

ai-modelspharmacologyisrael

#2350: NVIDIA's Strategic Pivot: From Chipmaker to Model Builder

Dive into NVIDIA’s Nemotron 3 Super, a hybrid MoE model combining Mamba, Transformers, and multi-token prediction for cutting-edge efficiency.

transformerslatent-spaceai-models

#2349: The 30-Person Lab Outpacing AI Giants

Discover how Arcee AI’s Trinity Large Thinking delivers cutting-edge reasoning at a fraction of the cost, all from a team of just 30.

ai-modelsreasoning-modelsbenchmarks

#2348: Diffusion Models Take on Text Generation

Explore Inception Labs’ Mercury 2, a groundbreaking diffusion-based language model that rethinks text generation and reasoning.

transformersparallel-computingvoice-first

#2342: How Python Ate Wall Street

Over 80% of equity trades are now executed algorithmically. How did Python libraries quietly democratize quant finance?

high-frequency-tradingfinancial-fraudquantitative-finance

#2336: How ADRs Solve AI's Institutional Memory Problem

Architectural Decision Records (ADRs) aren’t just documentation—they’re a way to give AI coding assistants the context they lack.

software-developmentai-agentslegacy-systems