Episodes

  • Business Physics: How Brand, Pricing, and Product Design Define Success with Erik Swan
    May 8 2025

    Summary
    In this episode, Erik reflects on his long and storied tech career—from the days of punch cards to founding multiple startups, including a stint at Splunk.

    At 61, he offers a unique perspective on how the industry has evolved and shares candid insights into what it takes to build a successful company. He discusses the evolution from building simple tools to creating comprehensive solutions and eventually platforms, emphasizing the importance of starting with a “hammer”—a focused, simple tool—before scaling to a broader offering.

    Eril introduces his concept of the “physics of business,” a framework for understanding go-to-market dynamics, pricing, and the critical role of brand in differentiating a product in a crowded market.

    He also touches on the challenges of product-led growth, the importance of achieving a strong “K value” (viral or network effects), and the pitfalls of allowing short-term quarterly pressures to derail long-term vision. Toward the end, he hints at his current project, Bestimer, which aims to apply lessons from his past ventures and leverage modern AI to tackle a massive, data-intensive problem.

    Chapters

    00:00 Erik's Journey Through Tech History
    04:06 The Philosophy of Designing for Success
    09:49 Understanding the Physics of Business
    14:29 Timing and Luck in Startups
    18:09 Lessons Learned from Splunk
    23:30 The Power of Brand in Business
    28:02 Leveraging AI for Brand Development
    32:04 The Resilience of Splunk
    36:45 Building a Competitive Edge
    37:28 From Tool to Solution
    40:59 The Importance of Onboarding
    44:32 Navigating Growth and Market Fit
    51:11 Innovating with AI: The Next Chapter

    Show more Show less
    1 hr and 2 mins
  • Incremental Materialization: Reinventing Database Views with Gilad Kleinman of Epsio
    Apr 24 2025

    Summary


    In this episode, Gilad Kleinman, co-founder of Epsio, shares his unique journey from PHP development to low-level kernel programming and how that evolution led him to build an innovative incremental views engine.

    Gilad explains that Epsio tackles a common challenge in databases: making heavy, complex queries faster and more efficient through incremental materialization. He describes how traditional materialized views fall short—often requiring full refreshes—and how Epsio seamlessly integrates with existing databases by consuming replication streams (CDC) and writing back to result tables without disrupting the core transactional system.

    The conversation dives into the technical trade-offs and optimizations involved, such as handling stateful versus stateless operators (like group-by and window functions), using Rust for performance, and the challenges of ensuring consistency.

    Gilad also contrasts Epsio’s approach with streaming systems like Flink, emphasizing that by maintaining tight integration with the native database, Epsio can offer immediate, up-to-date query results while minimizing disruption.

    Finally, he outlines his vision for the future of incremental stream processing and materialized views as a means to reduce compute costs and enhance overall system performance.


    Chapters

    00:00 From PHP to Kernel Development: A Journey
    07:30 Introducing Epsio: The Incremental Views Engine
    10:56 The Importance of Materialized Views
    15:07 Understanding Incremental Materialization
    19:21 Optimizing Query Performance with Epsio
    24:53 Integrating Epsio with Existing Databases
    27:02 The Shift from Theory to Practice in Data Processing
    29:42 Seamless Integration with Existing Databases
    32:02 Understanding Epsio Incremental Processing Mechanism
    34:46 Challenges and Limitations of Incremental Views
    36:49 The Complexity of Implementing Operators
    39:56 Trade-offs in Incremental Computation
    41:21 User Interaction with Epsio
    43:01 Comparing EPSIO with Streaming Systems
    45:09 Architectural Guarantees of Epsio
    50:33 The Future of Incremental Data Processing

    Show more Show less
    52 mins
  • From Data Mesh to Lake House: Revolutionizing Metadata with Lakekeeper
    Mar 21 2025

    Summary

    In this episode, Viktor Kessler shares his journey and insights from his extensive experience in data management—from building risk management systems and data warehouses to working as a solutions architect at MongoDB and Dremio, and now co-founding a startup.

    Initially exploring data mesh concepts, Viktor explains how real-world challenges—such as the disconnect between technical data models and business needs, inconsistent definitions across departments, and the difficulty in managing actionable metadata—led him and his co-founder to pivot toward building a lake house solution.

    His startup is developing Lakekeeper, an open source REST catalog for Apache Iceberg, which aims to bridge the gap between decentralized data production and centralized metadata management.

    The conversation also delves into the evolution of data catalogs, the necessity for self-service analytics, and how creating consumption-ready data products can transform data functions from cost centers into profit centers.

    Finally, Viktor outlines ways for interested listeners to get involved with the Lakekeeper community through GitHub, upcoming meetups, and a dedicated Discord channel.

    Chapters

    00:00 Introduction to Viktor Kessler and His Journey
    04:57 Transitioning from Data Mesh to Lake House
    09:15 Understanding Data Mesh: Pain Points and Solutions
    13:47 The Role of Metadata in Data Management
    18:16 The Evolution of Catalogs and Metadata Management
    28:14 Stabilizing the Consumption Pipeline
    31:18 Centralizing Metadata for Decentralized Organizations
    37:09 Bridging the Gap: Technical and Business Perspectives
    43:17 Rethinking Data Products and Consumption
    50:45 Finding Balance: Control and Flexibility in Data Management

    Show more Show less
    57 mins
  • Reinventing Stream Processing: From LinkedIn to Responsive with Apurva Mehta
    Mar 6 2025

    Summary


    In this episode, Apurva Mehta, co-founder and CEO of Responsive, recounts his extensive journey in stream processing—from his early work at LinkedIn and Confluent to his current venture at Responsive.

    He explains how stream processing evolved from simple event ingestion and graph indexing to powering complex, stateful applications such as search indexing, inventory management, and trade settlement.

    Apurva clarifies the often-misunderstood concept of “real time,” arguing that low latency (often in the one- to two-second range) is more accurate for many applications than the instantaneous response many assume. He delves into the challenges of state management, discussing the limitations of embedded state stores like RocksDB and traditional databases (e.g., Postgres) when faced with high update rates and complex transactional requirements.

    The conversation also covers the trade-offs between SQL-based streaming interfaces and more flexible APIs, and how Responsive is innovating by decoupling state from compute—leveraging remote state solutions built on object stores (like S3) with specialized systems such as SlateDB—to improve elasticity, cost efficiency, and operational simplicity in mission-critical applications.

    Chapters

    00:00 Introduction to Apurva Mehta and Streaming Background
    08:50 Defining Real-Time in Streaming Contexts
    14:18 Challenges of Stateful Stream Processing
    19:50 Comparing Streaming Processing with Traditional Databases
    26:38 Product Perspectives on Streaming vs Analytical Systems
    31:10 Operational Rigor and Business Opportunities
    38:31 Developers' Needs: Beyond SQL
    45:53 Simplifying Infrastructure: The Cost of Complexity
    51:03 The Future of Streaming Applications

    Click here to view the episode transcript.

    Show more Show less
    58 mins
  • Semantic Layers: The Missing Link Between AI and Data with David Jayatillake from Cube
    Feb 20 2025

    In this episode, we chat with David Jayatillake, VP of AI at Cube, about semantic layers and their crucial role in making AI work reliably with data.

    We explore how semantic layers act as a bridge between raw data and business meaning, and why they're more practical than pure knowledge graphs.

    David shares insights from his experience at Delphi Labs, where they achieved 100% accuracy in natural language data queries by combining semantic layers with AI, compared to just 16% accuracy with direct text-to-SQL approaches.

    We discuss the challenges of building and maintaining semantic layers, the importance of proper naming and documentation, and how AI can help automate their creation.

    Finally, we explore the future of semantic layers in the context of AI agents and enterprise data systems, and learn about Cube's upcoming AI-powered features for 2025.

    00:00 Introduction to AI and Semantic Layers
    05:09 The Evolution of Semantic Layers Before and After AI
    09:48 Challenges in Implementing Semantic Layers
    14:11 The Role of Semantic Layers in Data Access
    18:59 The Future of Semantic Layers with AI
    23:25 Comparing Text to SQL and Semantic Layer Approaches
    27:40 Limitations and Constraints of Semantic Layers
    30:08 Understanding LLMs and Semantic Errors
    35:03 The Importance of Naming in Semantic Layers
    37:07 Debugging Semantic Issues in LLMs
    38:07 The Future of LLMs as Agents
    41:53 Discovering Services for LLM Agents
    50:34 What's Next for Cube and AI Integration

    Show more Show less
    59 mins
  • From black holes to AI in mathematics: AI Innovation in Mathematics and Health with Yaron Hadad
    Feb 4 2025

    In this episode, we chat with Yaron Hadad, a fascinating individual who transitioned from theoretical physics to entrepreneurship.

    We explore his groundbreaking work on black holes and gravitational waves, and learn about the Ramanujan Machine - an algorithmic system he helped develop that discovers new mathematical formulas and democratizes mathematical research. We'll hear about the scientific community's mixed reactions to this innovative approach.

    The conversation then shifts to his work with Neutrino, a company he founded that uses AI and continuous monitoring devices to understand how food affects individual health. We delve into the complexities of nutrition science, the challenges of processing multiple data streams, and the future of personalized health monitoring.

    Throughout the episode, Yaron shares insights on bridging theoretical research with practical applications, and the role of AI in advancing both pure mathematics and healthcare.

    00:00 Yaron Hadad's Journey: From Physics to AI in Healthcare
    04:50 The Complexity of Einstein's Equations and Their Solutions
    10:12 AI in Mathematics: The Ramanujan Machine and Conjectures
    15:41 Navigating Criticism: The Scientific Community's Response to Innovation
    29:24 The Impact of Algorithms in Mathematics
    35:30 The Planck Machine: A New Approach
    41:15 Neutrino: A Personal Journey in Nutrition
    50:11 Connecting Food Complexity to Health Metrics

    Show more Show less
    59 mins
  • Building a Native Search Engine in PostgreSQL: ParadeDB's Journey to Replace Elasticsearch with Philippe Noël
    Jan 16 2025

    In this episode, we chat with Philippe Noël, founder of ParadeDB, about building an Elasticsearch alternative natively on PostgreSQL.

    We explore the challenges and benefits of extending PostgreSQL versus building a separate system, diving into topics like full-text search, faceted analytics, and why organizations need these capabilities.

    We discuss the emerging bring-your-own-cloud deployment model, the state of the PostgreSQL extension ecosystem, and what makes a truly production-ready database extension.

    Philippe shares insights on the future of search technology and how recent AI developments are actually increasing the demand for traditional search capabilities.

    The conversation also covers the misconceptions around PostgreSQL's scalability and the trade-offs between multi-tenant and single-tenant architectures in modern data infrastructure.

    Chapters

    00:00 Introduction to ParadeDB and Its Mission
    06:35 User-Facing Search and Analytics
    11:45 The Role of Postgres in Modern Data Solutions
    17:30 Future of Multimodal Databases
    31:04 The Rise of Fintech and Data Integrity
    36:36 Deployment Models: BYOC and Control Plane
    43:41 The Evolution of Cloud Infrastructure and Serverless Databases
    49:38 The Future of Search and Community Engagement

    Click here to view the episode transcript.

    Show more Show less
    1 hr
  • Optimizing SQL with LLMs: Building Verified AI Systems at Espresso AI with Ben Lerner
    Jan 3 2025

    In this episode, we chat with Ben, founder of Espresso AI, about his journey from building Excel Python integrations to optimizing data warehouse compute costs.

    We explore his experience at companies like Uber and Google, where he worked on everything from distributed systems to ML and storage infrastructure.

    We learn about the evolution of his latest venture, which started as a C++ compiler optimization project and transformed into a system for optimizing Snowflake workloads using ML.

    Ben shares insights about applying LLMs to SQL optimization, the challenges of verified code transformation, and the importance of formal verification in ML systems. Finally, we discuss his practical approach to choosing ML models and the critical lesson he learned about talking to users before building products.

    Chapters

    00:00 Ben's Journey: From Startups to Big Tech
    13:00 The Importance of Timing in Entrepreneurship
    19:22 Consulting Insights: Learning from Clients
    23:32 Transitioning to Big Tech: Experiences at Uber and Google
    30:58 The Future of AI: End-to-End Systems and Data Utilization
    35:53 Transitioning Between Domains: From ML to Distributed Systems
    44:24 Espresso's Mission: Optimizing SQL with ML
    51:26 The Future of Code Optimization and AI

    Click here to view the episode transcript.

    Show more Show less
    1 hr and 6 mins
adbl_web_global_use_to_activate_T1_webcro805_stickypopup