The Data Stack Show

Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

https://datastackshow.com

Eine durchschnittliche Folge dieses Podcasts dauert 45m. Bisher sind 347 Folge(n) erschienen. Alle 5 Tage erscheint eine Folge dieses Podcasts.

Gesamtlänge aller Episoden: 8 days 9 hours 4 minutes

subscribe
share






Data Council Week: How To Do Self-Service Data Analytics and Business Intelligence Right with Ryan Dolley of GoodData


It’s a special edition of The Data Stack Show as we come to you from the Data Council in Austin, Texas. Brooks and Matthew co-host the show to bring you some bonus episodes from some of the leading voices in the data space. This episode, Ryan Dolley, Vice President of Product Strategy at GoodData joins the show. During the conversation, Ryan shares his journey from creative arts to data, emphasizing the importance of understanding human behavior in both fields...


share








 April 15, 2024  42m
 
 

185: The Evolution of Data Processing, Data Formats, and Data Sharing with Ryan Blue of Tabular


This week on The Data Stack Show, Eric and Kostas chat with Ryan Blue, the Co-Founder and CEO of Tabular, and also creator of Iceberg and former Cloudera and Netflix employee. During the episode, Ryan discusses the challenges of managing large-scale data and the development of Iceberg, a new table format. He explains Iceberg's benefits, such as automatic partitioning and improved metadata management, which simplify data engineers' tasks and enhance query performance...


share








 April 10, 2024  1h29m
 
 

The PRQL: The Two Parallel Tracks of Development In Data Processing with Ryan Blue of Tabular


In this bonus episode, Eric and Kostas preview their upcoming conversation with Ryan Blue of Tabular.


share








 April 8, 2024  4m
 
 

184: Kafka Streams and Operationalizing Event Driven Applications with Aprurva Mehta of Responsive


This week on The Data Stack Show, Eric and Kostas chat with Apurva Mehta, Co-Founder and CEO of Responsive, about event-driven applications and the necessary infrastructure. Apruva shares his journey from LinkedIn to Confluent and eventually founding Responsive, focusing on managing event-driven applications in the cloud. The discussion covers the definition of event-driven applications, the significance of latency and state in event processing, and the evolution of Kafka and Kafka Streams...


share








 April 3, 2024  58m
 
 

The PRQL: Event-Driven Applications: Where Low Latency Meets High Impact with Apruva Mehta of Responsive


In this bonus episode, Eric and Kostas preview their upcoming conversation with Apurva Mehta of Responsive.


share








 April 1, 2024  3m
 
 

183: Why Modern Data Quality Must Move Beyond Traditional Data Management Practices with Chad Sanderson of Gable.ai


This week on The Data Stack Show, Eric and Kostas chat with Chad Sanderson, the CEO at Gable.ai. During the episode, Chad discusses the complexities of managing the data supply chain, emphasizing the importance of data quality, feedback loops, and aligning incentives within organizations. He shares his journey from analyst to data infrastructure leader at companies like Oracle, Sephora, and Microsoft. Chad introduces his company, Gable, which tackles upstream data quality issues...


share








 March 27, 2024  1h2m
 
 

The PRQL: The Data Supply Chain with Chad Sanderson of Gable.ai


In this bonus episode, Eric and Kostas preview their upcoming conversation with Chad Sanderson of Gable.ai.


share








 March 25, 2024  7m
 
 

182: How Can Data Infrastructure Facilitate Efficient Data Sharing? Featuring Kevin Liu of Stripe


This week on The Data Stack Show, Eric and Kostas chat with Kevin Liu, Software Engineer at Stripe. During the episode, Kevin discusses data infrastructure challenges and the development of data products. He also shares insights on the importance of metadata management and the role of catalogs in maintaining data consistency across various systems...


share








 March 20, 2024  1h0m
 
 

The PRQL: Exploring the Intersection of Software Engineering and Data Management with Kevin Liu of Stripe


In this bonus episode, Eric and Kostas preview their upcoming conversation with Kevin Liu of Stripe.


share








 March 18, 2024  6m
 
 

181: Apache Druid and the Next Generation of Business Intelligence with Mike Driscoll of Rill Data


This week on The Data Stack Show, Eric and Kostas chat with Mike Driscoll, the CEO of Rill Data. During the episode, Mike recounts his journey from the Human Genome Project to developing the Druid engine, which was created to handle massive advertising data. He discusses Druid's adoption by major companies and its evolution, emphasizing the importance of speed, simplicity, and scalability in data tools...


share








 March 13, 2024  59m