Data Engineering Podcast

This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Eine durchschnittliche Folge dieses Podcasts dauert 47m. Bisher sind 92 Folge(n) erschienen. Dieser Podcast erscheint wöchentlich


episode 89: Data Labeling That You Can Feel Good About

An interview about the Cloud Factory platform for data labeling and social good in developing nations



episode 88: Scale Your Analytics On The Clickhouse Data Warehouse

An interview about Clickhouse, an open source, columnar data warehouse built for massive scale and speed to enable interactive analytics



episode 87: Stress Testing Kafka And Cassandra For Real-Time Anomaly Detection

An interview about testing the limits of scaling Kafka and Cassandra for real-time anomaly detection at Instaclustr



episode 86: The Workflow Engine For Data Engineers And Data Scientists

An interview about how the Prefect workflow engine unifies the needs of data engineers and data scientists with a pure Python API



episode 85: Maintaining Your Data Lake At Scale With Spark

A conversation with the architect of Delta Lake on the challenges of building a sustainable data lake at scale



episode 84: Managing The Machine Learning Lifecycle

An interview about how the open source Hydrosphere platform simplifies management of the full machine learning lifecycle


 2019-06-10  1h2m

episode 83: Evolving An ETL Pipeline For Better Productivity

An interview about how and why Greenhouse migrated their homegrown ETL pipeline onto DataCoral


 2019-06-04  1h2m

episode 82: Data Lineage For Your Pipelines

An interview about how the open source Pachdyerm platform makes building flexible data pipelines with first class support for data lineage easy


 2019-05-27  49m

episode 81: Build Your Data Analytics Like An Engineer

An interview about how dbt enables your data teams to build better analytics in your data warehouse


 2019-05-20  56m

episode 80: Using FoundationDB As The Bedrock For Your Distributed Systems

An interview about the FoundationDB project and how it simplifies the work of building custom distributed systems applications


 2019-05-07  1h6m