Data Engineering Podcast

This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

https://www.dataengineeringpodcast.com

Eine durchschnittliche Folge dieses Podcasts dauert 47m. Bisher sind 92 Folge(n) erschienen. Dieser Podcast erscheint wöchentlich
subscribe
share



 

episode 89: Data Labeling That You Can Feel Good About


An interview about the Cloud Factory platform for data labeling and social good in developing nations


share





   57m
 
 

episode 88: Scale Your Analytics On The Clickhouse Data Warehouse


An interview about Clickhouse, an open source, columnar data warehouse built for massive scale and speed to enable interactive analytics


share





   1h11m
 
 

episode 87: Stress Testing Kafka And Cassandra For Real-Time Anomaly Detection


An interview about testing the limits of scaling Kafka and Cassandra for real-time anomaly detection at Instaclustr


share





   38m
 
 

episode 86: The Workflow Engine For Data Engineers And Data Scientists


An interview about how the Prefect workflow engine unifies the needs of data engineers and data scientists with a pure Python API


share





   1h8m
 
 

episode 85: Maintaining Your Data Lake At Scale With Spark


A conversation with the architect of Delta Lake on the challenges of building a sustainable data lake at scale


share





 2019-06-17  50m
 
 

episode 84: Managing The Machine Learning Lifecycle


An interview about how the open source Hydrosphere platform simplifies management of the full machine learning lifecycle


share





 2019-06-10  1h2m
 
 

episode 83: Evolving An ETL Pipeline For Better Productivity


An interview about how and why Greenhouse migrated their homegrown ETL pipeline onto DataCoral


share





 2019-06-04  1h2m
 
 

episode 82: Data Lineage For Your Pipelines


An interview about how the open source Pachdyerm platform makes building flexible data pipelines with first class support for data lineage easy


share





 2019-05-27  49m
 
 

episode 81: Build Your Data Analytics Like An Engineer


An interview about how dbt enables your data teams to build better analytics in your data warehouse


share





 2019-05-20  56m
 
 

episode 80: Using FoundationDB As The Bedrock For Your Distributed Systems


An interview about the FoundationDB project and how it simplifies the work of building custom distributed systems applications


share





 2019-05-07  1h6m