Data Engineering Podcast

This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

https://www.dataengineeringpodcast.com

Eine durchschnittliche Folge dieses Podcasts dauert 47m. Bisher sind 88 Folge(n) erschienen. Dies ist ein wöchentlich erscheinender Podcast
subscribe
share



 

episode 85: Maintaining Your Data Lake At Scale With Spark


A conversation with the architect of Delta Lake on the challenges of building a sustainable data lake at scale


share





   50m
 
 

episode 84: Managing The Machine Learning Lifecycle


An interview about how the open source Hydrosphere platform simplifies management of the full machine learning lifecycle


share





   1h2m
 
 

episode 83: Evolving An ETL Pipeline For Better Productivity


An interview about how and why Greenhouse migrated their homegrown ETL pipeline onto DataCoral


share





   1h2m
 
 

episode 82: Data Lineage For Your Pipelines


An interview about how the open source Pachdyerm platform makes building flexible data pipelines with first class support for data lineage easy


share





   49m
 
 

episode 81: Build Your Data Analytics Like An Engineer


An interview about how dbt enables your data teams to build better analytics in your data warehouse


share





   56m
 
 

episode 80: Using FoundationDB As The Bedrock For Your Distributed Systems


An interview about the FoundationDB project and how it simplifies the work of building custom distributed systems applications


share





 2019-05-07  1h6m
 
 

episode 79: Running Your Database On Kubernetes With KubeDB


An interview about how to run your database on Kubernetes with the creator of KubeDB


share





 2019-04-29  50m
 
 

episode 78: Unpacking Fauna: A Global Scale Cloud Native Database


A deep dive on building the Fauna database and how it supports transactions at global scale


share





 2019-04-22  53m
 
 

episode 77: Index Your Big Data With Pilosa For Faster Analytics


An interview about the Pilosa bitmap index server and how it can be used to run fast, continuous analytics on large and complex data sets


share





 2019-04-15  43m
 
 

episode 76: Serverless Data Pipelines On DataCoral


An interview about how DataCoral is building an abstraction layer over data pipelines using microservices built on serverless technologies


share





 2019-04-08  53m