Data Engineering Podcast

This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

https://www.dataengineeringpodcast.com

Eine durchschnittliche Folge dieses Podcasts dauert 53m. Bisher sind 433 Folge(n) erschienen. Jede Woche gibt es eine neue Folge dieses Podcasts.

Gesamtlänge aller Episoden: 16 days 2 hours 51 minutes

subscribe
share






Ship Faster With An Opinionated Data Pipeline Framework


An interview about how the open source Kedro framework makes it faster and easier to build your end-to-end data pipeline for machine learning projects


share








 October 1, 2019  35m
 
 

Open Source Object Storage For All Of Your Data

[transcript]


An interview on the open source MinIO platform for fast and flexible object storage for data intensive applications and analytics that runs everywhere


share








 September 23, 2019  1h8m
 
 

Navigating Boundless Data Streams With The Swim Kernel

[transcript]


An interview about using stateful computation on data streams with the SwimOS kernel to improve your analytics


share








 September 18, 2019  57m
 
 

Building A Reliable And Performant Router For Observability Data

[transcript]


An interview about building the Vector project to unify delivery of logs and metrics for better system observability


share








 September 10, 2019  55m
 
 

Building A Community For Data Professionals at Data Council

[transcript]


An interview with Pete Soderling about building and growing the Data Council events and helping engineers build businesses


share








 September 2, 2019  52m
 
 

Building Tools And Platforms For Data Analytics

[transcript]


An interview on what data engineers need to know about building tools and platforms for data analytics


share








 August 26, 2019  48m
 
 

A High Performance Platform For The Full Big Data Lifecycle

[transcript]


An interview about the HPCC Systems platform, its journey to open source, and how it handle the full lifecycle of big data for enterprise scale analytics


share








 August 19, 2019  1h13m
 
 

Digging Into Data Replication At Fivetran

[transcript]


An interview about how the Fivetran platform is designed to handle data replication as a service


share








 August 12, 2019  44m
 
 

Solving Data Discovery At Lyft

[transcript]


An interview about the open source Amundsen platform for data discovery and how Lyft is using it to improve their analytics workflow


share








 August 5, 2019  51m
 
 

Simplifying Data Integration Through Eventual Connectivity

[transcript]


An interview about a new pattern for data integration that reduces the amount of effort required to find connections in numerous data sets


share








 July 29, 2019  53m