Data Engineering Podcast

This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Eine durchschnittliche Folge dieses Podcasts dauert 48m. Bisher sind 109 Folge(n) erschienen. Dieser Podcast erscheint wöchentlich


episode 106: Designing For Data Protection

An interview about data protection regulations and how they can influence the design of your data platform



episode 105: Automating Your Production Dataflows On Spark

An interview about how the Ascend platform provides an autonomous data orchestration platform to simplify your production dataflows



episode 104: Build Maintainable And Testable Data Applications With Dagster

An interview about the Dagster framework and how you can use it to build testable and maintainable data applications



episode 103: Data Orchestration For Hybrid Cloud Analytics

An interview about the emerging category of data orchestration platforms and how they can be used to bridge the gap between modern and legacy analytics systems



episode 102: Keeping Your Data Warehouse In Order

An interview about Dataform and how it helps you to keep your data warehouse in good working order



episode 101: Fast Analytics On Semi-Structured And Structured Data In The Cloud

An interview about the architecture of Rockset and how they built a serverless platform for fast and flexible analytics on your semi-structured data


 2019-10-08  54m

episode 100: Ship Faster With An Opinionated Data Pipeline Framework

An interview about how the open source Kedro framework makes it faster and easier to build your end-to-end data pipeline for machine learning projects


 2019-10-01  35m

episode 99: Open Source Object Storage For All Of Your Data

An interview on the open source MinIO platform for fast and flexible object storage for data intensive applications and analytics that runs everywhere


 2019-09-23  1h8m

episode 98: Navigating Boundless Data Streams With The Swim Kernel

An interview about using stateful computation on data streams with the SwimOS kernel to improve your analytics


 2019-09-18  57m

episode 97: Building A Reliable And Performant Router For Observability Data

An interview about building the Vector project to unify delivery of logs and metrics for better system observability


 2019-09-10  55m