Data Engineering Podcast

This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

https://www.dataengineeringpodcast.com

Eine durchschnittliche Folge dieses Podcasts dauert 53m. Bisher sind 431 Folge(n) erschienen. Dieser Podcast erscheint wöchentlich.

Gesamtlänge aller Episoden: 16 days 1 hour 36 minutes

subscribe
share






Unlocking The Power of Data Lineage In Your Platform with OpenLineage


Data lineage is the common thread that ties together all of your data pipelines, workflows, and systems. In order to get a holistic understanding of your data quality, where errors are occurring, or how a report was constructed you need to track the lineage of the data from beginning to end. The complicating factor is that every framework, platform, and product has its own concepts of how to store, represent, and expose that information...


share








 May 18, 2021  n/a
 
 

Building Your Data Warehouse On Top Of PostgreSQL


An interview about how you can build your data warehouse on top of PostgreSQL for flexibility and full control over your data.


share








 May 14, 2021  1h15m
 
 

Making Analytical APIs Fast With Tinybird


A conversation about how Tinybird invested in Clickhouse to power analytical APIs that are fast to build and operate.


share








 May 11, 2021  54m
 
 

Making Spark Cloud Native At Data Mechanics


A conversation about how the team at Data Mechanics is bringing Apache Spark into the cloud native world and the positive impact that has on your development experience.


share








 May 7, 2021  40m
 
 

The Grand Vision And Present Reality of DataOps


A conversation about the grand vision and current realities of DataOps and how you can start on the journey toward more maintainable and reliable data systems.


share








 May 4, 2021  57m
 
 

Self Service Data Exploration And Dashboarding With Superset


An interview with Maxime Beauchemin about how to use Apache Superset as a platform for self-service data exploration and analytics.


share








 April 27, 2021  47m
 
 

Moving Machine Learning Into The Data Pipeline at Cherre


An interview about how the team at Cherre built an internal machine learning project to use as a service in their data pipelines to make dealing with messy address data less painful.


share








 April 20, 2021  48m
 
 

Exploring The Expanding Landscape Of Data Professions with Josh Benamram of Databand


An interview with Josh Benamram about the emerging roles across the data ecosystem and how they interact with data systems.


share








 April 13, 2021  1h8m
 
 

Put Your Whole Data Team On The Same Page With Atlan


In this episode Prukalpa Sankar discusses how Atlan uses metadata from all of your workflows to bring everyone on the same page, letting you delivery on your data projects in record time.


share








 April 6, 2021  57m
 
 

Data Quality Management For The Whole Team With Soda Data


An interview about the Soda Data platform and the open source components that they are building to level up the quality of your data pipelines.


share








 March 30, 2021  58m