Python Test

Practical automated testing for software engineers using Python. Mostly. But also so much more.

https://podcast.pythontest.com

subscribe
share






episode 57: What is Data Science? - Vicki Boykis


Data science, data engineering, data analysis, and machine learning are part of the recent massive growth of Python.

But really what is data science?

Vicki Boykis helps me understand questions like:

  • No really, what is data science?
  • What does a data pipeline look like?
  • What is it like to do data science, data analysis, data engineering?
  • Can you do analysis on a laptop?
  • How big does data have to be to be considered big?
  • What are the challenges in data science?
  • Does it make sense for software engineers to learn data engineering, data science, pipelines, etc?
  • How could someone start learning data science?

Also covered:

  • A type work (analysis) vs B type work (building)
  • data lakes and data swamps
  • predictive models
  • data cleaning
  • development vs experimentation
  • Jupyter Notebooks
  • Kaggle
  • ETL pipelines

I learned a lot about the broad field of data science from talking with Vicki.

Special Guest: Vicki Boykis.

Sponsored By:

  • DigitalOcean: Get started with a free $100 credit

Links:

  • How to Lie with Statistics : Darrell Huff
  • Should you replace Hadoop with your laptop?
  • Kaggle
  • Project Jupyter
  • Soviet Art Bot — A bot that finds socialist realism paintings and tweets them out


fyyd: Podcast Search Engine
share








 December 11, 2018  30m