Python Bytes

Python Bytes is a weekly podcast hosted by Michael Kennedy and Brian Okken. The show is a short discussion on the headlines and noteworthy news in the Python, developer, and data science space.

https://pythonbytes.fm/

subscribe
share





episode 210: Analyzing Kickstarter Campaigns with Python


The live stream recording on YouTube.

Special guest: Jay Miller

Sponsored by us! Support our work through:

  • Our courses at Talk Python Training
  • Test & Code Podcast
  • Patreon Supporters

Brian #1:Analyzing Kickstarter Campaigns with Python Data Science Tools

  • Article title: “Kickstarter Projects — Do They Succeed?”
  • Aditya Patkar
  • Using a Kaggle dataset of 378,661-ish projects up to 2018.
  • Looks at using pandas data frames to explore the data.
  • Using .describe() data frame method to learn a lot.
  • Uses matplotlib and seaborn to analyze the data further.
  • Odd statement that I’m not sure is straight faced or a really dry joke: “The data from 1970 seems to be bad or insignificant data.”
  • Examples of using heat maps, line graphs, bar charts, to look at different aspects.
  • Some results:
    • 35.64% of projects are successful (meaning goal hit)
    • tech asks for the most for goals, and has the highest average per backer.
    • Comics has the lowest pledged amount per backer average.
  • Nice that you can use the techniques to ask your own questions of the data.

Michael #2:GPU Accelerated Python for Machine Learning on Cross-Vendor Graphics Cards

  • Building machine learning algorithms using the Vulkan Kompute Python Framework
  • When you hear “CUDA”, that means Nvidia ????
  • Uses Vulkan Kompute framework
  • A large number of high profile (and new) machine learning frameworks such as Google’s Tensorflow, Facebook’s Pytorch, Tencent’s NCNN, Alibaba’s MNN —between others — have been adopting Vulkan as their core cross-vendor GPU computing SDK.
  • As you can imagine, the Vulkan SDK provides very low-level C / C++ access to GPUs, which allows for very specialized optimizations.
  • The main disadvantage is the verbosity involved, requiring 500–2000+ lines of C++ code to only get the base boilerplate required to even start writing the application logic.
  • The Kompute Python package is built on top of the Vulkan SDK through optimized C++ bindings, which exposes Vulkan’s core computing capabilities. Kompute is the Python GPGPU framework.
  • The main article talks through a couple of numerical computation examples.

Jay#3:Adafruit PyPortal - CircuitPython Powered Internet Display

  • Gift for the tinkering pythonista
  • CircuitPy
  • Use it to make plenty of cool things
  • Screen/speaker/Light Sensor Built-In

Brian#4:Introduction to Linear Algebra for Applied Machine Learning with Python

  • Pablo Caceres
  • Intended as a reference and not a comprehensive review.
  • Still, I very much appreciate it.
  • Includes links to both free and paid resources to thoroughly learn linear algebra
  • Covers
    • sets, ordered pairs, relations, functions,
    • vectors
    • matrices
    • linear and affine mappings
    • matrix decomposition
  • Uses numpy, pandas, and altair for examples
  • Quick (but useful) explanations of concepts, along with how to represent and do it with numpy
  • I’m really just getting into it, but I’m enjoying it and this is the right level of handholding I needed.

Michael#5:How many notebook frameworks? Many, and now +1 with Deepnote

  • Deepnote is a new kind of data science notebook. Jupyter-compatible with real-time collaboration and running in the cloud.
  • Free for individuals, paid for teams and companies
  • Real time collaboration is a key feature
  • Built in versioning coming
  • Code review in the notebook coming
  • “View” your variables as a whole environment
  • Better — real — autocomplete
  • Dashboards coming too

Jay#6:imagekit.io

  • image cdn
  • started using imagekit on my own website and noticed faster load times
  • allows for some responsive “fanciness”
    • Add Blurs
    • Smart Cropping
  • Python API or URL-Schema

Extras

Michael:

  • The Apple M1 mac mini wait continues. :)
  • Talk Python To Me, pro edition
  • PSF Fundraiser for the month of December: https://pythonbytes.fm/psf2020

Jay:

  • Elastic Community YouTube Channel
    • Just posted my lightning talk on looking at open data from the government.
    • Upcoming interview on one of our newest clients - Eland which is python client to create pandas-like dataframes with elasticsearch datastores.
  • My Podcast The PIT Show weekly insights from me on my developer journey and interviews with amazing folks in the tech space.
  • Elastic Blog - Just posted my first Elastic Blog post Elastic Contributor Program: How to create a video tutorial

Joke:

via twitter.com/Spirix3/status/1330611989891207168

  • Q: why can't SQL and NoSQL Developers date one other?
  • A: because they don't agree on relationships.


share







 2020-12-03  31m