DataBŷtes

Welcome to the home of DataBytes, a podcast about statistics and data science.

Apple Podcasts Google Play Stitcher Spotify
17 Jan 2020

Episode 49: Extreme Classification: Going at MACH Speed (Part 1)

In this episode, Dr. Derek Feng drops by to chat about a recent paper on a divide-and-conquer approach (Merged-Averaged Classifiers...

13 Dec 2019

Episode 48: Where Moneyball Meets Footy

We’ve long heard about the waves that statistics has made in baseball. But what about soccer? In this episode, we...

30 Nov 2019

Episode 47: Domoic Acid Testing -- A Crabshoot?

Domoic acid has plagued shellfish and other wildlife along the Pacific coastline in recent years. Testing for domoic acid concentration...

08 Nov 2019

Episode 46: Finding Your (Niche) Board Games

In this episode, we discuss how two statisticians used data from BoardGameGeek.com to put together their own board game recommendation...

01 Nov 2019

Episode 45: Learning Publicly, with Private Data

In this episode, Dr. Derek Feng discusses the general issue of data privacy in the age of big data, including...

25 Oct 2019

Episode 44: A Conversation with Jon Krohn

We sit down with Dr. Jon Krohn to chat about his work as a Chief Data Scientist at untapt, his...

04 Oct 2019

Episode 43: To Google and Back

In this episode, Professor Albert Y. Kim of Smith College describes his post-PhD journey, which included a stint at Google...

27 Sep 2019

Episode 42: Black in the Box

Dr. Derek Feng joins us again to discuss the two metrics by which we align all statistical/machine learning methods –...

20 Sep 2019

Episode 41: What to do with Outliers

Guest Dylan O’Connell joins us today to talk about a recent surprising, but legitimate Democratic primary poll result done by...

13 Sep 2019

Episode 40: Making a DIY ML-Controlled Cat Door

Outdoor-cat owners know all too well the unpleasantries of dealing with what the cat dragged in. A self-proclaimed machine learning...

06 Sep 2019

Episode 39: Rolling in the Deep Patient

We take a deep dive into the poster child for black-box machine learning methods, namely Deep Patient: an unsupervised learning...

30 Aug 2019

Episode 38: The Misuse of Statistics in Court

In this episode, we talk about how a statistical concept that you would learn about in an introductory course was...

23 Aug 2019

Episode 37: Susan Starts a New Job

In this episode, we talk about Susan’s new job as a Data Scientist! She recently transitioned from academia to industry...

16 Aug 2019

Episode 36: What's New in Machine Learning Startups

In this episode, we talk about some machine learning startups to pay attention to this year. Sources 25 Machine Learning...

09 Aug 2019

Episode 35: You Look How You Sound

Deep learning has been useful for lots of applications when it comes to prediction. Yet another is the use of...

02 Aug 2019

Episode 34: Protecting Kids' Digital Privacy

In this episode, we talk about protecting kids’ digital privacy. Sources Is that a kid browsing? This company says its...

26 Jul 2019

Episode 33: Statisticians Hate Post-Hoc Power

Statistics is key to demonstrating the effectiveness of new advancements in science and medicine, but when statistical significance is not...

19 Jul 2019

Episode 32: Amazon's 3D Body Scan Study

In this episode, we talk about Amazon’s 3D body scan study. Sources Amazon is 3D-scanning people’s bodies in exchange for...

12 Jul 2019

Episode 31: What Data Visualizations Do You Care About? It's Personal

In this episode, we talk about how data are personal for those in a rural Pennsylvania community. Sources Data is...

05 Jul 2019

Episode 30: Some Like It Hot -- What Gender Reveals About Our Temperature Preferences

Word on the street is that women prefer warmer temperatures than men do. Researchers designed an experiment to investigate whether...