DataBŷtes

» episode

Articles by category: episode



2020
17 Jan 2020

Episode 49: Extreme Classification: Going at MACH Speed (Part 1)

In this episode, Dr. Derek Feng drops by to chat about a recent paper on a divide-and-conquer approach (Merged-Averaged Classifiers...

2019
13 Dec 2019

Episode 48: Where Moneyball Meets Footy

We’ve long heard about the waves that statistics has made in baseball. But what about soccer? In this episode, we...

30 Nov 2019

Episode 47: Domoic Acid Testing -- A Crabshoot?

Domoic acid has plagued shellfish and other wildlife along the Pacific coastline in recent years. Testing for domoic acid concentration...

08 Nov 2019

Episode 46: Finding Your (Niche) Board Games

In this episode, we discuss how two statisticians used data from BoardGameGeek.com to put together their own board game recommendation...

01 Nov 2019

Episode 45: Learning Publicly, with Private Data

In this episode, Dr. Derek Feng discusses the general issue of data privacy in the age of big data, including...

25 Oct 2019

Episode 44: A Conversation with Jon Krohn

We sit down with Dr. Jon Krohn to chat about his work as a Chief Data Scientist at untapt, his...

04 Oct 2019

Episode 43: To Google and Back

In this episode, Professor Albert Y. Kim of Smith College describes his post-PhD journey, which included a stint at Google...

27 Sep 2019

Episode 42: Black in the Box

Dr. Derek Feng joins us again to discuss the two metrics by which we align all statistical/machine learning methods –...

20 Sep 2019

Episode 41: What to do with Outliers

Guest Dylan O’Connell joins us today to talk about a recent surprising, but legitimate Democratic primary poll result done by...

13 Sep 2019

Episode 40: Making a DIY ML-Controlled Cat Door

Outdoor-cat owners know all too well the unpleasantries of dealing with what the cat dragged in. A self-proclaimed machine learning...

06 Sep 2019

Episode 39: Rolling in the Deep Patient

We take a deep dive into the poster child for black-box machine learning methods, namely Deep Patient: an unsupervised learning...

30 Aug 2019

Episode 38: The Misuse of Statistics in Court

In this episode, we talk about how a statistical concept that you would learn about in an introductory course was...

23 Aug 2019

Episode 37: Susan Starts a New Job

In this episode, we talk about Susan’s new job as a Data Scientist! She recently transitioned from academia to industry...

16 Aug 2019

Episode 36: What's New in Machine Learning Startups

In this episode, we talk about some machine learning startups to pay attention to this year. Sources 25 Machine Learning...

09 Aug 2019

Episode 35: You Look How You Sound

Deep learning has been useful for lots of applications when it comes to prediction. Yet another is the use of...

02 Aug 2019

Episode 34: Protecting Kids' Digital Privacy

In this episode, we talk about protecting kids’ digital privacy. Sources Is that a kid browsing? This company says its...

26 Jul 2019

Episode 33: Statisticians Hate Post-Hoc Power

Statistics is key to demonstrating the effectiveness of new advancements in science and medicine, but when statistical significance is not...

19 Jul 2019

Episode 32: Amazon's 3D Body Scan Study

In this episode, we talk about Amazon’s 3D body scan study. Sources Amazon is 3D-scanning people’s bodies in exchange for...

12 Jul 2019

Episode 31: What Data Visualizations Do You Care About? It's Personal

In this episode, we talk about how data are personal for those in a rural Pennsylvania community. Sources Data is...

05 Jul 2019

Episode 30: Some Like It Hot -- What Gender Reveals About Our Temperature Preferences

Word on the street is that women prefer warmer temperatures than men do. Researchers designed an experiment to investigate whether...

28 Jun 2019

Episode 29: Jeopardy! Meets Statistics

Jeopardy! is a weeknightly televised trivia game show. In recent months, one player, James Holzhauer has taken the Jeopardy! fandom...

21 Jun 2019

Episode 28: Facial Recognition Technology Update and Rating Trustworthiness of AI-Generated Airbnb Profiles

In this episode, we discuss a number of miscellaneous news updates regarding facial recognition technology (concerning San Francisco, Amazon, and...

14 Jun 2019

Episode 27: Does Uber/Lyft Help Or Hurt Traffic Congestion and Machine Learning Interpretability

In this episode, we look at a study about whether ride-sharing services contribute to increased or decreased traffic congestion in...

07 Jun 2019

Episode 26: Household Electronics That See and Google's Reservation AI

In this episode, we talk about a new innovation that enables household electronics to see what’s around them. We then...

31 May 2019

Episode 25: DataFest 2019 and Measuring Migrations from Hurricane Maria

Susan recently served as a judge at a local DataFest competition (a weekend-long data competition for undergraduates). She shares her...

24 May 2019

Episode 24: Predictive Power of Early Polling and Did a TV Show Result in Higher Teenage Suicides?

In this episode, we discuss FiveThirtyEight.com’s analysis of primary election polling over the past 40 years. In particular, we consider...

17 May 2019

Episode 23: Offline Song Identification and Perceptions about AI

In this episode, we discuss how Google’s Now Playing feature can identify songs that are playing around you, using embeddings....

10 May 2019

Episode 22: Betting on the Game of Thrones and the Misfortune of Lefthandedness

In this episode, we discuss how bookmakers price/take bets on outcomes in the Game of Thrones. We then discuss a...

03 May 2019

Episode 21: Pitch Call Accuracy and Predicting the Outcome of the Champions League

Buckle up for a sports-filled episode! We discuss a study that analyzes the accuracy of umpire calls about strikes vs....

26 Apr 2019

Episode 20: Thinking Like Computers and Text Mining the Mueller Report

In this episode, we discuss a study that recruits human researchers to try to predict how computers classify images. We...

19 Apr 2019

Episode 19: Seeing with AI and Detecting Exoplanets

In this episode, we discuss Microsoft’s handy phone application for scanning and reporting on our surroundings, as a way of...

12 Apr 2019

Episode 18: Statistical Anxiety and the Fight Against Statistical Significance

We discuss a survey designed to analyze the extent and root cause of statistical anxiety in the classroom, discussing the...

05 Apr 2019

Episode 17: How Theranos Sinned Statistically

In this episode, Susan Wang is joined by guest Natalie Doss to consider the statistical sins committed by Theranos, the...

15 Mar 2019

Episode 16: Machine-Generated Faces/Text, and Relating Health Outcomes to Skin Tone

We discuss NVIDIA’s AI-generated faces that look incredibly authentic, and relatedly, OpenAI’s text generator that is so capable that it...

08 Mar 2019

Episode 15: Deep Learning to Fold Proteins and Automated Journalism

We discuss opportunities for machines and humans in the prediction of protein structures, a necessary task in new drug discovery....

01 Mar 2019

Episode 14: A Personality Test that Makes Sense and What Does Spotify Know?

FiveThirtyEight.com has provided a free, online personality test that might make more sense than your typical online clickbaity quiz. We...

22 Feb 2019

Episode 13: IBM's Debate Machine and Adopting a 'Data Culture' in Companies

On February 11, IBM showcased its Project Debater in a face-off against debate champion Harish Natarajan. We talk about how...

15 Feb 2019

Episode 12: Super Bowl Stats, Confidence Intervals, and Data Sources

Three topics are featured in this episode: first, statistics about Super Bowl LIII, including what was in the bowls as...

08 Feb 2019

Episode 11: How Machines Might be Biased and the Job Market for Data Scientists

AI and ML algorithms are growing popular – but they can actually perpetuate cognitive biases in our daily lives. We...

01 Feb 2019

Episode 10: AI in Medicine and Racial Bias in College Admissions

Artificial intelligence is starting to make waves in medicine; we look at how technology might potentially change how medical testing...

25 Jan 2019

Episode 9: Lessons Learned from Making a Fitbit Data Visualization Shiny App

Dynamic data visualization widgets can be pretty cool, but it takes more than just statistical chops to build an online...

18 Jan 2019

Episode 8: The French Revolution and the Challenge of Reproducibility

What can machine learning tell us about the French Revolution? This episode describes a brief history lesson of the digital...

11 Jan 2019

Episode 7: The Virtual Maestro and the Most Influential Movie

Have you ever wanted to try your hand at conducting an orchestra? Now you can, with Google’s Semi-Conductor online app....

04 Jan 2019

Episode 6: Probability Games and Amazon's Own Self-Driving Car

What are the odds that a toss of a 10-sided die, followed by a toss of a 20-sided die, and...

2018
28 Dec 2018

Episode 5: The Do's and Don'ts of Data Visualization

Data visualization is an integral pre-cursor to data analysis, providing a way to visually inspect the data for surprising trends...

21 Dec 2018

Episode 4: Meet the Co-hosts (Part 2)

This week, we learn about Jessi Cisewski-Kehe’s background to find out how she went from a Math major to an...

12 Dec 2018

Episode 3: Meet the Co-hosts (Part 1)

This week, we learn about Susan Wang’s background to find out how she went from an Applied Math major to...

04 Dec 2018

Episode 2: Biometric Technology at Airports, Google Smart Replies, Bestselling Books

In this episode, we discuss biometric technology used at airports, Google Smart Replies (and letting AI compose our emails/texts for us),...

29 Nov 2018

Episode 1: Thanksgiving, College Football, International Prize in Statistics

The first episode of the DataBytes Podcast where we discuss popular topics related to data, statistics, data science, machine learning,...