Member-only story

Use sklearn’s neural network to predict on podcast listening times

5 min readApr 5, 2025

It doesn’t matter what I am engaged in, I always try to fit Kaggle’s monthly playground competition into my schedule. This month required the data scientist to predict the length of time that an individual would listen to a podcast.

While I have used a linear regression model to make predictions in the past I opted to try out a nonlinear model in this competition.

MLPRegressor in scikit-learn is a multi-layer perceptron (MLP) regressor, which is a type of neural network used for regression tasks. It learns complex relationships between input features and a continuous target variable.

Key Features of MLPRegressor

Hidden Layers: You can specify the number of hidden layers and neurons in each layer using hidden_layer_sizes=(100,), where 100 represents the number of neurons in the single hidden layer.
Activation Functions: Supports activation functions like ReLU, tanh, logistic, and identity.
Optimization Solvers: Uses Adam, SGD, or LBFGS for weight optimization.
Regularization: Includes L2 regularization (alpha parameter) to prevent overfitting.
Learning Rate: Can be constant, adaptive, or inverse scaling.

Sklearn’s MLPRegressor can be used when:-

The data has non-linear relationships that traditional regression models struggle with.
When you need a flexible model that can learn complex patterns.
When you have enough data to train a neural network effectively.

I haven’t used sklearn’s neural network for a while, so I thought it would be a good refresher for me to employ this model. I write the code in Python, using Kaggle’s Jupyter Notebook, and stored it in my account with Kaggle.

The first thing that I did after creating the Jupyter Notebook was to import the libraries that I would need to execute the program, being:-

Numpy to perform numeric computations on the data,
Pandas to perform data processing on the data,
Os to go into the computer’s operating system,

Use sklearn’s neural network to predict on podcast listening times

Key Features of MLPRegressor

Sklearn’s MLPRegressor can be used when:-

Create an account to read the full story.

Written by Crystal X

No responses yet

More from Crystal X

A brief study of multivariate statistics

I have been studying statistics for a few years now and I like to explore niches that make this science explorable. One area of statistics…

Statistics Interview Question: Why is it better to report standard deviation than variance?

Statistics is perhaps the backbone of data science, so anybody wanting to excel in data science will need to attain a good understanding of…

What is the difference between a t-statistic and a z-statistic?

I have been studying hypothesis testing for several weeks now and one thing that I have learned is that when it comes to hypothesis…

Statistical interview question: What is the difference between a parametric and nonparametric test?

I have recently written a blog post about hypothesis tests, and that blog post can be read here…

Recommended from Medium

Understanding Regression and Its Types: Why We Need Regression and Real-World Applications…

Introduction

The Problem with P-values

Statistics and the Replication Crisis

PyTorch Lightning: The Library Supercharging Deep Learning

Explore PyTorch Lightning, the ultimate deep learning library to scale models fast, reduce boilerplate, and streamline AI development…

Basic Understanding of Least Squares For Data Science

Linear Algebra for Data Science Series (6): Least Squares

Random Forest Regression in Python — How to use it in a Predictive Analysis

What is Predictive Analysis?

Day 31: Handling Imbalanced Data — Oversampling, Undersampling, SMOTE

In the intricate world of machine learning, datasets rarely behave as we expect. One common challenge is imbalanced data — a scenario…