How to Analyze Income Dataset Using Pandas and Visualization Libraries

Nutan
8 min readMar 5, 2024

In this blog, we will take the income dataset and analyze all aspects of it. The dataset provided predictive features like age, education, employment status, marital status, etc. to predict if the salary is greater than $50k.

Photo by Domenico Loia on Unsplash

Download the Income Dataset

We can download adult dataset from Kaggle. We can follow the link below:

Adult income dataset

About Income Dataset

An individual’s annual income results from various factors. Intuitively, it is influenced by the individual’s education level, age, gender, occupation, and etc.

This is a widely cited KNN dataset. I encountered it during my course, and I wish to share it here because it is a good starter example for data pre-processing and machine learning practices.

Fields The dataset contains 16 columns Target filed: Income — The income is divide into two classes: <=50K and >50K Number of attributes: 14 — These are the demographics and other features to describe a person

Attribute Information:

  1. age: continuous.
    2.
    workclass: Private, Self-emp-not-inc, Self-emp-inc, Federal-gov, Local-gov, State-gov, Without-pay, Never-worked.
    3.
    fnlwgt…

Create an account to read the full story.

The author made this story available to Medium members only.
If you’re new to Medium, create a new account to read this story on us.

Or, continue in mobile web

Already have an account? Sign in

Nutan

knowledge of Machine Learning, React Native, React, Python, Java, SpringBoot, Django, Flask, Wordpress. Never stop learning because life never stops teaching.