Guest Blog — August 27, 2020
Beginner Data Exploration Data Visualization Python Structured Data Technique

Introduction

Exploratory Data Analysis

Exploratory Data Analysis(EDA)

– Handle Missing value
– Removing duplicates
– Outlier Treatment
– Normalizing and Scaling( Numerical Variables)
– Encoding Categorical variables( Dummy Variables)
– Bivariate Analysis

Exploratory Data Analysis - Import Libraries
Box-plot after removing outliers

Box-plot after removing outliers

  1. Exploratory Data Analysis - Data Shape

  2. Exploratory Data Analysis - Data Information

    Exploratory Data Analysis - Data Type

  3. Exploratory Data Analysis - Describe

Exploratory Data Analysis - Sum

Image for postExploratory Data Analysis - Impute Missing values

Exploratory Data Analysis - Impute Missing Values

Image for post

Handling Duplicate records

Image for post

Image for post

Image for post

Handling Outlier

Image for post

Box-plot before removing outliers

Image for post

Box-plot after removing outliers

Bivariate Analysis

  1. Two Categorical Variables

    1. Bar chart
    2. Grouped bar chart
    3. Point plot

Image for post

Correlation between all the variables

Normalizing and Scaling

Image for post

Image for post

ENCODING

Image for post

Image for post

Image for post

About the Author

Ritika Singh – Data Scientist

I am a Data scientist by profession and a Blogger by passion. I have been working on machine learning projects for more than 2 years. Here you will find articles on “Machine Learning, Statistics, Deep Learning, NLP and Artificial Intelligence”.

About the Author

Our Top Authors

  • Analytics Vidhya
  • Guest Blog
  • Tavish Srivastava
  • Aishwarya Singh
  • Aniruddha Bhandari
  • Abhishek Sharma
  • Aarshay Jain

Download Analytics Vidhya App for the Latest blog/Article

Leave a Reply Your email address will not be published. Required fields are marked *