Exploratory Data Visualization in R
Estimated Time Commitment: 3 Hours
In this skill, you will be introduced to a powerful visualization package in ggplot2 to create useful visualizations for exploratory data analysis (EDA).
We will start with a discussion on the role of EDA in the overall data analysis. Then, we will discuss basic types of charts/ plots for univariate, bivariate, and multivariate exploration. We will also illustrate how to create these visualizations in R using ggplot2 package. We will create various charts such as Histogram, Boxplot, Bar-chart, Scatter Plot, Grouped Boxplots, Grouped Histogram, etc.
In the end, we will introduce you to a useful GUI tool that can help you learn ggplot2, if you find ggplot2 codes overwhelming. The tool will allow you to first create charts and then inspect and save the code.
Upon successful completion, you will be able to:
Understand why and how to perform Exploratory Data Analysis (EDA)
Appreciate the type of variables and their explorations
Perform basic functions of ggplot2 package for creating nice visualizations in R
Understand how to explore relationship between two or more variables
Introduction Video
Introduction to the Skill
Glossary
What to Explore in the Data?
Univariate and Bivariate Exploration
Knowledge Check 1
Introduction to the Ggplot2 Package
Introduction to Ggplot Syntax
Univariate Exploration of a Numeric Variable
Univariate Exploration of a Categorical Variable
Knowledge Check 2
Visualizing Relationship between Two Numeric Variables
Visualizing Relationship between a Numeric and a Categorical Variable
Visualizing Relationship between Two Categorical Variables
Knowledge Check 3
Multidimensional Charts
Introduction to Ggplot Builder
Knowledge Check 4
Instructions
Exercise Files
Debriefing
Conclusion Video
Final Quiz
Survey Instructions
Feedback Survey
Survey Verification
Next Steps