Descriptive Statistics
Welcome to the Descriptive Statistics module! In this module, you will learn about:
- Summarizing data: How to calculate summary statistics for numeric data
- Frequency tables: How to summarize categorical data
- Summarizing data with tidyverse: How to calculate summary statistics for numeric data using tidyverse
- Frequency tables with tidyverse: How to summarize categorical data using tidyverse
- pivoting tables: How to reshape and summarize data
Each tutorial will comprise a series of interactive lessons with practice problems embedded throughout.
Learning Goals
After completing this module, you will be able to:
Summarizing Data
- Calculate the mean, the standard deviation and the range of a variable
- Create summary statistics using the
summary()function - Use the
describe()function - Account for missing values in your data when calculating summary statistics
Frequency Tables
- Make frequency tables in
R - Make proportion and cumulative proportion tables
- Make readable tables including various frequency measures
- Making cross tables between two variables
Summarizing Data with the Tidyverse
- Use
summarise()to compute descriptive statistics in a tidy, pipeline-based workflow - Apply summary functions to multiple variables at once using
across() - Create grouped summaries with
group_by() - Compute counts, proportions, and missingness with tidyverse tools
Frequency Tables with the Tidyverse
- Use
count()to generate tidy frequency tables - Add proportions and cumulative proportions using
mutate() - Create grouped or conditional frequency tables
Pivoting tables
- Understand when long or wide format is most appropriate for descriptive analyses
- Use
pivot_longer()to transform repeated measures into tidy long format - Use
pivot_wider()to restructure summaries or prepare matrix-like data - Control naming conventions and handle missing values during pivoting
Click the button below to get started with the first tutorial.