📊 📈 📉
Survey Data Analysis in R: A Crash Course
📖 Course Description
Are you looking for a resource that helps you quickly learn survey data analysis in R?
This goal-oriented crash course discusses the 15 most widely used tasks for survey data analysis in R.
This course differs from most of the publicly available resources in the following ways:
1️⃣ Utilizes real-world survey datasets, not fictional datasets
2️⃣ Focuses on specific tasks, not generic skills
3️⃣ Streamlines learning with just three packages: dplyr, ggplot2, and fixest
4️⃣ Prioritizes quick acquisition of practical skills
🪔 Learning Objectives
By completing this course, you should be able to:
💥 Begin survey data analysis in R for your research purpose
💥 Apply many of the lessons learned from this course in contexts beyond survey data analysis
💥 Tidy up your pre-existing knowledge of coding in R
There are no prerequisites for this course. However, familiarity with some basic operators in R is desirable. I describe them here.
All the materials are provided in English. Video lectures include English transcripts, which can be translated into other languages (on YouTube).
🎞️ Lecture Videos
📋 Course introduction [Video Link]
1️⃣ Importing a .csv file directly from the web [Video Link]
2️⃣ Creating a subset (Selecting specific rows) [Video Link]
3️⃣ Selecting specific columns [Video Link]
4️⃣ Creating a binary (a.k.a. dummy) variable [Video Link]
5️⃣ Renaming the levels of a categorical variable [Video Link]
6️⃣ Creating a new categorical variable [Video Link]
7️⃣ Creating a summary statistics table [Video Link]
8️⃣ Creating a barplot [Video Link]
💥 Extra Lesson: Creating a grouped/stacked barplot [Video Link]
This module uses data from the National Financial Well-Being Survey conducted by the Consumer Financial Protection Bureau
The written version of module 1 is available here.
1️⃣ Importing a .csv file from your local drive [Video Link]
2️⃣ Merging multiple waves of a survey by common columns [Video Link]
3️⃣ Creating a line chart [Video Link]
4️⃣ Saving graphs and tables [Video Link]
5️⃣ Estimating an empirical model using linear regression [Video Link]
💥 Extra Lesson: Creating a grouped line chart [Video Link]
This module uses data from the Survey of Household Economics and Decisionmaking conducted by the Federal Reserve Board
The written version of module 2 is available here.
1️⃣ Merging multiple segments of a survey by common columns (e.g., respondent IDs) [Video Link]
2️⃣ Summarizing a continuous variable [Video Link]
This module uses data from the American Time Use Survey conducted by the U.S. Bureau of Labor Statistics
The written version of module 3 is available here.
🗃️ Course Resources
This course does not require any textbooks.
You can find more content related to survey data analysis on my Medium page.
You should have RStudio (Posit) installed on your computer. Alternatively, you can use the cloud version of RStudio.
To record the lecture videos, I used OBS Studio, which is a free and open source software for video recording and live streaming.
If you have any questions/ concerns/ suggestions regarding any of the topics discussed in this crash course, feel free to reach out to me!