📊 📈 📉
Survey Data Analysis in R: A Crash Course
📖 Course Description
Are you looking for a resource that helps you quickly learn survey data analysis in R?
This goal-oriented crash course discusses the 15 most widely used tasks for survey data analysis in R.
This course differs from most of the publicly available resources in the following ways:
1️⃣ Utilizes real-world survey datasets, not fictional datasets
2️⃣ Focuses on specific tasks, not generic skills
3️⃣ Streamlines learning with just three packages: dplyr, ggplot2, and fixest
4️⃣ Prioritizes quick acquisition of practical skills
🪔 Learning Objectives
By completing this course, you should be able to:
💥 Begin survey data analysis in R for your research purpose
💥 Apply many of the lessons learned from this course in contexts beyond survey data analysis
💥 Tidy up your pre-existing knowledge of coding in R
💪🏼 Prerequisites
There are no prerequisites for this course. However, familiarity with some basic operators in R is desirable. I describe them here.
All the materials are provided in English. Video lectures include English transcripts, which can be translated into other languages (on YouTube).
🎞️ Lecture Videos
📋 Course introduction [Video Link]
Module 1
1️⃣ Importing a .csv file directly from the web [Video Link]
2️⃣ Creating a subset (Selecting specific rows) [Video Link]
3️⃣ Selecting specific columns [Video Link]
4️⃣ Creating a binary (a.k.a. dummy) variable [Video Link]
5️⃣ Renaming the levels of a categorical variable [Video Link]
6️⃣ Creating a new categorical variable [Video Link]
7️⃣ Creating a summary statistics table [Video Link]
8️⃣ Creating a barplot [Video Link]
💥 Extra Lesson: Creating a grouped/stacked barplot [Video Link]
This module uses data from the National Financial Well-Being Survey conducted by the Consumer Financial Protection Bureau
The written version of module 1 is available here.
Module 2
1️⃣ Importing a .csv file from your local drive [Video Link]
2️⃣ Merging multiple waves of a survey by common columns [Video Link]
3️⃣ Creating a line chart [Video Link]
4️⃣ Saving graphs and tables [Video Link]
5️⃣ Estimating an empirical model using linear regression [Video Link]
💥 Extra Lesson: Creating a grouped line chart [Video Link]
This module uses data from the Survey of Household Economics and Decisionmaking conducted by the Federal Reserve Board
The written version of module 2 is available here.
Module 3
1️⃣ Merging multiple segments of a survey by common columns (e.g., respondent IDs) [Video Link]
2️⃣ Summarizing a continuous variable [Video Link]
This module uses data from the American Time Use Survey conducted by the U.S. Bureau of Labor Statistics
The written version of module 3 is available here.
🗃️ Course Resources
This course does not require any textbooks.
You can find more content related to survey data analysis on my Medium page.
👩🏽💻 Software
You should have RStudio (Posit) installed on your computer. Alternatively, you can use the cloud version of RStudio.
To record the lecture videos, I used OBS Studio, which is a free and open source software for video recording and live streaming.
📩 Contact
If you have any questions/ concerns/ suggestions regarding any of the topics discussed in this crash course, feel free to reach out to me!