VIVEKANANDA DAS
  • Curriculum Vitae
  • Research
  • Data Projects
  • Teaching
  • Medium Articles
  • বাংলা
📊 📈 📉​ 

Survey Data Analysis in R: A Crash Course 

📖   Course Description

​This goal-oriented crash course discusses some of the   most widely used tasks for survey data analysis in R.

This course differs from most of the publicly available resources in the following ways:

1️⃣  Utilizes real-world survey datasets, not fictional datasets

2️⃣  Focuses on specific tasks, not generic skills

3️⃣  Streamlines learning with just three packages:   
dplyr,   ggplot2, and    fixest

4️⃣​  Prioritizes quick acquisition of practical skills



​🪔   Learning Objectives

By completing this course, you should be able to: 


💥   Begin survey data analysis in R for your research purpose

​💥   Apply many of the lessons learned from this course in contexts beyond survey data analysis

💥   Tidy up your pre-existing knowledge of coding in R
​

💪🏼   Prerequisites

There are no prerequisites for this course.   However, familiarity with some basic operators in R is desirable. I describe them   here. 

All the materials are provided in English. Video lectures include English transcripts, which can be translated into other languages (on YouTube).


🎞️   Lecture Videos

📋   Course introduction [Video Link]

Module 1


1️⃣   Importing a .csv file directly from the web [Video  Link]

2️⃣   Creating a subset (Selecting specific rows)   [Video  Link]

3️⃣   Selecting specific columns   [Video  Link]

4️⃣​   Creating a binary (a.k.a. dummy) variable   [Video  Link]

5️⃣   Renaming the levels of a categorical variable [Video Link]

6️⃣   Creating a new categorical variable [Video Link]

7️⃣   Creating a summary statistics table [Video Link]

8️⃣   Creating a barplot [Video Link]

💥   Extra Lesson:   Creating a grouped/stacked barplot [Video Link]

​This module uses data from the   National Financial Well-Being Survey   conducted by the   Consumer Financial Protection Bureau .

The written version of module 1 is available   here.


Module 2

1️⃣   Importing a .csv file from your local drive [Video Link] 

2️⃣   Merging multiple waves of a survey by common columns [Video Link]

3️⃣   Creating a line chart [Video Link] 

4️⃣   Saving graphs and tables    [Video Link]

5️⃣   Estimating an empirical model using linear regression [Video Link]

💥   Extra Lesson:   Creating a grouped line chart    [Video Link]

​This module uses data from the   Survey of Household Economics and Decisionmaking   conducted by the    Federal Reserve Board.

The written version of module 2 is available here.


Module 3

1️⃣   Merging multiple segments of a survey by common columns   (e.g., respondent IDs) [Video Link] 

2️⃣   Summarizing a continuous variable [Video Link]

​This module uses data from the    American Time Use Survey    conducted by the   U.S. Bureau of Labor Statistics.

The written version of module 3 is available here.



🗃️​   Course Resources

This course does not require any textbooks. 

You can find more content related to survey data analysis on my Medium page.


👩🏽‍💻   Software

You should have RStudio (Posit) installed on your computer. Alternatively, you can use the cloud version of RStudio.

To record the lecture videos, I used OBS Studio, which is a   free and open source software for video recording and live streaming.



📩   Contact

If you have any questions/ concerns/ suggestions regarding any of the topics discussed in this crash course, feel free to reach out to me!
Submit
© 2025 | Vivekananda Das
  • Curriculum Vitae
  • Research
  • Data Projects
  • Teaching
  • Medium Articles
  • বাংলা