View on GitHub

Data Wrangling with R

QCL Workshop Series, Claremont McKenna College

Download this project as a .zip file Download this project as a tar.gz file

License: CC BY 4.0

Workshop Syllabus

QCL Literacy: Level 2 - Data, Progamming

Instructor: Jeho Park, Director of the Quantitative and Computing Lab at Claremont McKenna College

Summary

Data wrangling is the process of obtaining, cleaning, reshaping, and transforming raw (and messy) data into a useable form of processed (and tidy) data. It is known that data analysists and data scientists spend 80% of their time (or more) on data wrangling. So, it’s essential to get familiar with good data wrangling skills and tools that help you save time and avoid errors. In this hands-on workshop, you will learn basic skills to import, export, clean, reshape, transform, and visualize data using well-known data science package called tidyverse, especially dplyr and ggplot2.

Learning Objectives: (You will learn how to)

Prerequisites

Basic knowledge of R and RStudio (e.g., R Programming for Beginners - Level 1)

RStudio Cloud account; if you don’t have one yet, please create a new account from https://rstudio.cloud site.

Tidyverse package; please make sure that you have installed the tidyverse package in your R environment. See https://www.tidyverse.org for more information.