Introduction to Data Science in the Tidyverse Workshop
1/27/20—1/28/20
9:00 AM-5:00 AM
2 Day Workshop
Hadley Wickham
Chief Scientist
RStudio
Amelia McNamara
Assistant Professor of Computer & Information Sciences
University of St Thomas
This is a two-day, hands-on workshop designed for people who are brand new to R & RStudio and who learn best in person.
You will learn the basics of R and data science, and practice using the RStudio IDE (integrated development environment). We'll discuss much of the material from the book R for Data Science, including data visualization (ggplot2), data transformation and tidying (dplyr, tidyr), understanding special data types (stringr, forcats, lubridate), and modeling (broom). Throughout the workshop, we'll work in RMarkdown documents, and learn best practices for data computing.
If you want to transition from coding in base R to the tidyverse, or just jump into doing data science in the tidyverse without any prior R experience, this is the workshop for you! We will have a team of TAs on hand to show you the ropes, and help you out when you get stuck.
To know whether this workshop is right for you, consider these questions:
- You have a dataset of prices of diamonds, as well as their size. Could you make a scatterplot of the two variables using ggplot2?
- You have two datasets, one with information on music genres and age ranges, the other with genres and radio station call names. Can you imagine how you would join them together with a dplyr verb?
- We want to model the wages of people in the United States, using their height and education as predictors. Then, we would like to plot model predictions for each level of educational attainment. Can you imagine how to do this in R?
If you answered "no" to any or all of those questions... great! This workshop is for you. By the end of the two days, you should be able to accomplish all those tasks. If you answered "yes" to all three questions, you may want to consider taking a different workshop.