Executive Editor, Data & Analytics,
Even if you attended RStudio’s pre-conference two-day training last month, you could only attend one workshop—and there were more than half a dozen. Now, though, many materials including slides and R code are available free online. Here’s how to get them.
Most of the code and slides have been posted on GitHub. If you don’t have git version control set up on your system, you can download a zipped file of any repository. But git and GitHub do make it easier and more elegant. Check out episode 33 of Do More with R below if you’d like to learn about git and GitHub in RStudio:
Instructor Rob J. Hyndman, professor statistics at Monash University, literally wrote the book on time series forecasting in R — not to mention the R forecast package. I was torn between attending this one and the machine-learning workshop I ended up taking. Happily, even though it’s not quite as good as being in a classroom in person, the written materials and code are online.
The GitHub repository is at https://github.com/rstudio-conf-2020/time-series-forecasting and his Forecasting Principles and Practice textbook is free online at https://otexts.com/fpp3/.
“You will learn to read, manipulate, and visualize spatial data and you’ll be introduced to functionality that will have you saying, ‘I didn’t know you could do that in R!’” touts this workshop’s overview. This is another one I wish I could have attended.
This class featured the sf, tmap, mapview, raster, and dplyr packages.
Most of the workshop information is not on GitHub directly, but there is a basic repo at https://github.com/rstudio-conf-2020/geospatial with instructions on how to download the rest.
There were two workshops on machine learning this year: an introduction to the still-evolving tidymodels machine learning package ecosystem and a more advanced session with Max Kuhn, creator of the well-known caret package.
Max Kuhn’s session has a website at https://rstudio-conf-2020.github.io/applied-ml/README.html. Toward the top there are links to see parts 1 through 6 separately. There is also a GitHub repo.
Check out the robust GitHub repo which includes a number of R Markdown notebooks with code and explanations as well as links to slides and data. This was taught by Brad Boehmke, director of data science at 84.51°.
Julia Silge, co-author of Text Mining with R, led this workshop. Her slides are at http://bit.ly/silge-rstudioconf-1 (Day 1) and bit.ly/silge-rstudioconf-2 (Day 2). The GitHub repo at https://github.com/rstudio-conf-2020/text-mining includes slides and R Markdown documents with code.
This workshop, taught by RStudio engineer James Blair, focused on using dplyr with data.table, databases, and Spark for large-scale data. It also used the vroom, dtplyr, and DBI packages.
The GitHub repo at https://github.com/rstudio-conf-2020/big-data includes an intro, slides, and workbook directory with R Markdown documents. The workshop exercises and code are also available as on online book at https://rstudio-conf-2020.github.io/big-data/introduction-to-vroom.html.
If you’ve wanted to learn the Shiny R interactive web framework — or if you’ve worked with it but wanted to up your game — Macalester College professor Danny Kaplan’s Shiny workshop GitHub repository features slides and project code. You can also clone the project with a free RStudio Cloud account at https://rstudio.cloud/project/865256.
This two-day workshop by Yihui Xie (creator of numerous R packages including knitr and DT and the co-author of Shiny, R Markdown, and leaflet) and RStudio education director Carl Howe was aimed at helping attendees create powerful interactive documents and dashboards.
The objectives, according to the workshop description, included the following:
The workshop GitHub repo at https://github.com/rstudio-conf-2020/rmarkdown-dashboard includes a materials directory with slides, exercises, cheat sheets, and more.
It sounds like an introductory workshop, but this was actually “designed for experienced R and RStudio users who want to (re)design their R lifestyle,” according to the session overview. “You’ll learn holistic workflows that address the most common sources of friction in data analysis. We’ll work on project-oriented workflows, version control for data science (Git/GitHub), and how to plan for collaboration, communication, and iteration (including R Markdown).” Instructors Kara Woo, Jenny Bryan, and Jim Hester are all well-known in the tidyverse world.
Find the GitHub repository at https://github.com/rstudio-conf-2020/what-they-forgot and “the one true URL that links to everything!” at https://rstd.io/wtf-2020-rsc.
Taught by Charlotte Wickham and Hadley Wickham, this workshop was aimed at “those who have embraced the tidyverse and now want to expand it to meet their own needs,” according to the workshop overview. It discusses API design, functional programming tools, the basics of object design in Amazon S3, and the tidy eval system for non-standard evaluation.
There is a GitHub repo with slides, R Markdown documents, and more.
This workshop covered “basic principles behind effective data visualizations” as well as learning how to build good graphics with ggplot2. It was taught by Duke University professor Kieran Healy, author of Data Visualization: A Practical Introduction. The workshop repo is at https://github.com/rstudio-conf-2020/dataviz.
If you’re interested in creating packages at your workplace for “easier data access, shared functions for data transformation and analysis, and a common look and feel for reporting,” you may want to check out this workshop materials by software engineer Rich Iannone and R developer and Ph.D. student Malcolm Barrett.
You can find the GitHub repo at https://github.com/rstudio-conf-2020/my-org-first-pkg.
R for Excel Users was, not surprisingly, a workshop aimed at power Excel users who want to start incorporating R into their workflow.
And Introduction to Data Science in the Tidyverse, taught by Hadley Wickham and Amelia McNamara, was a “two-day, hands-on workshop designed for people who are brand new to R and RStudio.”
This story, “https://www.itnewsug.com/wp-content/uploads/2020/02/how20to20get20rstudio20conference20202020workshop20materials20free20online.html” was originally published by
Sharon Machlis is Executive Editor, Data & Analytics at IDG, where she works on data analysis and in-house editor tools in addition to writing and editing. Her book Practical R for Mass Communication and Journalism was published in December 2018.
Copyright © 2020 IDG Communications, Inc.