The useR! 2016 Tutorials

By Joseph Rickert

(This article was first published on Revolutions, and kindly contributed to R-bloggers)

by Joseph Rickert

Over the years I have seen several excellent tutorials at useR!conferences that were not only very satisfying “you had to be there” experiences but were also backed up with meticulously prepared materials of lasting value. This year, quite a few useR!20i6 tutorials measure up to this level of quality. My take on why things turned out this way is that GitHub, Markdown, and Jupyter notebooks have been universally adopted as workshop / tutorial creation tools, and that having the right tools encourages creativity and draws out one’s best efforts.

Jenny Bryan’s tutorial Using Git and GitHub with R, Rstudio, and R Markdown and the tutorial by Andrie de Vries and Micheleen Harris: Using R with Jupyter notebooks for reproducible research are two superb, Escheresque self-referencing examples of what I am talking about. Bryan’s tutorial which uses GitHub and R Markdown to teach GitHub and R Markdown is an impressive introduction to these two essential resources. And, the tutorial by de Vries and Harris makes very effective use of GitHub and Jupyter Notebooks. Moreover, this tutorial sets the gold standard for how to set up a system for interactive user participation. Harris and de Vries staged their tutorial on Microsoft’s Azure Data Science VM. The Linux version of this VM comes provisioned with JupyterHub, a set of processes that enables a multi-user Jupyter Notebook server. Once the VM is loaded with the training materials, its only a matter of giving students a username and password to grant them immediate access to the interactive workshop materials. Have a look at notebook 06 to see how to set all of this up.

After seeing this, and comparing it to other tutorials where instructors wasted the better part of an hour trying to get students up and running with local copies of their course materials I can’t see why everyone wouldn’t opt for a cloud solution to this problem. When word gets out, the Data Science VM is going to be the standard for delivering technical workshops.

Unfortunately, I couldn’t get around to see all of the tutorials, but two more that I can heartily recommend are MoRe than woRds, Text and Context: Language Analytics in Finance with R, the introduction to text mining by Sanjiv Das and Karthik Mokashi and Machine Learning Algorithmic Deep Dive by Erin Ledell. Sanjiv Das is an inspired educator and I have never seen a presentation either at R/Finance, useR! or even BARUG, the Bay Area useR Group where he wasn’t on his game and super prepared. The tutorial he and Karthik gave this year at useR!2016 is a self-contained course in text mining.
Erin Ledell also came prepared with more tutorial material then she could ever present in three hours. But because of her thoughtful use of GitHub, Markdown and notebooks we have a machine learning resource that is well worth studying. Just being introduced to this incredible visualization of decision trees by Tony Chu and Stephanie Yee made my day.

Ledell is also a gifted teacher who anticipates where here audience may have have difficulties. Her historical approach to understanding gradient boosting machines provides an opportunity to clarify the differences between various versions of the boosting algorithms. Sometimes understanding how something came to be is halfway towards understanding how it works.

The bar for presenting lectures, tutorials and workshops has been set pretty high. Anyone who is serious about delivering a high quality education probably needs to develop some skills with GitHub, Markdown and Notebooks. Studying the tutorial materials from useR! 2016 is a good place to start.

To leave a comment for the author, please follow the link and comment on their blog: Revolutions.

R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more…

Source:: R News

Leave a Reply

Your email address will not be published. Required fields are marked *

Time limit is exhausted. Please reload CAPTCHA.