by Joseph Rickert
Over the years I have seen several excellent tutorials at useR!conferences that were not only very satisfying “you had to be there” experiences but were also backed up with meticulously prepared materials of lasting value. This year, quite a few useR!20i6 tutorials measure up to this level of quality. My take on why things turned out this way is that GitHub, Markdown, and Jupyter notebooks have been universally adopted as workshop / tutorial creation tools, and that having the right tools encourages creativity and draws out one’s best efforts.
Jenny Bryan’s tutorial Using Git and GitHub with R, Rstudio, and R Markdown and the tutorial by Andrie de Vries and Micheleen Harris: Using R with Jupyter notebooks for reproducible research are two superb, Escheresque self-referencing examples of what I am talking about. Bryan’s tutorial which uses GitHub and R Markdown to teach GitHub and R Markdown is an impressive introduction to these two essential resources. And, the tutorial by de Vries and Harris makes very effective use of GitHub and Jupyter Notebooks. Moreover, this tutorial sets the gold standard for how to set up a system for interactive user participation. Harris and de Vries staged their tutorial on Microsoft’s Azure Data Science VM. The Linux version of this VM comes provisioned with JupyterHub, a set of processes that enables a multi-user Jupyter Notebook server. Once the VM is loaded with the training materials, its only a matter of giving students a username and password to grant them immediate access to the interactive workshop materials. Have a look at notebook 06 to see how to set all of this up.
After seeing this, and comparing it to other tutorials where instructors wasted the better part of an hour trying to get students up and running with local copies of their course materials I can’t see why everyone wouldn’t opt for a cloud solution to this problem. When word gets out, the Data Science VM is going to be the standard for delivering technical workshops.
Ledell is also a gifted teacher who anticipates where here audience may have have difficulties. Her historical approach to understanding gradient boosting machines provides an opportunity to clarify the differences between various versions of the boosting algorithms. Sometimes understanding how something came to be is halfway towards understanding how it works.
The bar for presenting lectures, tutorials and workshops has been set pretty high. Anyone who is serious about delivering a high quality education probably needs to develop some skills with GitHub, Markdown and Notebooks. Studying the tutorial materials from useR! 2016 is a good place to start.
R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more…
Source:: R News