Site icon R-bloggers

How I Taught Scientific Blogging with R Markdown, Online

[This article was first published on Maëlle's R blog on Maëlle Salmon's personal website, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

Last week I had the pleasure to lead an online course about “Scientific Blogging with R Markdown”, invited by Najko Jahn and Anne Hobert from SUB Göttingen. To follow the example set by the incredible Alison Hill, I’ll write a summary of what I’ve learnt and would like to do better next time.

The topic

The topic of the course was “Scientific Blogging with R Markdown”. For months I would sometimes write down some ideas, from “present distill” to “show web developer console”, that I had whilst reading things online. I was already interested in topics related to R Markdown blogging and website development, but having the course on my agenda made me pay even more attention. I did my note taking in a physical notebook and then I set up a GitHub repo to use its issue tracker.

A bit later I had to start drafting an actual schedule, and decided I would try and not recommend a single tool, but instead show three ways of blogging with R Markdown: using the distill package whose goal is scientific and technical writing; using Hugo since that’s what I use at work; using WordPress since this comes up on Twitter every so often and seems like a good alternative workflow for when one does not use Git yet. I thought it might help each attendee see a tool that suit them better, and that it might mean everyone would learn something. I suppose it also helped reduce my responsability if folks ended up not liking their tool, since I did not recommend a single one. ????

The course summary I wrote was

Are you an R user who works in science? Would you like sharing more online? How about starting a new blog, with R Markdown (Rmd)?

In this 2-hour course with live coding, we’ll go on three short adventures:

We’ll prepare for the adventures by defining what we expect of an Rmd blog. We’ll end the course by reflecting on each adventure as well as mentioning important future paths such as how to promote your blog. You should leave the course ready to start a scientific R Markdown blog with your tool of choice, and knowing where to find more resources and help.

The practical plan

When I first said yes to giving the course a while ago, it was supposed to happen in person in Göttingen in Germany, but unsurprisingly that plan changed a few months ago. Online it would be! With Zoom. Anne Hobert and Najko Jahn, who organized the workshop and taught basic R Markdown on the first day, were technical helpers on the second day.

My course was to take place on the second day, in two hours. In the afternoon attendees would have time to work together in breakout rooms or on their own, and I stayed online for questions, as did Anne and Najko.

Anne had the idea to open my course to more attendees than the workshop first day i.e. open it to 10 attendees outside of the funding stream, bringing the maximum to 20 attendees. We decided to advertise it via R-Ladies communication channels (telling R-Ladies could share the info with their friends and colleagues from under-represented groups), and Anne gave spots on a first come first serve basis.

There was an online pad associated with the course, where attendees could write who they were (so cool to see that) and their questions. We had a short call at the beginning of the week to ensure we could hear each other. The three of us read The Carpentries guidance for teaching online and e.g. used the idea to ask attendees to write “hand” and “hand helper” in the Zoom chat if they wanted to ask a question to the instructor (me) or a helper.

A course website

Since it was a course about blogging, setting up a website seemed natural. It was also an excellent way to productively procrastinate if I’m being honest, but in the end I really liked filling and working with the course website so I’d say it was time well spent?

I used two Hugo themes,

Slides for each section are listed in the menu and opened in a new tab (thanks to a custom menu layout, compared to the original Hugo learn theme).

Some Markdown content was generated with R Markdown, using hugodown.

The website is deployed by Netlify.

Slides could be printed to PDF using Decktape which I had done in a concept but I did not pursue it further.

Why use Hugo for both the website and slidedecks, and not, say Hugo+hugodown for pages and xaringan for slides? This way the source of slides is html produced by Hugo from Markdown content. It allowed me to use:

Also, because slides are in the content, they are indexed by the Hugo learn theme so searchable!

I learnt a few Hugo tricks whilst setting up the website which was fun per se, and I really liked the end product, as mentioned above. Two highlights for me were

The website also had questions for topics not mentioned during the course, where I e.g. stuffed the content from the notes I had taken.

The newest tools ????

In two parts of the course I actually demo-ed packages in development which is not optimal, but I had excuses each time.

I hope my different warning signs will have helped the learners see what’s stable vs what’s not.

What we did

During the course two hours, I shared my whole desktop with the attendees1. I had several slidedecks integrated in my website.

The slidedecks related to the three tools had a slide telling it was time for a demo, and then a few slides ending with a break countdown (I had to remove the last break). For the demos, I had printed my notes after knitting them to PDF. The demos were my doing website things in RStudio and Firefox. Compared to the demos on the website, for hugodown I only showed create_site_academic(), not how to make another theme hugodown-compatible.

In the end I took time to present my decks about reproducibility and about interactions with readers (including blog promotion and analytics).

It was a bit intense, which I need to think about if I teach this again.

In the few hours after my course, some attendees remained in breakout rooms (I got one question I think, about a Pandoc installation issue) and two of them showed their experiments in the final sharing session (where there were more than the two still online, although some people understandably had to leave).

Questions I got

And my tentative answers.

(about distill) Is there a difference between knitting and using the build tab?

I think at that point I realized I’d need to use distill more regularly to differentiate the different parts and the magic happening from RStudio.

Apparently from RStudio the website gets built when one edits the site configuration, and when one knits a post. You need to knit posts; and if you edit about.Rmd you need to either knit it or re-build the website. My impression is that you need to knit with intent, but most often the build part happens magically.

What is the difference (pros/cons) between hugo and distill?

I mentioned it a bit, distill is perfect for scientific/technical blogging but not very flexible. Hugo is very flexible which might be a curse. ???? With Hugo you can really build any website: for this course website I mixed two Hugo themes, with a few custom layouts for instance (so even the slides used hugodown). But it requires time for learning. You could try both before committing, and you could still switch tools one day even if that’d take time.

Is it possible to have one citation style file for all posts/folders with Hugo or does it need to be in the same folder as the post?

So for both distill and hugodown, it’s not possible yet but at least you can store .bib at the root and refer to it as ../../../refs.bib or so in your post YAML.

How to limit the readership of your blog – if i want my website to only be viewable to the people in my company?

For WordPress websites having private posts seem to be built-in.

For Hugo and distill it depends on how you deploy. I.e. from https://hugodown.r-lib.org/articles/deploy.html if you choose say Amazon S3 you could look for “Amazon S3 private website”. Such a feature might not be in the free tier of the service.

Or you could deploy to a server that your team has?

Regarding reproducibility: Do you have any thoughts on how to share your data belonging to the code in such a blogpost?

You could distribute it with your Hugo website (in the post folder or from static – I haven’t tried), otherwise sharing it in a GitHub repo works; but better for bigger datasets would be to use Figshare/Zenodo/etc.

Using utterance.es, you have some sort of notification if someone comments on your post?

Yes, same as other GitHub notifications (so email or web depending on your settings).

Is there a way to collaborate on the same blog?

Yes definitely. At rOpenSci we use GitHub for that (we even have a guide, blogguide.ropensci.org), you could use GitLab too. Adding posts by pull requests is nice because of the pull requests review infrastructure. Without a git platform I think it might be less natural to edit collaboratively but you could still find an alternative workflow.

Is it a good idea to use the distill format for the lab where you work instead of being a personal blog?

I’ve seen distill for groups, I think it works well for that too, if your colleagues are ok editing from R / letting you edit things.

Do you know cheap hosting services?

That question was a good reminder of my privilege: for me a custom domain costing a few dollars a month is not expensive. For others, especially in some countries, it can be a luxury. In that case, using GitHub pages, free Netlify domains, etc. is a good workaround and better than having no website.

Comment by Anne: GDPR, privacy page, in Germany imprint.

I shortly mentioned one should add a privacy page mentioning what data is collected, and Anne Hobert reminded attendees that in Germany all websites need to have a page called Impressum/imprint.

Possible improvements

I might get even more ideas from participants2 but here are a few ones:

A big limitation of the format, and the way I taught (giving a few recipes but not ensuring attendees live with a website), is that I can’t know attendees will actually go and build a website. Or maybe I’ll know, in a few weeks, if/when I receive URLs. I think that with websites, there’s a need for technical knowledge for sure, but maybe also some sort of accountability/support group.

Thank you

I’d like to thank Najko Jahn and Anne Hobert for trusting me to give this course and for being such helpful and nice collaborators. Thanks to the participants, I’m looking forward to seeing new blogs but no pressure. ???? My course website has a credits page where I’m sure I forgot names. ????

Next time?

I’d love to teach the topic again, re-using and building upon my materials. If you’re the organizer of a meetup for under-represented R users, get in touch to see if my skills and availabilities can be a fit for your group. If you’re organizing something else, let’s talk business i.e. in that case I wouldn’t teach for free. ???? In all cases, I’m looking forward to keep learning new things myself.

If you taught or attended a workshop about blogging and R Markdown, feel free to share your own insights and good ideas. ????

< section class="footnotes" role="doc-endnotes">
  1. Sharing my whole screen makes me quite nervous. I had written a list of things to do before hand such as making sure Slack and Mattermost were closed and wouldn’t send any notification. ↩︎

  2. Participants were sent a pin.up link to add feedback on sticky notes à la Carpentries. ↩︎

To leave a comment for the author, please follow the link and comment on their blog: Maëlle's R blog on Maëlle Salmon's personal website.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.