rOpenSci News Digest, October 2021
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Dear rOpenSci friends, it’s time for our monthly news roundup!
You can read this post on our blog. Now let’s dive into the activity at and around rOpenSci!
rOpenSci HQ
Community call about the statistical software review project
A fantastic community call is coming up on Tuesday, 26 October 2021 18:00 UTC: Expanding Software Peer Review: Statistical Package Standards at rOpenSci!
This 1-hour community call will address the bigger picture of how our community-informed development of standards for statistical packages meets a critical need of stakeholders. Noam Ross (EcoHealth Alliance and rOpenSci Software Review Lead) will catch everyone up on the project. Rebecca Killick (Lancaster University and rOpenSci Statistical Software Peer Review advisory committee) will offer insights into standardisation and the potential role our program might play in the future of statistical software and open source software in general. Juliane Manitz (EMD Serono and R Validation Hub) will offer a perspective on the use of open source software in regulated environments. Christoph Sax (cynkra) will share his experience as the first person to submit a package, tsbox, for review and aligning his software with our standards.
Follow the community call page link for practical information. Everyone is welcome, no RSVP needed!
November 2 social coworking and office hours
Join us for monthly coworking Tuesday, November 2nd at 16:00 UTC! No RSVP needed. In addition to the usual quiet and noisy rooms (for Q&A and meeting peers), Mark Pagham, co-Lead of our Statistical Software Peer Review project, will be there to help folks check their own package with our new automated checking system.
rOpenSci at NHS-R conference
Jeroen Ooms, rOpenSci Lead Infrastructure Engineer, will give a talk on The r-universe project at the NHS-R 2021 virtual conference on November 9. Details.
Will Landau, community member, will give a workshop on Reproducible computation at scale in R with targets on November 4.
The conference is free with registration. Conference website.
Find out about more events. You can subscribe to rOpenSci-lead events via our ICS calendar (copy the link, add it to your calendar service e.g. Thunderbird or Google Calendar). Please note that if you use Google Calendar you have to check synchronization is turned on; furthermore, synchronization might be delayed.
Three new editors for software review
We are excited to welcome Emily Riederer, Adam Sparks, and Jeff Hollister to our team of Associate Editors for rOpenSci Software Peer Review. They join Laura DeCicco, Julia Gustavsen, Anna Krystalli, Mauro Lepore, Karthik Ram, Noam Ross, Maëlle Salmon, and Melina Vidoni. Find out more about the three new editors in our introduction blog post.
Software ?
New packages
The following package recently became a part of our software suite:
- rsat, developed by Unai Pérez – Goya together with Manuel Montesino – SanMartin, Ana F Militino, Maria Dolores Ugarte: Downloading, customizing, and processing time series of satellite images for a region of interest. rsat functions allow a unified access to multispectral images from Landsat, MODIS and Sentinel repositories. rsat also offers capabilities for customizing satellite images, such as tile mosaicking, image cropping and new variables computation. Finally, rsat covers the processing, including cloud masking, compositing and gap-filling/smoothing time series of images (Militino et al., 2018 doi:10.3390/rs10030398 and Militino et al., 2019 doi:10.1109/TGRS.2019.2904193). It has been reviewed by Marc Weber, Kelly Hondula.
Discover more packages, read more about Software Peer Review.
New versions
The following nine packages have had an update since the latest newsletter: gert (v1.4.1
), arkdb (v0.0.14
), GSODR (v3.1.4
), lingtypology (v1.1.6
), osmdata (v0.1.7
), rglobi (v0.2.26
), ruODK (v1.3.0
), stplanr (v0.8.4
), USAboundariesData (v0.4.0
).
Software Peer Review
There are fourteen recently closed and active submissions and 4 submissions on hold. Issues are at different stages:
-
One at ‘6/approved’:
- rsat, Tools for Downloading, Customizing, and Processing Time Series of Satellite Images from Landsat, MODIS, and Sentinel. Submitted by Unai Pérez-Goya.
-
Two at ‘5/awaiting-reviewer(s)-response’:
-
excluder, Checks for Exclusion Criteria in Online Data. Submitted by Jeffrey Stevens.
-
mctq, An R Package for the Munich ChronoType Questionnaire. Submitted by Daniel Vartanian.
-
-
Four at ‘4/review(s)-in-awaiting-changes’:
-
gendercoder, Recodes Sex/Gender Descriptions Into A Standard Set. Submitted by Emily Kothe.
-
epair, Grabs data from EPA API, simplifies getting pollutant data. Submitted by Leo Orozco-Mulfinger.
-
healthdatacsv, Access data in the healthdata.gov catalog. Submitted by iecastro.
-
chemspiderapi, R Wrapper for ChemSpider’s API Services. Submitted by Raoul Wolf.
-
-
Three at ‘3/reviewer(s)-assigned’:
-
cffr, Generate Citation File Format (cff) Metadata for R Packages. Submitted by Diego H..
-
ROriginStamp, Interface to OriginStamp API to Obtain Trusted Time Stamps. Submitted by Rainer M Krug.
-
occCite, Querying and Managing Large Biodiversity Occurrence Datasets. Submitted by Hannah Owens.
-
-
Three at ‘2/seeking-reviewer(s)’:
-
tidyqpcr, Quantitative PCR Analysis with the Tidyverse. Submitted by Edward Wallace.
-
tsbox, Class-Agnostic Time Series. Submitted by Christoph Sax. (Stats).
-
phruta, Phylogenetic Reconstruction and Time-dating. Submitted by Cristian Román-Palacios.
-
-
One at ‘1/editor-checks’:
- rdbhapi, Interface to DBH-API. Submitted by Marija Ninic.
Find out more about Software Peer Review and how to get involved.
On the blog
Software Review
- Announcing New Software Peer Review Editors: Emily Riederer, Adam Sparks, and Jeff Hollister by Stefanie Butland. Introducing 3 new editors for rOpenSci software review.
Tech Notes
- Generating and customizing badges in r-universe by Jeroen Ooms.
Use cases
Three use cases of our packages and resources have been reported since we sent the last newsletter.
-
Using jsonvalidate to validate the
packages.json
file from your personal universe. Reported by Hugo Gruson. -
Creating historical Congressional maps with USABoundaries. Reported by Andrew Heiss.
-
Predizendo volume de eucalipto com tidymodels, XGBoost e targets. Reported by Theilon Macêdo.
Explore other use cases and report your own!
Call for maintainers
There’s no open call for new maintainers at this point but you can refer to our contributing guide for finding ways to get involved!
As the maintainer of an rOpenSci package, feel free to contact us on Slack or email [email protected]
to get your call for maintainer featured in the next newsletter.
Package development corner
Some useful tips for R package developers. ?
Naming your package
Do you have a fantastic idea and plan for a package, but no name for it yet? We have some tips on this topic in our dev guide!
-
We strongly recommend short, descriptive names in lower case. If your package deals with one or more commercial services, please make sure the name does not violate branding guidelines. You can check if your package name is available, informative and not offensive by using the
available
package. In particular, do not choose a package name that’s already used on CRAN or Bioconductor. -
A more unique package name might be easier to track (for you and us to assess package use) and search (for users to find it and to find answers to their questions). Obviously a too unique package name might make the package less discoverable (e.g. it might be an argument for naming your package geojson). Also note that removing random letter e.g. vowels from a common word to create a package name might make it less easy to remember!
-
Find other interesting aspects of naming your package in this blog post by Nick Tierney, and in case you change your mind, find out how to rename your package in this other blog post of Nick’s.
Two testthat tips
The testthat package has been supporting snapshot tests that “record expected output in a separate human-readable file instead of using code to describe what the expected output looks like.” since its version 3.0.0. Their implementation is now stable. Note that they are skipped by default on CRAN.
Now what about input files for tests?
Maybe you can create fake ones on the fly (and delete them).
Or you can store them under tests/testthat
and use testthat::test_path()
when using them, to get a file path that “both interactively and during tests”.
So to use tests/testthat/examples/thing
you’d write testthat::test_path("examples", "thing")
.
Note that the vcr package for HTTP testing has a similar function you can use to locate cassettes, vcr_test_path().
Passing CRAN URL checks
CRAN checks URL validity. As recently reminded by Jenny Bryan on Twitter, the urlchecker package by Jim Hester is a great tool to help your package pass these checks.
Styling your package
How to enforce a style guide for your package, without too much human effort? Two suggestions:
-
Having a GitHub Action workflow that allows you to call styler via typing
/style
in a PR comment (or amend the logic to have it work via applying a label to the PR). Or apply this idea to another continuous integration service! If for any reason your style differs from styler’s default, follow one of styler articles for developers and tweak the workflow. Example, corresponding style guide package. -
Using a service like CodeFactor that integrates with GitHub directly. Thanks to Rich FitzJohn for suggesting this in rOpenSci semi-open Slack workspace.
Last words
Thanks for reading! If you want to get involved with rOpenSci, check out our Contributing Guide that can help direct you to the right place, whether you want to make code contributions, non-code contributions, or contribute in other ways like sharing use cases.
If you haven’t subscribed to our newsletter yet, you can do so via a form. Until it’s time for our next newsletter, you can keep in touch with us via our website and Twitter account.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.