rOpenSci News Digest, April 2023
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Dear rOpenSci friends, it’s time for our monthly news roundup!
You can read this post on our blog. Now let’s dive into the activity at and around rOpenSci!
rOpenSci HQ
Meeting the stars of the R-universe: Athanasia Monika Mowinckel
Knowing our community’s stories helps us to learn about the people behind our software, brings us closer and offers us new opportunities. To share some of these community stories, we created the rOpenSci interview series “Meeting the stars of the R-Universe”.
Working with the human brain requires specific software and complex developments. However R appears to be the natural way to collect and analyze the huge amount of data needed, and the R-Universe the most friendly place to share and invite collaborations. Our third stop brings us to Norway to talk with Athanasia Monika Mowinckel. You can also read the post in Spanish.
Multilingual posts
Have you noticed several of our recent blog posts were translated to Spanish, and in one case, French? We’ve started adding actual multilingual infrastructure to our website. You can find all posts in Spanish in https://ropensci.org/es/archive (and the lone French post in https://ropensci.org/fr/archive). From any post that has a translation, the sidebar on the right (or… at the bottom if you’re reading on mobile!) has a link to translations of the post, see this example.
Over time we plan to further improve the multilingualism of our website.
18,000 packages tracked on R-Universe!
Currently R-Universe is tracking and indexing over 18,000 R packages! These are a mix of packages found on popular networks like CRAN or Bioconductor, and packages that were registered by users.
In previous posts we explained how to create your personal CRAN-like repository and publish packages on R-Universe yourself. A new post explains the other part: how the scraper automatically finds packages on CRAN and Bioconductor to include in the R-Universe.
All in all, R-Universe provides a great way to discover and explore many packages!
Help us put together the rOpenSci Community Call calendar for the year!
We’ve had some great community calls lately, 🎉
and now we’d like to know what topics you would like us to cover in the future!
We hold Community Calls about every 3 months to share knowledge that is relevant to our community and consistent with our vision and mission. These are free and open for anyone to attend and provide opportunities for us to connect with rOpenSci community members around the world.
We’ve opened an issue in this repository for each topic we’re considering. We’d like your input and “votes” on these, and your suggestions for other topics we haven’t thought of.
We will also be very grateful if you suggest speakers, resources, or encourage others to weigh in by sharing the link to your favorite topic.
You can read more details on our community calls repository.
We look forward to hearing your ideas. Let’s build the schedule together!
Coworking
Join us for social coworking & office hours monthly on first Tuesdays! Hosted by Steffi LaZerte and various community hosts. Everyone welcome. No RSVP needed. Consult our Events page to find your local time and how to join.
And remember, you can always cowork independently on work related to R, work on packages that tend to be neglected, or work on what ever you need to get done!
- Tuesday, May 2nd, 9:00 Americas Pacific / 16:00 UTC “Spring Cleaning for R packages and scripts” Hosted by Steffi LaZerte and Yanina Bellini Saibene
- Explore how other organizations keep their scripts/packages nice and clean
- Take a look at your R packages and scripts and give them a good spring cleaning*
- Talk to our community host and other attendees and discuss tips for keeping on top of it all.
* in the northern hemisphere at least, otherwise, give them a good fall cleaning!
- Tuesday, June 6th, 9:00 Australia Western / 01:00 UTC “Integrating and merging datasets from different sources” Hosted by community host Cynthia Huang and Steffi LaZerte
- Spend time integrating datasets for your own work
- Explore how others tackle the problem of merging datasets from different sources or different data versions
- Talk to our community host and other attendees and discuss strategies for integrating datasets.
Software 📦
New packages
The following three packages recently became a part of our software suite:
-
concstats, developed by Andreas Schneider: Based on individual market shares of all participants in a market or space, the package offers a set of different structural and concentration measures frequently – and not so frequently – used in research and in practice. Measures can be calculated in groups or individually. The calculated measure or the resulting vector in table format should help practitioners make more informed decisions. It is available on CRAN. It has been reviewed by Sebastian Wojcik, and Christopher T. Kenny.
-
CRediTas, developed by Josep Pueyo-Ros: A tiny package to generate CRediT author statements (https://credit.niso.org/). It provides three functions: create a template, read it back and generate the CRediT author statement in a text file. It is available on CRAN. It has been reviewed by Marcelo S. Perlin, and João Martins.
-
predictNMB, developed by Rex Parsons together with Robin Blythe, and Adrian Barnett: Estimates when and where a model-guided treatment strategy may outperform a treat-all or treat-none approach by Monte Carlo simulation and evaluation of the Net Monetary Benefit. Details can be viewed in Parsons et al. (2023) doi:10.21105/joss.05328. It is available on CRAN. It has been reviewed by Emi Tanaka, and Tinula Kariyawasam.
Discover more packages, read more about Software Peer Review.
New versions
The following nineteen packages have had an update since the last newsletter: commonmark (v1.9.0
), chromer (v0.4
), ckanr (v0.7.0
), concstats (v0.1.6
), CRediTas (Zenodo_v0.2.0
), dbparser (v2.0.1
), dittodb (v0.1.6
), drake (7.13.5
), dynamite (1.3.2
), ezknitr (v0.6.2
), git2r (v0.32.0
), nodbi (v0.9.2
), predictNMB (v0.1.0
), qpdf (v1.3.2
), rgbif (v3.7.7
), rtweet (v1.2.0
), taxizedb (v0.3.1
), tinkr (0.2.0
), and waywiser (v0.3.0
).
Software Peer Review
There are thirteen recently closed and active submissions and 2 submissions on hold. Issues are at different stages:
-
Three at ‘6/approved’:
-
CRediTas, Generate CRediT Author Statements. Submitted by Josep Pueyo-Ros.
-
predictNMB, Evaluate Clinical Prediction Models by Net Monetary Benefit. Submitted by Rex Parsons. (Stats).
-
concstats, Market Structure, Concentration and Inequality Measures. Submitted by Andreas Schneider.
-
-
Three at ‘4/review(s)-in-awaiting-changes’:
-
birdsize, Estimate Avian Body Size Distributions. Submitted by Renata Diaz.
-
wmm, World Magnetic Model. Submitted by Will Frierson.
-
octolog, Better Github Action Logging. Submitted by Jacob Wujciak-Jens.
-
-
Three at ‘3/reviewer(s)-assigned’:
-
dwctaxon, Tools for Working with Darwin Core Taxon Data. Submitted by Joel Nitta.
-
ohun, Optimizing Acoustic Signal Detection. Submitted by Marcelo Araya-Salas.
-
dfms, Dynamic Factor Models. Submitted by Sebastian Krantz.
-
-
Two at ‘2/seeking-reviewer(s)’:
-
NA, NA. Submitted by Alexander Fischer. (Stats).
-
bssm, Bayesian Inference of Non-Linear and Non-Gaussian State Space. Submitted by Jouni Helske. (Stats).
-
-
Two at ‘1/editor-checks’:
-
pangoling, Access to Large Language Model Predictions. Submitted by Bruno Nicenboim.
-
qualtdict, Generating Variable Dictionaries and Labelled Data Exports of Qualtrics. Submitted by lyh970817.
-
Find out more about Software Peer Review and how to get involved.
On the blog
Software Review
- waywiser is Now a Part of rOpenSci by Mike Mahoney. Plus version 0.3.0 now on CRAN, and a new preprint.
Other topics
- Meeting the Stars of the R-Universe: Researching Our Brain with the Magic of the R-Universe by Yanina Bellini Saibene, Alejandra Bellini, Lucio Casalla, and Steffi LaZerte. Working with the human brain requires specific software and complex developments, however R appears as the natural way to collect hundreds of data that need to be analyzed and the R-Universe as the most friendly place to share and invite to collaborate. Our third stop bring us to Norway to talk with Athanasia Monika Mowinckel. Other languages: Conociendo a las estrellas del Universo R: Investigando nuestro cerebro con la magia del universo R (es).
Tech Notes
- How r-universe searches for packages on CRAN / Bioconductor by Jeroen Ooms. How r-universe searches for, and includes, packages on CRAN / Bioconductor that are available on GitHub or GitLab or R-forge.
Call for (co)maintainers
Call for maintainers
If you’re interested in maintaining any of the R packages below, you might enjoy reading our blog post What Does It Mean to Maintain a Package? (or listening to its discussion on the R Weekly highlights podcast hosted by Eric Nantz and Mike Thomas)!
-
rvertnet, Retrieve, map and summarize data from the VertNet.org archives (https://vertnet.org/). Functions allow searching by many parameters, including taxonomic names, places, and dates. In addition, there is an interface for conducting spatially delimited searches, and another for requesting large datasets via email. Issue for volunteering.
-
natserv. Interface to NatureServe (https://www.natureserve.org/). Includes methods to get data, image metadata, search taxonomic names, and make maps. Issue for volunteering.
-
geojsonlint, Tools for linting GeoJSON. Includes tools for interacting with the online tool https://geojsonlint.com, the Javascript library geojsonhint (https://www.npmjs.com/package/geojsonhint), and validating against a GeoJSON schema via the Javascript library (https://www.npmjs.com/package/is-my-json-valid). Some tools work locally while others require an internet connection. Issue for volunteering.
-
citesdb, a high-performance database of shipment-level CITES trade data. Provides convenient access to over 40 years and 20 million records of endangered wildlife trade data from the Convention on International Trade in Endangered Species of Wild Fauna and Flora, stored on a local on-disk, out-of memory ‘DuckDB’ database for bulk analysis. Issue for volunteering.
Call for comaintainers
Refer to our recent blog post to identify other packages where help is especially wished for!
Package development corner
Some useful tips for R package developers. 👀
Caching the results of functions of your R package
The R-hub blog has a post on Caching the results of functions of your R package by Maëlle Salmon and Christophe Dervieux. It presents different approaches. A recent PR by Lukas Wallrich to rcrossref maintained by Najko Jahn illustrates the approach of saving results to an environment (and reminded us about the topic of caching!).
Deprecated data in a package with delayedAssign()
If you need to change something like a function or function arguments in your package, you’ll find many resources in, say, the rOpenSci dev guide, or the lifecycle package docs.
Now what if you want to deprecate a dataset?
Read the solution proposed by Matthijs Berends on Stack Overflow, linking to a 3-step method in Bioconductor guidance.
Key is the usage of delayedAssign()
to save a promise that will serve both a warning, and the data.
We might mention this approach in a future version of our dev guide.
You might remember delayedAssign()
from a strategy to provide data as tibble only when tibble is installed.
Useful function!
Important change for CRAN packages with compiled code
Andy Teucher, R Package Developer Educator at Posit PBC, wrote a post about “New CRAN requirements for packages with C and C++”. Good to know for packages on CRAN, or to be submitted to CRAN, with compiled code!
Let dependabot monitor updates to your GitHub Actions
A tip by Jacob Wujciak-Jens. Dependabot alerts are really easy to set up to monitor for updates to actions, like this example. See the Full GitHub docs.
Last words
Thanks for reading! If you want to get involved with rOpenSci, check out our Contributing Guide that can help direct you to the right place, whether you want to make code contributions, non-code contributions, or contribute in other ways like sharing use cases.
If you haven’t subscribed to our newsletter yet, you can do so via a form. Until it’s time for our next newsletter, you can keep in touch with us via our website and Mastodon account.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.