Site icon R-bloggers

Metrics, Impact and Community Management

[This article was first published on rOpenSci - open tools for open science, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

On June 14 I was invited to present at the CZI Open Science 2024 event. I was asked to participate in “Case Study Session 3: Demonstrating Impact of Open Science” and to explore the challenges of using traditional academic metrics for measuring project impact with an emphasis on alternative approaches.

I was very excited to share our experiences and to learn from others projects. Here I present a summary of this talk.

Metrics

rOpenSci is a non-profit organization and a community of practice that leads a set of interrelated activities around capacity building, community development, and research software development practices to transform science through open data, software & reproducibility.

Similar to many other organizations, we have a set of traditional metrics that we use to measure the impact of our work. We use dashboards, we have automatic processes to calculate metrics and we also use software which provide other metrics. We calculate these metrics for individual projects or activities as well as for the organization overall.

For example, our Software Peer Review is a transparent, open, non-adversarial R package peer review process with the goal of improving software built to do science. Since we started this program, we have had

While those numbers are very useful, we know that they are not enough to fully measure the impact of our work. A community is not just about numbers, but about people, because communities are built on connections.

Frameworks to measure impact

We use two frameworks with a community management lens to capture metrics and try to measure our impact. The first one is a Impact Level framework with four levels: the individual level, the host institution/local level, the rOpenSci community level, and the Open Science or Open Source Community in general level. The second framework we use is based on a community of practice participation model.

Impact Level Framework

To exemplify this framework, lets talk about the Champions Program where I will introduce you to one of our Champions, Marcos Miguel Prunello, a professor of statistics at the Universidad Nacional de Rosario in Argentina.

Individual level

Being part of the Champions Program, Marcos receives training on package development and community outreach. He has individual mentoring, a cohort of colleagues with whom to share experiences and knowledge during the program, and paid time to develop a project in 12 months. He has also access to a network of experts in R and data science through the rOpenSci community.

The program allowed people who aren’t typically represented to have opportunities.
Marcos Prunello

Host institution/local level

As part of his activities in the Champions Program, Marcos organized a workshop in Spanish on how to develop multilingual R packages. He also delivered a talk about rOpenSci and how to participate, sharing his knowledge with his colleagues and students, and the local community.

The workshop and talk were organized jointly with the RenRosario R User Group and the Universidad Nacional de Rosario, who provided the venue for the event.

The revival of our R user group in our city was motivating to me. I organized a workshop. After that, we had another meeting, and another one planned.
Marcos Prunello

Marcos with the workshop and RenRosario’s meetup attendees and material from the event

Community level

Marco’s project in the Champions Program was to get his karel R package in shape and submit it for rOpenSci software peer review.

The package implements Karel the robot to teach introductory programming concepts in both Spanish and English. Marcos’ students speak in Spanish so he want the package in their native language, at the same time he also wants karel to be used by other educators, so for that he provided additional documentation and lessons in English.

As the package is multilingual the editorial team considers that it is best to have one reviewer in English and one in Spanish.
Mauro Lepore, rOpenSci Editor

This package become the first rOpenSci bilingual software peer-review. This experience helped to improve one of the most important projects at rOpenSci.

Open Science Community level

The multilingual aspect of the package goes beyond the documentation and lessons. Marcos design an architecture to have function names and messages in different languages, allowing students to learn to program also using their native language.

This package is pushing the limits of what R can do in term of multilingual functionality.
Joel Nitta, rOpenSci Reviewer

The designed solution allows other educators to translate functions and messages, so they can also teach in their native languages, contributing to removing language barriers to coding.

karel package peer-review messages, bilingual documentation and bilingual functions

Tools

The tools we used to measure the impact of the Champions Program at these different levels were:

Community Participation Model Framework

The second framework we use is the Community Participation Model Framework that is based on the CSCCE Participation Model.

This model describes four modes of community member participation:

Example of the Community Participation Model Framework

To exemplify this framework, let me introduce you to Joel Nitta from Japan and Athanasia Mowinckel from Norway.

In their own words:

Joel was planning to visit Oslo, Norway to teach a workshop.

He reached out to Mo, another active member of rOpenSci, to ask about getting together for a coffee,

Mo asked if Joel would be up for teaching another workshop.

Mo had seen Joel on an rOpenSci community call about targets, and really wanted to take advantage of his experience to start digging into targets (a rOpenSci’s package developed by Will Landau).

Joel realized this was the motivation he needed to fully flesh out the workshop materials, so he happily agreed.
Teaching targets with Penguins, rOpenSci Blog, July 20 2023

Mo and Joel during the workshop and the targets community call webpage

Blog post about the workshop, workshop material and Joel teaching

Tools

The tools we use to capture our members stories and the impact on the community via this Community Participation Model are:

Social network analysis (SNA)

As the goal of this talk was to explore the challenges of using traditional academic metrics for measuring project impact with an emphasis on alternative approaches, I want to share with you some details of how we use social network analysis to measure the impact of our work. We wanted to highlight this particular tool, because although SNA it is well developed, we have not seen many examples of its use in the context of communities of practice in the field of open initiatives with the objective of understanding the community management interventions that are carried out.

In 2022, we mapped the rOpenSci collaboration network across years. If we filter the network to collaborations on writing blog posts, we can compare the network of 2014 to 2022. We can see that the network has grown in both contributions, collaborators and interaction between the authors. The colors identify clusters (also called communities) of authors that wrote blog post together. For example, in 2022, the green cluster is the rOpenSci software peer-review participants and the pink one are community management related topics, like Code of Conducts updates and rOpenSci members interviews. We can identify important players in the network by their number of contributions and collaborations; for example, we can see that there are more community members contributing blog posts in 2022 than in 2014.

Blog post collaboration network 2014 and 2022

If we filter the network in a different way to look at collaborations on writing blog posts in Spanish, we can compare how our network looked before 2022, the year we introduced our Multilingual Publishing project, to how it looks now, in 2024. The network has grown, not only in contributors, but also the connections among them. We can see three clear clusters, the Champions Program participants, the interviewees in the Stars of R-Universe series and translations of important posts for the peer-review process.

Blog posts in Spanish collaboration network mapping in two different moments in time.

Creating the multilingual infrastructure on our website and developing tools for translations has positively impacted the amount of content we can have in other languages, not only by multilingual people but also by localizing our existing content. It also has increased the number of new contributors and opens new paths for contributions: writing and translating content. Using the SNA tool we can see and understand the impact of our work in the community.

Suggestions for using metrics

Use and develop metrics

Conclusion

The main recommendation is to use metrics that combine quantitative and qualitative tools to be able to make decisions about your programs and content. It is important to record the numbers but also the stories. There are many tools which can give us valuable information if we use. Although we will always have an incomplete picture, it can still be a useful guide.

To leave a comment for the author, please follow the link and comment on their blog: rOpenSci - open tools for open science.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Exit mobile version