Mapping research landscapes and dynamics: Some basic bibliometric analyses with R

May 6, 2025 | Vinicius Bastazini

Understanding how scientific knowledge develops requires more than merely counting papers and citations. It requires a careful evaluation of how research topics and themes interconnect and transform over time. This is where bibliometric analysis becomes essential. As the volume of scientific journals and papers continues to grow exponentially, bibliometric analyses ...
[Read more...]

Downsampling for predictive modeling

May 5, 2025 | Jason Bryer

Note that this is cross posted with a vignette in the medley R package. For the most up-to-date version go here: https://jbryer.github.io/medley/articles/downsampling.html Comments can be directed to me on Mastodon at @vis.social@jbryer. To insta...
[Read more...]

Hasler Statistics

May 4, 2025 | R - datawookie

The distances of Hasler kayak races for various divisions are nominally 4, 8 and 12 miles. However, the actual distances vary to some degree from one race venue to another. This makes it difficult to compare race times across different races. Using data from Paddle UK I attempt to estimate the actual distances.
[Read more...]

Simulating A Simple Response Adaptive Randomization – I Have To See It To Believe It

May 3, 2025 | r on Everyday Is A School Day

In my simulations of Response Adaptive Randomization, I discovered it performs comparably to fixed 50-50 allocation in identifying treatment effects. The adaptive approach does appear to work! However, with only 10 trials, I’ve merely scratched the surface. Important limitations exist - temporal bias risks, statistical inefficiency, and complex multiplicity adjustments ...
[Read more...]

Rotation with Modulo

May 2, 2025 | Jonathan Carroll

How well do you know your fundamental operators in different languages? ‘Easy’ examples help to fortify that knowledge, and comparing across languages makes for some neat implementation detail discoveries. I saw this toot from @gregeganSF on Mastodon ... [Read more...]

Model Diagnostics: Statistics vs Machine Learning

April 30, 2025 | Christian Lorentzen

In this post, we show how different use cases require different model diagnostics. In short, we compare (statistical) inference and prediction. As an example, we use a simple linear model for the Munich rent index dataset, which was kindly provided by the authors of Regression – Models, Methods and Applications 2nd ...
[Read more...]

30 Day Chart Challenge 2025

April 30, 2025 | R on Nicola Rennie

The 30 Day Chart Challenge is a data visualisation challenge organised by Cédric Scherer and Dominic Royé. Participants make one chart each day of the challenge, inspired by the daily prompt. The prompts are also split across 5 different categories, wh... [Read more...]

March 2025 Top 40 New CRAN Packages

April 29, 2025 | Joseph Rickert

In March, one hundred eighty-two new packages made it to CRAN. Here are my Top 40 picks in sixteen categories: Agriculture, Archaeology, Biology, Climate Modeling, Computational Methods, Data, Ecology, Epidemiology, Genomics, Machine Learning, M...
[Read more...]
1 2 3 2,172