In online A/B testing, we often face a significant practical challenge: the randomization unit differs from the analysis unit. Typically, control and treatment groups are randomly assigned at the user level, while metrics—such as click-through r...
The ellmer package for using LLMs with R is a game changer for scientists
Why is ellmer a game changer for scientists?
In this tutorial we’ll look at how we can access LLM agents
through API calls. We’ll use this skill for created structued data fro... [Read more...]
The ellmer package for using LLMs with R is a game changer for scientists
Why is ellmer a game changer for scientists?
In this tutorial we’ll look at how we can access LLM agents
through API calls. We’ll use this skill for created structued data fro... [Read more...]
I recently conducted a small time series workshop session for AI+ training hosted by ODSC. It went really well, and I’d be happy to offer longer interactive workshops going forward (please reach out if your team would like one!). One of the examples I shared was derived from the ...
The MAJOR story of F1’s 2025 season is of course the match-up between Sainz and Albon (sorry, did something happen at Ferrari?). Albon’s current level is quite a mystery, because since going up against Verstappen as a rookie he has only ever had...
We are excited to announce that the GitHub - insightsengineering/autoslider.core: autoslideR core {autoslider.core} package is now a part of pharmaverse!
The normal process of creating clinical study slides is that a statistician manually ty... [Read more...]
Hi pharmaverse community,
We’re excited to welcome Novartis as the newest member of the Pharmaverse Council! Representing Novartis on the council will be Orla Doyle.
Novartis has been a strong advocate for integrating open-source languages and... [Read more...]
Here’s another puzzle, from Henry Dudeney’s Perplexities column in Strand Magazine, January 1924.
Arrange the ten digits, 1 2 3 4 5 6 7 8 9 0, in such order that they shall form a number that may be divided by every number from 2 to 18 without ...
This is the second blog in a series about the {sparkline} R package for
inline data visualisations. You can read the first one
here. In this post I
will be demonstrating how you can include sparklines inside HTML tables.
Reactable
{reactable} is ...
Have you ever wondered why your ggplot title is not perfectly aligned, detracting from the impact of your data visualization? As experts in data visualization and r programming, we understand the power of a well-...
The BIST100 was rising before the Turkish Central Bank’s rate cuts; could that be an initial signal of a firm uptrend? The ML model tells us there is much more room to go up. [Read more...]
I am excited to announce that the pharmaverse council has approved additional organizational characteristics required for council membership. These criteria establish the level of DEI (Diversity, Equity & Inclusion) commitment we expect fr...
Your data scientist just spent three days crafting the perfect sales performance report. The sales team loves it. Next month, they’ll need it again. And the month after that. Sound familiar? Here’s a better way: A data scientist and a sales manager sit down for three hours. They ...
Time-series data have become ubiquitous in psychological research because it allows us to study within-person dynamics and their heterogeneity across persons. Vector autoregressive (VAR) models have become a popular choice as a first approximation of w... [Read more...]
A student recently asked whether bootstrap confidence intervals were more robust than confidence intervals estimated using the standard error (i.e. ). In order to answer this question I wrote a function to simulate taking a bunch of random s...
This post is based on the textbook Evidence Synthesis for Decision Making in Healthcare (ESDMH) by Nicky J. Welton, Alexander J. Sutton, Nicola J. Cooper, Keith R. Abrams, and A.E. Ades. This textbook is an exemplary presentation of healthcare d...
Call for Abstracts Deadline Extended
Good news! We’re extending the deadline for abstract submissions for Shiny in Production 2025 by two weeks. You now have until 11:59 PM BST on 3rd April 2025 to submit your proposal.
This extension gives you e...
What is data-driven art?
At first I thought the answer to the question what is data art? would be relatively straightforward. I initially started with the definition that data art lies somewhere between data visualisation and generative art. Where data visualisation aims to accurately represent data to communicate insights, generative ... [Read more...]
R is packed with powerhouse tools—think dplyr for data wrangling, ggplot2 for stunning visuals, or tidyr for tidying up messes. But beyond the headliners, there’s a lineup of lesser-known functions that deserve a spot in your toolkit. These hidd... [Read more...]