Articles by vgherard

Interpreting the Likelihood Ratio cost

November 14, 2023 | vgherard

Intro During the last few months, I’ve been working on a machine learning algorithm with applications in Forensic Science, a.k.a. Criminalistics. In this field, one common task for the data analyst is to present the trier-of-fact (the person or people who determine the facts in a legal ...

[Read more...]

AB tests and repeated checks

July 26, 2023 | vgherard

Intro “How is the experiment going?” Also: “Do we already see something?” And my favorite one: “Did we already hit significance, or do we need more data?” If you have dealt with experiments with relatively high outcome expectations, you will l...

[Read more...]

Testing functional specification in linear regression

July 10, 2023 | vgherard

Another one from the series on “misspecified regression models” (started with Model Misspecification and Linear Sandwiches). Intro Lately I’ve been messing around with the {lmtest} R package, a nice collection of hypothesis tests for classical linea...

[Read more...]

Linear regression with autocorrelated noise

May 25, 2023 | vgherard

Effects of noise autocorrelation on linear regression. Explicit formulae and a simple simulation. [Read more...]

Model Misspecification and Linear Sandwiches

May 13, 2023 | vgherard

Introduction Traditional linear models, such as the output of the R function lm(), are often loaded with a set of strong assumptions. Take univariate regression: \[ Y = q+mX+\varepsilon. (\#eq:lm) \] This equation assumes that: The conditional mean...

[Read more...]

How to get away with selection. Part I: Introduction

November 13, 2022 | vgherard

Introducing the problem of Selective Inference, illustrated through a simple simulation in R. [Read more...]

kgrams v0.1.2 on CRAN

November 11, 2021 | vgherard

Summary Version v0.1.2 of my R package kgrams was just accepted by CRAN. This package provides tools for training and evaluating k-gram language models in R, supporting several probability smoothing techniques, perplexity computations, random text generation and more. Short demo

library(kgrams)
# Get k-gram frequency counts from Shakespeare's "Much Ado About Nothing"
freqs <- kgram_freqs(kgrams::much_ado, N = 4)

# Build modified Kneser-Ney 4-gram model, with discount parameters D1, D2, D3.
mkn <- language_model(freqs, smoother = "mkn", D1 = 0.25, D2 = 0.5, D3 = 0.75)

# Sample sentences from the language model at different temperatures
set.seed(840)
sample_sentences(model = mkn, n = 3, max_length = 10, t = 1)

[1] "i have studied eight or nine truly by your office [...] (truncated output)"
[2] "ere you go : <EOS>"                                                        
[3] "don pedro welcome signior : <EOS>"

sample_sentences(model = mkn, n = 3, max_length = 10, t = 0.1)

[1] "i will not be sworn but love may transform me [...] (truncated output)" 
[2] "i will not fail . <EOS>"                                                
[3] "i will go to benedick and counsel him to fight [...] (truncated output)"

sample_sentences(model = mkn, n = 3, max_length = 10, t = 10)

[1] "july cham's incite start ancientry effect torture tore pains endings [...] (truncated output)"   
[2] "lastly gallants happiness publish margaret what by spots commodity wake [...] (truncated output)"
[3] "born all's 'fool' nest praise hurt messina build afar dancing [...] (truncated output)"

NEWS Overall Software Improvements ... [Read more...]

R Client for R-universe APIs

July 23, 2021 | vgherard

Introduction Following my previous post on how to use your R-universe API to automatically generate a list of the packages on your R-universe, I started working on a simple R client to interact with such APIs. For those who missed it, R-universe is ... [Read more...]

Automatic resumes of your R-developer portfolio from your R-Universe

July 19, 2021 | vgherard

Hi R-bloggers ? Starting from today, all posts from this blog in the R category will also appear on R-bloggers. I would like to thank Tal for aggregating my blog, and say “hi!” to all R-bloggers readers. I’m a particle physicist with a passion for R...

[Read more...]

{r2r} now on CRAN

July 4, 2021 | vgherard

Introduction My package {r2r} (v0.1.1) has been accepted by CRAN, and is now available for download from the public repository. r2r r2r provides a flexible implementation of hash tables in R, allowing for: arbitrary R objects as keys and value...

[Read more...]

Articles by vgherard

Never miss an update! Subscribe to R-bloggers to receive e-mails with the latest R posts. (You will not see this message again.)

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)