Articles by vgherard

Interpreting the Likelihood Ratio cost

November 14, 2023 | vgherard

Intro During the last few months, I’ve been working on a machine learning algorithm with applications in Forensic Science, a.k.a. Criminalistics. In this field, one common task for the data analyst is to present the trier-of-fact (the person or people who determine the facts in a legal ...
[Read more...]

AB tests and repeated checks

July 26, 2023 | vgherard

Intro “How is the experiment going?” Also: “Do we already see something?” And my favorite one: “Did we already hit significance, or do we need more data?” If you have dealt with experiments with relatively high outcome expectations, you will l...
[Read more...]

Model Misspecification and Linear Sandwiches

May 13, 2023 | vgherard

Introduction Traditional linear models, such as the output of the R function lm(), are often loaded with a set of strong assumptions. Take univariate regression: \[ Y = q+mX+\varepsilon. (\#eq:lm) \] This equation assumes that: The conditional mean...
[Read more...]

kgrams v0.1.2 on CRAN

November 11, 2021 | vgherard

Summary Version v0.1.2 of my R package kgrams was just accepted by CRAN. This package provides tools for training and evaluating k-gram language models in R, supporting several probability smoothing techniques, perplexity computations, random text generation and more. Short demo
library(kgrams)
# Get k-gram frequency counts from Shakespeare's "Much Ado About Nothing"
freqs <- kgram_freqs(kgrams::much_ado, N = 4)

# Build modified Kneser-Ney 4-gram model, with discount parameters D1, D2, D3.
mkn <- language_model(freqs, smoother = "mkn", D1 = 0.25, D2 = 0.5, D3 = 0.75)

# Sample sentences from the language model at different temperatures
set.seed(840)
sample_sentences(model = mkn, n = 3, max_length = 10, t = 1)
[1] "i have studied eight or nine truly by your office [...] (truncated output)"
[2] "ere you go : <EOS>"                                                        
[3] "don pedro welcome signior : <EOS>"
sample_sentences(model = mkn, n = 3, max_length = 10, t = 0.1)
[1] "i will not be sworn but love may transform me [...] (truncated output)" 
[2] "i will not fail . <EOS>"                                                
[3] "i will go to benedick and counsel him to fight [...] (truncated output)"
sample_sentences(model = mkn, n = 3, max_length = 10, t = 10)
[1] "july cham's incite start ancientry effect torture tore pains endings [...] (truncated output)"   
[2] "lastly gallants happiness publish margaret what by spots commodity wake [...] (truncated output)"
[3] "born all's 'fool' nest praise hurt messina build afar dancing [...] (truncated output)"
NEWS Overall Software Improvements ... [Read more...]

R Client for R-universe APIs

July 23, 2021 | vgherard

Introduction Following my previous post on how to use your R-universe API to automatically generate a list of the packages on your R-universe, I started working on a simple R client to interact with such APIs. For those who missed it, R-universe is ... [Read more...]

{r2r} now on CRAN

July 4, 2021 | vgherard

Introduction My package {r2r} (v0.1.1) has been accepted by CRAN, and is now available for download from the public repository. r2r r2r provides a flexible implementation of hash tables in R, allowing for: arbitrary R objects as keys and value...
[Read more...]

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)