Articles by R Views

tidyposterior’s Bayesian Approach to Model Comparison

December 15, 2019 | R Views

A task common to many machine learning workflows is to compare the performance of several models with respect to some metric such as accuracy or area under the ROC curve. Standard practice is to try out several different algorithms on a training data set and see which works better. Unfortunately, ...
[Read more...]

IPO Portfolios and a Benchmark

December 10, 2019 | R Views

In two previous posts, we explored IPOs and IPO returns by sector and year since 2004 and then examined the returns of portfolios constructed by investing in IPOs each year. In today’s post, we will add a benchmark so that we can compare our IPO portfolios to something besides themselves. ...
[Read more...]

In-Database Logisitc Regression with R

December 3, 2019 | R Views

Roland Stevenson is a data scientist and consultant who may be reached on Linkedin. In a previous article we illustrated how to calculate xgboost model predictions in-database. This was referenced and incorporated into tidypredict. After learning more about what the tidypredict team is up to, I discovered another tidyverse package ...
[Read more...]

October 2019: “Top 40” New R Packages

November 17, 2019 | R Views

Two Hundred twenty-three new packages made it to CRAN in October. Here are my “Top 40” picks in ten categories: Computational Methods, Data, Genomics, Machine Learning, Mathematics, Medicine, Pharmacology, Statistics, Utilities, and Visualization. Computational Methods admmDensestSubmatrix v0.1.0: Implements a method to identify the densest sub-matrix in a given or sampled binary ...
[Read more...]

IPO Exploration Part Two

November 13, 2019 | R Views

In a previous post, we explored IPOs and IPO returns by sector and year since 2004. Today, let’s investigate how portfolios formed with those IPOs have performed. We will need to grab the price histories of the tickers, then form portfolios, then calculate their performance, and then rank those performances ... [Read more...]

A comparison of methods for predicting clothing classes using the Fashion MNIST dataset in RStudio and Python (Part 1)

November 10, 2019 | R Views

Florianne Verkroost is a PhD candidate at Nuffield College at the University of Oxford. With a passion for data science and a background in mathematics and econometrics. She applies her interdisciplinary knowledge to computationally address societal problems of inequality. In this series of blog posts, I will compare different machine ...
[Read more...]

A First Look at Confidence Distributions

November 4, 2019 | R Views

Using a probability distribution to characterize uncertainty is at the core of statistical inference. So, it seems natural to try to summarize the information about the parameters in statistical models with probability distributions. R. A. Fisher thought so. In fact, he expended a great deal of effort over more than ... [Read more...]

Sept 2019: “Top 40” New R Packages

October 28, 2019 | R Views

One hundred and thirteen new packages made it to CRAN in September. Here are my “Top 40” picks in eight categories: Computational Methods, Data, Economics, Machine Learning, Statistics, Time Series, Utilities, and Visualization. Computational Methods eRTG3D v0.6.2: Provides functions to create realistic random trajectories in a 3-D space between two ...
[Read more...]

IPO Exploration

October 20, 2019 | R Views

Inspired by recent headlines like “Fear Overtakes Greed in IPO Market after WeWork Debacle” and “This Year’s IPO Class is Least Profitable since the Tech Bubble”, today we’ll explore historical IPO data, and next time we’ll look at the the performance of IPO-driven portfolios constructed during the ...
[Read more...]

Productionizing Shiny and Plumber with Pins

October 16, 2019 | R Views

Producing an API that serves model results or a Shiny app that displays the results of an analysis requires a collection of intermediate datasets and model objects, all of which need to be saved. Depending on the project, they might need to be reused in another project later, shared with ...
[Read more...]

Building Interactive World Maps in Shiny

October 8, 2019 | R Views

Florianne Verkroost is a PhD candidate at Nuffield College at the University of Oxford. With a passion for data science and a background in mathematics and econometrics. She applies her interdisciplinary knowledge to computationally address societal problems of inequality. In this post, I will show you how to create interactive ...
[Read more...]

Multiple Hypothesis Testing in R

October 1, 2019 | R Views

In the first article of this series, we looked at understanding type I and type II errors in the context of an A/B test, and highlighted the issue of “peeking”. In the second, we illustrated a way to calculate always-valid p-values that were immune to peeking. We will now ... [Read more...]

August 2019: “Top 40” R packages

September 25, 2019 | R Views

Two hundred and twenty-seven new packages made it to CRAN in August. Quite a few were devoted to medical or genomic applications, and this is reflected in my “Top 40” selections, listed below in nine categories: Computational Methods, Data, Genomics, Machine Learning, Medicine and Pharma, Statistics, Time Series, Utilities, and Visualization. ...
[Read more...]

Accelerate your plots with ggforce

September 18, 2019 | R Views

In this post, I will walk you through some examples that show off the major features of the ggforce package. The main goal is to share a few ideas about customizing visualizations that you may find useful in your everyday work. The ggforce package is an extension to ggplot2 developed ...
[Read more...]

R/Medicine 2019 Workshops

September 11, 2019 | R Views

R/Medicine 2019 kicked off on Thursday with two outstanding workshops. It was difficult to choose between the two, but fortunately both presenters developed rich sets of materials that are available online. Alison Hill delivered R Markdown for Medicine with an elegant HTML exposition masterfully created to cultivate beginners while still ...
[Read more...]

How to Send Custom E-mails with R

September 3, 2019 | R Views

A common business oriented data science task is to programatically craft and send custom emails. In this post, I will show how to accomplish this with R on the RStudio Connect platform (a paid product built for the enterprise) using the blastula package.blastula provides a set of functions for ...
[Read more...]

July 2019 “Top 40” R Packages

August 28, 2019 | R Views

One hundred seventy-six new packages made it to CRAN in July. Here are my “Top 40” picks organized into twelve categories: Data, Data Science, Finance, Genomics, Machine Learning, Mathematics, Medicine, Statistics, Time Series, Topological Data Analysis, Utilities and Visualization. Data eia v0.3.2: Provides API access to data from the US Energy ...
[Read more...]

Calculating Always-Valid p-values in R

August 21, 2019 | R Views

In this post, we will develop a framework for always-valid inference based on the paper Always Valid Inference: Continuous Monitoring of A/B Tests (2019 Johari, Pekelis, Walsh). Using an always-valid p-value allows us to continuously monitor A/B tests, and potentially stop the test early in a valid way1. In ...
[Read more...]

Tech Dividends, Part 2

August 16, 2019 | R Views

In a previous post, we explored the dividend history of stocks included in the SP500, and we followed that with exploring the dividend history of some NASDAQ tickers. Today’s post is a short continuation of that tech dividend theme, with the aim of demonstrating how we can take our ...
[Read more...]
1 6 7 8 9 10 18

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)