Articles by Econometrics and Free Software

Using the tidyverse for more than data manipulation: estimating pi with Monte Carlo methods

December 20, 2018 | Econometrics and Free Software

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 5, which presents the {tidyverse} packages and how to use them to compute descriptive statistics and manipulate data. In the text below, I show how ... [Read more...]

Manipulate dates easily with {lubridate}

December 14, 2018 | Econometrics and Free Software

What hyper-parameters are, and what to do with them; an illustration with ridge regression

December 1, 2018 | Econometrics and Free Software

This blog post is an excerpt of my ebook Modern R with the tidyverse that you can read for free here. This is taken from Chapter 7, which deals with statistical models. In the text below, I explain what hyper-parameters are, and as an example I run a ridge regression using ... [Read more...]

A tutorial on tidy cross-validation with R

November 24, 2018 | Econometrics and Free Software

Introduction This blog posts will use several packages from the {tidymodels} collection of packages, namely {recipes}, {rsample} and {parsnip} to train a random forest the tidy way. I will also use {mlrMBO} to tune the hyper-parameters of the random forest. Set up Let’s load the needed packages:

library("tidyverse")
library("tidymodels")
library("parsnip")
library("brotools")
library("mlbench")

Load ... [Read more...]

The best way to visit Luxembourguish castles is doing data science + combinatorial optimization

November 20, 2018 | Econometrics and Free Software

Inspired by David Schoch’s blog post, Traveling Beerdrinker Problem. Check out his blog, he has some amazing posts! Introduction Luxembourg, as any proper European country, is full of castles. According to Wikipedia, “By some optimistic estimates, there are as many as 130 castles in Luxembourg but more realistically there are ... [Read more...]

Using a genetic algorithm for the hyperparameter optimization of a SARIMA model

November 15, 2018 | Econometrics and Free Software

Introduction In this blog post, I’ll use the data that I cleaned in a previous blog post, which you can download here. If you want to follow along, download the monthly data. In my last blog post I showed how to perform a grid search the “tidy” way. As ... [Read more...]

Searching for the optimal hyper-parameters of an ARIMA model in parallel: the tidy gridsearch approach

November 14, 2018 | Econometrics and Free Software

Introduction In this blog post, I’ll use the data that I cleaned in a previous blog post, which you can download here. If you want to follow along, download the monthly data. In the previous blog post, I used the auto.arima() function to very quickly get a “good-enough” ... [Read more...]

Easy time-series prediction with R: a tutorial with air traffic data from Lux Airport

November 13, 2018 | Econometrics and Free Software

In this blog post, I will show you how you can quickly and easily forecast a univariate time series. I am going to use data from the EU Open Data Portal on air passenger transport. You can find the data here. I downloaded the data in the TSV format for ... [Read more...]

Analyzing NetHack data, part 2: What players kill the most

November 9, 2018 | Econometrics and Free Software

Link to webscraping the data Link to Analysis, part 1 Introduction This is the third blog post that deals with data from the game NetHack, and oh boy, did a lot of things happen since the last blog post! Here’s a short timeline of the events: I scraped data from ... [Read more...]

Analyzing NetHack data, part 1: What kills the players

November 2, 2018 | Econometrics and Free Software

Abstract In this post, I will analyse the data I scraped and put into an R package, which I called {nethack}. NetHack is a roguelike game; for more context, read my previous blog post. You can install the {nethack} package and play around with the data yourself by installing it ... [Read more...]

From webscraping data to releasing it as an R package to share with the world: a full tutorial with data from NetHack

October 31, 2018 | Econometrics and Free Software

If someone told me a decade ago (back before I'd ever heard the term "roguelike") what I'd be doing today, I would have trouble believing this...Yet here we are. pic.twitter.com/N6Hh6A4tWl— Josh Ge (@GridSageGames) June 21, 2018 Abstract In this post, I am going to show ...

[Read more...]

Maps with pie charts on top of each administrative division: an example with Luxembourg’s elections data

October 26, 2018 | Econometrics and Free Software

Abstract You can find the data used in this blog post here: https://github.com/b-rodrigues/elections_lux This is a follow up to a previous blog post where I extracted data of the 2018 Luxembourguish elections from Excel Workbooks. Now that I have the data, I will create a map ... [Read more...]

Getting the data from the Luxembourguish elections out of Excel

October 20, 2018 | Econometrics and Free Software

In this blog post, similar to a previous blog post I am going to show you how we can go from an Excel workbook that contains data to flat file. I will taking advantage of the structure of the tables inside the Excel sheets by writing a function that extracts ... [Read more...]

Exporting editable plots from R to Excel: making ggplot2 purrr with officer

October 4, 2018 | Econometrics and Free Software

I was recently confronted to the following problem: creating hundreds of plots that could still be edited by our client. What this meant was that I needed to export the graphs in Excel or Powerpoint or some other such tool that was familiar to the client, and not export the ... [Read more...]

How Luxembourguish residents spend their time: a small {flexdashboard} demo using the Time use survey data

September 13, 2018 | Econometrics and Free Software

In a previous blog post I have showed how you could use the {tidyxl} package to go from a human readable Excel Workbook to a tidy data set (or flat file, as they are also called). Some people then contributed their solutions, which is always something I really enjoy when ... [Read more...]

Going from a human readable Excel file to a machine-readable csv with {tidyxl}

September 10, 2018 | Econometrics and Free Software

I won’t write a very long introduction; we all know that Excel is ubiquitous in business, and that it has a lot of very nice features, especially for business practitioners that do not know any programming. However, when people use Excel for purposes it was not designed for, it ... [Read more...]

The year of the GNU+Linux desktop is upon us: using user ratings of Steam Play compatibility to play around with regex and the tidyverse

September 7, 2018 | Econometrics and Free Software

I’ve been using GNU+Linux distros for about 10 years now, and have settled for openSUSE as my main operating system around 3 years ago, perhaps even more. If you’re a gamer, you might have heard about SteamOS and how more and more games are available on GNU+Linux. I ... [Read more...]

Dealing with heteroskedasticity; regression with robust standard errors using R

July 7, 2018 | Econometrics and Free Software

First of all, is it heteroskedasticity or heteroscedasticity? According to McCulloch (1985), heteroskedasticity is the proper spelling, because when transliterating Greek words, scientists use the Latin letter k in place of the Greek letter κ (kappa). κ sometimes is transliterated as the Latin letter c, but only when these words entered the English ... [Read more...]

Missing data imputation and instrumental variables regression: the tidy approach

June 30, 2018 | Econometrics and Free Software

In this blog post I will discuss missing data imputation and instrumental variables regression. This is based on a short presentation I will give at my job. You can find the data used here on this website: http://eclr.humanities.manchester.ac.uk/index.php/IV_in_R The data ... [Read more...]

Forecasting my weight with R

June 23, 2018 | Econometrics and Free Software

I’ve been measuring my weight almost daily for almost 2 years now; I actually started earlier, but not as consistently. The goal of this blog post is to get re-acquaiented with time series; I haven’t had the opportunity to work with time series for a long time now and ... [Read more...]

« 1 … 7 8 9 10 11 12 »

R-bloggers

R news and tutorials contributed by hundreds of R bloggers

Articles by Econometrics and Free Software

Using the tidyverse for more than data manipulation: estimating pi with Monte Carlo methods

Manipulate dates easily with {lubridate}

What hyper-parameters are, and what to do with them; an illustration with ridge regression

A tutorial on tidy cross-validation with R

The best way to visit Luxembourguish castles is doing data science + combinatorial optimization

Using a genetic algorithm for the hyperparameter optimization of a SARIMA model

Searching for the optimal hyper-parameters of an ARIMA model in parallel: the tidy gridsearch approach

Easy time-series prediction with R: a tutorial with air traffic data from Lux Airport

Analyzing NetHack data, part 2: What players kill the most

Analyzing NetHack data, part 1: What kills the players

From webscraping data to releasing it as an R package to share with the world: a full tutorial with data from NetHack

Maps with pie charts on top of each administrative division: an example with Luxembourg’s elections data

Getting the data from the Luxembourguish elections out of Excel

Exporting editable plots from R to Excel: making ggplot2 purrr with officer

How Luxembourguish residents spend their time: a small {flexdashboard} demo using the Time use survey data

Going from a human readable Excel file to a machine-readable csv with {tidyxl}

The year of the GNU+Linux desktop is upon us: using user ratings of Steam Play compatibility to play around with regex and the tidyverse

Dealing with heteroskedasticity; regression with robust standard errors using R

Missing data imputation and instrumental variables regression: the tidy approach

Forecasting my weight with R

Articles by Econometrics and Free Software

Never miss an update! Subscribe to R-bloggers to receive e-mails with the latest R posts. (You will not see this message again.)

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)