Articles by arthur charpentier

Classification from scratch, linear discrimination 8/8

June 6, 2018 | arthur charpentier

Eighth post of our series on classification from scratch. The latest one was on the SVM, and today, I want to get back on very old stuff, with here also a linear separation of the space, using Fisher’s linear discriminent analysis. Bayes (naive) classifier Consider the follwing naive classification ...

Classification from scratch, SVM 7/8

June 6, 2018 | arthur charpentier

Seventh post of our series on classification from scratch. The latest one was on the neural nets, and today, we will discuss SVM, support vector machines. A formal introduction Here takes values in . Our model will be Thus, the space is divided by a (linear) border The distance from point ...

Classification from scratch, neural nets 6/8

June 5, 2018 | arthur charpentier

Sixth post of our series on classification from scratch. The latest one was on the lasso regression, which was still based on a logistic regression model, assuming that the variable of interest has a Bernoulli distribution. From now on, we will discuss technique that did not originate from those probabilistic ...

Classification from scratch, penalized Lasso logistic 5/8

June 4, 2018 | arthur charpentier

Fifth post of our series on classification from scratch, following the previous post on penalization using the norm (so-called Ridge regression), this time, we will discuss penalization based on the norm (the so-called Lasso regression). First of all, one should admit that if the name stands for least absolute shrinkage ...

Classification from scratch, penalized Ridge logistic 4/8

June 2, 2018 | arthur charpentier

Fourth post of our series on classification from scratch, following the previous post which was some sort of detour on kernels. But today, we’ll get back on the logistic model. Formal approach of the problem We’ve seen before that the classical estimation technique used to estimate the parameters ...

Classification from scratch, logistic with kernels 3/8

May 31, 2018 | arthur charpentier

Third post of our series on classification from scratch, following the previous post introducing smoothing techniques, with (b)-splines. Consider here kernel based techniques. Note that here, we do not use the “logistic” model… it is purely non-parametric. kernel based estimated, from scratch I like kernels because they are somehow ...

Classification from scratch, trees 9/8

May 30, 2018 | arthur charpentier

Nineth post of our series on classification from scratch. Today, we’ll see the heuristics of the algorithm inside classification trees. And yes, I promised eight posts in that series, but clearly, that was not sufficient… sorry for the poor prediction. Decision Tree Decision trees are easy to read. So ...

Classification from scratch, logistic with splines 2/8

May 30, 2018 | arthur charpentier

Today, second post of our series on classification from scratch, following the brief introduction on the logistic regression. Piecewise linear splines To illustrate what’s going on, let us start with a “simple” regression (with only one explanatory variable). The underlying idea is natura non facit saltus, for “nature does ...

Classification from scratch, logistic regression 1/8

May 30, 2018 | arthur charpentier

Let us start today our series on classification from scratch… The logistic regression is based on the assumption that given covariates , has a Bernoulli distribution,The goal is to estimate parameter . Recall that the heuristics for the use of that function for the probability is that Maximimum of the (log)...

Classification from scratch, overview 0/8

May 29, 2018 | arthur charpentier

Before my course on « big data and economics » at the university of Barcelona in July, I wanted to upload a series of posts on classification techniques, to get an insight on machine learning tools. According to some common idea, machine learning algorithms are black boxes. I wanted to get back ...

Some sort of Otto Neurath (isotype picture) map

May 14, 2018 | arthur charpentier

Yesterday evening, I was walking in Budapest, and I saw some nice map that was some sort of Otto Neurath style. It was hand-made but I thought it should be possible to do it in R, automatically. A few years ago, Baptiste Coulmont published a nice blog post on the ...

Graduate Course on Advanced Tools for Econometrics (2)

March 25, 2018 | arthur charpentier

This Tuesday, I will be giving the second part of the (crash) graduate course on advanced tools for econometrics. It will take place in Rennes, IMAPP room, and I have been told that there will be a visio with Nantes and Angers. Slides for the morning a... [Read more...]

When “learning Python” becomes “practicing R” (spoiler)

March 8, 2018 | arthur charpentier

15 years ago, a student of mine told me that I should start learning Python, that it was really a great language. Students started to learn it, but I kept postponing. A few years ago, I started also Python for Kids, which is really nice actually, with my son. That was ...

Using convolutions (S3) vs distributions (S4)

January 24, 2018 | arthur charpentier

Usually, to illustrate the difference between S3 and S4 classes in R, I mention glm (from base) and vglm (from VGAM) that provide similar outputs, but one is based on S3 codes, while the second one is based on S4 codes. Another way to illustrate is to manipulate distributions. Consider ...

Holt-Winters with a Quantile Loss Function

January 8, 2018 | arthur charpentier

Exponential Smoothing is an old technique, but it can perform extremely well on real time series, as discussed in Hyndman, Koehler, Ord & Snyder (2008)), when Gardner (2005) appeared, many believed that exponential smoothing should be disregarded because it was either a special case of ARIMA modeling or an ad hoc procedure with ... [Read more...]

The myth of interpretability of econometric models

December 8, 2017 | arthur charpentier

There are important discussions nowadays about data modeling, to choose between the “two cultures” (as mentioned in Breiman (2001)), i.e. either econometrics models or machine/statistical learning models. We did discuss this issue recently in Econométrie et Machine Learning (so far only in French) with Emmanuel Flachaire and Antoine ...

Traveling Salesman

September 26, 2017 | arthur charpentier

In the second part of the course on graphs and networks, we will focus on economic applications, and flows. The first series of slides are on the traveling salesman problem. Slides are available online. [Read more...]

Matching, Optimal Transport and Statistical Tests

July 30, 2017 | arthur charpentier

To explain the “optimal transport” problem, we usually start with Gaspard Monge’s “Mémoire sur la théorie des déblais et des remblais“, where the the problem of transporting a given distribution of matter (a pile of sand for instance) into another (an excavation for instance). This problem ...

The U.S. Has Been At War 222 Out of 239 Years

March 19, 2017 | arthur charpentier

This morning, I discovered an interesting statistic, America Has Been At War 93% of the Time – 222 Out of 239 Years – Since 1776, i.e. the U.S. has only been at peace for less than 20 years total since its birth. I wanted to check, get a better understanding and look at other countries ...

Advanced Econometrics: Model Selection

March 7, 2017 | arthur charpentier

On Thursday, March 23rd, I will give the third lecture of the PhD course on advanced tools for econometrics, on model selection and variable selection, where we will focus on ridge and lasso regressions . Slides are available online. The first part w... [Read more...]

« 1 2 3 4 5 6 … 19 »

Copyright © 2025 | MH Corporate basic by MH Themes