Econometrics and Free Software | R-bloggers

Synthetic micro-datasets: a promising middle ground between data privacy and data analysis

February 22, 2020 | Econometrics and Free Software

Intro: the need for microdata, and the risk of disclosure Survey and administrative data are essential for scientific research, however accessing such datasets can be very tricky, or even impossible. In my previous job I was responsible for getting access to such “scientific micro-datasets” from institutions like Eurostat. In general, ...

[Read more...]

Synthetic micro-datasets: a promising middle ground between data privacy and data analysis

February 22, 2020 | Econometrics and Free Software

Intro: the need for microdata, and the risk of disclosure Survey and administrative data are essential for scientific research, however accessing such datasets can be very tricky, or even impossible. In my previous job I was responsible for getting access to such “scientific micro-datasets” from institutions like Eurostat. In general, ... [Read more...]

Dynamic discrete choice models, reinforcement learning and Harold, part 2

February 13, 2020 | Econometrics and Free Software

In this blog post, I present a paper that has really interested me for a long time. This is part2, where I will briefly present the model of the paper, and try to play around with the data. If you haven’t, I suggest you read part 1 where I provid...

[Read more...]

Dynamic discrete choice models, reinforcement learning and Harold, part 2

February 13, 2020 | Econometrics and Free Software

In this blog post, I present a paper that has really interested me for a long time. This is part2, where I will briefly present the model of the paper, and try to play around with the data. If you haven’t, I suggest you read part 1 where I provide ... [Read more...]

Dynamic discrete choice models, reinforcement learning and Harold, part 1

January 25, 2020 | Econometrics and Free Software

Introduction I want to write about an Econometrica paper written in 1987 (jstor link) by John Rust, currently Professor of Economics at Georgetown University, paper which has been on my mind for the past 10 years or so. Why? Because it is a s...

[Read more...]

Dynamic discrete choice models, reinforcement learning and Harold, part 1

January 25, 2020 | Econometrics and Free Software

Introduction I want to write about an Econometrica paper written in 1987 (jstor link) by John Rust, currently Professor of Economics at Georgetown University, paper which has been on my mind for the past 10 years or so. Why? Because it is a seminal paper in the econometric literature, but it is ... [Read more...]

Intrumental variable regression and machine learning

November 8, 2019 | Econometrics and Free Software

Intro Just like the question “what’s the difference between machine learning and statistics” has shed a lot of ink (since at least Breiman (2001)), the same question but where statistics is replaced by econometrics has led to a lot of discussion, as well. I like this presentation by Hal Varian ...

[Read more...]

Intrumental variable regression and machine learning

November 8, 2019 | Econometrics and Free Software

Intro Just like the question “what’s the difference between machine learning and statistics” has shed a lot of ink (since at least Breiman (2001)), the same question but where statistics is replaced by econometrics has led to a lot of discussion, as well. I like this presentation by Hal Varian ... [Read more...]

Multiple data imputation and explainability

November 1, 2019 | Econometrics and Free Software

Introduction Imputing missing values is quite an important task, but in my experience, very often, it is performed using very simplistic approaches. The basic approach is to impute missing values for numerical features using the average of each fe...

[Read more...]

Multiple data imputation and explainability

November 1, 2019 | Econometrics and Free Software

Introduction Imputing missing values is quite an important task, but in my experience, very often, it is performed using very simplistic approaches. The basic approach is to impute missing values for numerical features using the average of each feature, or using the mode for categorical features. There are better ways ... [Read more...]

Cluster multiple time series using K-means

October 12, 2019 | Econometrics and Free Software

I have been recently confronted to the issue of finding similarities among time-series and though about using k-means to cluster them. To illustrate the method, I’ll be using data from the Penn World Tables, readily available in R (inside the {pwt9...

[Read more...]

Cluster multiple time series using K-means

October 12, 2019 | Econometrics and Free Software

I have been recently confronted to the issue of finding similarities among time-series and though about using k-means to cluster them. To illustrate the method, I’ll be using data from the Penn World Tables, readily available in R (inside the {pwt9} package):

library(tidyverse)
library(lubridate)
library(pwt9)
library(brotools)

First, of all, let’s only ... [Read more...]

Split-apply-combine for Maximum Likelihood Estimation of a linear model

October 4, 2019 | Econometrics and Free Software

Intro Maximum likelihood estimation is a very useful technique to fit a model to data used a lot in econometrics and other sciences, but seems, at least to my knowledge, to not be so well known by machine learning practitioners (but I may be wro...

[Read more...]

Split-apply-combine for Maximum Likelihood Estimation of a linear model

October 4, 2019 | Econometrics and Free Software

Intro Maximum likelihood estimation is a very useful technique to fit a model to data used a lot in econometrics and other sciences, but seems, at least to my knowledge, to not be so well known by machine learning practitioners (but I may be wrong about that). Other useful techniques ... [Read more...]

R-bloggers

R news and tutorials contributed by hundreds of R bloggers

Articles by Econometrics and Free Software

What would a keyboard optimised for Luxembourguish look like?

What would a keyboard optimised for Luxembourguish look like?

Explainbility of {tidymodels} models with {iml}

Explainbility of {tidymodels} models with {iml}

Machine learning with {tidymodels}

Machine learning with {tidymodels}

Synthetic micro-datasets: a promising middle ground between data privacy and data analysis

Synthetic micro-datasets: a promising middle ground between data privacy and data analysis

Dynamic discrete choice models, reinforcement learning and Harold, part 2

Dynamic discrete choice models, reinforcement learning and Harold, part 2

Dynamic discrete choice models, reinforcement learning and Harold, part 1

Dynamic discrete choice models, reinforcement learning and Harold, part 1

Intrumental variable regression and machine learning

Intrumental variable regression and machine learning

Multiple data imputation and explainability

Multiple data imputation and explainability

Cluster multiple time series using K-means

Cluster multiple time series using K-means

Split-apply-combine for Maximum Likelihood Estimation of a linear model

Split-apply-combine for Maximum Likelihood Estimation of a linear model

Articles by Econometrics and Free Software

Never miss an update! Subscribe to R-bloggers to receive e-mails with the latest R posts. (You will not see this message again.)

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)