## Remembering the Gist, But Not the Details: One-Dimensional Representation of Consumer Ratings

October 13, 2013
In survey research, it makes a difference how the question is asked.  "How would you rate the service you received at that restaurant?" is not the same as "Did you have to wait to be seated, to order your meal, to be served your food, or to pay yo...

## Logistic regression and categorical covariates

September 26, 2013
$A$

A short post to get back – for my nonlife insurance course – on the interpretation of the output of a regression when there is a categorical covariate. Consider the following dataset > db = read.table("http://freakonometrics.free.fr/db.txt",header=TRUE,sep=";") > tail(db) Y X1 X2 X3 995 1 4.801836 20.82947 A 996 1 9.867854 24.39920 C 997 1 5.390730 21.25119 D 998 1...

## R GIS: Polygon Intersection with gIntersection{rgeos}

September 16, 2013
A short tutorial on doing intersections in R GIS. gIntersection{rgeos} will pick the polygons of the first submitted polygon contained within the second poylgon - this is done without cutting the polygon's edges which cross the clip source polygon. For the function that I use to download the example data, url_shp_to_spdf() please see HERE. library(rgeos)library(dismo)URLs...

## R for Ecologists: RLQ analysis (semi) explained

September 3, 2013
I’ve been reading about RLQ analysis, also known as the fourth corner method, for analyzing relationships between environmental characteristics and species traits. I was interested because I thought I might be using RLQ analysis to answer a specific set of … Continue reading →

## Estimation, prediction, and evaluation of logistic regression models

August 26, 2013
I provide an introduction to using logistic regression for prediction (binary classification) using the Titanic data competition from www.Kaggle.com as an example. I use models to predict in missing data, estimate a logistic regression model on a trai...

## Residuals from a logistic regression

August 23, 2013
I always claim that graphs are important in econometrics and statistics ! Of course, it is usually not that simple. Let me come back to a recent experience. A got an email from Sami yesterday, sending me a graph of residuals, and asking me what could be done with a graph of residuals, obtained from a logistic regression ?...

## Analyse discriminante linéaire ou Regression logistique

July 10, 2013
Supposons que l'on dispose d'iris de Paris (en population >100khabts) et qu'on veuille pouvoir les classer selon leurs caractéristiques sociodémos : Population taux de chômage Etudiants CSP etc... Une fois, les iris classés, on se demande si l'on peut transporter cette typologie à une autre grande ville (Lyon) par exemple : Il faudrait alors pouvoir utiliser un modèle d'affectation des iris selon leurs caractéristiques respectives...

## Getting raster histogram in QGIS using SEXTANTE and R

July 6, 2013
The issue with the broken histogram creation tool in QGIS annoyed me far too long. Sometimes you just need a quick glance on the histogram of a raster just to make a decision on how to process it or just to assess distribution of classes. But as you kn...

## R GIS: Terrain Analysis for Polygons as Simple as it Gets!

June 18, 2013
library(rgdal)library(raster)alt gadm gadm_sub plot(alt)plot(gadm_sub, add=T)asp slo > extract(slo, gadm_sub, fun = mean, na.rm = T, small = T, df = T) ID slope1 1 9.9590532 2 1.0474433 3 7.4561654 4 1.6737865 5 11.946553> extract(asp, gadm_sub, fun = mean, na.rm = T, small...

## Robust logistic regression

June 7, 2013
Corey Yanofsky writes: In your work, you’ve robustificated logistic regression by having the logit function saturate at, e.g., 0.01 and 0.99, instead of 0 and 1. Do you have any thoughts on a sensible setting for the saturation values? My intuition suggests that it has something to do with proportion of outliers expected in the The post Robust...