Articles by arthur charpentier

Random points on the Earth

December 7, 2013 | arthur charpentier

The problem with puzzles is that you keep it in your head for days, until you find an answer. Or at least some ideas about a possible answer. This is what happened to me a few weeks ago, when a colleague of mine asked me the following question : Consider points ... [Read more...]

Conditional densities, on one single graph

December 5, 2013 | arthur charpentier

With Stéphane Tufféry we’ve been working on credit scoring1 and we’ve been using the popular german credit dataset, __ myVariableNames credit = read.table( + "http://archive.ics.uci.edu/ml/machine-learning-databases/statlog/german/german.data", + header=FALSE,col.names=myVariableNames) __ credit$class library(RColorBrewer) __ CL=brewer.pal(6, "RdBu") __ ... [Read more...]

Binomial regression model

November 18, 2013 | arthur charpentier

Most of the time, when we introduce binomial models, such as the logistic or probit models, we discuss only Bernoulli variables, . This year (actually also the year before), I discuss extensions to multinomial regressions, where  is a function on some simplex. The multinomial logistic model was mention here. The idea ... [Read more...]

Maximum Likelihood versus Goodness of Fit

November 8, 2013 | arthur charpentier

Thursday, I got an interesting question from a colleague of mine (JP). I mean, the way I understood the question turned out to be a nice puzzle (but I have to confess I might have misunderstood). The question is the following : consider a i.i.d. sample of continuous variables. ... [Read more...]

Generating functions

November 8, 2013 | arthur charpentier

Today, I wanted to publish a post on generating functions, based on discussions I had with Jean-Francois while having our coffee after lunch a couple of times already. The other reason is that I publish my post while my student just finished their Probability exam (and there were a few ... [Read more...]

Smoothing mortality rates

November 4, 2013 | arthur charpentier

This morning, I was working with Julie, a student of mine, coming from Rennes, on mortality tables. Actually, we work on genealogical datasets from a small region in Québec, and we can observe a lot of volatiliy. If I borrow one of her graph, we get something like Since ... [Read more...]

Halloween and candies (a ballot problem)

October 30, 2013 | arthur charpentier

This year, for Halloween, a post on candies (I promise, next year I will write another post on zombies). But I don’t want to focus on the kids problems (last year, we tried to minimize their walking distance to collect as much candies as possible, with part 1 and part 2), ... [Read more...]

More significant? so what…

October 30, 2013 | arthur charpentier

Following my non-life insurance class, this morning, I had an interesting question from a student, that I will try to illustrate, and reformulate as accurately as possible. Consider a simple regression model, with one variable of interest, and one possible explanatory variable. Assume that we have two possible models, with ... [Read more...]

Pricing Reinsurance Contracts

October 24, 2013 | arthur charpentier

In order to illustrate the next section of the non-life insurance course, consider the following example1, inspired from http://sciencepolicy.colorado.edu/…. This is the so-called “Normalized Hurricane Damages in the United States” dataset, for the period 1900-2005, from Pielke et al. (2008). The dataset is available in xls format, so ... [Read more...]

GLM, non-linearity and heteroscedasticity

October 22, 2013 | arthur charpentier

Last week in non-life insurance course, we’ve seen the theory of the Generalized Linear Models, emphasizing the two important components the link function (which is actually the key component in predictive modeling) the distribution, or the variance function Just to illustrate, consider my favorite dataset ­lin.mod = lm(dist~... [Read more...]

Equidistant points on a map

October 17, 2013 | arthur charpentier

This morning, I had a comment on a recent post, regarding a graph I did upload on the blog, which was extracted from a paper now online (see http://hal.archives-ouvertes.fr/hal-00871883). Jo (from KUL, I guess I can share that piece of information) asked me I was wondering ... [Read more...]

Some heuristics about spline smoothing

October 8, 2013 | arthur charpentier

Let us continue our discussion on smoothing techniques in regression. Assume that . where is some unkown function, but assumed to be sufficently smooth. For instance, assume that  is continuous, that exists, and is continuous, that  exists and is also continuous, etc. If  is smooth enough, Taylor’s expansion can be ... [Read more...]

Regression on variables, or on categories?

September 30, 2013 | arthur charpentier

I admit it, the title sounds weird. The problem I want to address this evening is related to the use of the stepwise procedure on a regression model, and to discuss the use of categorical variables (and possible misinterpreations). Consider the following dataset __ db = read.table("http://freakonometrics.free.fr/... [Read more...]

ROC curves and classification

September 30, 2013 | arthur charpentier

To get back to a question asked after the last course (still on non-life insurance), I will spend some time to discuss ROC curve construction, and interpretation. Consider the dataset we’ve been using last week, __ db = read.table("http://freakonometrics.free.fr/db.txt",header=TRUE,sep=";") __ attach(db) ... [Read more...]

Nice tutorials to discover R

September 28, 2013 | arthur charpentier

A series of tutorials, in R, by Anthony Damico. As claimed on http://twotorials.com/, “how to do stuff in r. two minutes or less, for those of us who prefer to learn by watching and listening“. So far, 000 what is r? the lingua statistica, s’il vous plaît 001 ... [Read more...]

Logistic regression and categorical covariates

September 26, 2013 | arthur charpentier

A short post to get back – for my nonlife insurance course – on the interpretation of the output of a regression when there is a categorical covariate. Consider the following dataset __ db = read.table("http://freakonometrics.free.fr/db.txt",header=TRUE,sep=";") __ tail(db) Y X1 X2 X3 995 1 4.801836 20.82947 A 996 1 9.867854 24.39920 C 997 1 5.390730 21.25119 ... [Read more...]

Monty Hall (oh no, not again)

September 13, 2013 | arthur charpentier

Quite frequently, someone on the internet discovers the Monty Hall paradox, and become so enthusiastic that it becomes urgent to publish an article – or a post – about it. The latest example can be http://www.bbc.co.uk/news/magazine-24045598. I won’t blame them, I did the same a ... [Read more...]
1 9 10 11 12 13 19

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)