Monthly Archives: September 2013

Going to Plot Some Proportions? Why not Flog ’em First?

September 23, 2013
By
Going to Plot Some Proportions? Why not Flog ’em First?

Fractions and proportions can be difficult to plot nicely for a number of reasons: If the proportions are based on small counts (e.g., two of his three computing devices were Apple products) then the calculated proportions will only take on a number of discrete values. Depending on what you have measured there might be many proportions close to the...

Read more »

Waiting in One Line or Multiple Lines

September 23, 2013
By
Waiting in One Line or Multiple Lines

Whenever I go to the grocery store it always seems to be a lesson in statistics. I go get the things I need to buy and then  I try to select the checkout register that will decrease the amount of time I have to wait. Inevitably, I select the one line where there is some

Read more »

Introducing parallelRandomForest: faster, leaner, parallelized

September 23, 2013
By
ffffffffffffforst

Together with other members of Andreas Beyer's research group, I participated in the DREAM 8 toxicogenetics challenge. While the jury is still out on the results, I want to introduce my improvement of the R randomForest package, namely parall...

Read more »

Detecting Weak Instruments in R

September 23, 2013
By
Detecting Weak Instruments in R

Any instrumental variables (IV) estimator relies on two key assumptions in order to identify causal effects: That the excluded instrument or instruments only effect the dependent variable through their effect on the endogenous explanatory variable or variables (the exclusion restriction), That the correlation between the excluded instruments and the endogenous explanatory variables is strong enough

Read more »

4th Milano R net meeting to be held on October 24, 2013

September 23, 2013
By

4th Milano R net meeting October 24, 2013 @ 6.00 PM Fiori Oscuri Bistrot & Bar Via Fiori Oscuri, 3 Milano Further details will be published shortly. Stay connected!

Read more »

analyze the home mortgage disclosure act (hmda) microdata with r and monetdb

September 23, 2013
By

back in 1975, congress had it up to here with discriminatory lending practices and decided to require financial organizations originating home mortgages to report some basic operational statistics publicly.  the home mortgage disclosure act mandat...

Read more »

Exploratory Data Analysis: Quantile-Quantile Plots for New York’s Ozone Pollution Data

Exploratory Data Analysis: Quantile-Quantile Plots for New York’s Ozone Pollution Data

Introduction Continuing my recent series on exploratory data analysis, today’s post focuses on quantile-quantile (Q-Q) plots, which are very useful plots for assessing how closely a data set fits a particular distribution.  I will discuss how Q-Q plots are constructed and use Q-Q plots to assess the distribution of the “Ozone” data from the built-in

Read more »

MCMSki IV, Jan. 6-8, 2014, Chamonix (news #9)

September 22, 2013
By
MCMSki IV, Jan. 6-8, 2014, Chamonix (news #9)

This a reminder about the October 15 deadlines for MCMSki IV: First, the early bird rate for the registration ends up on October 15. Second, the young investigator travel support can only be requested up to October 15 as well. Be sure to book your hotel or rental place early too as Chamonix is quite popular in

Read more »

Preview of my book – Modeling data with functional programming in R

September 21, 2013
By
Preview of my book – Modeling data with functional programming in R

As some of you know, I’ve been writing a book (to be published by CRC Press/Chapman & Hall and released …Continue reading »

Read more »

A few gotchas with R date-time classes

September 21, 2013
By

Date and time handling is essential to many modelling and analysis exercises, in R and other languages used for scientific computing. Over the past few months I tackled the mapping of date-time concepts between R and the .NET framework as part of the w...

Read more »