This is a lecture post for my students in the CUNY MS Data Analytics program. In this series of lectures …Continue reading »

After my last post I have recurringly received two questions: (a) is it worthwhile to analyze GNU R speed in simulations and (b) how would simulation speed compare between GNU R and Python. In this post I want to address the former question and next ti...

Most books that discuss regression modeling start out and often finish with Ordinary Least Squares (OLS) as the technique to use; Generalized Least Squares (GLS) sometimes get a mention near the back. This is all well and good if the readers’ data has the characteristics required for OLS to be an applicable technique. A lot

In a previous post, I gave a cursory overview of how prior information about covariate measurement error can reduce bias in linear regression. In the comments, Rasmus Bååth asked about estimation in the absence of strong priors. Here, I’ll describe a Bayesian approach for estimation and correction for covariate measurement error using a latent-variable based errors-in-variables...

by Joseph Rickert In this roundup of R-related news: Domino enables data science collaboration; Plotly adds an R graphics gallery; Revolution Analytics R user group sponsorship applications are open; and Quandl adds new data sets. San Francisco startup takes on collaborative Data Science Domino, a San Francisco based startup, is inviting users to sign up to beta test its...

Christian Robert reviewed on line a paper that was critical of non-informative priors. Among the points that were discussed by him and other contributors (e.g. Keith O’Rourke), was the issue of induced priors, i.e. priors which arise from a transformation of original parameters, or of observables. I found this exchange interesting because I did something

I am happy to announce that version 1.0 of the PSAboot package has been released to CRAN. This package implements bootstrapping for propensity score analysis. This deviates from typical implementations such as boot in that it allows for separate sampling specifications for treatment and control units. For example, in the case where the ratio of treatment-to-control units is...

Getting Started with Multilevel Modeling in R Getting Started with Multilevel Modeling in R Jared E. Knowles Introduction Analysts dealing with grouped data and complex hierarchical structures in their data ranging from measurements nested within participants, to counties nested within states or students nested within classrooms often find themselves...

This is a gem of a book.From the introduction: We intend this work to be a practitioner’s guide to the predictive modeling process and a place where one can come to learn about the approach and to gain intuition about the many commonly used and modern, powerful models. …it was our goal to be as hands-on as possible, enabling the readers...

e-mails with the latest R posts.

(You will not see this message again.)