1783 search results for "Ggplot2"

Comparison of ave, ddply and data.table

October 28, 2011
By
Comparison of ave, ddply and data.table

This is a copy of a post by me on the R-statistics blog. Fortran and C programmers often say that interpreted languages like R are nice and all, but lack in terms of speed. How fast something works in R… See more ›

Read more »

Mixed-Effects Models in R with Quantum Forest

October 26, 2011
By

For anyone who wants to estimate linear or nonlinear mixed-effects models (aka random-effects models, hierarchical models or multilevel models) using the R language, the Quantum Forest blog has several recent posts that will be of interest. Written by Luis Apiolaza from the School of Forestry at the University of Canterbury in New Zealand, the blog includes a number of...

Read more »

Machine Learning Ex 5.2 – Regularized Logistic Regression

October 25, 2011
By
Machine Learning Ex 5.2 – Regularized Logistic Regression

Now we move on to the second part of the Exercise 5.2, which requires to implement regularized logistic regression using Newton's Method. Plot the data:

Read more »

Installing the RMySQL package on Windows 7

October 25, 2011
By

So you want to get statistical? Nowadays one of the ways to go is to use R, mostly in combination with ggplot2 for generating the plots. These plots and graphs however need some data, for that we use data sources. There are a lot of data sources availa...

Read more »

Machine Learning Ex 5.1 – Regularized Linear Regression

October 25, 2011
By
Machine Learning Ex 5.1 – Regularized Linear Regression

The first part of the Exercise 5.1 requires to implement a regularized version of linear regression. Adding regularization parameter can prevent the problem of over-fitting when fitting a high-order polynomial. Read More: 194 Words Totally

Read more »

Simple Heatmap in R with Formula One Dataset

October 24, 2011
By
Simple Heatmap in R with Formula One Dataset

Now, that the 2011 F1 season is over I decided to quickly scrub the Formula 1 data of the F1.com website, such as the list of drivers, ordered by the approximate amount of salary driver is getting (top list driver is making the most, approx. 30MM) and position at the end of each race. There

Read more »

Machine Learning Ex4 – Logistic Regression

October 24, 2011
By
Machine Learning Ex4 – Logistic Regression

Exercise 4 required implementing Logistic Regression using Newton's Method. The dataset in use is 80 students and their grades of 2 exams, 40 students were admitted to college and the other 40 students were not. We need to implement a binary classification model to estimates college admission based on the student's scores on...

Read more »

Isarithmic Maps of Public Opinion Data

October 24, 2011
By
Isarithmic Maps of Public Opinion Data

As a follow-up to my isarithmic maps of county electoral data, I have attempted to experiment with extending the technique in two ways. First, where the electoral maps are based on data aggregated to the county level, I have sought to generalize the method to accept individual responses for which only zip code data is … Continue reading →

Read more »

Teaching with R: the switch

October 21, 2011
By

There are several blog posts, websites (and even books) explaining the transition from using another statistical system (e.g. SAS, SPSS, Stata, etc) to relying on R. Most of that material treats the topic from the point of view of i- … Continue reading →

Read more »

Spatial correlation in designed experiments

October 20, 2011
By
Spatial correlation in designed experiments

Last Wednesday I had a meeting with the folks of the New Zealand Drylands Forest Initiative in Blenheim. In addition to sitting in a conference room and having nice sandwiches we went to visit one of our progeny trials at … Continue reading →

Read more »