1651 search results for "ggplot2"

Benchmarking feature selection with Boruta and caret

November 25, 2010
By
Benchmarking feature selection with Boruta and caret

Feature selection is the data mining process of selecting the variables from our data set that may have an impact on the outcome we are considering. For commercial data mining, which is often characterised by having too many variables for model building, this is an...

Read more »

Is there a Market for Premium R Packages?

November 19, 2010
By
Is there a Market for Premium R Packages?

Nathan Yau, of the excellent FlowingData blog, recently asked on his Twitter stream: I wonder if there’s a market for premium R packages, like there is for say, @wordpress themes and plugins There are some great packages available for R, all of which are currently free. I think it would be great if authors like

Read more »

Competitive Data Science: An Update

November 18, 2010
By

A quick reminder that two competitions based around data analysis, both very suited to R, are currently underway. First, there's still plenty of time to enter the competition to predict popular R packages, announced by the The Dataists and hosted at Kaggle. According to organizer Drew Conway, the competition has already received 114 entries from 21 teams. But with...

Read more »

Visualizing US House Results with a Seats-Votes curve

November 16, 2010
By
Visualizing US House Results with a Seats-Votes curve

A few weeks ago I wrote about ways to compare major-party returns in US House elections. I experimented with several visualizations, none as useful as the seats-votes curve. A traditional seats-votes cure measures average party performance against individual US House results. Our simplified curve uses a density plot to measure major-party (Democratic, in this case)

Read more »

Feature selection: Using the caret package

November 16, 2010
By
Feature selection: Using the caret package

Feature selection is an important step for practical commercial data mining which is often characterised by data sets with far too many variables for model building. In a previous post we looked at all-relevant feature selection using the Boruta package while in this post we consider the same (artificial, toy) examples using the caret package. ...

Read more »

Feature selection: Using the caret package

November 16, 2010
By
Feature selection: Using the caret package

Feature selection is an important step for practical commercial data mining which is often characterised by data sets with far too many variables for model building. In a previous post we looked at all-relevant feature selection using the Boruta package while in this post we consider the same (artificial, toy) examples using the caret package. ...

Read more »

In case you missed it: October Roundup

November 16, 2010
By

In case you missed them, here are some articles from October of particular interest to R users. Reviews of the winners and finalists of the 2010 ggplot2 case study competition. We have published a new article "R is Hot", with interviews from a dozen R users in industry and academia. A new code highlighting tool for displaying R code...

Read more »

Isarithmic History of the Two-Party Vote

November 15, 2010
By
Isarithmic History of the Two-Party Vote

A few weeks ago, I shared a series of choropleth maps of U.S. presidential election returns, illustrating the relative support for Democratic, Republican, and third Party candidates since 1920. The granularity of these county level results led me to wonder whether it would be possible to develop an isarithmic map of presidential voting using the … Read more

Read more »

Help Mozilla visualize how people use Firefox

November 11, 2010
By
Help Mozilla visualize how people use Firefox

You might recall we posted a couple of weeks ago this chart summarizing the times of the day Firefox users switch on Private Browsing mode: The chart, based on data from the Mozilla Test Pilot program tells an interesting story about the habits of Web users. But what other interesting stories could be told, to reveal more insights into...

Read more »

Mortgage Calculator (and Amortization Charts) with R

November 10, 2010
By
Mortgage Calculator (and Amortization Charts) with R

Mortgage rates have been at historic lows recently.  The rates are posted various places online along with simple mortgage calculators.  Such calculators illustrate the payment schedule for a mortgage based upon selected terms. But with less ...

Read more »