38 search results for "rattle"

Big Data Sets you can use with R

August 22, 2013
By
Big Data Sets you can use with R

by Joseph Rickert The world may indeed be awash with data, however, it is not always easy to find a suitable data set when you need one. As the number of people becoming involved with R and data science increases so does the need for interesting data sets for creating examples, showcasing machine learning algorithms and developing statistical analyses....

Read more »

K-means Clustering (from “R in Action”)

August 7, 2013
By
K-means Clustering (from “R in Action”)

In R’s partitioning approach, observations are divided into K groups and reshuffled to form the most cohesive clusters possible according to a given criterion. There are two methods—K-means and partitioning around mediods (PAM). In this article, based on chapter 16 of R in Action, Second Edition, author Rob Kabacoff discusses K-means clustering. Read more »

Read more »

R talks to Weka about Data Mining

July 14, 2013
By
R talks to Weka about Data Mining

R provides us with excellent resources to mine data, and there are some good overviews out there: Yanchang’s website with Examples and a nice reference card The rattle-package that introduces a nice GUI for R, and Graham William’s compendium of tools The caret-package that offers a unified interface to running a multitude of model builders.

Read more »

Draw nicer Classification and Regression Trees with the rpart.plot package

June 19, 2013
By
Draw nicer Classification and Regression Trees with the rpart.plot package

by Joseph Rickert The basic way to plot a classification or regression tree built with R’s rpart() function is just to call plot. However, in general, the results just aren’t pretty. As it turns out, for some time now there has been a better way to plot rpart() trees: the prp() function in Stephen Milborrow’s rpart.plot package. This function...

Read more »

Innovation Will Never Be At The Push Of A Button

May 17, 2013
By

@randyzwitch @benjamingaines @usujason I am envisioning the data science equivalent of an autonomous vehicle pileup. — Todd Belcher (@toddmetrics) May 16, 2013   Recently, I’ve been getting my blood pressure up reading (marketing) articles about “big data” and “data science”.  What saddens me about the whole discussion is that there is the underlying premise that Innovation Will Never...

Read more »

Forecast Update: Will 2014 be the Beginning of the End for SAS and SPSS?

May 14, 2013
By
Forecast Update: Will 2014 be the Beginning of the End for SAS and SPSS?

I recently updated my plots of the data analysis tools used in academia in my ongoing article, The Popularity of Data Analysis Software. I repeat those here and update my previous forecast of data analysis software usage. Learning to use … Continue reading →

Read more »

Video: Data Mining with R

February 15, 2013
By

Yesterday's Introduction to R for Data Mining webinar was a record setter, with more than 2000 registrants and more than 700 attending the live session presented by Joe Rickert. If you missed it, I've embedded the video replay below, and Joe's slides (with links to many useful resources) are also available. During the webinar, Joe demoed several examples of...

Read more »

Learn about R through data mining

February 5, 2013
By
Learn about R through data mining

If you're in San Francisco for this week's DeveloperWeek conference, our own Joe Rickert will also giving a presentation on Wednesday at 2:10PM on Predictive Modeling with Big Data in R which will feature several demos of data mining massive data sets using the Revolution R Enterprise. Incidentally, the whole team Revolution Analytics was proud to receive the Top...

Read more »

Video: SQL queries in R using sqldf package

December 17, 2012
By

This video covers how to run SQL queries using the ‘sqldf’ package within R. This sqldf tutorial was part of a Keystone Solutions podcast discussion about data science and what skills beginning analysts should be learning to improve their skill set. The example files from this tutorial can be downloaded from this link: Example Data Video: SQL...

Read more »

Predictive Modeling using R and the OpenScoring-Engine – a PMML approach

December 13, 2012
By
Predictive Modeling using R and the OpenScoring-Engine – a PMML approach

On November, the 27th, a special post took my interest. Scott Mutchler presented a small framework for predictive analytics based on the PMML (Predictive Model Markup Language) and a Java-based REST-Interface. PMML is a XML based standard for the description and exchange of analytical models. The idea is that every piece of software which supports the corresponding...

Read more »