169 search results for "iris"

Pro Grammar and Devel Hoper

August 22, 2014
By
Pro Grammar and Devel Hoper

I've been teasing about this post for some time now. My next blog post is "Pro Grammar and Devel Hoper". And this not just an empty pun. Stay tuned.— Romain François (@romain_francois) August 3, 2014 @stefanbache another teaser. https://t.co/i2ubfOyjIO iris >> filter( Sepal.Length > 7 ) iris |> filter( Sepal.Length > 7 )— Romain François (@romain_francois)

Read more »

Do your "data janitor work" like a boss with dplyr

August 20, 2014
By
Do your "data janitor work" like a boss with dplyr

Data “janitor-work” The New York Times recently ran a piece on wrangling and cleaning data: “For Big-Data Scientists, ‘Janitor Work’ Is Key Hurdle to Insights” Whether you call it “janitor-work,” wrangling/munging, cleaning/cleansing/scrubbing, tidying, or something else, the article above is worth a read (even though it implicitly denigrates the important work that your housekeeping staff does). It’s...

Read more »

sort.data.frame

August 15, 2014
By

I came accross this post on SO, where several solutions to sorting data.frames are presented. It must have been solved a million times, but here's a solution I like to use. It benefits from the fact that sort is an … Continue reading →

Read more »

GrapheR: A GUI for base graphics in R

August 12, 2014
By
GrapheR: A GUI for base graphics in R

How did I miss the GrapheR package? The author, Maxime Hervé, published an article about the package in the same issue of the R Journal as we did on googleVis. Yet, it took me a package update notification on CRANbeeries to look into GrapheR in more detail - 3 years later! And...

Read more »

A Conversation with Max Kuhn – The useR! 2014 Interview

August 11, 2014
By
youtube___aaaaaaaaaaaaUntitled

The Interview In the video above, Max provides some amazing insights into the why and...

Read more »

A Few Notes on UseR! 2014

July 25, 2014
By
A Few Notes on UseR! 2014

It has been a month since the UseR! 2014 conference, and I'm probably the last one who writes about it. UseR! is my favorite conference because it is technical and not too big. I have completely lost interest in big and broad conferences like JSM (to me, it has become Joint Sightseeing Meetings). Karl has written two blog posts...

Read more »

There’s no mistake in the barley data

July 21, 2014
By
There’s no mistake in the barley data

Statistics has many canonical data sets. For classification statistics, we have the Fisher's iris data. For Big Data statistics, the canonical data set used in many examples is the Airlines data. And for dotplots, we have the barley data, first popularized by Bill Cleveland in the landmark 1993 text Visualizing Data. Cleveland's innovations in data visualiation were hugely influential...

Read more »

Reflections on useR! 2014

July 7, 2014
By
Reflections on useR! 2014

UseR! 2014, the R user conference held last week in LA, was the most successful yet. Around 700 R users from around the world converged on the UCLA campus to share their experiences with the R language and to socialize with other data scientists, statisticians and others using R. The week began with a series of 3-hour tutorials on...

Read more »

str Implementation for Data Frames

June 5, 2014
By

The str function is perhaps the most useful function in R. It provides great information about the structure of some object. When I teach R, especially for those coming from SPSS, the str function for data frames provides the information they are use to seeing on the variable view tab. However, sometimes I want to display the information str...

Read more »

RMOA: Massive online data stream classifications with R & MOA

RMOA: Massive online data stream classifications with R & MOA

For those of you who don't know MOA. MOA stands for Massive On-line Analysis and is an open-source framework that allows to build and run experiments of machine learning or data mining on evolving data streams. The website of MOA (http://moa.cms.waikato.ac.nz) indicates it contains machine learning algorithms for classification, regression, clustering, outlier detection and recommendation engines.   For R users...

Read more »