Monthly Archives: April 2013

Splitting a Large CSV File into Separate Smaller Files Based on Values Within a Specific Column

April 3, 2013
By
Splitting a Large CSV File into Separate Smaller Files Based on Values Within a Specific Column

One of the problems with working with data files containing tens of thousands (or more) rows is that they can become unwieldy, if not impossible, to use with “everyday” desktop tools. When I was Revisiting MPs’ Expenses, the expenses data I downloaded from IPSA (the Independent Parliamentary Standards Authority) came in one large CSV file

Read more »

CFP: the 11th Australasian Data Mining Conference (AusDM 2013), submission due 15 July

April 3, 2013
By
CFP: the 11th Australasian Data Mining Conference (AusDM 2013), submission due 15 July

********************************************************************* The 11th Australasian Data Mining Conference (AusDM 2013) Canberra, Australia, 13-15 November 2013, http://ausdm13.togaware.com Join us on LinkedIn: http://www.linkedin.com/groups/AusDM-4907891 ********************************************************************* Data mining, the art and science of intelligent analysis of (usually large) data sets for meaningful (and previously unknown) … Continue reading →

Read more »

Beware of grep with a list

April 2, 2013
By
Beware of grep with a list

Another R tip: beware of as.character applied to a list. Really, beware of grep with a list: You might have thought that the result would be just 1, but grep expects a vector of character strings. If the input is not that, it uses as.character(). Since the result of that starts with "c(", grep finds

Read more »

apply vs for

April 2, 2013
By
apply vs for

It’s widely understood that, in R programming, one should avoid for loops and always try to use apply-type functions. But this isn’t entirely true. It may have been true for Splus, back in the day: As I recall, that had to do with the entire environment from each iteration being retained in memory. Here’s a

Read more »

x[[c(5,3)]]

April 2, 2013
By
x[[c(5,3)]]

An R tip: Did you know that x] is the same as x]]? I should make more thorough use of this. In the help file for ] is equivalent to alist]...] providing all but

Read more »

Personal R Packages

April 2, 2013
By
Personal R Packages

I came across this R package on GitHub, and it made me so excited that I decided to write a post about it. It’s a compilation by Karl Broman of various R functions that he’s found helpful to write throughout … Continue reading →

Read more »

2013-3 Generating unique names in gridSVG

April 2, 2013
By

The gridSVG package exports grid images to the SVG image format for viewing on the web. This article describes the problems associated with retaining grid object names in SVG element id attributes. In addition, new features in gridSVG that allow … Continue reading →

Read more »

Revisiting MPs’ Expenses

April 2, 2013
By
Revisiting MPs’ Expenses

I couldn’t but notice the chatter about Iain Duncan Smith claiming he’d have no problem “living on 53 pounds a dayweek“, which made me wonder not only how many meal catered events he attends each week (and how many of his scheduled meeting also have complementary tea and biscuits (a bellweather of the extent of

Read more »

a brief on naked statistics

April 2, 2013
By
a brief on naked statistics

Over the last Sunday breakfast I went through Naked Statistics: Stripping the Dread from the Data. The first two pages managed to put me in a prejudiced mood for the rest of the book. To wit: the author starts with some math bashing (like, no one ever bothers to tell us about the uses of

Read more »

R Beginners – Plotting Locations on to a World Map

April 2, 2013
By
R Beginners – Plotting Locations on to a World Map

This post is targeted at those who are just getting started plotting on maps using R.The relevant libraries are: maps, ggplot2, ggmap, and maptools. Make sure you install them.The Problem Let's take a fairly simple use case: We have a few points on the...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)