4901 search results for "git"

Introducing dplyr

January 20, 2014
By
Introducing dplyr

dplyr is a new package which provides a set of tools for efficiently manipulating datasets in R. dplyr is the next iteration of plyr, focussing on only data frames. dplyr is faster, has a more consistent API and should be easier to use. There are three key ideas that underlie dplyr: Your time is important,

Read more »

Second NYC R classes(announcement and teaching experience)

January 20, 2014
By
Second NYC R classes(announcement and teaching experience)

(The photo was from our first offering of R classes) We are going to offer our Data Science by R (beginner level) course again in February. The goal of this class is to get students to a point where they are self-sufficient in R, are proficient at analyzing data and can take these skills back to their full-time jobs....

Read more »

Visualization Series: Using Scatterplots and Models to Understand the Diamond Market (so You Don’t Get Ripped Off)

January 19, 2014
By
Visualization Series: Using Scatterplots and Models to Understand the Diamond Market (so You Don’t Get Ripped Off)

My last post railed against the bad visualizations that people often use to plot quantitive data by groups, and pitted pie charts, bar charts and dot plots against each other for two visualization tasks.  Dot plots came out on top. … Continue reading →

Read more »

Evolution hospital beds, resources and consumption

January 19, 2014
By
Evolution hospital beds, resources and consumption

For this analysis publicly available data from the OECD is used. The time series run from 1975 to 2013 (where available). Let’s focus on Belgium, Germany, France, The Netherlands, United Kingdom, United States, Norway and Switzerland and have a look at their number of hospital beds per 1000 population. The blue line is a trend The post

Read more »

Le Monde puzzle [#849]

January 18, 2014
By
Le Monde puzzle [#849]

A straightforward Le Monde mathematical puzzle: Find a pair (a,b) of integers such that a has an odd number d of digits larger than 2 and ab is written as 10d+1+10a+1. Find the smallest possible values of a and of b. I ran the following R code which produced a=137 (and b=83) as the unique

Read more »

Convering plots to data

January 18, 2014
By
Convering plots to data

It is a problem which occurs ever so often in applied work, you have a plot, but you want the data. There are at least two programs which can help you there; PlotDigitizer and Engauge Digitizer. I got both on my openSuse machine. Both are available for...

Read more »

Using the plyr package

January 18, 2014
By
Using the plyr package

Introduction The base R system provides lapply() and related functions, and the package plyr provides alternatives that are worth considering. It will be assumed that readers are familiar with lapply() and are willing to spend a few moments reading the plyr documentation, to see why the illustration here will use the ldply() function. The test task will be extraction of...

Read more »

R Users, Meet Joyent Manta; Manta Users, Meet R

January 17, 2014
By

When I joined Joyent last year, I jumped on the opportunity to make R work with Joyent Manta. If you are new to Joyent, we are the High-Performance Cloud Infrastructure Company. If you are new to data analytics, the R language is a rich open-source env...

Read more »

R Users, Meet Joyent Manta; Manta Users, Meet R

January 17, 2014
By

When I joined Joyent last year, I jumped on the opportunity to make R work with Joyent Manta. If you are new to Joyent, we are the High-Performance Cloud Infrastructure Company. If you are new to data analytics, the R language is a rich open-source env...

Read more »

R Users, Meet Joyent Manta; Manta Users, Meet R

January 17, 2014
By

When I joined Joyent last year, I jumped on the opportunity to make R work with Joyent Manta. If you are new to Joyent, we are the High-Performance Cloud Infrastructure Company. If you are new to data analytics, the R language is a rich open-source env...

Read more »