Monthly Archives: June 2013

cran2deb4ubuntu Updated for R 3.0.1 and Ubuntu 13.04

June 12, 2013
By

It has taken a long time, but cran2deb4ubuntu has been updated for R 3.0.1. Over 1000 R packages are available as .deb files (with dependicies) for Ubutnu 13.04 (raring), 12.10 (quantal) and 12.04 (precise). These packages can be found at the c2d4u PPA. Instructions on how to install the PPA can be found on this...

Read more »

Cluster NHL Teams Based on 2012/13 Regular Season Performance

June 12, 2013
By
Cluster NHL Teams Based on 2012/13 Regular Season Performance

Since tonight kicks off Game 1 of the Stanley Cup Finals, I thought it would be fun to do a very quick and dirty cluster analysis of the league based on regular season performance. Tonight, the Chicago Blackhawks square off against my hometown team, the Boston Bruins.  Even though it was a lockout-shortened season, the

Read more »

Twitter Twitter on the Web, Who is the Most Popular of All? Interactively Determining Popularity of Two Entitites on Twitter

June 12, 2013
By
Twitter Twitter on the Web, Who is the Most Popular of All? Interactively Determining Popularity of Two Entitites on Twitter

Code updated based on feedback (see list of changes at the very end)Okay, that was a take on the mirror mirror on the wall quote from Snow White. This continues my saga of learning from the superb work done by the R-community and building on their...

Read more »

The Reorderable Data Matrix and the Promise of Pattern Discovery

June 12, 2013
By
The Reorderable Data Matrix and the Promise of Pattern Discovery

We typically start with the data matrix, a rectangular array of rows and columns.  If we type its name on the R command line, it will show itself.  But the data matrix is hard to read, even when there are not many rows or columns.  The heat map is a visual alternative.  All you need is the R function...

Read more »

Data imputation I

June 12, 2013
By

I recently entered kaggle titanic learning competition for fun and to see where my out of the box utilization of random forest would rank me (303 out of 5,882). It was interesting to see that much of the scoring differentiation came from score imputation, that is filling missing values based on other data. For example, we might have

Read more »

Using Quandl in R

June 12, 2013
By
Using Quandl in R

Image by Jan Zander Our mantra here at Quandl is making data easy to find and easy to use. Following that goal we (and subsequently the community) have created packages that integrate Quandl’s API into a number of software platforms. Today we’ll take a look at R. R is a free statistical computing language created

Read more »

More fun with data frames

June 12, 2013
By
More fun with data frames

Data frames are such a straightforward and essential element of R that it’s easy to lose sight of some of their peculiarities. Last week, I developed some code which would tear apart some data frames and create new ones based on columns specified by the user. This would allow me to dynamically create new data

Read more »

R to Oracle Database Connectivity: Use ROracle for both Performance and Scalability

June 12, 2013
By
R to Oracle Database Connectivity: Use ROracle for both Performance and Scalability

Normal 0 false false false EN-US X-NONE X-NONE ...

Read more »

Plotting average read and write operation size by ASM disk for Oracle

June 12, 2013
By
Plotting average read and write operation size by ASM disk for Oracle

  Throughput, throughput, throughput – for many databases, this is the performance measure of importance.  When you are working with a fixed number of IOPS but see mixed workload types, system health can be assessed through the average read and…Read more ›

Read more »

Introducing GTrendsR

June 12, 2013
By

Just another R blog has beed added to r-bloggers!In a paper, to be soon published in Conservation Biology and entitle Googling trends in conservation biology, we developed a package named GTrendsR that provides an interface for retrieving and displaying the information returned online...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)