Advanced settings in ggplot2 (R)

June 13, 2013
By
Advanced settings in ggplot2 (R)

Here is a post with some information to control the layout of your graph produced by the ggplot library: http://rpubs.com/Lionel/6557 Enjoy!Filed under: R and Stat Tagged: ggplot, R

Read more »

inline 0.3.12

June 12, 2013
By

Another maintenance release of inline is now on CRAN and in already included in Debian. This release was triggered by a change in the development version of R which removed an argument to package.skeleton(). The complete NEWS entry is below. Chan...

Read more »

The null model for age effects with overdispersed infection

June 12, 2013
By
The null model for age effects with overdispersed infection

How does overdispersion of infections affect the behavior of the multiple-infection model? I redefine the model to account for overdispersion, assuming the same overdispersion occurs in both age classes. The parameter varies inversely with the degree of overdispersion. Again, the classes are demographically identical, and infection affects mortality but not growth: \[\begin{aligned} \frac{dJ}{dt}...

Read more »

Mini-tutorial for Quandl: How to access financial data with R

June 12, 2013
By
Mini-tutorial for Quandl: How to access financial data with R

by Joseph Rickert Quandl.com, the open source website for financial data, made rapid progress earlier this year in becoming an R friendly source for financial time series data. Tammer Kamel, Quandl’s founder introduced the site on Revolutions blog in late February as a “search engine” for numerical data and explained how Quandl’s “Q-bot” can take data from almost any...

Read more »

cran2deb4ubuntu Updated for R 3.0.1 and Ubuntu 13.04

June 12, 2013
By

It has taken a long time, but cran2deb4ubuntu has been updated for R 3.0.1. Over 1000 R packages are available as .deb files (with dependicies) for Ubutnu 13.04 (raring), 12.10 (quantal) and 12.04 (precise). These packages can be found at the c2d4u PPA. Instructions on how to install the PPA can be found on this...

Read more »

Cluster NHL Teams Based on 2012/13 Regular Season Performance

June 12, 2013
By
Cluster NHL Teams Based on 2012/13 Regular Season Performance

Since tonight kicks off Game 1 of the Stanley Cup Finals, I thought it would be fun to do a very quick and dirty cluster analysis of the league based on regular season performance. Tonight, the Chicago Blackhawks square off against my hometown team, the Boston Bruins.  Even though it was a lockout-shortened season, the

Read more »

Twitter Twitter on the Web, Who is the Most Popular of All? Interactively Determining Popularity of Two Entitites on Twitter

June 12, 2013
By
Twitter Twitter on the Web, Who is the Most Popular of All? Interactively Determining Popularity of Two Entitites on Twitter

Code updated based on feedback (see list of changes at the very end)Okay, that was a take on the mirror mirror on the wall quote from Snow White. This continues my saga of learning from the superb work done by the R-community and building on their...

Read more »

The Reorderable Data Matrix and the Promise of Pattern Discovery

June 12, 2013
By
The Reorderable Data Matrix and the Promise of Pattern Discovery

We typically start with the data matrix, a rectangular array of rows and columns.  If we type its name on the R command line, it will show itself.  But the data matrix is hard to read, even when there are not many rows or columns.  The heat map is a visual alternative.  All you need is the R function...

Read more »

Data imputation I

June 12, 2013
By

I recently entered kaggle titanic learning competition for fun and to see where my out of the box utilization of random forest would rank me (303 out of 5,882). It was interesting to see that much of the scoring differentiation came from score imputation, that is filling missing values based on other data. For example, we might have

Read more »

Using Quandl in R

June 12, 2013
By
Using Quandl in R

Image by Jan Zander Our mantra here at Quandl is making data easy to find and easy to use. Following that goal we (and subsequently the community) have created packages that integrate Quandl’s API into a number of software platforms. Today we’ll take a look at R. R is a free statistical computing language created

Read more »

More fun with data frames

June 12, 2013
By
More fun with data frames

Data frames are such a straightforward and essential element of R that it’s easy to lose sight of some of their peculiarities. Last week, I developed some code which would tear apart some data frames and create new ones based on columns specified by the user. This would allow me to dynamically create new data

Read more »

R to Oracle Database Connectivity: Use ROracle for both Performance and Scalability

June 12, 2013
By
R to Oracle Database Connectivity: Use ROracle for both Performance and Scalability

Normal 0 false false false EN-US X-NONE X-NONE ...

Read more »

Plotting average read and write operation size by ASM disk for Oracle

June 12, 2013
By
Plotting average read and write operation size by ASM disk for Oracle

  Throughput, throughput, throughput – for many databases, this is the performance measure of importance.  When you are working with a fixed number of IOPS but see mixed workload types, system health can be assessed through the average read and…Read more ›

Read more »

Introducing GTrendsR

June 12, 2013
By

Just another R blog has beed added to r-bloggers!In a paper, to be soon published in Conservation Biology and entitle Googling trends in conservation biology, we developed a package named GTrendsR that provides an interface for retrieving and displaying the information returned online...

Read more »

Survey on R and education

June 12, 2013
By
Survey on R and education

Main takeaways from the survey on R in education There is a need to train students in R since large majority of respondents (using R professionally) expect its market share to further increase. Large interest from both academics and students in online interactive R and statistics courses. Highest interest in free courses and only small

Read more »

Updated age calculation function

June 12, 2013
By
Updated age calculation function

I had previously posted a function for calculating age with two dates. This was for the whole number ‘age’ where we are assuming you don’t want someone to be recorded as ’18′ until their 18th birthday (so more than just … Continue reading →

Read more »

userR2013 data analysis contest: data exploration

June 12, 2013
By
userR2013 data analysis contest: data exploration

Description The useR2013 conference is organizing a data analysis contest, check the rules here. They have a package called useR2013DAC with two data sets: one from La Liga and the other one from the Formula 1. Once you download and install the package (available here), you can quickly explore the data using the following R commands: Data exploration ## Load the...

Read more »

Fear and Loathing in Data Science 2013-06-11 20:25:00

June 11, 2013
By

Visual Exploration of Time SeriesA couple of weeks ago, I stumbled across the following post on using R to discover patterns in time series.http://dahtah.wordpress.com/2013/05/17/finding-patterns-in-time-series-using-regular-expressions/#commentsThe au...

Read more »

A Null Model for Age Effects in Disease with Multiple Infections

June 11, 2013
By
A Null Model for Age Effects in Disease with Multiple Infections

Here’s a little thought exercise I did that has caused me to go back and restart my Sudden Oak Death modeling in a new framework. Feedback welcome. I’m especially interested in relevant literature – I haven’t found many good examples of macroparasite/multiple infection models with age structure. Introduction Cobb et al. (2012) develop two models of forest stand demography...

Read more »

New book: Seamless R and C++ Integration with Rcpp

June 11, 2013
By
New book: Seamless R and C++ Integration with Rcpp

A new book from Dirk Eddelbuettel, co-author of the Rcpp package, is now available. Seamless R and C++ Integration with Rcpp can be ordered from Springer and from Amazon. The book provides the first comprehensive introduction to Rcpp, the R package that makes it easy to integrate C++ code with R and speed up R code. If you haven't...

Read more »

More explorations of Shiny

June 11, 2013
By
More explorations of Shiny

I have continued to explore the functionality of the Shiny package released by the Rstudio team and I have been increasingly impressed.  The code fits together very clean and easy to manipulate or add to.  If you have some knowledge of html o...

Read more »

Answering “How many people use my R package?”

June 11, 2013
By
Answering “How many people use my R package?”

The question “How many people use my R package?” is a natural question that (I imagine) every R package developer asks himself at some point or another. After many years in the dark, a silver lining has now emerged thanks …Read more »

Read more »

Rickshaw d3.js from R with rCharts

June 11, 2013
By

Shutterstock’s open source Rickshaw provides a very nice tutorial to get started building interactive time series d3.js charts with Rickshaw.  I just could not resist rebuilding the tutorial but this time ENTIRELY in R with the amazing packages slidify and rCharts.  I have embedded the tutorial below,...

Read more »

Visualizing densities of spatial processes

June 11, 2013
By
Visualizing densities of spatial processes

We recently uploaded on http://hal.archives-ouvertes.fr/hal-00725090 a revised version of our work, with Ewen Gallic (a.k.a. @3wen) on Visualizing spatial processes using Ripley’s correction: an application to bodily-injury car accident location In this paper, we investigate (and extend) Ripley’s circumference method to correct bias of density estimation of edges (or frontiers) of regions. The idea of the method was theoretical...

Read more »

Biostatisticians… beware of fuzzy researchers!

June 11, 2013
By
Biostatisticians… beware of fuzzy researchers!

In the last days I was thinking about about how researchers could collaborate efficiently with their experts in statistics. With the increasing complexity in science, interchanging information can be crucial to get the best results. But what happens when a … Sigue leyendo →

Read more »

Thursday: Webinar on video game analytics

June 11, 2013
By

Video games are big business today: Electronic Arts (EA) generated more than 4 billion dollars in revenue last year, and they're not even the biggest player on the block. In addition to big bucks, video games also generate Big Data: 50 terabytes per day at EA alone. So there's an obvious need to apply predictive analytics to these massive...

Read more »

R2leaflet (v0.1) – make interactive online maps from R

June 11, 2013
By
R2leaflet (v0.1) – make interactive online maps from R

I have been working on a simple R function to take latitude and longitude of points of interest, and text for pop-up labels, and produce an interactive online map. Interactive graphics are incredibly useful in getting people interested in your … Continue reading →

Read more »

Finally! Tracking CRAN packages downloads

June 11, 2013
By
Finally! Tracking CRAN packages downloads

The guys from RStudio now provide CRAN download logs (see also this blog post). Great work! I always asked myself, how many people actually download my packages. Now I finally can

Read more »

R package development

June 11, 2013
By
R package development

Building R packages is not particular hard, but it can be a bit of a daunting endeavour at the beginning, particularly if you are more of a statistician than a computer scientist or programmer. Some concepts may appear foreign or like red tape, yet man...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.