## EPL Table Motion Chart

February 28, 2013
By

The Shiny package provides great user interactivity and another boost to its attractiveness has come with its integration with googleVis. Markus Gesman provides some background in a blog article with coded examples which he along with fellow googleVis creator, Diego de Castillo and lead Shiny developer Winson Chang have furnished There are at least three

## Using R in LaTeX with knitr and RStudio

February 28, 2013
By

Hi, I presented today at INSEE R user group (FLR) how to use knitr (Sweave evolution) for writing documents which are self contained with respect to the source code: your data changed? No big deal, just compile your .Rnw file again and you are done with an updated version of your paper! is easy. Some

## Summary of My First Trip to Strata #strataconf

February 28, 2013
By

In this post I am goIing to summarize some of the things that I learned at Strata Santa Clara 2013. For now, I will only discuss the conference sessions as I have a much longer post about the tutorial sessions that I am still working on and will post at a later date. I will add to this post...

## Slides from "Big Data Real Time Predictive Analytics"

February 28, 2013
By

At Tuesday's Data Driven Business Day at the Strata conference I gave my talk, Real-time Big Data Predictive Analytics: From Deployment to Production. My goal in the talk was to explain the buzz-phrases "real time", "big data" and "predictive examples" in the context of a specific example: why are some web ads today uncannily targeted at our personal interests...

## Pollination effectiveness landscape

February 28, 2013
By

I want to show you a pollination landscape, but this is not a pollinator landscape with flowers and nesting sites, but a plot showing two components of pollination. Quantity and quality. A recent paper by Pedro Jordano (see here for other … Continue reading →

## Classifying Emails as Spam or Ham using RTextTools

February 28, 2013
By

Recently, I had read an article on R-bloggers, titled Classifying Breast Cancer as Benign or Malignent using RTextTools by Timothy P. Jurka, who is the author of both that article and the RTextTools package. Having reproduced the results using the...

## System Testing

February 28, 2013
By

R at Work...crunching numbersIf you wanna find out how your trading idea would have coped in the previous years you would perform a backtest. For years I have used commercial software but could not find a package, which I would really recommend. I have...

## The MaxDiff Killer: Rank-Ordered Logit Models

February 27, 2013
By

Compared to MaxDiff (Sawtooth Software), ranked-order logit modeling:simplifies data collection without needing additional software to generate experimental designsreduces respondent burden making the task easier and seemingly ...

## inline 0.3.11

February 27, 2013
By

A maintenance release of inline is now on CRAN, and is being uploaded to Debian. The release fixes two minor bugs kindly reported by users. As the two previous releases appear to not have been announced here, their NEWS entries are included as well. ...

## The Paul Tol 21-color salute

February 27, 2013
By

You may or may not know that PerformanceAnalytics contains a number of specific color schemes designed for charting data in R (they aren’t documented well, but they show up in some of the chart examples). I’ve been collecting color palates for years in search of good combinations of attractiveness, relative weight, and distinctiveness, helped along

## Graphical Tools (rgl) on a Headless Shiny Server

February 27, 2013
By

If you’ve encountered errors such as Warning in rgl.init(initValue) : RGL: unable to open X11 display Warning in fun(libname, pkgname) : error in rgl_init or Error: rgl_dev_getcurrent when trying to use a graphically based package like RGL with Shiny Server, then you’re in the right spot.  The issue is likely that you’re running R or

## Non-Linear Curve Fitting is Nature Publication Worthy?

February 27, 2013
By

Someone needs to explain to me why this is a Nature publication ...Granted it is Nature Protocols which started its publication run when I was in grad school and is likely less high-impact as full blown Nature.  Seriously tough - using the So...

## Workshops on Modelling Choices using R in Toronto

February 27, 2013
By

Making choices is inherently human. We choose between brands of cereal or amongst candidates in an election. At times, choices may be influenced by the characteristics of the decision maker, such as age, income and sex. Choices may also be influenced by the attributes of competing alternatives, such as the cost of travelling between two cities by air or...

## UPDATE Multiple postgreSQL Table Records in Parellel

February 27, 2013
By

Unfortunately the RpostgreSQL package (I’m pretty sure other SQL DBs as well) doesn’t have a provision to UPDATE multiple records (say a whole data.frame) at once or allow placeholders making the UPDATE a one row at a time ordeal, so I built a work around hack to do the job in parellel.  The big problem

February 27, 2013
By

On Revolution Analytics partner Cloudera's blog, Uri Laserson has posted an excellent guide to resampling from a large data set in Hadoop. Resampling is an important step in fitting ensemble models (including random forests and other bagging techniques), and Uri provides a step-by-step guide to implementing resampling methods using RHadoop. He provides the complete map-reduce code in the R...

## Installing Pandoc from R (on Windows) – using the {installr} package

February 27, 2013
By

The R blogger Rolf Fredheim has recently wrote a great piece called “Reproducible research with R, Knitr, Pandoc and Word“, where he advocates for Pandoc as an essential part of reproducible research workflow in R, in helping to turn documents …Read more »

## Whats new in rugarch (ver 1.01-5)

February 27, 2013
By

Since the last release of rugarch on CRAN (ver 1.0-16), there have been many changes and new features in the development version of the package (ver 1.01-5). First, development of the package (and svn) has been moved to google code from r-forge. Second, the package now features exclusive use of xts based time series for

## Fast factor generation with Rcpp

February 27, 2013
By

Recall that factors are really just integer vectors with ‘levels’, i.e., character labels that get mapped to each integer in the vector. How can we take an arbitrary character, integer, numeric, or logical vector and coerce it to a factor with Rcpp? It’s actually quite easy with Rcpp sugar: #include <Rcpp.h> using namespace Rcpp; template <int RTYPE> IntegerVector fast_factor_template( const Vector<RTYPE>& x )...

## How to make a scientific result disappear

February 27, 2013
By
$How to make a scientific result disappear$

Nathan Danneman (a co-author and one of my graduate students from Emory) recently sent me a New Yorker article from 2010 about the “decline effect,” the tendency for initially promising scientific results to get smaller upon replication. Wikipedia can summarize the phenomenon as well as I can: In his article, Lehrer gives several examples where

## New R Workshop in the Bay Area

February 26, 2013
By

Practical Data Visualization with R Saturday March 9th, 2013 8:30-5:00pm EBay 2161 North 1st Street San Jose, California I will be presenting a one day professional development workshop on modern data visualization with R, sponsored by the ACM San Francisco Bay … Continue reading →

## New ways to Hadoop with R

February 26, 2013
By

Today, there are two main ways to use Hadoop with R and big data: 1. Use the open-source rmr package to write map-reduce tasks in R (running within the Hadoop cluster - great for data distillation!) 2. Import data from Hadoop to a server running Revolution R Enterprise, via Hbase, ODBC (for high-performance Hadoop/SQL interfaces), or streaming data direct...

## Stop Sign Project Post1: Some GIS stuff done in R

February 26, 2013
By

(This article was first published on bRogramming, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on their blog: bRogramming. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL,...

## Job for life ? Bishop of Rome ?

February 26, 2013
By

The job of Bishop of Rome – i.e. the Pope – is considered to be a life-long commitment. I mean, it usually was. There have been 266 popes since 32 A.D. (according to http://oce.catholic.com/…): almost all popes have served until their death. But that does not mean that they were in the job for long… One can easily extract...

## Make Your Date Folder Clean with Function unzip & unz

February 26, 2013
By

I am a somewhat minimalist R user. I feel uncomfortable if something is not in a good order, such as the names of variables and documents, the structures of my codes and projects. I prefer my data stored in .txt or .csv so I can load them to R using read.table or read.csv. For most of the time we...

## Automatic Notice When Vacancy Available

February 26, 2013
By

Today, I visited a webpage inadvertently and found several job positions that I am competent with, unfortunately all of them has expired. How many chances we lost in this way?! So I decide to do somthing to limit this kind of loss, and of course using our smart R! The idea is simple: check the job vacancy webpages reguarly, if...

## The stringdist package

February 26, 2013
By

String metrics have important applications in web search, spelling correction and computational biology amongst others. Many different metrics exist, but the most well-known are based on counting the number of basic edit operations it takes to turn one string into … Continue reading →

## First steps of using googleVis on shiny

February 26, 2013
By

The guys at RStudio have done a fantastic job with shiny. It is really easy to build web apps with R using shiny. With the help of Joe Cheng from RStudio we figured out a way to make googleVis work on shiny as well. This allows you to make use of the Google Charts...

## Normal distribution functions

February 25, 2013
By

Ah, the Central Limit Theorem.  The basis of much of statistical inference and how we get those 95% confidence intervals.  It's just so beautiful!  Lately, I have found myself looking up the normal distribution functions in R.  They...

## What is Revolution R Enterprise?

February 25, 2013
By

Let us explain, in 90 seconds: Want a more in-depth introduction to R and Revolution R Enterprise? I'll be giving the webinar Revolution R Enterprise: 100% R and More on March 14. Just follow the link below to secure your seat for the live presentation, and to receive notification of the replay. Revolution Analytics webinars: Revolution R Enterprise: 100%...