EPL Table Motion Chart

February 28, 2013
By

The Shiny package provides great user interactivity and another boost to its attractiveness has come with its integration with googleVis. Markus Gesman provides some background in a blog article with coded examples which he along with fellow googleVis creator, Diego de Castillo and lead Shiny developer Winson Chang have furnished There are at least three

Read more »

Using R in LaTeX with knitr and RStudio

February 28, 2013
By
Using R in LaTeX with knitr and RStudio

Hi, I presented today at INSEE R user group (FLR) how to use knitr (Sweave evolution) for writing documents which are self contained with respect to the source code: your data changed? No big deal, just compile your .Rnw file again and you are done with an updated version of your paper! is easy. Some

Read more »

Summary of My First Trip to Strata #strataconf

February 28, 2013
By
Summary of My First Trip to Strata #strataconf

In this post I am goIing to summarize some of the things that I learned at Strata Santa Clara 2013. For now, I will only discuss the conference sessions as I have a much longer post about the tutorial sessions that I am still working on and will post at a later date. I will add to this post...

Read more »

Slides from "Big Data Real Time Predictive Analytics"

February 28, 2013
By

At Tuesday's Data Driven Business Day at the Strata conference I gave my talk, Real-time Big Data Predictive Analytics: From Deployment to Production. My goal in the talk was to explain the buzz-phrases "real time", "big data" and "predictive examples" in the context of a specific example: why are some web ads today uncannily targeted at our personal interests...

Read more »

Pollination effectiveness landscape

February 28, 2013
By
Pollination effectiveness landscape

I want to show you a pollination landscape, but this is not a pollinator landscape with flowers and nesting sites, but a plot showing two components of pollination. Quantity and quality. A recent paper by Pedro Jordano (see here for other … Continue reading →

Read more »

Pollination effectiveness landscape

February 28, 2013
By
Pollination effectiveness landscape

I want to show you a pollination landscape, but this is not a pollinator landscape with flowers and nesting sites, but a plot showing two components of pollination. Quantity and quality. A recent paper by Pedro Jordano (see here for other … Continue reading →

Read more »

Classifying Emails as Spam or Ham using RTextTools

February 28, 2013
By

Recently, I had read an article on R-bloggers, titled Classifying Breast Cancer as Benign or Malignent using RTextTools by Timothy P. Jurka, who is the author of both that article and the RTextTools package. Having reproduced the results using the...

Read more »

System Testing

February 28, 2013
By
System Testing

R at Work...crunching numbersIf you wanna find out how your trading idea would have coped in the previous years you would perform a backtest. For years I have used commercial software but could not find a package, which I would really recommend. I have...

Read more »

The MaxDiff Killer: Rank-Ordered Logit Models

February 27, 2013
By

Compared to MaxDiff (Sawtooth Software), ranked-order logit modeling:simplifies data collection without needing additional software to generate experimental designsreduces respondent burden making the task easier and seemingly ...

Read more »

inline 0.3.11

February 27, 2013
By

A maintenance release of inline is now on CRAN, and is being uploaded to Debian. The release fixes two minor bugs kindly reported by users. As the two previous releases appear to not have been announced here, their NEWS entries are included as well. ...

Read more »

The Paul Tol 21-color salute

February 27, 2013
By
The Paul Tol 21-color salute

You may or may not know that PerformanceAnalytics contains a number of specific color schemes designed for charting data in R (they aren’t documented well, but they show up in some of the chart examples). I’ve been collecting color palates for years in search of good combinations of attractiveness, relative weight, and distinctiveness, helped along

Read more »

Graphical Tools (rgl) on a Headless Shiny Server

February 27, 2013
By

If you’ve encountered errors such as Warning in rgl.init(initValue) : RGL: unable to open X11 display Warning in fun(libname, pkgname) : error in rgl_init or Error: rgl_dev_getcurrent when trying to use a graphically based package like RGL with Shiny Server, then you’re in the right spot.  The issue is likely that you’re running R or

Read more »

Non-Linear Curve Fitting is Nature Publication Worthy?

February 27, 2013
By
Non-Linear Curve Fitting is Nature Publication Worthy?

Someone needs to explain to me why this is a Nature publication ...Granted it is Nature Protocols which started its publication run when I was in grad school and is likely less high-impact as full blown Nature.  Seriously tough - using the So...

Read more »

Workshops on Modelling Choices using R in Toronto

February 27, 2013
By

Making choices is inherently human. We choose between brands of cereal or amongst candidates in an election. At times, choices may be influenced by the characteristics of the decision maker, such as age, income and sex. Choices may also be influenced by the attributes of competing alternatives, such as the cost of travelling between two cities by air or...

Read more »

UPDATE Multiple postgreSQL Table Records in Parellel

February 27, 2013
By
UPDATE Multiple postgreSQL Table Records in Parellel

Unfortunately the RpostgreSQL package (I’m pretty sure other SQL DBs as well) doesn’t have a provision to UPDATE multiple records (say a whole data.frame) at once or allow placeholders making the UPDATE a one row at a time ordeal, so I built a work around hack to do the job in parellel.  The big problem

Read more »

Resampling data in Hadoop with RHadoop

February 27, 2013
By

On Revolution Analytics partner Cloudera's blog, Uri Laserson has posted an excellent guide to resampling from a large data set in Hadoop. Resampling is an important step in fitting ensemble models (including random forests and other bagging techniques), and Uri provides a step-by-step guide to implementing resampling methods using RHadoop. He provides the complete map-reduce code in the R...

Read more »

Installing Pandoc from R (on Windows) – using the {installr} package

February 27, 2013
By
Installing Pandoc from R (on Windows) – using the {installr} package

The R blogger Rolf Fredheim has recently wrote a great piece called “Reproducible research with R, Knitr, Pandoc and Word“, where he advocates for Pandoc as an essential part of reproducible research workflow in R, in helping to turn documents …Read more »

Read more »

Whats new in rugarch (ver 1.01-5)

February 27, 2013
By
Whats new in rugarch (ver 1.01-5)

Since the last release of rugarch on CRAN (ver 1.0-16), there have been many changes and new features in the development version of the package (ver 1.01-5). First, development of the package (and svn) has been moved to google code from r-forge. Second, the package now features exclusive use of xts based time series for

Read more »

Whats new in rugarch (ver 1.01-5)

February 27, 2013
By
Whats new in rugarch (ver 1.01-5)

Since the last release of rugarch on CRAN (ver 1.0-16), there have been many changes and new features in the development version of the package (ver 1.01-5). First, development of the package (and svn) has been moved to google code from r-forge. Second, the package now features exclusive use of xts based time series for

Read more »

Fast factor generation with Rcpp

February 27, 2013
By
Fast factor generation with Rcpp

Recall that factors are really just integer vectors with ‘levels’, i.e., character labels that get mapped to each integer in the vector. How can we take an arbitrary character, integer, numeric, or logical vector and coerce it to a factor with Rcpp? It’s actually quite easy with Rcpp sugar: #include <Rcpp.h> using namespace Rcpp; template <int RTYPE> IntegerVector fast_factor_template( const Vector<RTYPE>& x )...

Read more »

Fast factor generation with Rcpp

February 27, 2013
By
Fast factor generation with Rcpp

Recall that factors are really just integer vectors with ‘levels’, i.e., character labels that get mapped to each integer in the vector. How can we take an arbitrary character, integer, numeric, or logical vector and coerce it to a factor with Rcpp? It’s actually quite easy with Rcpp sugar: #include <Rcpp.h> using namespace Rcpp; template <int RTYPE> IntegerVector fast_factor_template( const Vector<RTYPE>& x )...

Read more »

How to make a scientific result disappear

February 27, 2013
By
How to make a scientific result disappear

Nathan Danneman (a co-author and one of my graduate students from Emory) recently sent me a New Yorker article from 2010 about the “decline effect,” the tendency for initially promising scientific results to get smaller upon replication. Wikipedia can summarize the phenomenon as well as I can: In his article, Lehrer gives several examples where

Read more »

How to make a scientific result disappear

February 27, 2013
By
How to make a scientific result disappear

Nathan Danneman (a co-author and one of my graduate students from Emory) recently sent me a New Yorker article from 2010 about the “decline effect,” the tendency for initially promising scientific results to get smaller upon replication. Wikipedia can summarize the phenomenon as well as I can: In his article, Lehrer gives several examples where

Read more »

New R Workshop in the Bay Area

February 26, 2013
By
New R Workshop in the Bay Area

Practical Data Visualization with R Saturday March 9th, 2013 8:30-5:00pm EBay 2161 North 1st Street San Jose, California I will be presenting a one day professional development workshop on modern data visualization with R, sponsored by the ACM San Francisco Bay … Continue reading →

Read more »

New ways to Hadoop with R

February 26, 2013
By

Today, there are two main ways to use Hadoop with R and big data: 1. Use the open-source rmr package to write map-reduce tasks in R (running within the Hadoop cluster - great for data distillation!) 2. Import data from Hadoop to a server running Revolution R Enterprise, via Hbase, ODBC (for high-performance Hadoop/SQL interfaces), or streaming data direct...

Read more »

Stop Sign Project Post1: Some GIS stuff done in R

February 26, 2013
By

(This article was first published on bRogramming, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on their blog: bRogramming. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL,...

Read more »

Job for life ? Bishop of Rome ?

February 26, 2013
By
Job for life ? Bishop of Rome ?

The job of Bishop of Rome – i.e. the Pope – is considered to be a life-long commitment. I mean, it usually was. There have been 266 popes since 32 A.D. (according to http://oce.catholic.com/…): almost all popes have served until their death. But that does not mean that they were in the job for long… One can easily extract...

Read more »

Make Your Date Folder Clean with Function unzip & unz

February 26, 2013
By

I am a somewhat minimalist R user. I feel uncomfortable if something is not in a good order, such as the names of variables and documents, the structures of my codes and projects. I prefer my data stored in .txt or .csv so I can load them to R using read.table or read.csv. For most of the time we...

Read more »

Automatic Notice When Vacancy Available

February 26, 2013
By

Today, I visited a webpage inadvertently and found several job positions that I am competent with, unfortunately all of them has expired. How many chances we lost in this way?! So I decide to do somthing to limit this kind of loss, and of course using our smart R! The idea is simple: check the job vacancy webpages reguarly, if...

Read more »

Sponsors