June 2013

A Shiny App Goes Viral

June 7, 2013 | Francis Smart

I am not sure how many of you have seen this Business Insider article.  It is basically about a shiny app created by Joshua Katz as NC State.  It is really fun playing with shiny app.With nearly a million facebook likes this web app buil... [Read more...]

Income Distribution in London

June 7, 2013 | leisuretronic

Inspired by the Institute of Fiscal Studies' "Where do you fit in" application, where people can find out their position in the UK's income distribution, I wanted to find out how the picture in London looks like. Quite different. If you are in a very high percentile nationwide, high incomes ... [Read more...]

Symmetric set differences in R

June 7, 2013 | xi'an

My .Rprofile contains a collection of convenience functions and function abbreviations. These are either functions I use dozens of times a day and prefer not to type in full:## my abbreviation of head() h Or problems that I'd rather figure out once, and only once: ## example: ## between( 1:10, 5.5, 6.5 ) between = low & x ... [Read more...]

Comrades Marathon Attrition Rate

June 7, 2013 | andrew

It is a bit of a mission to get the complete data set for this year’s Comrades Marathon. The full results are easily accessible, but come as an HTML file. Embedded in this file are links to the splits for individual athletes. So with a bit of scripting wizardry ... [Read more...]

Data Class Conversion

June 6, 2013 | Al-Ahmadgaid Asaad

Data in R can be converted from one class to the other. The function is prefixed with as. then followed by the name of the data class that we wish to convert to. Data class in R are the following:numeric - as.numericvector - as.vectorcharacter - as.cha... [Read more...]

How likely is the NSA PRISM program to catch a terrorist?

June 6, 2013 | Corey Chivers

Recent revelations about PRISM, the NSA’s massive program of surveillance of civilian communications have caused quite a stir. And rightfully so, as it appears that the agency has been granted warrantless direct access to just about any form of digital communication engaged in by American citizens, and that their ... [Read more...]

Feature Selection 3 – Swarm Mentality

June 6, 2013 | Max Kuhn

"Bees don't swarm in a mango grove for nothing. Where can you see a wisp of smoke without a fire?" - Hla Stavhana In the last two posts, genetic algorithms were used as feature wrappers to search for more effective subsets of predictors. Here, I will do the same with ... [Read more...]

Intro to Parallel Random Number Generation with RevoScaleR

June 6, 2013 | Joseph Rickert

by Joseph Rickert Random number generation is fundamental to doing computational statistics. As you might expect, R is very rich in random number resources. The R base code provides several high quality random number generators including: Wichmann-Hill, Marsaglia-Multicarry, Super-Duper, Mersenne-Twister, Knuth-TAOCP-2002 and L’Ecuyer-CMRG. (See Random for details.) And, there ... [Read more...]

Box-plot with R – Tutorial

June 6, 2013 | Fabio Veronesi

Yesterday I wanted to create a box-plot for a small dataset to see the evolution of 3 stations through a 3 days period. I like box-plots very much because I think they are one of the clearest ways of showing trend in your data. R is extremely good for this type of ... [Read more...]

Inputting Data in Matrix Format

June 6, 2013 | Al-Ahmadgaid Asaad

Matrix in R is formed using matrix, rbind, or cbind function. These functions have the following descriptions:matrix - used to transform a concatenated data into matrix form of compatible dimensions. rbind - short for row bind, that binds a conca... [Read more...]

At what sample size do correlations stabilize?

June 6, 2013 | FelixS

Maybe you have encountered this situation: you run a large-scale study over the internet, and out of curiosity, you frequently check the correlation between two variables. My experience with this practice is usually frustrating, as in small sample sizes (and we will see what “small” means in this context) correlations ... [Read more...]

Hillslope Position by Soil Series

June 5, 2013 |

Soil survey data are typically built upon a foundation of soil-landscape relationships that have been verified in the field. SSURGO data contain several geomorphic descriptions of landscape, landform, hillslope position, and surface shape for each... [Read more...]

KDNuggets 2013 software poll results

June 5, 2013 | David Smith

The results of the 2013 KDNuggets software poll are in, with RapidMiner and R in a near-tie for first place. Of a record 1880 respondents, 737 reported using Rapid-I RapidMiner/RapidAnalytics, and 704 reported using R. Excel came in third: with 527 respondents, it was the lone commercial tool in the top 5. You can see ... [Read more...]

Running R Scripts Directly From Dropbox

June 5, 2013 | Francis Smart

I have written a little function that allows users to run R scripts out of Dropbox directly from any location.  It was aided by this post on biobucket.  The reason I am particularly interested in this feature is because I am often using a ser... [Read more...]

Oracle R Distribution for R 2.15.3 is released

June 5, 2013 | Sherry Lamonica

We are pleased to announce that Oracle R Distribution (ORD) for R 2.15.3 is available for download today. This update consists of mostly minor bug fixes, and is the final release of the R 2.x series. Oracle recommends using yum to install ORD from our public yum server.  To install ORD 2.15.3, ...
[Read more...]

The Frisch–Waugh–Lovell Theorem for Both OLS and 2SLS

June 5, 2013 | diffuseprior

The Frisch–Waugh–Lovell (FWL) theorem is of great practical importance for econometrics. FWL establishes that it is possible to re-specify a linear regression model in terms of orthogonal complements. In other words, it permits econometricians to partial out right-hand-side, or control, variables. This is useful in a variety of ... [Read more...]

RcppArmadillo 0.3.900.0

June 5, 2013 | Thinking inside the box

A Armadillo release 3.900.0 was provided by Conrad yesterday. It has been rolled into a new RcppArmadillo release 0.3.900.0 which is now on CRAN and in Debian. It has a number of nice changes, mostly on the performance side of things (see below) an... [Read more...]

A Big Data introduction

June 5, 2013 | Michele Usuelli

Since R uses the computer RAM, it may handle only rather small sets of data. Nevertheless, there are some packages that allow to treat larger volumes and the best solution is to connect R with a Big Data environment. This … Continue reading → [Read more...]
1 10 11 12 13 14

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)