Exploring distributions of Ensatina salamander subspecies using rvertnet by Neil Kelly

August 9, 2012
By
Exploring distributions of Ensatina salamander subspecies using rvertnet by Neil Kelly

This week we have a guest blog post by Neil Kelley Last week, I stumbled on Vijay’s blog post demonstrating his new package rvertnet. Although I am a paleontologist, some of my research involves anatomical comparison between extinct species and extant relatives or ecological analogs, so I have some experience using VertNet to track down

Read more »

Exploring distributions of Ensatina salamander subspecies using rvertnet by Neil Kelly

August 9, 2012
By
Exploring distributions of Ensatina salamander subspecies using rvertnet by Neil Kelly

This week we have a guest blog post by Neil Kelley Last week, I stumbled on Vijay’s blog post demonstrating his new package rvertnet. Although I am a paleontologist, some of my research involves anatomical comparison between extinct species and extant relatives or ecological analogs, so I have some experience using VertNet to track down

Read more »

How Many Data Scientists Are There?

August 9, 2012
By
How Many Data Scientists Are There?

How Many Data Scientists Are There? I've seen a lot of articles lately about “Big Data” and the looming “talent gap.” This article from the Wall Street Journal is a good example. It cites a McKinsey estimate that states that we will need 1.5 million more managers and analysts who are conversant with “big data.” Of course, some of...

Read more »

Interviews

August 9, 2012
By

I’ve been interviewed twice in the last year: For DecisionStats, 9 August 2012. For Data Mining Research, 21 October 2011. Republished in Amstat News, 1 December 2011. Some readers of this blog might find them interesting. I said a few things in t...

Read more »

Show me yours and I’ll show you mine

August 9, 2012
By
Show me yours and I’ll show you mine

I remember when I started with R, there was little processing power directed toward an IDE. I had enough problems with the syntax, loops and the like and R gui seemed adequate. When I started working on a heavy project, I had to knock it up a notch (bam!). After weeks of trial and error

Read more »

Minimum Expected Shortfall Portfolio, Part 1

August 8, 2012
By

A few days ago, I wrote a piece on finding the minimum expected shortfall portfolio.  A few astute commenters quickly picked up where I was going with this -- using this as an alternative to low/minimum volatility portfolios.  What follo...

Read more »

RcppBDT 0.2.1

August 8, 2012
By

A new bug-fix release of the RcppBDT package appeared on CRAN earlier today. David Reiner noticed that the functions getEndOfMonth and getEndOfBizWeek were not working right. These are convenience wrappers around the real functionality provided as a ...

Read more »

The top 10 critical packages on CRAN

August 8, 2012
By
The top 10 critical packages on CRAN

While most R packages on CRAN are designed to be used by an R user directly, a few packages are designed to be used by other package developers. (And some packages are so useful that they're regularly used by both camps.) When a package author publishes a package to CRAN, she must list those packages that provide functions her...

Read more »

“Trend is Not Your Friend” Applied to 48 Industries

August 8, 2012
By
“Trend is Not Your Friend” Applied to 48 Industries

Please see previous post Crazy RUT in Academic Context Why Trend is Not Your Friend. I’ll repeat the intro to the post mentioned above, so we can all get caught back up. In response to Where are the Fat Tails?, reader vonjd very helpfully referred me...

Read more »

If you are into large data and work a lot with package ff

August 8, 2012
By
If you are into large data and work a lot with package ff

The ff package is a great and efficient way of working with large datasets.  One of the main reasons why I prefer to use it above other packages that allow working with large datasets is that it is a complete set of tools. When comparing it to the other open source 'bigdata' packages in R It is not...

Read more »

Get a path to your Dropbox folder

August 8, 2012
By

I am currently designing my RStudio - Dropbox - Mardown/Knitter/Wordpress - Github workflow. One problem is that working on multiple machines with different version of Windows means I somehow need to tell R where my Dropbox folder is located.I used to ...

Read more »

200 Countries animation revisited

August 8, 2012
By
200 Countries animation revisited

Here’s a new animated graph, again made using only stats software – R this time – to recreate Hans Rosling’s famous 200 Countries animation. Have you ever noticed how Prof Rosling had a team of graphics people beavering away in … Continue reading →

Read more »

Rainfall Amount Flooding Quezon City Philippines

August 8, 2012
By
Rainfall Amount Flooding Quezon City Philippines

The rainfall received by Quezon City, Philippines was almost double of what the city normally receive for the entire month of August, causing flooding and land slide to various villages in the area.

Read more »

Manipulating Data Frames Using sqldf – A Brief Overview

August 8, 2012
By

By Josh MillsIntroductionFor those who are learning R and who may be well-versed in SQL, the sqldf package provides a mechanism to manipulate R data frames using SQL. Even for experienced R programmers, sqldf can be a useful tool for data manipulation. This site provides a useful introduction to SQL. The...

Read more »

Project Euler — problem 17

August 8, 2012
By

It has been two weeks since my last post on the 16th Euler problem. Now, since I just need a break after supper, I’m coming the 17th problem. If the numbers 1 to 5 are written out in words: one, … Continue reading →

Read more »

Creating Olympic Medal Treemap Visualisations Using OTS R Libraries

August 7, 2012
By
Creating Olympic Medal Treemap Visualisations Using OTS R Libraries

In London Olympics 2012 Medal Tables At A Glance? I posted some treemap visualisations of the Olympics medal tables generated using a Google Visualisation Chart treemap component. I thought it might be worth posting a quick R generated example too, using the off-the-shelf/straight out of CRAN treemap component. (If you want to play along, download

Read more »

An analysis of the r-help mailing list

August 7, 2012
By
An analysis of the r-help mailing list

Even though forums and question-and-answer services like StackOverflow are emerging as the place to find crowdsourced technical help when using software like R, the traditional r-help email list is still going strong. UCLA grad student and R user Richard Kwock presented a poster at last month's JSM conference with an analysis of traffic on the list, showing it's still...

Read more »

How to load the {rJava} package after the error “JAVA_HOME cannot be determined from the Registry”

August 7, 2012
By

In case you tried loading a package that depends on the {rJava} package (by Simon Urbanek), you might came across the following error: Loading required package: rJava library(rJava) Error : .onLoad failed in loadNamespace() for ‘rJava’, details: call: fun(libname, pkgname) …Read more »

Read more »

Customizing ggplot graphs

August 7, 2012
By

There are many things I love about the R package ggplot2. For the most part, they fall into two categories:The "grammar of graphics" approach builds a hierarchical relationship between the data and the graphic, which creates a consistent, int...

Read more »

48 Industries Since 1963

August 7, 2012
By
48 Industries Since 1963

Please see http://timelyportfolio.blogspot.com/search/label/horizonplot for all horizon plot posts. Once more thanks to Ken French for his data, we can accomplish something I think is fairly amazing.  In 640x800, we can see 250 day rollling return...

Read more »

August 6-7 Rainfall on Metro Manila

August 7, 2012
By
August 6-7 Rainfall on Metro Manila

Majority of Metro Manila is affected by floods. Looking at the hourly data from PAGASA weather  station located at Bicutan, Taguig, the graph below will gave the viewers of this blog on the rainfall situation in Manila from August 6 (12:00am)-7(11...

Read more »

R for Ecologists: Simulating Species-Area Curves (linear vs. nonlinear regression)

August 7, 2012
By
R for Ecologists: Simulating Species-Area Curves (linear vs. nonlinear regression)

This post is about basic model simulation so we can get a feel for how curves are supposed to look given certain processes assumed by the model. One of the most prevalent patterns in ecology is the species-area (SAR) curve, which … Continue reading →

Read more »

Unify R plots with pander

August 7, 2012
By
Unify R plots with pander

MotivationR has a great variety of plotting tools (just to mention a few: the base graphics and e.g. lattice and ggplot2 packages building on grid) and most R user has a preference for either of them.I think all of you would agree with me: each package...

Read more »

For the Stupid Password Rules at Iowa State

August 7, 2012
By

The Fall semester is coming, which means it is time to log into several stupid systems to be prepared for the new semester. Time and time again I'm annoyed by the bullshit password rules at Iowa State University. I wrote to the IT staff once but no one...

Read more »

The New 60/40

August 6, 2012
By
The New 60/40

I want to share a brilliant idea and a great example from the You’re Looking at the Wrong Number post at the GestaltU blog. Today, I will focus on the section of this post that outlines simple steps to improve a typical 60/40 stock/bond portfolio by using risk allocation instead of dollar allocation, and targeting

Read more »

Careless comparison bites back (again)

August 6, 2012
By
Careless comparison bites back (again)

When running stats labs I like to allocate a slightly different subset of data to each student, which acts as an incentive for people to do their own work (rather than copying the same results from a fellow student). We … Continue reading →

Read more »

Vim-R-plugin: Installation

August 6, 2012
By
Vim-R-plugin: Installation

On Ubuntu 12.04, I took to the following steps to install Vim-R-plugin: First I had to install timux: sudo apt-get installl tmux Having already installed vim-pathogen, I used git to clone to install vim-r-plugin and vim-screen plugin, which is also … Continue reading →

Read more »

A bunch of R (and JAGS) scripts

August 6, 2012
By
A bunch of R (and JAGS) scripts

I finally (nearly) got around to prepare the R code to replicate the examples in the book. I divided the examples by chapter and then linked to the R scripts and, for those involving Bayesian analysis, the associated JAGS models.At the moment, the scri...

Read more »

Predicting the 100m sprint: results

August 6, 2012
By

Last week, Markus Gesmann used a log-linear model in R to predict the Olympic gold-medal winning 100m sprint time to be 9.68 seconds. The actual time was 9.63 seconds. Not bad! Meanwhile, the New York Times put Usuain Bolt's olympic record in context, comparing him in a virtual race with other gold medal-winners over the past century (via FlowingData).

Read more »