Bio7 1.6 for Windows and Linux released!

August 1, 2012
By
Bio7 1.6 for Windows and Linux released!

01.08.2012 Finally i released a new version of Bio7 with many improvements and new features. Updated tutorials are available, too. The new Bio7 1.6 release can be downloaded here. Please also download the examples *.zip file from the sourceforge website which contains new examples for Bio7 1.6 (e.g. an example to cluster an image folder with

Read more »

Hadley Wickham’s ggplot2 basics

August 1, 2012
By

If you haven't made the plunge yet to making R graphics with Hadley Wickham's ggplot2 package, his "ggplot2 basics" slides (from the recent Introduction to Data Visualization and Analysis course at JSM) is a good place to start. Once you get the hang of the "grammar of graphics" notation, you'll be building beautiful data visualizations like this or this...

Read more »

Creating a text grob that automatically adjusts to viewport size

August 1, 2012
By
Creating a text grob that automatically adjusts to viewport size

I recently wanted to construe a dashboard widget that contains some text and other elements using the grid graphics system. The size available for the widget will vary. When the sizes for the elements of the grobs in the widget are specified as Normalised Parent Coordinates the size adjustments happen automatically. Text does not automatically adjust though. The

Read more »

Olympic body match and 1:1 BMI

August 1, 2012
By
Olympic body match and 1:1 BMI

In my morning attempt to read the whole internet before beginning work, I came across a program on the BBC website which allows you to see which Olympic athletes are your body doubles. Or rather, which athletes share your height and weight, and therefore your body mass index. Being a Canadian, I exist in an

Read more »

Building a presentation, report or paper in R

August 1, 2012
By

If you need to build a presentation, obviously you have following options: Powerpoint alike presentation Online engines LaTex The first two are beloved by business people and the third one is widely used in academia. The objective of the first group is shiny presentation, contrary to the second where asceticism and demand for automation are

Read more »

Examples of profiling R code

August 1, 2012
By
Examples of profiling R code

by Yanchang Zhao, RDataMining.com Below are simple examples of profiling R code, which help to find out which steps or functions are most time consuming. It is very useful for improving efficiency of R code. # profiling of running time … Continue reading →

Read more »

Trying Julia

August 1, 2012
By
Trying Julia

In my previous post I tried building Williams designs in R. Since that code was running a bit slow, this was an ideal test for Julia. Big enough to be at least slightly realistic, small enough that it is doable.I am very impressed. Almost twenty fold s...

Read more »

Rook rocks! Example with googleVis

August 1, 2012
By
Rook rocks! Example with googleVis

What is Rook?Rook is a web server interface for R, written by Jeffrey Horner, the author of rApache and brew. But unlike other web frameworks for R, such as brew, R.rsp (which I have used in the past1), Rserve, gWidgetWWWW or sumo (which I haven't used...

Read more »

Highlights from the useR! 2012 conference

August 1, 2012
By
Highlights from the useR! 2012 conference

Video (screencast) of the presentation by Szilard Pafka at the Los Angeles R users group. I summarized (with short demos) a few of the talks from the useR! 2012 conference. We are planning one more meetup to cover more talks. … Continue reading →

Read more »

RcppCNPy 0.2.0

July 31, 2012
By

Version 0.2.0 of the recently introduced RcppCNPy package for reading/writing NumPy data in R arrived on CRAN earlier today. The main change are the added ability to also write gzip-ed npy files, to suppress an automatic transposition as well as th...

Read more »

The Environmental Performance Index, visualized with R

July 31, 2012
By
The Environmental Performance Index, visualized with R

The Environmental Performance Index (EPI) ranks countries on performance indicators for environmental public health and ecosystem vitality. Yale University hosts the EPI website, which was used to present the 2012 EPI Rankings to world leaders at the 2012 World Economic Forum at Davos. The Country Profiles section of the website allowed members to browse the performance characteristics of their...

Read more »

Twitter analysis of air pollution in Beijing

July 31, 2012
By
Twitter analysis of air pollution in Beijing

One of the air pollution detection machine in Beijing (at the American Embassy) is connected to Twitter and tweet about the air quality in real time. By default the machine in Beijing output the 24hr summary PM2.5 air pollution information. What is PM2.5 is define here Next will be to compare the...

Read more »

Fun with geocoding and mapping in JGR

July 31, 2012
By
Fun with geocoding and mapping in JGR

For a recent project I had to do some mapping of addresses, but I didn’t have there lat/lons do use the Deducer and DeducerSpatial packages in R JGR.  After frustrating myself trying to adapt this code from stackoverflow.com, I found a much easier way of geocoding using the dismo and XML packages in R. First

Read more »

Text and symbol size in multi-panel figures in R

July 31, 2012
By
Text and symbol size in multi-panel figures in R

In R, there are a couple of packages that allow you to create multi-panel figures (see examples here and here), but, of course, you can also make multi-panel figures in the base package*. Below I provide a simple example for creating a multi-panel figure in the R base package with the focus on making the

Read more »

Edge Prediction in a Social Graph: My Solution to Facebook’s User Recommendation Contest on Kaggle

July 31, 2012
By
Edge Prediction in a Social Graph: My Solution to Facebook’s User Recommendation Contest on Kaggle

A couple weeks ago, Facebook launched a link prediction contest on Kaggle, with the goal of recommending missing edges in a social graph. I love investigating social networks, so I dug around a little, and since I did well enough to score one of the coveted prizes, I’ll share my approach here. (For some background, the contest provided...

Read more »

Application of Horizon Plots

July 31, 2012
By
Application of Horizon Plots

for background please see prior posts Horizon Plot Already Available and Cubism Horizon Charts in R Good visualization simplifies, and stories are better told with effective and pretty visualizations. Although horizon plots are not immediately intuitiv...

Read more »

Multidimensional Scaling and Company Similarity

July 30, 2012
By
Multidimensional Scaling and Company Similarity

Background and ideaOften we are looking at a particular sector, and want to get a quick overview of a group of companies relative to one another. I thought I might apply Multidimensional Scaling (MDS) to various financial ratios and see if it...

Read more »

Making R graphics legible in presentation slides

July 30, 2012
By
Making R graphics legible in presentation slides

I only visited a few JSM sessions today, as I’ve been focused on preparing for my own talk tomorrow morning. However, I went to several talks in a row which all had a common problem that made me cringe: graphics … Continue reading →

Read more »

Yet Another Forecast Dashboard

July 30, 2012
By
Yet Another Forecast Dashboard

Recently, I came across quite a few examples of time series forecasting using R. Here are some examples: Time series cross-validation 4: forecasting the S&P 500 Holt-Winters forecast using ggplot2 Autoplot: Graphical Methods with ggplot2 Large-Scale Parallel Statistical Forecasting Computations in R (2011) by M. Stokely, F. Rohani, E. Tassone Forecasting time series data ARIMA

Read more »

Split-plot 2: let’s throw in some spatial effects

July 30, 2012
By
Split-plot 2: let’s throw in some spatial effects

Disappeared for a while collecting frequent flyer points. In the process I ‘discovered’ that I live in the middle of nowhere, as it took me 36 hours to reach my conference destination (Estoril, Portugal) through Christchurch, Sydney, Bangkok, Dubai, Madrid … Continue reading →

Read more »

Big data, big analytics, big opportunity

July 30, 2012
By
Big data, big analytics, big opportunity

Data, data, every where Nor any byte to think The world today is awash with data. Corporations, governments, and individuals are busy generating petabytes of data on culture, economy, environment, religion, and society.  While data has become abundant and ubiquitous, data analysts needed to turn raw data into knowledge are in fact in short...

Read more »

Forecasting the Olympics

July 30, 2012
By

Forecasting sporting events is a growing research area. The International Journal of Forecasting even had a special issue on sports forecasting a couple of years ago. The London 2012 Olympics has attracted a few forecasters trying to predict medal counts, world records, etc. Here are some of the articles I’ve seen. Which Olympic records get shattered?, Nate Silver, New...

Read more »

A prediction for the Olympic men’s 100m sprint

July 30, 2012
By
A prediction for the Olympic men’s 100m sprint

R user Markus Gesmann used the gold-winning times from the Olympic Men's 100m sprint since 1990 as the basis of the following prediction for the London Games: My simple log-linear model forecasts a winning time of 9.68 seconds, which is 1/100 of a second faster than Usain Bolt's winning time in Beijing in 2008, but still 1/10 of a...

Read more »

Archetypal Analysis

July 30, 2012
By
Archetypal Analysis

Thinking Strategically about Customer HeterogeneityIronically, market segmentation, whose motto is "one size does not fit all," seems to rely almost exclusively on one definition of what constitutes a segment.  Borrowing its definition f...

Read more »

Machine learning for better homicide counts in Ciudad Juarez

July 30, 2012
By
Machine learning for better homicide counts in Ciudad Juarez

Photo Credit: Jesús Villaseca Pérez Ever since March 2008 Ciudad Juárez began to register an alarming number of homicides becoming Mexico's most violent city. According to the Mexican vital statistics system Ciudad Juárez (coterminous with the Juárez municipality) went from having just 202 murders in 2007 to 1,616 in 2008, 2,397 in...

Read more »

Blue Jay and Scrub Jay : Using rvertnet to check the distributions in R

July 30, 2012
By
Blue Jay and Scrub Jay : Using rvertnet to check the distributions in R

As part of my Google Summer of Code, I am also working on another package for R called rvertnet. This package is a wrapper in R for VertNet websites. Vertnet is a vertebrate distributed database network consisting of FishNet2, MaNIS, HerpNET, and ORNIS. Out of that currently Fishnet, HerpNET and ORNIS have their v2 portals serving data. rvertnet has functions now to access

Read more »

Blue Jay and Scrub Jay : Using rvertnet to check the distributions in R

July 30, 2012
By
Blue Jay and Scrub Jay : Using rvertnet to check the distributions in R

As part of my Google Summer of Code, I am also working on another package for R called rvertnet. This package is a wrapper in R for VertNet websites. Vertnet is a vertebrate distributed database network consisting of FishNet2, MaNIS, HerpNET, and ORNIS. Out of that currently Fishnet, HerpNET and ORNIS have their v2 portals serving data. rvertnet has functions now to access

Read more »

Returns with negative net asset values

July 30, 2012
By
Returns with negative net asset values

How are returns calculated when net asset value goes negative? Previously In “A tale of two returns” we highlighted the similarities and differences of log returns versus simple returns. Positive valuation We create — in R — an example of net asset value at four times: > nav1 <- c(1000, 900, 950, 1010) > nav1 … Continue reading...

Read more »

unsupervised classification of a raster in R: the layer-stack or part one.

July 29, 2012
By
unsupervised classification of a raster in R: the layer-stack or part one.

In my last post I was explaining the usage of QGis to do a layerstack of a Landsat-scene. Due to the fact that further research and trying out resulted in frustration I decided to stick with a software I know well: R. So download the needed layers here and open up your flavoured version of

Read more »