R training: Visualization, Big Data, Data Mining, and Marketing Analytics

August 2, 2012
By

Revolution Analytics is hosting several live and online courses over the next couple of months that will be of interest to R users looking to hone their skills: Visualization in R with ggplot2. Garrett Grolemund and Winston Chang instruct how to use the ggplot2 package to make, format, label and adjust graphs using R. (August 28, Redwood City, CA.)...

Read more »

plotting raster data in R: adjusting the labels and colors of a classified raster

August 2, 2012
By
plotting raster data in R: adjusting the labels and colors of a classified raster

Thank’s to Andrej who wrote this comment: “Is it possible to to color the resulting 12 clusters within your original image to get a feel for visual separation?” You can do so: But how to get values at a location? You will need these values to determine whether the defined class is representing a water

Read more »

Who wants to maintain pgfSweave?

August 2, 2012
By

So the time has come for me to face the fact that I have no time to maintain pgfSweave. It was recently archived because I didn’t make necessary changes to comply with some CRAN policies. SO, I need someone to step up to the plate to make some tweakes, put it back up on CRAN

Read more »

Spacing of multi-panel figures in R

August 2, 2012
By
Spacing of multi-panel figures in R

In a previous post, I showed how to keep text and symbols at the same size across figures that have different numbers of panels. The figures in that post were ugly because they used the default panel spacing associated with the mfrow argument of the par( ) function. Below I will walk through how to

Read more »

How do you say “We Will Do Whatever It Takes” in Thai?

August 2, 2012
By
How do you say “We Will Do Whatever It Takes” in Thai?

As the market has already started to poke holes in Draghi’s promise, I thought it would be good to continue the series of posts that I began with the British version “We Will Do Whatever it Takes” with my favorite article written during the Asia ...

Read more »

Data Parallelism Using Oracle R Enterprise

August 2, 2012
By

Modern computer processors are adequately optimized for many statistical calculations, but large data operations may require hours or days to return a result.  Oracle R Enterprise (ORE), a set of R packages designed to process large data computations in Oracle Database, can run many R operations in parallel, significantly reducing processing time. ORE supports parallelism through the transparency layer,...

Read more »

Multivariate Data Analysis Work Flow

August 2, 2012
By
Multivariate Data Analysis Work Flow

Here is an example of a data analysis work flow supported in imDEV. This network visualization was made using CmapTools.

Read more »

Units and metadata

August 2, 2012
By

Handling meta-data is not natural in R, or any traditional rectangular shaped type data storage system.There are several tricks and packages which attempt to solve this problem, with Hmisc using the atrribute feature and the IRange package having its o...

Read more »

CFP: AusDM 2012, deadline extended to 31 August 2012

August 2, 2012
By
CFP: AusDM 2012, deadline extended to 31 August 2012

The Tenth Australasian Data Mining Conference (AusDM 2012) Sydney, Australia 5-7 December 2012 http://ausdm12.togaware.com/ Deadline extended to 31 August 2012 The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and researchers in data … Continue reading →

Read more »

unsupervised classification of a Landsat image in R: the whole story or part two

August 1, 2012
By
unsupervised classification of a Landsat image in R: the whole story or part two

The main question when using remote sensed raster data, as we do, is the question of NaN-treatment. Many R functions are able to use an option like rm.NaN=TRUE to treat these missing values. In our case the kmeans function in R is not capable to use such a parameter. After reading the tif-files and creating

Read more »

More on Horizon Charts

August 1, 2012
By
More on Horizon Charts

for background please see prior posts Application of Horizon Plots, Horizon Plot Already Available, and Cubism Horizon Charts in R Some feedback has led me to think that I might have been a little ambitious with my last post on horizon charts. I though...

Read more »

Genetic algorithms: a simple R example

August 1, 2012
By
Genetic algorithms: a simple R example

Genetic algorithm is a search heuristic. GAs can generate a vast number of possible model solutions and use these to evolve towards an approximation of the best solution of the model. Hereby it mimics evolution in nature. GA generates a population, the individuals in this population (often called chromosomes) have  Read more »

Analytics for Marketing online training 25 – 28 September 2012

August 1, 2012
By
Analytics for Marketing online training 25 – 28 September 2012

I am excited to be giving the Analytics for Marketing online training course on 25-28 September 2012. Sign up before 25 August 2012 for the early bird discount. Our friends at Revolution Analytics who will provide the infrastructure to host the event. Update: For clarification, this is an online, instructor led training course. We are...

Read more »

Analytics for Marketing online training 25 – 28 September 2012

August 1, 2012
By
Analytics for Marketing online training 25 – 28 September 2012

I am excited to be giving the Analytics for Marketing online training course on 25-28 September 2012. Sign up before 25 August 2012 for the early bird discount. Our friends at Revolution Analytics who will provide the infrastructure to host the event. Update:...

Read more »

Genetic algorithms: a simple R example

August 1, 2012
By
Genetic algorithms: a simple R example

Genetic algorithm is a search heuristic. GAs can generate a vast number of possible model solutions and use these to evolve towards an approximation of the best solution of the model. Hereby it mimics evolution in nature. GA generates a population, the individuals in this population (often called chromosomes) have a given state. Once the population is generated, the state of these individuals is evaluated...

Read more »

Bio7 1.6 for Windows and Linux released!

August 1, 2012
By
Bio7 1.6 for Windows and Linux released!

01.08.2012 Finally i released a new version of Bio7 with many improvements and new features. Updated tutorials are available, too. The new Bio7 1.6 release can be downloaded here. Please also download the examples *.zip file from the sourceforge website which contains new examples for Bio7 1.6 (e.g. an example to cluster an image folder with

Read more »

Hadley Wickham’s ggplot2 basics

August 1, 2012
By

If you haven't made the plunge yet to making R graphics with Hadley Wickham's ggplot2 package, his "ggplot2 basics" slides (from the recent Introduction to Data Visualization and Analysis course at JSM) is a good place to start. Once you get the hang of the "grammar of graphics" notation, you'll be building beautiful data visualizations like this or this...

Read more »

Creating a text grob that automatically adjusts to viewport size

August 1, 2012
By
Creating a text grob that automatically adjusts to viewport size

I recently wanted to construe a dashboard widget that contains some text and other elements using the grid graphics system. The size available for the widget will vary. When the sizes for the elements of the grobs in the widget are specified as Normalised Parent Coordinates the size adjustments happen automatically. Text does not automatically adjust though. The

Read more »

Olympic body match and 1:1 BMI

August 1, 2012
By
Olympic body match and 1:1 BMI

In my morning attempt to read the whole internet before beginning work, I came across a program on the BBC website which allows you to see which Olympic athletes are your body doubles. Or rather, which athletes share your height and weight, and therefore your body mass index. Being a Canadian, I exist in an

Read more »

Building a presentation, report or paper in R

August 1, 2012
By

If you need to build a presentation, obviously you have following options: Powerpoint alike presentation Online engines LaTex The first two are beloved by business people and the third one is widely used in academia. The objective of the first group is shiny presentation, contrary to the second where asceticism and demand for automation are

Read more »

Examples of profiling R code

August 1, 2012
By
Examples of profiling R code

by Yanchang Zhao, RDataMining.com Below are simple examples of profiling R code, which help to find out which steps or functions are most time consuming. It is very useful for improving efficiency of R code. # profiling of running time … Continue reading →

Read more »

Trying Julia

August 1, 2012
By
Trying Julia

In my previous post I tried building Williams designs in R. Since that code was running a bit slow, this was an ideal test for Julia. Big enough to be at least slightly realistic, small enough that it is doable.I am very impressed. Almost twenty fold s...

Read more »

Rook rocks! Example with googleVis

August 1, 2012
By
Rook rocks! Example with googleVis

What is Rook?Rook is a web server interface for R, written by Jeffrey Horner, the author of rApache and brew. But unlike other web frameworks for R, such as brew, R.rsp (which I have used in the past1), Rserve, gWidgetWWWW or sumo (which I haven't used...

Read more »

Highlights from the useR! 2012 conference

August 1, 2012
By
Highlights from the useR! 2012 conference

Video (screencast) of the presentation by Szilard Pafka at the Los Angeles R users group. I summarized (with short demos) a few of the talks from the useR! 2012 conference. We are planning one more meetup to cover more talks. … Continue reading →

Read more »

RcppCNPy 0.2.0

July 31, 2012
By

Version 0.2.0 of the recently introduced RcppCNPy package for reading/writing NumPy data in R arrived on CRAN earlier today. The main change are the added ability to also write gzip-ed npy files, to suppress an automatic transposition as well as th...

Read more »

The Environmental Performance Index, visualized with R

July 31, 2012
By
The Environmental Performance Index, visualized with R

The Environmental Performance Index (EPI) ranks countries on performance indicators for environmental public health and ecosystem vitality. Yale University hosts the EPI website, which was used to present the 2012 EPI Rankings to world leaders at the 2012 World Economic Forum at Davos. The Country Profiles section of the website allowed members to browse the performance characteristics of their...

Read more »

Twitter analysis of air pollution in Beijing

July 31, 2012
By
Twitter analysis of air pollution in Beijing

One of the air pollution detection machine in Beijing (at the American Embassy) is connected to Twitter and tweet about the air quality in real time. By default the machine in Beijing output the 24hr summary PM2.5 air pollution information. What is PM2.5 is define here Next will be to compare the...

Read more »

Fun with geocoding and mapping in JGR

July 31, 2012
By
Fun with geocoding and mapping in JGR

For a recent project I had to do some mapping of addresses, but I didn’t have there lat/lons do use the Deducer and DeducerSpatial packages in R JGR.  After frustrating myself trying to adapt this code from stackoverflow.com, I found a much easier way of geocoding using the dismo and XML packages in R. First

Read more »

Text and symbol size in multi-panel figures in R

July 31, 2012
By
Text and symbol size in multi-panel figures in R

In R, there are a couple of packages that allow you to create multi-panel figures (see examples here and here), but, of course, you can also make multi-panel figures in the base package*. Below I provide a simple example for creating a multi-panel figure in the R base package with the focus on making the

Read more »