Blog Archives

A nice short article on memory in R

November 28, 2011
By
A nice short article on memory in R

There is a nice short article on memory issue in R at http://www.matthewckeller.com/html/memory.html. If you use R to process large data, you might find it helpful. It introduces: - checking how much memory an object is taking; - the memory … Continue reading →

Read more »

Using Text Mining to Find Out What @RDataMining Tweets are About

November 8, 2011
By
Using Text Mining to Find Out What @RDataMining Tweets are About

This post shows an example on text mining of Twitter data with R packages twitteR, tm and wordcloud. Package twitteR provides access to Twitter data, tm provides functions for text mining, and wordcloud visualizes the result with a word cloud. … Continue reading →

Read more »

Help: stemming and stem completion with package tm in R

November 3, 2011
By
Help: stemming and stem completion with package tm in R

I came across a problem below when doing stemming and stem completion with package tm in R. Word “mining” was stemmed to “mine” with stemDocument(), and then completed to “miners”with stemCompletion(). However, I prefer to keep “mining” intact. For stemCompletion(), … Continue reading →

Read more »

R Cookbook with examples

October 27, 2011
By
R Cookbook with examples

An R Cookbook can be found at http://code.ca-net.org/R%20Cookbook. It is a short web document presenting dozens of examples on - Accessing Database with packages RSQLite, RMySQL, RdbiPgSQL and RODBC; - Reading and Writing Data; - Date/Time variable; - Graphics; - … Continue reading →

Read more »

Interactive charts with googleVis package and R

October 4, 2011
By
Interactive charts with googleVis package and R

Examples at the link below illustrate interactive charts created with the googleVis package and R. http://code.google.com/p/google-motion-charts-with-r/wiki/GadgetExamples Some amazing features are: a motion chart shows the changes over time, an AnnotatedTimeLine shows zoom-in/zoom-out view of time series, a TreeMap supports drill-down … Continue reading →

Read more »

Obama recruiting analysts and R is one preferred skill

September 27, 2011
By
Obama recruiting analysts and R is one preferred skill

Barack Obama is recruiting analysts for his 2012 re-election campaign. It is to analyze the campaign’s data to guide election strategy and develop quantitative, actionable insights that drive decision-making. R is mentioned as one of the tools to use. Analytics … Continue reading →

Read more »

Datasets to Practice Your Data Mining

September 16, 2011
By
Datasets to Practice Your Data Mining

There are many datasets available online for free for research use. Some of them are listed below. - The R Datasets Package: There are around 90 datasets available in the package. Most of them are small and easy to feed … Continue reading →

Read more »

Slides of 10+ talks at R Users Groups

August 29, 2011
By
Slides of 10+ talks at R Users Groups

Links to slides of 10+ talks at R Users Groups in Australia are provided below. Slides of the talks are downloadable at the links, including R codes if any. MelbURN: Melbourne Users of R Network: Experiences with using R in … Continue reading →

Read more »

Examples on Clustering with R

August 25, 2011
By
Examples on Clustering with R

R code examples on various clustering techniques are available as “Clustering in R” in Chapter 4 of R & Bioconductor Manual by Thomas Girke, UC Riverside. It provides R examples on - Hierarchical Clustering, including tree cutting/coloring and heatmaps, - … Continue reading →

Read more »

Time Series Analysis and Mining with R

August 23, 2011
By
Time Series Analysis and Mining with R

Time series data are widely seen in analytics. Some examples are stock indexes/prices, currency exchange rates and electrocardiogram (ECG). Traditional time series analysis focuses on smoothing, decomposition and forecasting, and there are many R functions and packages available for those … Continue reading →

Read more »