Blog Archives

A tutorial on outlier detection techniques

July 4, 2012
By
A tutorial on outlier detection techniques

by Yanchang Zhao, RDataMining.com There is an excellent tutorial on outlier detection techniques, presented by Hans-Peter Kriegel et al. at ACM SIGKDD 2010. It presents many popular outlier detection algorithms, most of which were published between mid 1990s and 2010, … Continue reading →

Read more »

An example on sentiment analysis with R

June 21, 2012
By
An example on sentiment analysis with R

by Yanchang Zhao, RDataMining.com There is a nice example on sentiment analysis with R at <http://viksalgorithms.blogspot.com.au/2012/06/tracking-us-sentiments-over-time-in.html>. In the example, the Wikileaks cable corpus is analyzed to track US sentiments of other countries and their presidents over time. The example describes … Continue reading →

Read more »

PDF slides and R code examples on Data Mining and Exploration

June 4, 2012
By
PDF slides and R code examples on Data Mining and Exploration

by Yanchang Zhao, RDataMining.com There are some nice slides and R code examples on Data Mining and Exploration at http://www.inf.ed.ac.uk/teaching/courses/dme/, which are listed below. PDF Slides: - Overview of Data Mining http://www.inf.ed.ac.uk/teaching/courses/dme/2012/slides/datamining_intro4up.pdf - Visualizing Data http://www.inf.ed.ac.uk/teaching/courses/dme/2012/slides/visualisation4up.pdf - Decision trees http://www.inf.ed.ac.uk/teaching/courses/dme/2012/slides/classification4up.pdf … Continue reading →

Read more »

CFP: the 10th Australasian Data Mining Conference (AusDM 2012)

May 20, 2012
By
CFP: the 10th Australasian Data Mining Conference (AusDM 2012)

The Tenth Australasian Data Mining Conference (AusDM 2012) Sydney, Australia 5-7 December 2012 http://ausdm12.togaware.com/ Data mining, the art and science of intelligent analysis of (usually large) data sets for meaningful (and previously unknown) insights, is now being actively applied in … Continue reading →

Read more »

An Example of Social Network Analysis with R using Package igraph

May 16, 2012
By
An Example of Social Network Analysis with R using Package igraph

by Yanchang Zhao, RDataMining.com This post presents an example of social network analysis with R using package igraph. The data to analyze is Twitter text data of @RDataMining used in the example of Text Mining, and it can be downloaded … Continue reading →

Read more »

Book “R and Data Mining: Examples and Case Studies” on CRAN

May 9, 2012
By
Book “R and Data Mining: Examples and Case Studies” on CRAN

by Yanchang Zhao, RDataMining.com My book in draft titled “R and Data Mining: Examples and Case Studies” is now available on CRAN at http://cran.r-project.org/other-docs.html. It is scheduled to be published by Elsevier in late 2012. Its latest version can be … Continue reading →

Read more »

A simple example of parallel computing on a Windows (and also Mac) machine

May 8, 2012
By
A simple example of parallel computing on a Windows (and also Mac) machine

by Yanchang Zhao, RDataMining.com With a Mac, parallel computing can be achieved with package multicore. Unfortunately, it does not work under Windows. A simple way for parallel computing under Windows (and also Mac) is using package snowfall, which can work … Continue reading →

Read more »

Online resources for handling big data and parallel computing in R

May 6, 2012
By
Online resources for handling big data and parallel computing in R

by Yanchang Zhao, RDataMining.com Compared with many other programming languages, such as C/C++ and Java, R is less efficient and consumes much more memory. Fortunately, there are some packages that enables parallel computing in R and also packages for processing … Continue reading →

Read more »

2nd round of call for chapter proposals for book Data Mining Applications with R: due by 31 May

May 2, 2012
By
2nd round of call for chapter proposals for book Data Mining Applications with R: due by 31 May

2nd CALL FOR CHAPTERS: proposals due by 31 May 2012 Data Mining Applications with R A book to be published by Elsevier http://www.RDataMining.com/books/book2 Introduction —————— R is one of the most widely used data mining tools in scientific and business … Continue reading →

Read more »

R Tips: lots of tips for R programming

April 26, 2012
By
R Tips: lots of tips for R programming

by Yanchang Zhao, RDataMining.com There are more than 100 R tips at http://pj.freefaculty.org/R/Rtips.html, which provide quick examples to small challenges in everyday R programming, especially for users switching from other languages to R. There is also a .PDF version for … Continue reading →

Read more »