Blog Archives

2nd CFP: the 10th Australasian Data Mining Conference (AusDM 2012)

July 10, 2012
By
2nd CFP: the 10th Australasian Data Mining Conference (AusDM 2012)

The Tenth Australasian Data Mining Conference (AusDM 2012) Sydney, Australia, 5-7 December 2012 http://ausdm12.togaware.com/ The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and researchers in data mining. This year’s conference, AusDM’12, co-hosted … Continue reading →

Read more »

Data Mining In Excel: Lecture Notes and Cases

July 10, 2012
By
Data Mining In Excel: Lecture Notes and Cases

by Yanchang Zhao, RDataMining.com It is a 270-page book on data mining with Excel. It can be downloaded as a PDF file at http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.83.1393&rep=rep1&type=pdf. Below is its table of contents. - Overview of the Data Mining Process - Data Exploration … Continue reading →

Read more »

A tutorial on outlier detection techniques

July 4, 2012
By
A tutorial on outlier detection techniques

by Yanchang Zhao, RDataMining.com There is an excellent tutorial on outlier detection techniques, presented by Hans-Peter Kriegel et al. at ACM SIGKDD 2010. It presents many popular outlier detection algorithms, most of which were published between mid 1990s and 2010, … Continue reading →

Read more »

An example on sentiment analysis with R

June 21, 2012
By
An example on sentiment analysis with R

by Yanchang Zhao, RDataMining.com There is a nice example on sentiment analysis with R at <http://viksalgorithms.blogspot.com.au/2012/06/tracking-us-sentiments-over-time-in.html>. In the example, the Wikileaks cable corpus is analyzed to track US sentiments of other countries and their presidents over time. The example describes … Continue reading →

Read more »

PDF slides and R code examples on Data Mining and Exploration

June 4, 2012
By
PDF slides and R code examples on Data Mining and Exploration

by Yanchang Zhao, RDataMining.com There are some nice slides and R code examples on Data Mining and Exploration at http://www.inf.ed.ac.uk/teaching/courses/dme/, which are listed below. PDF Slides: - Overview of Data Mining http://www.inf.ed.ac.uk/teaching/courses/dme/2012/slides/datamining_intro4up.pdf - Visualizing Data http://www.inf.ed.ac.uk/teaching/courses/dme/2012/slides/visualisation4up.pdf - Decision trees http://www.inf.ed.ac.uk/teaching/courses/dme/2012/slides/classification4up.pdf … Continue reading →

Read more »

CFP: the 10th Australasian Data Mining Conference (AusDM 2012)

May 20, 2012
By
CFP: the 10th Australasian Data Mining Conference (AusDM 2012)

The Tenth Australasian Data Mining Conference (AusDM 2012) Sydney, Australia 5-7 December 2012 http://ausdm12.togaware.com/ Data mining, the art and science of intelligent analysis of (usually large) data sets for meaningful (and previously unknown) insights, is now being actively applied in … Continue reading →

Read more »

An Example of Social Network Analysis with R using Package igraph

May 16, 2012
By
An Example of Social Network Analysis with R using Package igraph

by Yanchang Zhao, RDataMining.com This post presents an example of social network analysis with R using package igraph. The data to analyze is Twitter text data of @RDataMining used in the example of Text Mining, and it can be downloaded … Continue reading →

Read more »

Book “R and Data Mining: Examples and Case Studies” on CRAN

May 9, 2012
By
Book “R and Data Mining: Examples and Case Studies” on CRAN

by Yanchang Zhao, RDataMining.com My book in draft titled “R and Data Mining: Examples and Case Studies” is now available on CRAN at http://cran.r-project.org/other-docs.html. It is scheduled to be published by Elsevier in late 2012. Its latest version can be … Continue reading →

Read more »

A simple example of parallel computing on a Windows (and also Mac) machine

May 8, 2012
By
A simple example of parallel computing on a Windows (and also Mac) machine

by Yanchang Zhao, RDataMining.com With a Mac, parallel computing can be achieved with package multicore. Unfortunately, it does not work under Windows. A simple way for parallel computing under Windows (and also Mac) is using package snowfall, which can work … Continue reading →

Read more »

Online resources for handling big data and parallel computing in R

May 6, 2012
By
Online resources for handling big data and parallel computing in R

by Yanchang Zhao, RDataMining.com Compared with many other programming languages, such as C/C++ and Java, R is less efficient and consumes much more memory. Fortunately, there are some packages that enables parallel computing in R and also packages for processing … Continue reading →

Read more »