266 search results for "pCA"

Rd2roxygen: Convert Rd to roxygen documentation

December 1, 2010
By

must admit that I have been tired of maintaining my R packages for a long time, and the main reason is I feel really uncomfortable with writing R documentations (Rd). The required structure of an R package mainly includes two directories R and man — the former for the R source code (typically functions), and

Read more »

Principal Component Analysis: Which variables contribute most to principal components ?

November 23, 2010
By

Principal component analysis (PCA) is a mathematical transformation of possibly(correlated) variables into a number of uncorrelated variables called principal components. The resulting components from this transformation is defined in such a way that t...

Read more »

Principal Component Analysis: Which variables contribute most to principal components ?

November 23, 2010
By

Principal component analysis (PCA) is a mathematical transformation of possibly(correlated) variables into a number of uncorrelated variables called principal components. The resulting components from this transformation is defined in such a way that t...

Read more »

My Day at ACM Data Mining Camp III

November 13, 2010
By
My Day at ACM Data Mining Camp III

My first time at ACM Data Mining Camp was so awesome, that I was thrilled the make the trip up to San Jose for the November 2010 version. In July, I gave a talk at the Emerging Technologies for Online Learning Symposium conference with a faculty member in the Department of Statistics, at the Fairmont. The place was amazing,...

Read more »

An analysis of the Stackoverflow Beta sites

November 1, 2010
By
An analysis of the Stackoverflow Beta sites

In the last six months or so, the behemoth of Q & A sites stackoverflow, decided to change tack and launch a number of other non-computing-language sites. To launch a site in the stackoverflow family, sites have to spend time gathering followers in Area51. Once a site has gained a critical mass, a new StackExchange

Read more »

How to Start Using (pgf)Sweave in LyX in One Minute

October 30, 2010
By

regor Gorjanc published an interesting article “Using Sweave with LyX” in R News in 2008, which (I believe) makes it much easier to use Sweave. I use command-line tools a lot every day, but I am still “GUI-addicted”. (I don’t want to comment more about Microsoft Word here.) LyX is a somewhat WYSIWYG tool based

Read more »

Grabbing Tables in Webpages Using the XML Package

October 24, 2010
By

ables are pretty common in web pages as data sources, and the most direct way to get these data is probably to copy and paste. This is OK if there are only two or three tables, and when we need to grab 5000 tables in 1000 web pages, we may not really wish to fulfill

Read more »

On the Gory Loops in R

October 17, 2010
By

his blog post is mainly for Stat 579 students on the homework for week 7, since I received too many “gory” loops in the homework submissions and I think it would help a bit to write my thoughts on R loops for beginners. The immortal motto for newbies in programming is: If you want to

Read more »

SyntaxHighlighter Brush for the R Language

September 11, 2010
By

al Galili requested in the R-help mailing list for a SyntaxHighlighter brush for the R language, so that WordPress users can highlight their R code easily. I promised to contribute a few minutes on this task, and here is the result: /** * Author: Yihui Xie * URL: http://yihui.name/en/2010/09/syntaxhighlighter-brush-for-the-r-language * License: GPL-2 | GPL-3 */

Read more »

Eigenimages: The AT&T Cambridge Faces Database

September 7, 2010
By
Eigenimages: The AT&T Cambridge Faces Database

I picked up the AT&T Laboratories Cambridge database of faces for a clustering application. The database consists of images of 40 distinct subjects, each in 10 different facial positions and expressions. Typically, the goal of clustering in these data is to recover the ‘true’ partition, or that which isolates images of distinct subjects. Each image

Read more »