243 search results for "PCA"

Examples on Clustering with R

August 25, 2011
By
Examples on Clustering with R

R code examples on various clustering techniques are available as “Clustering in R” in Chapter 4 of R & Bioconductor Manual by Thomas Girke, UC Riverside. It provides R examples on - Hierarchical Clustering, including tree cutting/coloring and heatmaps, - … Continue reading →

Read more »

Statistical Analysis Functions in R

August 20, 2011
By

Lately, I've been using statistical tests on a daily basis. I've noticed that I have to format my data the same way in order to get it into R (tab-delimited flat file essentially). Every other change in order to prep that data structure for any sort of...

Read more »

Making Stuff is Scary

August 15, 2011
By
Making Stuff is Scary

My daughter's best friend lives just down the street. Her mother runs a cupcake shop that's just a little further down the street. Being eleven going on sixteen, my daughter fancies herself a "quote" -- worker at the shop. She's not paid in actual mone...

Read more »

GDAT 2011 in Review

August 13, 2011
By
GDAT 2011 in Review

As usual, the Guerrilla Data Analysis Techniques (GDAT) class was a total blast. Motivated students always guarantee that. It would really help our scheduling, however, if people didn't wait until the last nanosecond to register for the class. But give...

Read more »

Image Data from ImageJ to R and Vice Versa

August 5, 2011
By

In recent years many R packages have been developed to enable image analysis in R. As an alternative the combination of R with a powerful image analysis software like ImageJ offers many advanced image analysis interfaces and algorithms not yet available in R. Bio7 integrates both applications in a Rich Client Plattform based on Eclipse

Read more »

Regional differences on what drives CO2 emissions

July 20, 2011
By
Regional differences on what drives CO2 emissions

If you are investigating the change of CO2 emissions, then you might ask: Where do the changes occur? Well here is the answer.The staircase plots show the contributing factors to CO2 emissions for each continent. population refers to population effects, gdp_pcap refers to income per capita, energy_intensity refers to energy used per dollar added value, and carbon intensity...

Read more »

In case you missed it: June Roundup

July 11, 2011
By

In case you missed them, here are some articles from June of particular interest to R users. Highlights of presentations from the R/Finance 2011 conference. Trulia uses R and statistical models to map local crime. Resources for data mining with R. K-means clustering on large data sets with the RevoScaleR package. Revolution Analytics' CTO David Champagne writes on real-time...

Read more »

ARMA Models for Trading, Part VI

July 5, 2011
By
ARMA Models for Trading, Part VI

All posts in this series were combined into a single, extended tutorial and posted on my new blog. In the fourth posting in this series, we saw the performance comparison between the ARMA strategy and buy-and-hold over the last approximately 10 years. Over the last few weeks (it does take time, believe me) I back-tested

Read more »

Five things Biologists should know about Statistics

June 21, 2011
By

In a thoughtful blog post, Bioinformatician Ewan Birney (Head of Nucleotide Data at the European Bioinformatics Institute) talks about the importance of Statistics to biologists: Biology is really about stats. Indeed, the foundation of much of frequentist statistics - RA Fisher and colleagues - were totally motivated by biological problems. He also cites the "Five statistical things I wished...

Read more »

Importing Nanotoxicity Data with SPARQL into R for analysis

June 14, 2011
By

Not so long ago I wrote about mporting RDF input in R for analysis. I am collecting nanotoxicology data in a Semantic MediaWiki with the RDFIO extension installed (by Samuel), allowing me to SPARQL that data directly from R. There is nothing much structural to visualize at this moment, so I'm skipping the Bioclipse...

Read more »