Monthly Archives: February 2012

"R": PLS Regression (Gasoline) – 003

February 3, 2012
By
"R": PLS Regression  (Gasoline) – 003

The gasoline data set has the spectra of 60 samples acquired by diffuse reflectance from 900 to 1700 nm. We saw how to plot the spectra in the previous post.
Now, following the tutorial of Bjorn-Helge Mevik published in "R-News Volume 6/3, August 2006", we will do the PLS regression:

gas1 <- plsr(octane~NIR, ncomp = 10,data = gasoline, validation...

Read more »

Accelerating analytics at MSU with Revolution R Enterprise

February 3, 2012
By

Erik Sigur, Information Technologist for the Department of Statistics and Probability at Michigan State University, writes at ReadWriteWeb about using Revolution R Enterprise to provide high-performance computation in R to the researchers in his department: Our search for a more effective version of R ultimately brought us to a product called Revolution R Enterprise by Revolution Analytics, which provides...

Read more »

Monty Hall by simulation in R

February 3, 2012
By
Monty Hall by simulation in R

(Almost) every introductory course in probability introduces conditional probability using the famous Monte Hall problem. In a nutshell, the problem is one of deciding on a best strategy in a simple game. In the game, the contestant is asked to select one of three doors. Behind one of the doors is a great prize (free

Read more »

Forbes: Top 20 influencers in Big Data

February 3, 2012
By

Haydn Shaughnessy at The Forbes blog provides a list of the "Top 20 Influencers in Big Data", and I'm humbled to report that yours truly is listed there at #2. It's an instantaneous ranking based on the social-media tracking tool Traakr, but it's still great to be listed alongside writers for SiliconAngle, GigaOM, and KDNuggets (and even Mashable!). I...

Read more »

New R User Groups in Austin, Adelaide

February 3, 2012
By

It's awesome to see so many local R user groups kicking off in 2011! Yet another is the Austin R User Group in Austin, Texas. They've already held their first informal get-together, and the first formal meeting on February 23 will be devoted to data management techniques in R. Props to Sandy Donlon for organizing the group! And I'm...

Read more »

Why don’t we hear more about Adrian Dantley on ESPN? This graph makes me think he was as good an offensive player as Michael Jordan.

February 3, 2012
By
Why don’t we hear more about Adrian Dantley on ESPN? This graph makes me think he was as good an offensive player as Michael Jordan.

In my last post I complained about efficiency not being discussed enough by NBA announcers and commentators. I pointed out that some of the best scorers have relatively low FG% or TS%. However, via the comments it was pointed out that top scorers need ...

Read more »

Large search spaces using R

February 3, 2012
By

I'm working on some really interesting stuff at the moment, the details of which I can't discuss for reasons of national security (not really). However, one of the things I've been doing a lot of is searching though lots of different combina...

Read more »

How many pages in Scott Walker Recall Petition PDF files?

February 3, 2012
By

Computer Assisted Reporting In an online press release on Tuesday the Wisconsin Government Accountability Board announced they would put all 153,335 pages of PDF copies of the Scott Walker recall petition online later that day. The GAB announced the PD...

Read more »

Green Disk Sizing

February 3, 2012
By
Green Disk Sizing

I finally got around to completing item 5 on my 2011 list concerning electrical power consumed by a magnetic hard disk drive (HDD). The semi-empirical statement is:

Power ∝ Nplatters × Ω2.8 × D4.6    . . .    (1)

where Nplatters is the number of platters on the spindle, Ω is the rotational speed in revolutions per minute (RPM) and D...

Read more »

Japan Quake Map 2010-2011

February 2, 2012
By
Japan Quake Map 2010-2011

1 Introduction “The 3.11 Tohoku Earthquake in Japan”, It did serious damage to Japan. I have attempted gaining

Read more »