I'm working on some really interesting stuff at the moment, the details of which I can't discuss for reasons of national security (not really). However, one of the things I've been doing a lot of is searching though lots of different combina...

I'm working on some really interesting stuff at the moment, the details of which I can't discuss for reasons of national security (not really). However, one of the things I've been doing a lot of is searching though lots of different combina...

Computer Assisted Reporting In an online press release on Tuesday the Wisconsin Government Accountability Board announced they would put all 153,335 pages of PDF copies of the Scott Walker recall petition online later that day. The GAB announced the PD...

I finally got around to completing item 5 on my 2011 list concerning electrical power consumed by a magnetic hard disk drive (HDD). The semi-empirical statement is: Power ∝ Nplatters × Ω2.8 × D4.6 . . . (1) where Nplatters is the number of platters on the spindle, Ω is the rotational speed in revolutions per minute (RPM) and D...

When I say Ease of Use Improved, I mean you can simply copy, paste and run the codes in this post, without referring to other places, without downloading a data file and read it from R. This is how I like a blog article to be. You don’t need to read the whole article. You

My Paris colleague (and fellow-runner) Aurélien Garivier has produced an interesting comparison of 4 (or 6 if you consider scilab and octave as different from matlab) computer languages in terms of speed for producing the MLE in a hidden Markov model, using EM and the Baum-Welch algorithms. His conclusions are that matlab is a lot

"R" has a package called "ChemometricsWithR", where we can get data from different analytical instruments including Near Infrared (NIR).Follow the steps to plot the spectra of a gasoline data set:In this other case we plot the spectra of the NIR shootout 2002: > data(shootout)> wavelengths<-seq(600, 1898,by=2)> mattplot(wavelengths,shootout$calibrate.1,xlab="wavelength(nm)",ylab="log1/R)")>...

Page 7 of Facebook's 213-page S-1 filing for their record-breaking IPO includes the following chart, under the headline: "Our Mission: To make the world more open and connected". This chart was created using the R language and Hadoop by Facebook intern Paul Butler. (Thanks to the blog IOER Tools for first noticing the inclusion of the chart.) And speaking...

This is a quick update to announce my new blog Serious Stats. This is a companion to my forthcoming book of the same name:Baguley, T. (2012, in press). Serious stats: A guide to advanced statistics for the behavioral sciences. Basingstoke: Palgrav...

In my previous HANA and R blogs, I have been forced to create .csv files from HANA and read them on R...an easy but also boring procedure...specially if your R report is supposed to be run on a regular basis...having to create an .csv file every time you need to run your report it's not a nice thing...After...

I just received this announcement for the opening of a (tenured/civil servant) position in the national research institute in biostatistics, genetics, and agronomy, INRA: Position opening with profile Approximate inference techniques in complex systems Key activities and required skills: You will develop methodological research in the field of statistical inference for models used in environmental

01.02.2012 Landscape metrics were developed to analyze spatial patterns of landscapes (e.g. composition and spatial arrangement). In R it is possible to calculate these metrics with the “SDMTools” package. Bio7 offers an easy to use interface to R and ImageJ and can use these tools to simplify a workflow to analyze image data (e.g. vegetation

Ken Rice and Thomas Lumley will give a course on advanced R programming in two locations this summer. 1. In Edinburgh, June 13-15 (the week before the International Conference in Quantitative Genetics). See http://www.eisg2012.org.uk/ 2. In Seattle, July 23-25, as part of the Summer Institute in Statistical Genetics. See http://www.biostat.washington.edu/suminst/sisg/general The course is about 60% lecture and 40% lab session (BYO R),...

Ventana Research analyst David Menninger was on the judging panel for the Applications of R in Business contest. In a post on the Ventana research blog, he offers his perspectives on the contest, noting that R, as a statistical package, includes many algorithms for predictive analytics, including regression, clustering, classification, text mining and other techniques. The contest submissions supported...

As other softwares "R" has nice tools to look to the data before to develop the calibration.Statistics for the "Y" variable (in this case octane number) like Maximun, Minimun,..,standard deviation,...are important:> library(ChemometricsWithR)> data(gasoline)> summary(gasoline$octane) Min. 1st Qu. Median Mean 3rd Qu. Max. 83.40 85.88 87.75 87.18 88.45 89.60> sd(gasoline$octane) 1.530078And of course the Histogram:> hist(gasoline$octane)

The birthday problem (i.e. looking at the distribution of the birthdates in a group of n persons, assuming a uniform distribution of the calendar dates of those birthdates) is always a source of puzzlement ! For instance, here is a recent post on Cross Validated: I have 360 friends on facebook, and, as

I like having all my important documents and scripts in one single place. This saves me from having to synchronize them between the different workplaces I have, and makes backupping much less of a pain. One way of achieving this… See more ›