Monthly Archives: July 2012

Citing R or SAS

July 2, 2012
By
Citing R or SAS

One of us recently read a colleague's first draft of a paper, in which she had written: "All analyses were done in R 2.14.0." We assume we're preaching to the converted here, when we say that the enormous amount of work that goes into R needs to be re...

Read more »

My first competition at Kaggle

July 2, 2012
By
My first competition at Kaggle

For me Kaggle becomes a social network for data scientist, as stackoverflow.com or github.com for programmers. If you are data scientist, machine learner or statistician you better off to have a profile there, otherwise you do not exist. Nevertheless, I won’t bet on rosy future for data scientist as journalists suggest (sexy job for next

Read more »

Popularity of R continues

July 2, 2012
By
Popularity of R continues

No doubt those that read my blog know that the tools I use to do my Industrial Engineering and Operations Research work heavily rely on the open source side of software.  That is why I try to support as many open source projects such as COIN-OR, G...

Read more »

Moving beyond hopeless graphics

July 2, 2012
By

I was at a talk awhile ago where the speaker presented tables with 4, 5, 6, even 8 significant digits even though, as is usual, only the first or second digit of each number conveyed any useful information. A graph would be better, but even if you’re too lazy to make a plot, a bit The post Moving...

Read more »

Random portfolios versus Monte Carlo

July 2, 2012
By
Random portfolios versus Monte Carlo

What is the difference between Monte Carlo — as it is usually defined in finance — and random portfolios? The meaning of “Monte Carlo” The idea of “Monte Carlo” is very simple.  It is a fancy word for “simulation”. As usual, it is all too possible to find incredibly muddied explanations of such a simple … Continue reading...

Read more »

Simple distribution plot in R

July 2, 2012
By
Simple distribution plot in R

Plot the distribution of a sample as bars and add a histogram line for visualizing the sample characteristics.No related posts.

Read more »

MatLab, SAS, STATA, SPSS, Excel users: Try R, damn it!

July 2, 2012
By
MatLab, SAS, STATA, SPSS, Excel users: Try R, damn it!

Due to my work with a multitude of statistical packages in my career I may be able to evaluate a lot of them. I’ve first used Excel for my calculations as most of the normal users do. I like the idea behind a spreadsheet and the combination of data and click-to-do functions. Nevertheless I’ve often

Read more »

Olive vs. Sunflower oil Spectra – 002 (ChemoSpec)

July 1, 2012
By
Olive vs. Sunflower oil Spectra – 002 (ChemoSpec)

I add other data set of “sunflower oil” to import together with the olive oil into ChemoSpec R package. Before, as I showed in a video (Preparing spectra to import into ChemoSpec), every sample has been acquired with a NIR instrument (in transmitta...

Read more »

Visualizing uncertainty using Jackknife

July 1, 2012
By
Visualizing uncertainty using Jackknife

Once again, I (re)discovered last week at the Rmetrics conference that old toolds can be extremely interesting to illustrate complex ideas, like uncertainty in fnancial markets, and stock prices. For instance a 99.5% quantile: we look for the scena...

Read more »

FAO statistical areas in Google Earth

July 1, 2012
By
FAO statistical areas in Google Earth

Some time ago I did a blog describing how to get ICES and NAFO statistical areas, originally as shapefiles into Google Earth readable format (ICES, NAFO). These areas are the primary fisheries areas upon which nominal fisheries catch statistics have be...

Read more »