analyze the national health and nutrition examination survey (nhanes) with r

October 30, 2012
By

nhanes is this fascinating survey where doctors and dentists accompany survey interviewers in a little mobile medical center that drives around the country.  while the survey folks are interviewing people, the medical professionals administer labo...

Read more »

"Advanced R" Course – November 15-16, 2012

October 30, 2012
By

This is the last post about the course. As places are limited, please register as soon as possible! Milano R net, in collaboration with Quantide, organizes "Advanced R" Course November 15-16, 2012 Course description This course is designed for those … Continue reading →

Read more »

Introducing R and Biostatistics to first year LCG students (2012 version)

October 30, 2012
By
Introducing R and Biostatistics to first year LCG students (2012 version)

On Friday November 9th I’ll be giving a talk to the first year students from the Undergraduate Program on Genomic Sciences (LCG in Spanish) during their “Seminar 1: Introduction to Bioinformatics” course. It’s just like I did a year ago as I documented in my post Introducing Biostatistics to first year LCG students. Well, this time I’ll change things...

Read more »

Can We Live Without Backslashes?

October 30, 2012
By
Can We Live Without Backslashes?

Two months ago there was a discussion in the ESS mailing list about Emacs/ESS started by Paul Johnson, who claimed "Emacs Has No Learning Curve". While this sounds impossible, he really has some good points, e.g. he encourages beginners to look at the ...

Read more »

Tracking Hurricane Sandy with Open Data and R

October 29, 2012
By
Tracking Hurricane Sandy with Open Data and R

Hurricane Sandy is shaping up to be a major, and very dangerous, meteorological event for the US's East coast. Naturally, everyone is looking for the latest information and forecasts. Fortunately, the wealth of public meteorological data available on the open web, combined with real-time on-the-ground updates via social media, means that an ecosystem of on-line apps is now available...

Read more »

Working with Shootout – 2012 in R (001)

October 29, 2012
By
Working with Shootout – 2012 in R (001)

I have downloaded (from the IDRC) the ASCI files of the Shootout 2012 (see: Shootout 2012 files), so I can work with the data  to develop a model and predict a Validation Set.For that task I have a "Calibration Set", and a  "Test Se...

Read more »

Temporal network of information diffusion in Twitter

October 29, 2012
By

Millions of tweets, retweets and mentions are exchanged in Twitter everyday about very different subjects, events, opinions, etc. While aggregating this data over a time window might help to understand some properties of those processes in online social networks, the … Continue reading →   Related posts: Temporal...

Read more »

Pull Yahoo Finance Key-Statistics Instantaneously Using XML and XPath in R

October 29, 2012
By
Pull Yahoo Finance Key-Statistics Instantaneously Using XML and XPath in R

This two-part blog post I published a day ago required key-stats from Yahoo Finance for all the companies in the control group I created for my research.  I wanted all the key-stats pulled, arranged in a data-frame and then present them side-...

Read more »

ggplot2 Pinterest

October 29, 2012
By

I don’t understand the website Pinterest, but it looks pretty (especially on the iPad), and an undergraduate student said it was the greatest thing since Facebook, so I thought I would give it a shot. The idea is that Pinterest … Continue reading →

Read more »

lag function for data frames

October 29, 2012
By
lag function for data frames

When applying the stats::lag() function to a data frame, you probably expect it will pad the missing time periods with NA, but lag() doesn’t. For example: Nothing happened. Here is an alternative lag function made for this situation. It pads … Continue reading →

Read more »

Charting Wikipedia interest in GOP candidates with googleVis

October 29, 2012
By

I recently posted an article on how to collate Wikipedia page views As there is a time component to this, it seemed appropriate to use the googleVis Package to visualize changes in page hits in the Google Motion chart For this exercise, I ran the wikiFun function covered in the last post to collate page

Read more »

Montreal R User Group meetup Nov. 14th

October 29, 2012
By
Montreal R User Group meetup Nov. 14th

After a bit of a summer lull, the Montreal R User Group is meeting up again! We’re trying out a new venue this time. Notman House is the home of the web in Montreal. They hold hackathons and other tech user group meetups, and they are all around great people in an all around great

Read more »

Terrain Attributes with the raster package

October 29, 2012
By

Terrain attributes can be derived by elevation values on a small neighbourhood of each DEM point.The major attributes can be calculated from the derivatives of the topographic surface. These derivatives measure the rate of change in elevation in relati...

Read more »

Installing Routino under OSX

October 29, 2012
By
Installing Routino under OSX

Routino is a set of libraries that enable road based route calculations to be conducted over OpenStreetMap data. I have been using them extensively over the past six months for a project looking at CO2 emissions and the commute to school. Although Routino was designed to run under Linux, it can also be compiled and installed under OSX (sorry...

Read more »

Installing Routino under OSX

October 29, 2012
By
Installing Routino under OSX

Routino is a set of libraries that enable road based route calculations to be conducted over OpenStreetMap data. I have been using them extensively over the past six months for a project looking at CO2 emissions and the commute to school. Although Routino was designed to run under Linux, it can also be compiled and installed under OSX (sorry...

Read more »

Generate stock option prices – How to simulate a Brownian motion

October 29, 2012
By
Generate stock option prices  – How to simulate a Brownian motion

The Brownian motion is certainly the most famous stochastic process (a random variable evolving in the time). It has been the first way to model a stock option price (Louis Bachelier's thesis in 1900).The reason why is easy to understand, a Brownian mo...

Read more »

Computing for Data Analysis (Simply Statistics Edition)

October 29, 2012
By

As the entire East Coast gets soaked by Hurricane Sandy, I can’t help but think that this is the perfect time to…take a course online! Well, as long as you have electricity, that is. I live in a heavily tree-lined … Continue reading →

Read more »

Computing for Data Analysis (Simply Statistics Edition)

October 29, 2012
By

As the entire East Coast gets soaked by Hurricane Sandy, I can’t help but think that this is the perfect time to…take a course online! Well, as long as you have electricity, that is. I live in a heavily tree-lined area and so it’s onl...

Read more »

Example 10.7: Fisher vs. Pearson

October 29, 2012
By
Example 10.7: Fisher vs. Pearson

In the early days of the discipline of statistics, R.A. Fisher argued with great vehemence against Egon Pearson (and Jerzy Neyman) over the foundational notions supporting statistical inference. The personal invective recorded is somewhat amusing an...

Read more »

More football

October 29, 2012
By

Given that Sampdoria have lost their fourth game in a row, I am not really interested in football any more (as of yesterday, I actually find it a very boring game and think we should focus on real sports $-$ of course if we manage to break the cra...

Read more »

‘Sandy’ Code Up On Github

October 29, 2012
By
‘Sandy’ Code Up On Github

UPDATE: As indicated in the code comments, Google took down the cone KML files. I’ll be changing the code to use the NHC archived cone files later tonight I will (most likely) not be littering the blog with any more updates to the ‘Sandy’ code unless they are really significant. You can follow along at

Read more »

Volatility from daily or monthly: garch evidence

October 29, 2012
By
Volatility from daily or monthly: garch evidence

Should you use daily or monthly returns to estimate volatility? Does garch explain why volatility estimated with daily data tends to be bigger than if it is estimated with monthly data? Previously There are a number of previous posts — with the variance compression tag — that discuss the phenomenon of volatility estimated with daily … Continue reading...

Read more »

How Scenic is the HS2 Route?

October 29, 2012
By
How Scenic is the HS2 Route?

It is fairly clear from the duration between this and my last post that various other things have been getting in the way of updates. Anyway, I shall try and post a few updates on news and things I have been working on recently in the coming weeks before getting back to regular posting! Back

Read more »

googleVis 0.3.2 is released: Better integration with knitr

October 29, 2012
By
googleVis 0.3.2 is released: Better integration with knitr

After last week's kerfuffle I hope the roll out of googleVis version 0.3.2 will be smooth. To test the water I release this version into the wild here and if it doesn't get shot down in the next days, then I shall try to upload it to CRAN. I am mindful...

Read more »

How Scenic is the HS2 Route?

October 29, 2012
By
How Scenic is the HS2 Route?

It is fairly clear from the duration between this and my last post that various other things have been getting in the way of updates. Anyway, I shall try and post a few updates on news and things I have been working on recently in the coming weeks befo...

Read more »

Visiting FHCRC, JHSPH and Meeting Xi’an

October 29, 2012
By
Visiting FHCRC, JHSPH and Meeting Xi’an

I have been traveling during the last two weeks. I visited Fred Hutchinson Cancer Research Center on Oct 16 and the Department of Biostatistics at Johns Hopkins at the invitation of Simply Statistics on Oct 23. Today Christian Robert was visiting our department at Iowa State, and I also talked to him. It is really cool...

Read more »

Build Static HTML Help Pages for R Packages

October 29, 2012
By
Build Static HTML Help Pages for R Packages

Many R users may still remember the good old days when we had static HTML documentation for R packages. That was probably before R 2.10.0 (in 2009). Then we had the fancy dynamic HTML help based on the built-in httpd server, but it has never really mad...

Read more »

How Scenic is the HS2 Route?

October 28, 2012
By
How Scenic is the HS2 Route?

It is fairly clear from the duration between this and my last post that various other things have been getting in the way of updates. Anyway, I shall try and post a few updates on news and things I have been working on recently in the coming weeks befo...

Read more »

the large half now

October 28, 2012
By
the large half now

The little half puzzle proposed a “dumb’ solution in that players play a minimax strategy. There are 34 starting values less than 100 guaranteeing a sure win to dumb players. If instead the players maximise their choice at each step, the R code looks like this: and there are now 66 (=100-34, indeed!) starting values

Read more »

Sponsors