Blog Archives

userR2013 data analysis contest: data exploration

June 12, 2013
By
userR2013 data analysis contest: data exploration

Description The useR2013 conference is organizing a data analysis contest, check the rules here. They have a package called useR2013DAC with two data sets: one from La Liga and the other one from the Formula 1. Once you download and install the package (available here), you can quickly explore the data using the following R commands: Data exploration ## Load the...

Read more »

Reading an R file from GitHub

Reading an R file from GitHub

Lets say that I want to read in this R file from GitHub into R. The first thing you have to do is locate the raw file. You can do so by clicking on the Raw button in GitHub. In this case it’s https://raw.github.com/lcolladotor/ballgownR-devel/master/ballgownR/R/infoGene.R One would think that using source() would work, but it doesn’t as shown below: source("https://raw.github.com/lcolladotor/ballgownR-devel/master/ballgownR/R/infoGene.R") ##...

Read more »

Using plyr and doMC for quick and easy apply-family functions

April 26, 2013
By
Using plyr and doMC for quick and easy apply-family functions

A few weeks back I dedicated a short amount of time to actually read what plyr (Wickham, 2011) is about and I was surprised. The whole idea behind plyr is very simple: expand the apply() family to do things easy. plyr has...

Read more »

Predicting who will win a NFL match at half time

March 23, 2013
By
Predicting who will win a NFL match at half time

It was great to have a little break, Spring break, although the weather didn’t feel like spring at all! During the early part of the break I worked on my final project for Jeff Leek’s data analysis class, which we call 140.753 here. Continuing my previous posts on the topic, this time I’ll share the results of my...

Read more »

And so begins English Composition I

March 21, 2013
By
And so begins English Composition I

This week started the English Composition I: Achieving Expertise course (Comer, 2013) that I have been looking forward to. I am not sure yet how long I will last, but I hope to enjoy it as much as I can. Plus, it should help me with my...

Read more »

FBit: GitHub repo for posts with R code for this blog

March 11, 2013
By
FBit: GitHub repo for posts with R code for this blog

This is a test post since I want to improve upon Jeffrey Horner’s strategy for posting R code in Tumblr. The only minor improvement I wanted to try out is hosting the images directly on the web. I mean, right now the images won’t show in RSS readers. I’m not doing anything new at all, just using the...

Read more »

Analyzing SimplyStatistics visits info

March 9, 2013
By
Analyzing SimplyStatistics visits info

Recently we had to analyze the data of the number of visits per day to SimplyStatistics.org. There were two goals: Estimate the fraction of visitors retained after a spike in the number of visitors Identify (if any) any factors that influence the fraction estimated in 1. For me it was a fun project in part because I like SimplyStatistics but also...

Read more »

Sharing my work for “Advanced Methods III”

February 13, 2013
By
Sharing my work for “Advanced Methods III”

This semester I’m taking the live version of the Data Analysis class by Jeff Leek. His more popular version of the course is available through Coursera.  One of the things that Jeff promotes is reproducibility and sharing code. I share that tendency and thus created a Git repository for my homework and code for the class: lcollado753. I’m...

Read more »

Introduction to R and Biostatistics (2012 version): presentation

November 12, 2012
By
Introduction to R and Biostatistics (2012 version): presentation

To follow my Introducing R and Biostatistics to first year LCG students (2012 version) post,  you can now find the presentation online from my site either in presentation format, in a single webpage format, or the raw Rmd file. To prove the point that publishing to RPubs is super easy, you can also find the single...

Read more »

Introducing R and Biostatistics to first year LCG students (2012 version)

October 30, 2012
By
Introducing R and Biostatistics to first year LCG students (2012 version)

On Friday November 9th I’ll be giving a talk to the first year students from the Undergraduate Program on Genomic Sciences (LCG in Spanish) during their “Seminar 1: Introduction to Bioinformatics” course. It’s just like I did a year ago as I documented in my post Introducing Biostatistics to first year LCG students. Well, this time I’ll change things...

Read more »