Monthly Archives: August 2011

Outlier Detection with DPM Slides from JSM 2011

August 5, 2011
By
Outlier Detection with DPM Slides from JSM 2011

Here are the 14 slides I used during my talk at the Joint Statistical Meetings 2011: shotwell-jsm-2011.pdf. I'm trying hard to minimize the text in my presentation slides. But, this usually requires that I practice more. Hence, you will know which talks I have practiced thoroughly by the amount of text in the slides .

Read more »

Friday Links: R, OpenHelix Bioinformatics Tips, 23andMe, Perl, Python, Next-Gen Sequencing

August 5, 2011
By
Friday Links: R, OpenHelix Bioinformatics Tips, 23andMe, Perl, Python, Next-Gen Sequencing

I haven't posted much here recently, but here is a roundup of a few of the links I've shared on Twitter (@genetics_blog) over the last two weeks.Here is a nice tutorial on accessing high-throughput public data (from NCBI) using R and Bioconductor.Cloud...

Read more »

New Rcpp master classes scheduled for New York and San Francisco

August 4, 2011
By

Together with Revolution Analytics, I will be offering two more one-day classes on the Rcpp package for seamless integration of R and C++. The format will follow the workshop Romain and I gave during the tutorial day preceding this year's R/Financ...

Read more »

Aug 4, 2011 "plunge" headlines are in the air tonight

August 4, 2011
By
Aug 4, 2011 "plunge" headlines are in the air tonight

Today's financial headlines are littered with the word 'plunge.'  Considering today's (cl-cl) drop on the S&P500 was just about -5%, I don't know that I would exactly call that a plunge.         &nb...

Read more »

CHCN: Canadian Historical Climate Network

August 4, 2011
By
CHCN: Canadian Historical Climate Network

A reader asked a question about data from   environment canada.  He wanted to know if that data could somehow be integrated into the RGhcnV3 package.  That turned out to be a bit more challenging that I expected.  In short order I’d found a couple other people who had done something similar.  DrJ of course was

Read more »

Statisticians at JSM consider themselves "Data Scientists"

August 4, 2011
By
Statisticians at JSM consider themselves "Data Scientists"

At the JSM 2011 conference in Miami earlier this week, we conducted an informal poll of attendees on their attitudes to respect to Big Data, statistical software, and data science. JSM is the largest gathering of statisticians in North America, and attendees were invited to complete a survey after logging into the Wi-Fi network. Of the 190 respondents to...

Read more »

Lattice-xyplot without Border/Box, with Axes at Bottom & Left Side Only, with Custom Ablines/Grid & Axis-Labelling

August 4, 2011
By
Lattice-xyplot without Border/Box, with Axes at Bottom & Left Side Only, with Custom Ablines/Grid & Axis-Labelling

Here's how you do a lattice-xyplot without border/box, with axes at bottom & left side only, with custom ablines/grid & axis-labelling Read more »

Read more »

Does Jon Skeet have mental powers that make us upvote his answers? (The effect of reputation on upvotes)

August 4, 2011
By
Does Jon Skeet have mental powers that make us upvote his answers? (The effect of reputation on upvotes)

Of course since we all know Jon Skeet does have various powers, I will move onto unanswered questions, whether a users reputation makes them receive more upvotes for answers. I’ve seen this theory mentioned in multiple places (see any of the comments to Jon Skeet’s answer that are along the lines of “If this was

Read more »

Q-Q Plots for Multi-modal Performance Data

August 3, 2011
By
Q-Q Plots for Multi-modal Performance Data

I'm in the process of putting together some slides on how to apply Quantile-Quantile plots to performance data. Q-Q plots are a handy tool for visually inspecting how well your data matches a known probability distribution (prob dsn). If the match is g...

Read more »

Hotness

August 3, 2011
By
Hotness

We have an internal image that floated around work several years ago that details network utilization of TCP over a wide variety of configurations. It is a heatmap created in matlab that is just sweet, sweet eye candy. We actually hung it on the outside of a cube for a short while and people couldn't help but stop and...

Read more »