Sunday Data/Statistics Link Roundup (11/18/12)

November 18, 2012
By

An interview with Brad Efron about scientific writing. I haven’t watched the whole interview, but I do know that Efron is one of my favorite writers among statisticians. Slidify, another approach for making HTML5 slides directly from R.  I love … Continue reading →

Read more »

The new definitive guide for setting up Eclipse, StatET, and R on Windows 7

November 17, 2012
By

Quite a while back I wrote some tutorials on getting the StatET plugin for Eclipse running, so that you can write R code and run it within the Eclipse development environment. The developers of all of these pieces of software have kept marching on with...

Read more »

Datacentric product development and the rebirth of engineering

November 17, 2012
By
Datacentric product development and the rebirth of engineering

An old irony in New York is the ubiquity of the ‘gourmet deli’. It is hard to find a deli …Continue reading »

Read more »

More sense of random effects

November 17, 2012
By
More sense of random effects

I can’t exactly remember how I arrived to Making sense of random effects, a good post in the Distributed Ecology blog (go over there and read it). Incidentally, my working theory is that I follow Scott Chamberlain (@recology_), who follows … Continue reading →

Read more »

Get the exit polls from CNN using R and Python

November 17, 2012
By

Yesterday I posted an example of plotting 2012 U.S. presidential exit poll results using ggplot2. There I took for granted that a data.frame containing all we need resides in a file called "PresExitPolls2012.Rdata". Today I want to show how I scraped t...

Read more »

Visualizing Missing Data

November 17, 2012
By
Visualizing Missing Data

There are several graphics available for visualizing missing data including the VIM package. However, I wanted a plot specifically for looking at the nature of missingness across variables and a clustering variable of interest to support data preparati...

Read more »

Visualizing Missing Data

November 17, 2012
By
Visualizing Missing Data

There are several graphics available for visualizing missing data including the VIM package. However, I wanted a plot specifically for looking at the nature of missingness across variables and a clustering variable of interest to support data preparati...

Read more »

Using R — Packaging a C library in 15 minutes

November 16, 2012
By

This entry is part 14 of 12 in the series Using RYes, this post condenses 50+ hours of learning into a 15 minute tutorial.  Read ‘em and weep.  (That is, you read while I weep.) OK.  For the last week …   read more ...

Read more »

RcppArmadillo 0.3.4.4

November 16, 2012
By

A minor bug-fix release 3.4.4 of Armadillo came out upstream a few days ago. RcppArmadillo, our wrapper for R and Armadillo, is now on CRAN with its corresponding version 0.3.4.4. No R level or interface changes were made and the upstream changes are ...

Read more »

The Race to the F1 2012 Drivers’ Championship – Initial Sketches

November 16, 2012
By
The Race to the F1 2012 Drivers’ Championship – Initial Sketches

In part inspired by the chart described in The electoral map sans the map, I thought I’d start mulling over a quick sketch showing the race to the 2012 Formula One Drivers’ Championship. The chart needs to show tension somehow, so in this first really quick and simple rough sketch, you really do have to

Read more »

Parallelized Back Testing

November 16, 2012
By

As mentioned earlier, currently I am playing with trading strategies based on Support Vector Machines. At a high level, the approach is quite similar to what I have implemented for my ARMA+GARCH strategy. Briefly, the simulation goes as follows: we step through the series one period (day, week, etc) at a time. For each period,

Read more »

Making sense of random effects

November 16, 2012
By
Making sense of random effects

The other night in my office I got into a discussion with my office mate, the brilliant scientist / amazing skier Dr. Thor Veen about how to understand the random effect variance term in a mixed-effects model. Thor teaches the R statistics course here at UBC, and last night a student came to the office...

Read more »

VIDEO: Looking to the regression coefficients in R

November 16, 2012
By
VIDEO: Looking to the regression coefficients in R

(This article was first published on NIR-Quimiometria, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on their blog: NIR-Quimiometria. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL,...

Read more »

Which programming language is the most concise?

November 16, 2012
By
Which programming language is the most concise?

An expressive programming language allows developers to implement algorithms quickly, by using high-level concepts and leaving the details to the language implementation. The result is clearer, more maintainable code that can be created in less time. (Although shorter code isn't always better, especially when taken to extremes.) So which programming languages use the least code, when compared on an...

Read more »

Simulating Sudden Oak Death Dynamics

November 16, 2012
By
Simulating Sudden Oak Death Dynamics

I am working on a project with the Rizzo Lab examining the dynamics of Sudden Oak Death (SOD). I really have to write more about this, but today I’m just going to post the results of an initial exercise. Here I attempt to replicate model results from Cobb et al. (2012). The model in that paper simulates...

Read more »

Revolution Newsletter: November 2012

November 16, 2012
By

The most recent edition of the Revolution Newsletter is out. The news section is below, and you can read the full November edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. Now Available: Revolution R Enterprise 6.1 The latest release of Revolution Analytics' enterprise-ready data...

Read more »

Excel + Cytoscape + R = ExCytR

November 16, 2012
By
Excel + Cytoscape + R = ExCytR

My new project is coming along nicely and should be released early 2013. It builds on the structures developed in imDEV to link Excel, Cytoscape and R using RExcel,  RCytoscape, and CytoscapeRPC . This trio can be used to rapidly generate beautiful and  informative network representations of data. Here is an example of a  undirected Gaussian graphical

Read more »

Logo Contest Winner

November 16, 2012
By
Logo Contest Winner

Congratulations to Bradley Saul, the winner of the Simply Statistics Logo contest! We had some great entries which made it difficult to choose between them. You can see the new logo to the right of our home page or the … Continue reading →

Read more »

Hjust and Vjust

November 16, 2012
By
Hjust and Vjust

So, when you’re setting the position of text in ggplot, you may have to use the hjust and vjust commands. Depending on your demands, and if you don’t understand what they’re doing, they might seem hard to use. I found one script that...

Read more »

Exit PEBOS – Enter exit polls

November 16, 2012
By
Exit PEBOS – Enter exit polls

PEBOS is over. Time to look at the details of the Election. The final results are not yet in, but the exit polls are there, and up for grabs. Just to get warm: here's a tiny example.Obviously Romney had an age problem. But for now I don't want to specu...

Read more »

Network vizualization and meaning shifting due to algorithm settings

November 15, 2012
By
Network vizualization and meaning shifting due to algorithm settings

Data visualizations are useful for exploratory work and as an aid in communicating findings. Data visualizations also seem to be in demand these days as a kind of eye candy for capturing attention. But when we look at one engaging enough to hold our attention, we want to know what it means. In other words,

Read more »

Want to win "Guess who?" – Have an institutional neural network approach

November 15, 2012
By
Want to win "Guess who?" – Have an institutional neural network approach

Have you ever played the board game "Guess who?". For those who have not experienced childhood (because it might be the only reason to ignore this board game), this is a game consisting in trying to guess who the opponent player is thinking o...

Read more »

GEE QIC update

November 15, 2012
By
GEE QIC update

Here is improved code for calculating QIC from geeglm in geepack in R (original post). Let me know how it works. I haven’t tested it much, but is seems that QIC may select overparameterized models. In the code below, I … Continue reading →

Read more »

What does R do? Bring people together, of course!

November 15, 2012
By
What does R do? Bring people together, of course!

Last night we had a great meet up of the Montreal R User Group. I got things started with a little presentation asking the question “What does R do?” (slides). I made the presentation using Montreal R User Group member Ramnath Vaidyanathan‘s Slidify package. Slidify allows you to generate rather handsome HTML5 slides directly using

Read more »

Bottom-up creation of data-driven capabilities: automate your work

November 15, 2012
By

My previous post on how to transform an organization into a more data-driven version of itself made a pretty big assumption that often doesn’t hold true. I assumed that people in the organization wanted their company or agency to become more data-driven. I think almost everyone says they want that if asked. I even think

Read more »

Create elegant, interactive presentations from R with Slidify

November 15, 2012
By
Create elegant, interactive presentations from R with Slidify

If you often find yourself cutting-and-pasting charts or tables generated in R into PowerPoint or Keynote, you might want to take a look at Slidify. Created by R user Ramnath Vaidyanathan, Slidify is an R package that allows you to use R Markdown to define the slide content and automatically embed R output. This is especially useful if the...

Read more »

Innovation in Statistical Computing

November 15, 2012
By

In A Capitalist’s Dilemma, Whoever Wins on Tuesday, Clayton Christensen lays out three kinds of innovations through which an industry cycles: Empowering Innovations - those that offer products and services to a new customer base. The classic empowering (or disruptive) innovation is Ford Motor Company’s introduction of the low-cost Model T coupled with the ability of Ford’s own...

Read more »

Reproducible Research: With Us or Against Us?

November 15, 2012
By

Last night this article by Chris Drummond of the Canadian National Research Council (Conseil national de recherches Canada) popped up in my Google Scholar alert. The title of the article, “Reproducible Research: a Dissenting Opinion” would seem to indicate that he disagrees … Continue reading →

Read more »

Textual Healing Part II

November 15, 2012
By
Textual Healing Part II

Yesterday’s post showed a number of quick coding options for changing text in a basic ggplot. Here’s another two tricks for fine-tuning faceted plots. Again, let’s load our made up data about tooth growth (a real dataset in R, ToothGr...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de









ODSC

CRC R books series













Contact us if you wish to help support R-bloggers, and place your banner here.