New functions for linear model inference in Revolution R Enterprise 4.3

April 26, 2011
By

The latest release of Revolution R Enterprise shows how Revolution Analytics’ package for big data, RevoScaleR, is continuing add new capabilities for Big Data statistics. RevoScaleR removes the limits on the size of the data that can be processed in R through the use of the highly efficient .Xdf binary file format. Xdf stores data by rows within columns...

Read more »

Great FAJ Article on Statistical Measure of Financial Turbulence Part 2

April 26, 2011
By
Great FAJ Article on Statistical Measure of Financial Turbulence Part 2

I did not intend for this to be a multi-part series, but after some clear thinking at the beach over the weekend, I decided that it needed some more analysis.  For those of you that read the article or know Mahalanobis distance, the measure I pre...

Read more »

Automatically Save Your Plots to a Folder

April 26, 2011
By
Automatically Save Your Plots to a Folder

Suppose you're working on a problem that involves a loop for calculations. At each iteration inside the loop, you want to construct a plot. Not only do you want to see the plot, but you would like to save each plot for a presentation, report or paper...

Read more »

Automatically Save Your Plots to a Folder

April 26, 2011
By
Automatically Save Your Plots to a Folder

Suppose you're working on a problem that involves a loop for calculations. At each iteration inside the loop, you want to construct a plot. Not only do you want to see the plot, but you would like to save each plot for a presentation, report or paper...

Read more »

Bayesian job in Cambridge

April 26, 2011
By
Bayesian job in Cambridge

Here is an email that could appeal to some readers: Job in Cambridge MRC-BSU – Bayesian statistician Career development fellow MRC Biostatistics Unit, Cambridge We are offering an exciting opportunity to work on Bayesian models for infectious disease dynamics. A statistician is required to contribute to a programme of research to develop inferential approaches to

Read more »

R Bloggers

April 26, 2011
By
R Bloggers

I recently found a great resource for R in the blogosphere, the R Bloggers Blog Aggregator. Basically, the site aggregates posts from a bunch of blogs about R (like this one!) into a giant feed of uses for R. If you are interested in learning more ab...

Read more »

R Bloggers

April 26, 2011
By
R Bloggers

I recently found a great resource for R in the blogosphere, the R Bloggers Blog Aggregator. Basically, the site aggregates posts from a bunch of blogs about R (like this one!) into a giant feed of uses for R. If you are interested in learning more ab...

Read more »

Running Phylip’s contrast application for trait pairs from R

April 26, 2011
By
Running Phylip’s contrast application for trait pairs from R

Here is some code to run Phylip's contrast application from R and get the output within R to easily manipulate yourself. Importantly, the code is written specifically for trait pairs only as the regular expression work in the code specifically grabs da...

Read more »

Statistical Practice in Epidemiology using R

April 26, 2011
By
Statistical Practice in Epidemiology using R

This is a long running course which usually takes place in Tartu, Estonia. This year we are hosting it at IARC in Lyon, France. The course is intended for epidemiologists and statisticians who wish to use R  for statistical modelling … Continue reading →

Read more »

Adonis (PERMANOVA) – Assumptions

April 26, 2011
By
Adonis (PERMANOVA) – Assumptions

Before you use PERMANOVA (R-vegan function adonis) you should read the user notes for the original program by the author (Marti J. Anderson) who first came up with this method. An important assumtption for PERMANOVA is same "multivariate spread&qu...

Read more »

Designing and Analyzing Studies with Optmatch and RItools (Part 1)

April 25, 2011
By

I am currently writing a brief “how-to” for the APSA Section on Experimental Research newsletter on using Optmatch and RItools. The complete paper (a work in progress) can be found on my github page. I have the basics of the paper sketched in, but I would love to get feedback from the online R community,...

Read more »

A Tiny Model of Evolution

April 25, 2011
By
A Tiny Model of Evolution

I've always wanted to write a(n overly) simple model of evolution. The assumptions are minimalistic: only one species, for which each individual's genotype is represented as a one-dimensional real number, e.g. 7.4. Now, the fun stuff: I define a fu...

Read more »

4 lines of R to get you started using the Rook web server interface

April 25, 2011
By
4 lines of R to get you started using the Rook web server interface

Jeffrey Horner's new Rook package provides a new interface for developing R-based web applications. Rook allows the same application to run in R's built-in web server or (soon) in the rApache module. This post shows how easy it is to use the package's Rhttpd class to get started with Rook.

Read more »

Job Search Part 4: Timing Beveridge Curve Movements During A Recession

April 25, 2011
By
Job Search Part 4: Timing Beveridge Curve Movements During A Recession

This economics blogger feels like he would be cheating the reader if he did not include recent work done by Barnichon and Figura (2010) on timing movements in the unemployment rate during recessions. That is why this is part 4 of my special 5 part mini...

Read more »

Milktrader: Quantitative finance in R

April 25, 2011
By

The blog Milktrader has been on a roll recently with a series of posts with practical examples of quantitative in finance, from backtesting to automated trading, and option pricing to data acquisition. The latest post focuses on calculating returns, with an example of downloading data for a silver ETF and calculating daily returns with the dailyReturn function in the...

Read more »

Kickstarter Data Analysis: Success and Pricing

April 25, 2011
By
Kickstarter Data Analysis: Success and Pricing

Kickstarter is an online crowdfunding platform for launching creative projects. When starting a new project, project owners specify a deadline and the minimum amount of money they need to raise. They receive the money (less a transaction fee) only if … Continue reading →

Read more »

Calculating DV01 using futile.paradigm

April 25, 2011
By
Calculating DV01 using futile.paradigm

Here’s a short example of using the futile.paradigm for calculating the DV01 of a bond. The basic idea is to …Continue reading »

Read more »

Annotated Manhattan plots and QQ plots for GWAS using R, Revisited

April 25, 2011
By
Annotated Manhattan plots and QQ plots for GWAS using R, Revisited

Last year I showed you how to create manhattan plots, and later how to highlight regions of interest, using ggplot2 in R. The code was slow, required a lot of memory, and was difficult to maintain and modify.I finally found time to rewrite the code usi...

Read more »

Annotated Manhattan plots and QQ plots for GWAS using R, Revisited

April 25, 2011
By

Last year I showed you how to create manhattan plots, and later how to highlight regions of interest, using ggplot2 in R. The code was slow, required a lot of memory, and was difficult to maintain and modify.I finally found time to rewrite the code u...

Read more »

Optimizing My R Code

April 25, 2011
By

Thanks to gappy3000 I optimized my code in few several ways. The original script lasted ~30 minutes thanks to using pure vectors instead of zoo objects. First of all I changed all lm functions for lm.fit functions. Although lm.fit function cannot handl...

Read more »

Let’s try this again: Introducing Rook for #rstats!

April 25, 2011
By

Well I totally botched my launch last week. Thankfully, my wife straightened me out: I decided to change the name of Rack to Rook so there won’t be any future confusion. And instead of rewriting all my posts last week, just read them again and every...

Read more »

R workshop in Hamilton, Ontario, May 24 and 25

April 25, 2011
By

John Fox will be teaching a two-day introductory R workshop at McMaster University in Hamilton, Ontario, on May 24 and 25. The workshop will largely be based on materials from Fox and Weisberg, An R Companion to Applied Regression, Second Edition (Sage, 2011). Further information about the workshop is available at: https://www.socialsciences.mcmaster.ca/registrations/john-fox-introduction-to-r/fg_base_view_p3. A few spaces in the workshop are reserved for non-McMaster attendees and made...

Read more »

intuitive visualizations of categorization for non-technical audiences

April 25, 2011
By
intuitive visualizations of categorization for non-technical audiences

For a project I’m working on at work, I’m building a predictive model that categorizes something (I can’t tell you what) into two bins. There is a default bin that 95% of the things belong to and a bin that the business cares a lot about, containing 5% of the things. Some readers may be

Read more »

Merging Data Video Tutorial

April 25, 2011
By
Merging Data Video Tutorial

Here's a video tutorial where I walk through some code that does what the previous post describes.The FRED data is used extensively for macroeconomics. You might these data useful for joining in graph fights in the blogosphere.

Read more »

Merging Data Video Tutorial

April 25, 2011
By
Merging Data Video Tutorial

Here's a video tutorial where I walk through some code that does what the previous post describes.The FRED data is used extensively for macroeconomics. You might these data useful for joining in graph fights in the blogosphere.

Read more »

Further Adventures in Visualisation with ggplot2

April 25, 2011
By
Further Adventures in Visualisation with ggplot2

So I previously took a look at some data of player performance from a computer game. In this post, I’m going to do some further visualisations using ggplot2. The data consists of different types of player character, different roles for those characters, and their overall damage output (the unit here is damage per second, or

Read more »

RInside 0.2.4

April 25, 2011
By

After several months, it was time for a new release 0.2.4 of RInside which is now on CRAN. RInside is a set of convenience classes which facilitate embedding of R inside of C++ applications and programs, using the classes and functions provided by th...

Read more »

Merging Multiple Data Files into One Data Frame

April 24, 2011
By
Merging Multiple Data Files into One Data Frame

We often encounter situations where we have data in multiple files, at different frequencies and on different subsets of observations, but we would like to match them to one another as completely and systematically as possible. In R, the merge() comma...

Read more »

Merging Multiple Data Files into One Data Frame

April 24, 2011
By
Merging Multiple Data Files into One Data Frame

We often encounter situations where we have data in multiple files, at different frequencies and on different subsets of observations, but we would like to match them to one another as completely and systematically as possible. In R, the merge() comma...

Read more »