## sab-R-metrics: Basics of LOESS Regression

May 11, 2011
Last week, I left you off at logistic regression. This week, I'll be pushing the limits of regression analysis a bit more with a smoothing technique called LOESS regression. There are a number of smoothing methods that can be used, such as Smoothing ...

## One-way ANOVAs in R – including post-hocs/t-tests and graphs

May 11, 2011
In this post, I go over the basics of running an ANOVA using R. The dataset I’ll be examining comes from this website, and I’ve discussed it previously (starting here and then here). I’ve not seen many examples where someone runs through the … Continue reading →

## Multivariate probit regression using (direct) maximum likelihood estimators

May 11, 2011
Consider a random pair of binary responses, i.e. with taking values 1 or 2. Assume that probability can be function of some covariates . The Gaussian vector latent structureA standard model is based a latent Gaussian structure, i.e. there exi...

## EC2 Trials and Tribulations, Part 1 (Web Crawling)

May 11, 2011
Elastic Compute Cloud (EC2) is a service provided a Amazon Web Services that allows users to leverage computing power without the need to build and maintain servers, or spend money on special hardware. The idea is simple, the user “boots” up one or more machines and then accesses those machines as if they were logged into any other machine...

## An essential vocabulary for the R language

May 11, 2011
The Oxford English Dictionary includes more than 600,000 words, yet most of us get by in our day-to-day lives with a vocabulary of just a few thousand. In a similar vein, the R language includes thousands of functions: when you start up R 2.13, you have 2832 functions at your disposal: > length(apropos(".", mode="function")) 2382 This includes only...

## Comparison of functions for comparative phylogenetics

May 11, 2011
With all the packages (and beta stage groups of functions) for comparative phylogenetics in R (tested here: picante, geiger, ape, motmot, Liam Revell's functions), I was simply interested in which functions to use in cases where multiple functions exis...

## Defining Custom Model Priors in BMS

Bayesian Model Averaging (BMA) allows for any kind of model prior distributions. While the R package BMS has built-in support for several types of commonly used priors, there may be the need for constructing a custom model prior in a particular exerci...

## A clock utility, via console hackery

May 11, 2011
A discussion on StackOverflow today shows an interesting use of special characters inside the cat function. The most common special characters that you may have come across are the tab and newline characters, represented by \t and \n respectively. Try them for yourself. cat("Red\tlorry\nYellow\tlorry\n") cat also respects the backspace character, \b, and the carriage return

## High Low Clustering on intraday high frequency sampled data

May 10, 2011
Nothing unusually exciting on this post, but I happened to be engaged in some particle based methods recently and made some simple visual observations as I was setting up some of the sampling environment in R.  I am also using Rkward and Ubuntu to...

## Publishing in Veterinary Academic Journals

May 10, 2011
Following the post by Arthur Charpentier (Freakonometrics), I wondered what would be the outcome considering my current engagement (veterinary medicine, epidemiology, bovine mastitis). Briefly, Arthur Charpentier’s post looked at clusters of journals publishing the same kind of papers. So I looked at 25 journals (Journal of Dairy Science, Canadian Journal of Veterinary Medicine, Preventive Veterinary

## ABC model choice by DIC

May 10, 2011
Yet another paper on ABC model choice was posted on arXiv a few days ago, just prior to the ABC in London meeting that ended in the pub above (most conveniently located next to my B&B!). It is written by Olivier Francois and Guillaume Laval and the approach relies on DIC for running model selection.

## Late to the party for R in Finance blogging

May 10, 2011
I meant to blog about the R/Finance conference during a lull, but I didn’t find too many. Unlike many conferences I’ve been to the structure of R/Finance was simple: one room and one speaker at a time. Relying on each … Continue reading →

## Data Science Toolset discussion at Data Scientist Summit

May 10, 2011
Heads-up to anyone attending the sold-out Data Science Summit in Las Vegas this week: I'll be there tomorrow and Thursday for the conference and to discuss R on the panel discussion "Data Science Toolset - Recipes That Win" (more details about the panel discussion below.) I'm looking forward to meeting with the other R users there -- tweet or...

## Day #38-39 Data-manipulation Part 1

May 10, 2011
Last week i created some plots, always for 1 feature. Today I started working on the full script that creates all these plots, 1 per feature. This means, using for loops in R. Let’s see how this is going to work out. Today I mostly worked on data...

May 10, 2011
## Problems with plyr — the memory/complexity trade-off

May 10, 2011
Two types of R users My overwhelming impression from UseR 2010 is that, generally speaking, there are 2 types of regular R users -- those who have heard and are made uncomfortable by the idea of the *apply() functions, and those who really get it. In ...

## Extending mtable() to ivreg, gls and robust standard errors

May 9, 2011
I have written several extensions of the mtable() command in the memisc library that may come in handy. The methods are available in a package I have written called tonymisc (now, available on CRAN). A zipped folder with my package files is available...

## First-Cut Approach to Synchronizing Field Notes with GPS Data

May 9, 2011
After a week's worth of work in the field, I typically have several pages of hand-written field notes that are associated with GPS waypoints-- badly in need of some kind of transcription/organization. I have yet to find a simple approach for bringing t...

## Importing Weather Data from Wunderground

May 9, 2011
Wunderground Example The Wunderground.com website offers several creative interfaces to current and historic weather information. One of the more interesting features is the URL-based interface to personal weather stations. As far as I can tell, the Wu...

## Registration open for Rmetrics Workshop on Computational Finance

May 9, 2011
The Rmetrics Association is once again holding its annual Workshop and Summer School on Computational Finance and Financial Engineering at Meielisalp (on Lake Thune in Switzerland) from June 26-30. Now in its fifth year, the workshop consists of Summer School-like tutorial sessions and a user/developer meeting: Both focus on topics from "Computational Finance and Financial Engineering" and on the...

## Accessing Databases From R

May 9, 2011
Jeffrey Breen put together a useful slideshow on accessing databases from R. I use RODBC every single day to access my own local MySQL server from R. I've had trouble with RMySQL, so I've always used RODBC instead after setting up my localhost MySQL se...

## making meat shares more efficient with R and Symphony

May 9, 2011
$making meat shares more efficient with R and Symphony$

In my previous post, I motivated a web application that would allow small-scale sustainable meat producers to sell directly to consumers using a meat share approach, using constrained optimization techniques to maximize utility for everyone involved. In this post, I’ll walk through some R code that I wrote to demonstrate the technique on a small

## Comments on an R Connections API

May 9, 2011
I wrote this post months ago but never hit 'Publish'. But, the subject has changed little since then. So, here's to cleaning out the draft folder... R's connections are the heart of data/code/text input and output. Without connections, R would be crippled. Additional connections make R more ... connected with potential data sources and output

## Multiple Y-axis in a R plot

May 9, 2011
I often have to to plot multiple time-series with different scale of values for comparative purposes, and although placing them in different rows are useful, placing on a same graph is still useful sometimes...I searched a bit about this, and found som...

## One minor detail for getting 64-bit R-2.13 running with Eclipse/StatET

May 9, 2011
Upgrading from R-2.12 to R-2.13 was fairly painless, except for one minor hiccup in trying to get the 64-bit version running on my installation of Eclipse + StatET under Windows 7. The setup instructions are almost entirely the same as I have outlined ...