March 2012

Gastwirth’s location estimator

March 3, 2012 | Ron Pearson (aka TheNoodleDoodler)

The problem of outliers – data points that are substantially inconsistent with the majority of the other points in a dataset – arises frequently in the analysis of numerical data.  The practical importance of outliers lies in the fact that even a few of these points can badly distort the results of ... [Read more...]

R versus Stata Redux

March 3, 2012 | Ethan Fosse

I've used both R and Stata for a long time, but these days I use Stata much more frequently than R. While R is useful for some kinds of graphics (especially three-dimensional graphics) and some statistical procedures (for example, finite mixture models... [Read more...]

New data visualization features in ggplot2 update

March 2, 2012 | David Smith

Hadley Wickham has just released an update to the ggplot2 graphics package for R. Version 0.9.0 significantly speeds up the process of rendering graphics, and the documentation is much improved (including the addition of many new examples). This update also adds a bunch of new features, which are documented in this 40... [Read more...]

What is R-index

March 2, 2012 | Wingfeet

R index is developed in interpreting signal detection data for human perception. In sensory research it is used to interpret ranking data. The value one gets out of an R-index calculation is interpreted as a confusion between samples tested. It has bee...
[Read more...]

A terrible 2000 words

March 2, 2012 | Josef Fruehwald

I've only just started looking at the homicide data made available by the Philadelphia Inquirer in my free time (which is hard to come by lately). I've been thinking about what sorts of statistics I could do, or what kinds of additional data sets I cou... [Read more...]

ABC in Roma [R lab #2]

March 2, 2012 | xi'an

Here are the R codes of the second R lab organised by Serena Arima in supplement of my lectures (now completed!). This morning I covered ABC model choice and the following example is the benchmark used in the course (and in the paper) about the impact of summary statistics. (Warning! ... [Read more...]

Modeling Trick: the Signed Pseudo Logarithm

March 1, 2012 | John Mount

Much of the data that the analyst uses exhibits extraordinary range. For example: incomes, company sizes, popularity of books and any “winner takes all process”; (see: Living in A Lognormal World). Tukey recommended the logarithm as an important “stabilizing transform” (a transform that brings data into a more usable form ... [Read more...]

Download and Parse NAREIT Data

March 1, 2012 | Peter Carl

This is the first post of a series that describes how to download and parse specific data sets into R. These kinds of scripts can be functionalized further, but I doubt that these will ever find their way into a formal package. They are intended to be helpful to those ... [Read more...]

NIT: Fatty acids study in R – Part 001

March 1, 2012 | jrcuesta

This time I´m going to use my own data to develop a model to predict some fatty acid in the solid fat (pork).Samples had been analyzed in a NIT (Near Infrared Transmittance) instrument. The range of the wavelengths is from 850 to 1048 nm (100 data poi...
[Read more...]

Poisson approximation of binomial probabilities

March 1, 2012 | enayet

This is yet another experiment to see how good is the approximation of binomial probability when we use Poisson and normal distributions for scenarios with large $n$, and $p$ close to zero or one. Consider a problem where the random variable $X$ follows a binomial distribution with a known probability ... [Read more...]

Parallelizing Voting simulation

March 1, 2012 | Bogumił Kamiński

Last week I have compared synchronous and asynchronous implementation of NetLogo Voting model. An interesting afterthought is that synchronous model implementation can be easily made much faster using vectorization.The two versions of the Voting synchr...
[Read more...]

I see high frequency data

March 1, 2012 | Dzidorius Martinaitis

In the previous post I shared an example how to get high frequency data from IB broker (well, it is retail version of HFD – it has only best bid/ask and the trades). Now, once you saved some data – what should you do next? Next logical step would be data ... [Read more...]

Bad Science at Strata 2012

March 1, 2012 | David Smith

Ben Goldacre, the physician and biostatistician behind the always-excellent Bad Science column in the Guardian, gave a barnburner of a talk at Strata 2012 yesterday, "The Information Architecture of Medicine is Broken". For anyone not aware of the problems caused by publication bias in clinical trials (for example, ineffective drugs with ... [Read more...]

Example 9.22: shading plots and inequalities

March 1, 2012 | Nick Horton

A colleague teaching college algebra wrote in the R-sig-teaching list asking for assistance in plotting the solutions to the inequality x^2 - 3 __ 0. This type of display is handy in providing a graphical solution to accompany an analytic one. RThe plot...
[Read more...]
1 13 14 15 16

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)