Webinar on Revolution R Enterprise

December 7, 2010
By

R evangelist David Smith, marketing VP at Revolution R, will be giving a webinar showing off some of the finer features of Revolution R Enterprise - an integrated development environment (IDE) for R that has an enhanced script editor with syntax highli...

Statistique de l’assurance STT6705V, partie 12 bis

December 7, 2010
By

In the previous post (here) discussing forecasts of actuarial quantities, I did not mention much how to forecast the temporal component in the Lee-Carter model. Actually, many things can be done. Consider here some exponential smoothing techniques ...

Le Monde puzzle [49]

December 7, 2010
By

Here is a quick-and-dirty solution to Le Monde puzzle posted a few days ago: the R code counts the number of winning tickets between 1 and N, and stops when there is a proportion of 10% of winning tickets. #winning ticket win=function(n){ #decimal digits decomposition x=rep(0,4) x=n%%10 m=(n-x)/10 x=m%%10 m=(m-x)/10 x=m%%10 m=(m-x)/10 x=m%%10 tic=0 for

highlight 0.2-5

December 7, 2010
By

I pushed highlight 0.2-5 on CRAN. This release improves the latex renderer and the sweave driver so that multiple lines character strings are properly rendered. This example vignette shows it: \documentclass{report} \begin{document} <<echo=FALSE,results=hide>>= old.op <- options( prompt = " ", continue = " " ) @ <<>>= require( inline ) require( Rcpp ) convolve <- cxxfunction( signature( a = "numeric", b...

R 2.12.1 scheduled for December 16

December 6, 2010
By

The next update to R will be a patch release: R 2.12.1 will be released on December 16, as announced today by the R Core Team. As is typical for a patch release, this version will include some minor bug fixes plus a few new features (from the current build's NEWS file): The DVI/PDF reference manual now includes the...

R 2.12.1 scheduled for December 16

December 6, 2010
By

The next update to R will be a patch release: R 2.12.1 will be released on December 16, as announced today by the R Core Team. As is typical for a patch release, this version will include some minor bug fixes plus a few new features (from the current build's NEWS file): The DVI/PDF reference manual now includes the...

Jeromy Anglim on Reproducible Research and R

December 6, 2010
By

Jeromy Anglim, fellow social scientist and R aficionado from across the globe, gave a great talk to the Melbourne R Users Group last week on the joys of creating reproducible results. A subject near and dear to me, but not one that is given enough attention in research training. Jeromy discusses tools for generating reproducible

What my R code looks and feels like (Vanilla)

December 6, 2010
By

Geoff Robinson discusses how to write simple and reusable R code. He provides several examples with code.

Reproducible Research and R Workflow

December 6, 2010
By

Jeromy Anglim discusses best practices and tools used to support reproducible research in R. He discusses tools for R scripting and version control; and provides examples of SWeave. This presentation was given to the Melbourne R Users Group (MelbURN)...

3 weak days in a row

December 6, 2010
By

Recently, Trading the odds posted one of many flavors of mean reverting strategies and I decided to get my hands dirty by writing R code and testing it. You can find full description of the strategy by following latter link above. Long story short – if SPY shows lower open, high and close 3 days in

JAGS – Bayesian Analysis

December 6, 2010
By

JAGS is used for Bayesian analysis using MCMC and stands for Just Another Gibbs Sampler.  It is an alternative to WinBUGS and can be accessed through R just like WinBUGS (via R2jags or RJags).  It will work on a Mac unlike WinBUGS.  The ...

Using the "Divide by 4 Rule" to Interpret Logistic Regression Coefficients

December 6, 2010
By

I was recently reading a bit about logistic regression in a book on hierarchical/multilevel modeling when I first learned about the "divide by 4 rule" for quickly interpreting coefficients in a logistic regression model in terms of the predicted probabilities of the outcome. The idea is pretty simple. The logistic curve (predicted probabilities) is steepest at the center where...

Example 8.17: Logistic regression via MCMC

December 6, 2010
By

In examples 8.15 and 8.16 we considered Firth logistic regression and exact logistic regression as ways around the problem of separation, often encountered in logistic regression. (Re-cap: Separation happens when all the observations in a category sha...

Electoral Marimekko Plots

December 6, 2010
By

To be reductive, visual displays of quantitative information might be reasonably categorized on a continuum between “data display” and “statistical graphics.” By statistical graphics, I mean a plot that displays some summary of or relationship amongst several variables, likely having undergone some processing or analysis. This may be as simple as a scatterplot of a … Read more

Bear hunting

December 6, 2010
By

When were there bear and bull markets in US stocks since 1950? Smoothing While we’d really like to estimate the expected return at each point in time, finding bear markets is ambitious enough.  The plan starts by smoothing the daily returns through time, as in Figure 1. Figure 1: Smoothed returns with a 4 year … Continue reading...

Forecasting workshop: Switzerland, June 2011

December 6, 2010
By

I will be running a workshop on Statistical Forecasting: Principles and Practice in Switzerland, 20-22 June 2011. Check out the venue: Waldhotel Doldenhorn, Kandersteg! So if you fancy a trip to the beautiful Swiss Alps next June, read on… Outline Forecasting is required in many situations: deciding whether to build another power generation plant in

R 2.12.0 and Eclipse with StatET installation

December 5, 2010
By

A while back I outlined the setup process for running R under the Eclipse integrated development environment with the help of WalWare.de’s StatET plugin for Eclipse, in Windows 7. This post is an update on that procedure for the newer versions of...

December 5, 2010
By

In the continuation of my earlier post on computing evidence, I read a very interesting paper by Merlise Clyde, Joyee Ghosh and Michael Littman, to appear in JCGS. It is called  Bayesian adaptive sampling for variable selection and model averaging. The sound idea at the basis of the paper is that, when one is doing

Rethinking ‘loess’ for Binomial-Response Pitch F/X Strike Zone Maps

December 5, 2010
By

So after a long hiatus, I'm back for today. I've been crazy busy with a number of different things--including getting engaged and helping plan out wedding dates and things of that sort--and unfortunately have not kept up here on this blog (or on Fanta...

Pareto plot party!

December 5, 2010
By

A Pareto plot is an enhanced bar chart. It comes in useful for deciding which bars in your bar chart are important. To see this, take a look at some made up DVD sales data. set.seed(1234) dvd_names <- c("Toy Tales 3", "The Dusk Saga: Black Out", "Urban Coitus 2", "Dragon Training for Dummies", "Germination", "Fe

Genetic optimization for Trading Strategies using Rapidminer and R

December 5, 2010
By

That is the second tutorial of Rapidminer and R extension for Trading and the first in Video. In the last example the ROC obtained is not as good as it should be to make money in this business, To improve the strategy we will try to optimize the trading strategy. Different methods of optimization and objective functions...

Genetic optimization for Trading Strategies using Rapidminer and R

December 5, 2010
By

That is the second tutorial of Rapidminer and R extension for Trading and the first in Video. In the last example the ROC obtained is not as good as it should be to make money in this business, To improve the strategy we will try to optimize the trading strategy. Different methods of optimization and objective functions...

GLMM and R issues

December 4, 2010
By

I have been trying to run a Generalized Linear Mixed Model (GLMM) for some count data with repeated measures on sub-sampled sites and fixed effects at the site level with covariates at both the sub-plot and time levels.  Plus there are different n...

Comparison of results

December 4, 2010
By

I am doing a simple comparison of different estimation procedures in dealing with a simple binomial model. Here is where I got started:---------------------------------------------library(INLA)library(npmlreg)library(MCMCglmm)library(DPpackage)data(See...

Comparison of results

December 4, 2010
By

I am doing a simple comparison of different estimation procedures in dealing with a simple binomial model. Here is where I got started:---------------------------------------------library(INLA)library(npmlreg)library(MCMCglmm)library(DPpackage)data(See...

Root finding

December 4, 2010
By

Numerical root finding methods use iteration, producing a sequence of numbers that hopefully converge towards a limits which is a root. In this post, only focus four basic algorithm on root finding, and covers bisection method, fixed point method, Newton-Raphson method, and secant method. Read More: 1886 Words Totally

Root finding

December 4, 2010
By

Numerical root finding methods use iteration, producing a sequence of numbers that hopefully converge towards a limits which is a root. In this post, only focus four basic algorithm on root finding, and covers bisection method, fixed point method, Newton-Raphson method, and secant method. Read More: 1896 Words Totally

Le Monde puzzle [48: resolution]

December 4, 2010
By
$Le Monde puzzle [48: resolution]$

The solution to puzzle 48 given in Le Monde this weekend is rather direct (which makes me wonder why the solution for 6 colours is still unavailable..) Here is a quick version of the solution: Consider one column, 1 say. Since 326=5×65+1, there exists one value c with at least 66 equal to c. Among