Example 8.16: Exact logistic regression

November 30, 2010
By
Example 8.16: Exact logistic regression

In example 8.15, on Firth logistic regression, we mentioned alternative approaches to separation troubles. Here we demonstrate exact logistic regression. The code for this appears in the book (section 4.1.2) but we don't show an example of it there. ...

Read more »

Sweave Tutorial 3: Console Input and Output – Multiple Choice Test Analysis

November 30, 2010
By

This post provides an example of using Sweave to perform an item analysis of a multiple choice test. It is designed as a tutorial for learning more about using Sweave in a mode where console input and output is displayed. Copies of all source code a...

Read more »

Sweave Tutorial 3: Console Input and Output – Multiple Choice Test Analysis

November 30, 2010
By

This post provides an example of using Sweave to perform an item analysis of a multiple choice test. It is designed as a tutorial for learning more about using Sweave in a mode where console input and output is displayed. Copies of all source code a...

Read more »

Hägerstrand Time-Space Cube

November 30, 2010
By
Hägerstrand Time-Space Cube

  With the rgl package it’s possible to interact with the 3d visualization of the timespace tracks. Code example: plot3d(lon,lat,timedate, xlim=range(lon), ylim=range(lat), zlim=range(timedate), ticktype=”detailed”, xlab=”longitude”, ylab=”latitude”, zlab=”Date”, col= as.POSIXlt(daten)$mday, type=”l”, main=plottitle) In the posted example individual waypoints were added by drawing vertical lines.

Read more »

GOSemSim redesign in terms of S4 classes

November 30, 2010
By

I started to develop GOSemSim package two years ago when I was not quite familiar with R. I am very happy to see that someone use it and found it helpful. I try to learn S4 and redesign GOSemSim with S4 classes and methods in the pass two weeks, and the very first version was implemented. As I’m not...

Read more »

GOSemSim redesign in terms of S4 classes

November 30, 2010
By

I started to develop GOSemSim package two years ago when I was not quite familiar with R. I am very happy to see that someone use it and found it helpful. I try to learn S4 and redesign GOSemSim with S4 classes and methods in the pass two weeks, and the very first version was implemented. As I’m...

Read more »

Analysis of retractions in PubMed

November 30, 2010
By
Analysis of retractions in PubMed

As so often happens these days, a brief post at FriendFeed got me thinking about data analysis. Entitled “So how many retractions are there every year, anyway?”, the post links to this article at Retraction Watch. It discusses ways to estimate the number of retractions and in particular, a recent article in the Journal of

Read more »

Data visualization videos

November 29, 2010
By

Probably everyone has seen Hans Rosling’s famous TED talk by now. I recently came across a couple of other exceptional talks on data visualization: Hans Rosling again: Let my dataset change your mindset. If only all statistics lecturers were this dynamic! David McCandless: The beauty of data visualization. Not so exciting as Hans, but some great

Read more »

Initializing the Holt-Winters method

November 29, 2010
By
Initializing the Holt-Winters method

The Holt-Winters method is a popular and effective approach to forecasting seasonal time series. But different implementations will give different forecasts, depending on how the method is initialized and how the smoothing parameters are selected. In this post I will discuss various initialization methods. Suppose the time series is denoted by and the seasonal period

Read more »

Sorting out Sweave in Eclipse/StatET

November 29, 2010
By

Using Sweave to produce pretty-looking documentation for R is awfully handy. It takes a little tweaking to get set up in Eclipse and StatET though. I followed the information in Jeromy Anglim’s webpage to originally get Sweave set up. The followi...

Read more »

Slices and crumbs [arXiv:1011.4722]

November 29, 2010
By
Slices and crumbs [arXiv:1011.4722]

An interesting note was arXived a few days ago by Madeleine Thompson and Radford Neal. Beside the nice touch of mixing crumbs and slices, the neat idea is to have multiple-try proposals for simulating within a slice and to decrease the dimension of the simulation space at each try. This dimension diminution is achieved via

Read more »

The Joy of Visualizations

November 29, 2010
By

This is a clip from the forthcoming BBC4 program, The Joy of Stats: The clip shows Hans Rosling (who we've profiled here before), plotting life expectancy versus income for various countries, and animating over time. The clip amply demonstrates that with the right presentation and story, even the simplest of scatterplots can be a joy to behold. I'm looking...

Read more »

Altering Eclipse user name

November 29, 2010
By

Using StatET’s code generation templates is handy, but the standard Eclipse variable for ${user} just inserts the currently logged-in user’s account name. If you want a different name to appear when ${user} is called, one method is to alter...

Read more »

John Chambers on R and Multilingualism

November 29, 2010
By

John Chambers, one of the creators of R's predecessor S and a current member of the R Core Group, gave a seminar at Stanford last week titled "R, other languages and object-oriented programming". Unfortunately, I was away for the Thanksgiving break and couldn't make it myself, but John has kindly made his slides (PDF) available for download. The talk...

Read more »

John Chambers on R and Multilingualism

November 29, 2010
By

John Chambers, one of the creators of R's predecessor S and a current member of the R Core Group, gave a seminar at Stanford last week titled "R, other languages and object-oriented programming". Unfortunately, I was away for the Thanksgiving break and couldn't make it myself, but John has kindly made his slides (PDF) available for download. The talk...

Read more »

Statistique de l’assurance STT6705V, partie 11

November 29, 2010
By
Statistique de l’assurance STT6705V, partie 11

Last course will be uploaded soon (the links will be here and there). The R code considered is given below. First, we had to work a little bit on the datasets, tabB=read.table("http://perso.univ-rennes1.fr/arthur.charpentier/tabB.csv", sep=";",head...

Read more »

SAS and R joins SAS-x

November 29, 2010
By
SAS and R joins SAS-x

Tal Galili, organizer of the R-bloggers blog aggregator, has opened a new aggregator for people blogging about SAS. If you're unfamiliar with the concept, an aggregator is a single blog which republishes (with permission, in this case) the entries fro...

Read more »

INFORMS Data Mining Competition leaders used Open Source software

November 29, 2010
By
INFORMS Data Mining Competition leaders used Open Source software

The results of 2010 INFORMS data mining competition just recently finished.  The leaders were presented at the 2010 Annual INFORMS Conference.  The 2010 INFORMS data mining competition goal was to determine short term movements in stock price...

Read more »

INFORMS Data Mining Competition leaders used Open Source software

November 29, 2010
By
INFORMS Data Mining Competition leaders used Open Source software

The results of 2010 INFORMS data mining competition just recently finished.  The leaders were presented at the 2010 Annual INFORMS Conference.  The 2010 INFORMS data mining competition goal was to determine short term movements in stock price...

Read more »

Oct 2010 Year-To-Date Global Temperature Anomaly 1st in 2 Series, 2nd in 3 Series

November 29, 2010
By
Oct 2010 Year-To-Date Global Temperature Anomaly 1st in 2 Series, 2nd in 3 Series

This post shows the YTD global land – ocean temperature anomaly (LOTA) trends for the 5 major series through October, 2010 and how  2010 YTD ranks over the entire record for each series. The source data  file link is provided. … Continue ...

Read more »

Joy of Stats coming soon

November 29, 2010
By
Joy of Stats coming soon

The Joy of Stats really is a joy.  It will be shown on BBC4, apparently scheduled for December 7.  (That date comes from Hans Rosling on twitter, I haven’t found scheduling evidence at the BBC.) I saw its debut at the Royal Statistical Society on World Statistics Day. Here is a five minute excerpt: You … Continue reading...

Read more »

Sweave Tutorial 2: Batch Individual Personality Reports using R, Sweave, and LaTeX

November 29, 2010
By

This post documents an example of using Sweave to generate individualised personality reports based on responses to a personality test. Each report provides information on both the responses of the general sample and responses of the specific respond...

Read more »

Sweave Tutorial 2: Batch Individual Personality Reports using R, Sweave, and LaTeX

November 29, 2010
By

This post documents an example of using Sweave to generate individualised personality reports based on responses to a personality test. Each report provides information on both the responses of the general sample and responses of the specific respond...

Read more »

Getting Started with Git, EGit, Eclipse, and GitHub: Version Control for R Projects

November 28, 2010
By

This post provides information on (a) installing Git using the Eclipse plugin Egit. (b) uploading repositories to GitHub, and (c) links to resources on Git, Git and LaTeX, and Git and R. The focus is on version control for people working on R, Sweave, ...

Read more »

Getting Started with Git, EGit, Eclipse, and GitHub: Version Control for R Projects

November 28, 2010
By

This post provides information on (a) installing Git using the Eclipse plugin Egit. (b) uploading repositories to GitHub, and (c) links to resources on Git, Git and LaTeX, and Git and R. The focus is on version control for people working on R, Sweave, ...

Read more »

Computing evidence

November 28, 2010
By
Computing evidence

The book Random effects and latent variable model selection, edited by David Dunson in 2008 as a Springer Lecture Note. contains several chapters dealing with evidence approximation in mixed effect models. (Incidentally, I would be interested in the story behind the  Lecture Note as I found no explanation in the backcover or in the preface.

Read more »

Analyst First – SURF

November 28, 2010
By
Analyst First – SURF

This presentation is aimed at all those working in commercial and government analytics, irrespective of what tools they use, and also to those students intending on such a career. R and other open source tools play a powerful, unique and … Continue reading →

Read more »

Random variable generation (Pt 1 of 3)

November 28, 2010
By
Random variable generation (Pt 1 of 3)

As I mentioned in a recent post, I’ve just received a copy of Advanced Markov Chain Monte Carlo Methods. Chapter 1.4 in the book (very quickly) covers random variable generation. Inverse CDF Method A standard algorithm for generating random numbers is the inverse cdf method. The continuous version of the algorithm is as follows: 1.

Read more »

parser 0.0-12

November 28, 2010
By

I've pushed a new version of the parser package to CRAN. This is the first release that depends on Rcpp, which allowed me to reduce the code size and increase its maintainability. This also features a faster version of nlines, a function that r...

Read more »