Monthly Archives: July 2012

This is *huge*: SAScii package

July 10, 2012
By

http://blog.revolutionanalytics.com/2012/07/importing-public-data-with-sas-instructions-into-r.html Q: How do you make a hairless primate? Answer 1: Take a hairy primate, wait a few million years and see if Darwin was right. Answer 2: Make them work i...

Read more »

Importing public data with SAS instructions into R

July 10, 2012
By

Many public agencies release data in a fixed-format ASCII (FWF) format. But with the data all packed together without separators, you need a "data dictionary" defining the column widths (and metadata about the variables) to make sense of them. Unfortunately, many agencies make such information available only as a SAS script, with the column information embedded in a PROC...

Read more »

introduction to R: learning by doing (part 2: plots)

July 10, 2012
By
introduction to R: learning by doing (part 2: plots)

Lets go one with the second part of learning R by doing R (you will find the first part here. As we have used vectors, matrices and loops in the first part, we will concentrate on graphics in this one. but first we will need data to plot: Sometimes you will need several plots in

Read more »

simulation, an ubiquitous tool

July 10, 2012
By
simulation, an ubiquitous tool

(This article was first published on Xi'an's Og » R, and kindly contributed to R-bloggers) After struggling for quite a walk on that AMSI public lecture talk, and dreading its loss with the problematic Macbook, I managed to complete a first draft last night in Adelaide, downloading a final set of images from the Web...

Read more »

Visualizing Graphical Models

July 10, 2012
By
Visualizing Graphical Models

I'm anticipating presenting research of mine based on Bayesian graphical models to an audience that might not be familiar with them. When presenting ordinary regression results, there's already the sort of statistical sniper questions along the lines o...

Read more »

SAS Beats R on July 2012 TIOBE Rankings

July 10, 2012
By
SAS Beats R on July 2012 TIOBE Rankings

The TIOBE Community Programming Index ranks the popularity of programming languages, but from a programming language perspective rather than as analytical software (http://www.tiobe.com). It extracts measurements from blogs, entries in Wikipedia, books on Amazon, search engine results, etc. and combines them into a single index. … Continue reading →

Read more »

2nd CFP: the 10th Australasian Data Mining Conference (AusDM 2012)

July 10, 2012
By
2nd CFP: the 10th Australasian Data Mining Conference (AusDM 2012)

The Tenth Australasian Data Mining Conference (AusDM 2012) Sydney, Australia, 5-7 December 2012 http://ausdm12.togaware.com/ The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and researchers in data mining. This year’s conference, AusDM’12, co-hosted … Continue reading →

Read more »

Data Mining In Excel: Lecture Notes and Cases

July 10, 2012
By
Data Mining In Excel: Lecture Notes and Cases

by Yanchang Zhao, RDataMining.com It is a 270-page book on data mining with Excel. It can be downloaded as a PDF file at http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.83.1393&rep=rep1&type=pdf. Below is its table of contents. - Overview of the Data Mining Process - Data Exploration … Continue reading →

Read more »

Sourcing Code from GitHub

July 10, 2012
By
Sourcing Code from GitHub

In previous posts I described how to input data stored on GitHub directly into R. You can do the same thing with source code stored on GitHub. Hadley Wickham has actually made the whole process easier by combining the getURL, textConnection, and source commands into one function: source_url. This is in his devtools...

Read more »

Package JM — version 1.0-0

July 10, 2012
By

(by Dimitris Rizopoulos) Dear R-users, I’d like to announce the release of version 1.0-0 of package JM (already available from CRAN) for the joint modeling of longitudinal and time-to-event data using shared parameter models. These models are applicable in mainly two settings. First, when focus is in the survival outcome and we wish to account for the effect of an...

Read more »