Presidential Debates with qdap-beta

October 4, 2012
By
Presidential Debates with qdap-beta

qdap brief intro For the past year I’ve been working on a package (qdap) to assist my field in quantitative discourse analysis; basically looking at patterns in language. It’s still a ways from being finished and lacks documentation (roxygen2 is … Continue reading →

Read more »

Garmin data visualization

October 4, 2012
By
Garmin data visualization

People go on rage, when governments initiate surveillance projects like CleanIT, nevertheless share very private data without a doubt. I have to admit, that some data leaks are well buried in the process. Take for example Garmin which produces GPS training devices for runners. In order to see your workouts you are forced to upload

Read more »

Parse pdf files with R (on a Mac)

October 4, 2012
By

Inspired by this blog post from theBioBucket, I created a script to parse all pdf files in a directory. Due to its reliance on the Terminal, it’s Mac specific, but modifications for other systems shouldn’t be too hard (as a start for Windows, see BioBucket’s script). First, you have to install the command line tool

Read more »

Log odds ratios and an indicator matrix from categorical data

October 4, 2012
By
Log odds ratios and an indicator matrix from categorical data

A long title, but there are a couple of handy things in this Gist. The first, and more obscure, is the conversion of a data.frame of categorical variables into a matrix of dummy/binary/indicator variables, one for each category of each original variab...

Read more »

Graphing Non-Proportional Hazards in R

October 3, 2012
By

(This article was first published on Christopher Gandrud (간드루드 크리스토파), and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on their blog: Christopher Gandrud (간드루드 크리스토파). R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps,...

Read more »

Tips on accessing data from various sources with R

October 3, 2012
By

Jeffrey Breen (the man behind the Twitter airline sentiment analysis example) recently posted a collection of slides with some great tips for accessing data from R. "Tapping the Data Deluge" includes information on: Using the XLConnect package to read data from Excel spreadsheets Using the foreign package to read SPSS, SAS, Stata and dBase data files Using SQL queries...

Read more »

Have I chosen the right power company?

October 3, 2012
By
Have I chosen the right power company?

Do you always wonder if I have chosen the right power company and have not been over charged? Your questions may be answered here (if you reside in Wellington, New Zealand). Power costs per day and per month_Oct_2012 shows which company … Continue reading →

Read more »

A post about greater blogs

October 3, 2012
By

I'm very glad if you use or at least read sometimes my blog. However, I'd like to give you a list of wonderful blogs which are useful if you are interested in applied mathematics and programming for simulation.My favorite one is written by Arthur Charp...

Read more »

Perculiar behaviour of the sum function

October 3, 2012
By

The sum function in R is a special one in contrast to other summary statistics functions such as mean and median. The first distinguish is that it is a Primitive function where the others are not (Although you can call mean using .Internal). This ...

Read more »

Transforming a color scale

October 3, 2012
By
Transforming a color scale

In developing plots, I often use color (or “colour” in ggplot2 parlance) to reflect values of a third, non-X/Y, variable. Depending on the distribution of this Z variable, however, the effective color range can be narrow, making it difficul...

Read more »

A Quick Note On Large 2D Data

October 3, 2012
By
A Quick Note On Large 2D Data

Two months ago I was told one of my old blog posts was borrowed to this post: Finding patterns in big data with SAS/GRAPH. I wrote my blog post four years ago just for fun. The over-plotting issue is pretty boring to me now, but what caught my attentio...

Read more »

Oracle R Enterprise Tutorial Series on Oracle Learning Library

October 2, 2012
By

Oracle Server Technologies Curriculum has just released the Oracle R Enterprise Tutorial Series, which is publicly available on Oracle Learning Library (OLL). This 8 part interactive lecture series with review sessions covers Oracle R Enterprise ...

Read more »

Emerging as Low Vol

October 2, 2012
By
Emerging as Low Vol

Extending the series begun with When Russell 2000 is Low Vol, I thought I should take a look at Emerging Market stocks during periods of low relative volatility to the S&P 500.  So you can replicate even without access to expensive data, let

Read more »

Clegg vs Pleb: An XKCD-esque chart

October 2, 2012
By
Clegg vs Pleb: An XKCD-esque chart

I saw an interesting “challenge” on StackOverflow last night to create an XKCD style chart in R. A couple of hours later & going in a very similar direction to a couple of the answers on SO, I got to something that looked pretty good, using the sin and cos curves for simple and reproducible … Continue reading...

Read more »

Loading Packages and Functions Automatically in R

October 2, 2012
By
Loading Packages and Functions Automatically in R

For a long time, I was wondering how to get R to automatically load packages I use every time I open the program. For example, for a variety of reasons, I use the ‘Cairo’ package to save all my figures as … Continue reading →

Read more »

R 2.15.2 scheduled for October 26

October 2, 2012
By

The next minor update to R — version 2.15.2 "Trick or Treat" — will be released on October 26, R-core member Peter Dalgaard announced today. You can find the planned updates in the current NEWS file (scroll down to the section 'CHANGES IN R VERSION 2.15.1 patched'; the changes at the top of the file are planned for the...

Read more »

City Size and SUHI

October 2, 2012
By
City Size and SUHI

In the course of putting together data for my kriging project with the CRN stations, I got another idea related to a small but potentially important corner of the concerns over UHI in the global temperature index. For clarity I suppose I should make it clear that my position is that the UHI bias is

Read more »

Slides from “Tapping the Data Deluge with R” lightning talk #rstats #PAWCon

October 2, 2012
By
Slides from “Tapping the Data Deluge with R” lightning talk #rstats #PAWCon

Here is my presentation from last night’s Boston Predictive Analytics Meetup graciously hosted by Predictive Analytics World Boston. The talk is meant to provide an overview of (some) of the different ways to get data into R, especially supplementary data sets to assist with your analysis. All code and data files are available at github:

Read more »

From Lavaan to OpenMx

October 2, 2012
By
From Lavaan to OpenMx

Joel Caldwell gave an example of Confirmatory factor analysis in lavaan. The purpose of this post is to show you how to express this model in OpenMx. The data being modeled are in an object called ratings (see below for the code to create this object as described here. Here’s Joe’s example structural equation model, as implemented in lavaan: require(lavaan) bifactor <- "general.factor =~ Easy_Reservation +...

Read more »

R for SAS, SPSS, Stata Users Workshop Redesigned

October 2, 2012
By
R for SAS, SPSS, Stata Users Workshop Redesigned

My workshop R for SAS, SPSS and Stata Users has been popular over the years, but it’s time for an overhaul. A common request has been to simplify it, so I have moved data management to a separate 4-hour workshop, … Continue reading →

Read more »

A replacement for theme_blank()

October 2, 2012
By
A replacement for theme_blank()

ggplot2 has just hit 0.9.2, and with the change comes a new theme system. Previous versions of ggplot2 offered a theme_blank(), which was a stripped-down, essentially blank plotting canvas, but it is now deprecated. github user jrnold has produced a s...

Read more »

“Advanced R” Course – November 15-16, 2012

October 2, 2012
By

This two days course shows wide variety of “Advanced R” topics ranging from S4 methods and classes to parallel computing in order to provide an exhaustive overview of R capabilities. Continue reading →

Read more »

Connecting the real world to R with an Arduino

October 2, 2012
By
Connecting the real world to R with an Arduino

If connecting data to the real world is the next sexy job, then how do I do this? And how do I connect the real world to R? It can be done as Matt Shottwell showed with his home made ECG and a patched version of R at useR! 2011. However, there are ot...

Read more »

Scraping pages and downloading files using R

October 1, 2012
By
Scraping pages and downloading files using R

I have written a few posts discussing descriptive analyses of evaluation of National Standards for New Zealand primary schools.The data for roughly half of the schools was made available by the media, but the full version of the dataset is … Continue reading →

Read more »

analyze the area resource file (arf) with r

October 1, 2012
By

the arf is fun to say out loud.  it's also a single county-level data table with about 6,000 variables, produced by the united states health services and resources administration (hrsa).  the file contains health information and statistics fo...

Read more »

Where in the world is R and RStudio

October 1, 2012
By
Where in the world is R and RStudio

Using the web logs collected when users download RStudio, we’ve prepared the following two maps showing where RStudio is being used, over the whole globe and just within the continental USA. Obviously this data is somewhat biased, as it reflects the number of downloads of RStudio, rather than the number of users of R (which

Read more »

Ordinal football

October 1, 2012
By
Ordinal football

I've had a quick look at this article on R-bloggers $-$ I don't think I've followed the whole exchange, but I believe they have discussed what models should/could be applied to estimate football scores (specifically, in this case they are using the Dut...

Read more »

Designing real-world 3-D objects with R

October 1, 2012
By
Designing real-world 3-D objects with R

The Maker Movement has led to the production of open-source 3-D printers and other manufacturing machines that allow hobbyists to design, create and produce real-world objects affordably. Now R user Ian Walker, in a post at the Psychological Statistics blog, shows how to use the R language to transform 3-D surfaces into real-world physical objects with a 3-D printer....

Read more »

A Brief Tip on Generating Fractional Factorial Designs in R

October 1, 2012
By

A number of marketing researchers use the orthoplan procedure in SPSS to generate fractional factorial designs.  It is not surprising, then, that I received a number of questions concerning the recent article in the Journal of Statistical Software by Hideo Aizaki on “Basic Functions for Supporting an Implementation of Choice Experiments in R.”  To summarize their issues,...

Read more »

Sponsors