Functions ddply and melt make plotting summary stats in R more tolerable

May 15, 2012
By
Functions ddply and melt make plotting summary stats in R more tolerable

The main reason why I have usually chosen to use excel to make my plots at work is because I had difficulty feeding the summary stats in R into a plotting function.  One thing I learned this week is how … Continue reading →

Read more »

R solvements to Project Euler — problem 1

May 15, 2012
By

Things have been going wild since I opened this blog. Tasks were piled up while I was tight on time. At present, I’m facing a major challenge in my life. However, I decide to spare some time for self-improvements. R … Continue reading →

Read more »

GitHub data analysis

May 15, 2012
By
GitHub data analysis

Few weeks ago GitHub announced, that its timeline data is available on bigquery for analysis. Moreover, it offers prizes for the best visualization of the data. Despite my art skills and minimal chances to win beauty contest, I decided to crunch GitHub data and run data analysis. After initial trial of bigquery service, I found hard

Read more »

Blog aggregators

May 15, 2012
By

A very useful way of keeping up with blogs in a particular area is to subscribe to a blog aggregator. These will syndicate posts from a large number of blogs and provide links back to the original sources. So you only need to subscribe once to get all the good stuff in that area. There are now several blog...

Read more »

Setting up StatET & Eclipse in Windows

May 15, 2012
By
Setting up StatET & Eclipse in Windows

A view of the StatET plugin in the Juno Eclipse. The environment is perfect for developing R packages and creating more complex functions. I wanted to write about creating R-packages in Windows but after trying to get StatET to work seamlessly...

Read more »

Plotting data and distribution simultaneously (with ggplot2)

May 14, 2012
By
Plotting data and distribution simultaneously (with ggplot2)

Ever wanted to see at a glance the distribution of your data across different axes? It happens often to me, and R allows to build a nice plot composition - This is my latest concoction. I used ggplot2 here, but equivalent graphics can be made...

Read more »

Multiple Sclerosis Tweet-Chat: Review

May 14, 2012
By
Multiple Sclerosis Tweet-Chat: Review

We had a great Twitter conversation last Thursday on the use of big-data analytics, Revolution R Enterprise, and IBM Netezza in the search for a cure for MS. Many thanks to the other panelists: Murali Ramanathan (SUNY Buffalo), Tim Coetzee (National MS Society) and moderator Shawn Dolley (IBM) for fielding and answering questions from interested parties following #IBMDataChat. As...

Read more »

New courses from R gurus

May 14, 2012
By

Looking to learn R, or to expand your R skills for data visualization or package development? Here are some R courses presented by the experts you may be interested in: June 19-20: Visualization in R with ggplot2. This course presented by Garrett Grolemund & Dr. Winston Chang of Rice University is also a web-based course with live presentation. This...

Read more »

generalised ratio of uniforms

May 14, 2012
By
generalised ratio of uniforms

A recent arXiv posting of the paper “On the Generalized Ratio of Uniforms as a Combination of Transformed Rejection and Extended Inverse of Density Sampling” by Martino, Luengo, and Míguez from Madrid rekindled my interest in this rather peculiar simulation method. The ratio of uniforms samples uniformly on the subgraph to produce simulations from p

Read more »

Spatial Randomness Evaluation in R: Monte Carlo Test

May 14, 2012
By
Spatial Randomness Evaluation in R: Monte Carlo Test

This post is a some kind of reply to this one.So our goal is to determine whether our point process is random or not. We will use R and spatstat package in particular. Spatstat provides a very handy function for this, that uses K-function combined with...

Read more »

New Version of RStudio (v0.96)

May 14, 2012
By
New Version of RStudio (v0.96)

Today a new version of RStudio (v0.96) is available for download from our website. The main focus of this release is improved tools for authoring, reproducible research, and web publishing. This means lots of new Sweave features as well as tight integration with the knitr package (including support for creating dynamic web reports with the

Read more »

Criticism 3 of NHST: Essential Information is Lost When Transforming 2D Data into a 1D Measure

May 14, 2012
By
Criticism 3 of NHST: Essential Information is Lost When Transforming 2D Data into a 1D Measure

Introduction Continuing on with my series on the weaknesses of NHST, I’d like to focus on an issue that’s not specific to NHST, but rather one that’s relevant to all quantitative analysis: the destruction caused by an inappropriate reduction of dimensionality. In our case, we’ll be concerned with the loss of essential information caused by

Read more »

Example 9.31: Exploring multiple testing procedures

May 14, 2012
By
Example 9.31: Exploring multiple testing procedures

In example 9.30 we explored the effects of adjusting for multiple testing using the Bonferroni and Benjamini-Hochberg (or false discovery rate, FDR) procedures. At the time we claimed that it would probably be inappropriate to extract the adjusted p-values from the FDR method from their context. In this entry we attempt to explain our misgivings about...

Read more »

Source R-Script from Dropbox

May 14, 2012
By
Source R-Script from Dropbox

A quick tip on how to source R-scripts from a Dropbox-account:(1) Upload the script.. (2) Get link with the "get link" option. The link should look like "https://www.dropbox.com/s/XXXXXX/yourscript.R"..(3) Grab this part "XXXXXX/yourscript.R" and paste...

Read more »

Bias in Federal Reserve Inflation Forecasts

May 13, 2012
By

Bias in Federal Reserve Inflation Forecasts: Christopher Gandrud uses ggplot2 to visualize potential partisan bias in US Federal Reserve inflation forecasts as a PhD student at the London School of Economics.

Read more »

Text Mining to Word Cloud App with R

May 13, 2012
By
Text Mining to Word Cloud App with R

Here is a simple application to transform text into a beautiful word cloud, Text Mining to WordCloud. The purpose is to find out the highest frequency word in a certain text. It is an app built with R language, the source code is attached at the end of...

Read more »

BCEA on CRAN!

May 13, 2012
By

Finally, I got round to find some time to work out all the problems in compiling the BCEA (Bayesian Cost-Effectiveness Analysis) package.I developed it as part of the work for the book. In a nutshell, what it does is the following: first, you need to s...

Read more »

The whinny of the exponential horse

May 13, 2012
By
The whinny of the exponential horse

A Poisson process provides a good model for events that happen rarely. That's what von Bortkiewicz realized in 1898 when he modeled deaths by horse kick in Prussian cavalry; since it would be ungentlemanly to actually kill my readers, I instead represent the events in a Poisson process using a horse's whinny.

Read more »

Spurious correlations and the Lasso

May 13, 2012
By
Spurious correlations and the Lasso

Autocorrelation of a time series can be useful for prediction because the most recent observation of the prediction target contains information about future values. At the same time autocorrelation can play tricks on you because many standard statistical methods implicitely assume independence of measurements at different times. The correlation coefficient between two variable and has

Read more »

ggplot2 presentation at Victoria University of Wellington

May 13, 2012
By

Next week I’ll present a glimpse of R and ggplot2 graphics at VUW. This is a MESA seminar on ‘Data analysis and plotting with free and open source tools’ where we’ll present spreadsheet alternatives based on gnuplot, Python, an...

Read more »

gRaphics! 2012-05-12 14:07:00

May 12, 2012
By
gRaphics! 2012-05-12 14:07:00

My own version of bubble plot (part 1)During one of my projects, I found myself in need of visualizing more than 3 dimensions at once. Three-dimensional graphs are not a good solution, usually - they will need to be properly oriented, for a start, ad that's tricky.So, I started looking at bubble plots. The size of the bubble can...

Read more »

gRaphics! 2012-05-12 13:47:00

May 12, 2012
By

First Post: Welcome to this new blog!!!It's been almost one years that I've started using R as my main programming/analysis tool. I like the fact that so many beautiful graphics can be produced directly within R.Although I often just use the basic func...

Read more »

Neat demo real of d3 (js & svg powered interactive graphics…

May 12, 2012
By

Neat demo real of d3 (js & svg powered interactive graphics in the browser).  Hopefully there will be ggplot2 integration one day!

Read more »

R Videos – and More

May 12, 2012
By

Some of us learn easily from the written word, but for most of us some visualization speeds up the process and generally helps with retention as well. With that in mind I was delighted to see this nice list of free videos that demonstrate the use of R, posted on Ethan Fosse's blog, "Culture, Statistics, and...

Read more »

Criticism 2 of NHST: NHST Conflates Rare Events with Evidence Against the Null Hypothesis

May 12, 2012
By

Introduction This is my second post in a series describing the weaknesses of the NHST paradigm. In the first post, I argued that NHST is a dangerous tool for a community of researchers because p-values cannot be interpreted properly without perfect knowledge of the research practices of other scientists — knowledge that we cannot hope

Read more »

The Foreign Language of ‘Mad Men’

May 12, 2012
By

The Foreign Language of 'Mad Men': ggplot2 in the Atlantic

Read more »

useR! 2012: Call for Late-Breaking Posters; REGULAR REGISTRATION ENDS 12May

May 12, 2012
By

*** Call for Late-breaking Posters *** Abstracts may be submitted for posters presenting recent developments and late-breaking applications of R, on topics as indicated in the earlier call for abstracts: http://biostat.mc.vanderbilt.edu/UseR-2012#Call_for_Abstracts_and_Tutorial Late-breaking posters will be displayed during the poster session alongside regular posters, and they will appear in the electronically published book of abstracts for the conference. However, these...

Read more »

ASA fellows

May 12, 2012
By
ASA fellows

Being freshly elected ASA Fellow (yay!), I just received the list of 2012 ASA Fellows. Among whose, let me mention Sudipto Banerjee, University of Minnesota, Minneapolis, Minnesota, elected “For theoretical, methodological and applied research in spatiotemporal statistical modeling, especially as applied to problems in environmetrics, ecology, occupational health, agriculture and economics, for professional work at

Read more »

R – some introductory material

May 12, 2012
By

R is a statistical programming language and can be a little scary at first. I learned it during my first statistics class. While others used Stata, I decided to try if I could do the tasks in R. That was probably one of my best research-choices. My main source of knowledge was Quick-R that's an excellent resource. It...

Read more »