Update: Parameters as Population Quantities

May 16, 2012
By

Some time ago, I had an ineloquent and less-than-cordial online discussion with a commenter on this site, partially about how statisticians define the term "parameter". This post is just to quote a relevant passage from "Bootstrap Methods and Their Application", by Davison and Hinkley (1997), that better articulates a point I had made earlier. 2.1.1

Read more »

Global Homicide Rates by Government Type

May 16, 2012
By
Global Homicide Rates by Government Type

Surprising results For purposes of this article, any mention of homicide rates refers to reported homicide rates. Open vs Closed In mostly open countries (full democracies), the homicide rates are rather low when compared to other types of...

Read more »

My take on polar bar (a.k.a. consultant’s) charts

May 16, 2012
By
My take on polar bar (a.k.a. consultant’s) charts

Once upon a time, when I was working at Johnson & Johnson (pharma branch), I was surrounded by a bunch of programmers working to develop (among other things) a nifty piece of software for internal use. Part of it was later released as freeware, cal...

Read more »

RProtoBuf 0.2.4

A new release 0.2.4 of RProtoBuf is now on CRAN. RProtoBuf provides GNU R bindings for the Google Protobuf data encoding library used and released by Google. This release once again contains a number of patches kindly contributed by Murray Stokely...

Read more »

Finding Waldo, a flag on the moon and multiple choice tests, with R

May 16, 2012
By
Finding Waldo, a flag on the moon and multiple choice tests, with R

I have to admit, first, that finding Waldo has been a difficult task. And I did not succeed. Neither could I correctly spot his shirt (because actually, it was what I was looking for). You know, that red-and-white striped shirt. I guess it sho...

Read more »

Revolution Newsletter: May 2012

May 16, 2012
By

The most recent edition of the Revolution Newsletter is out. The news section is below, and you can read the full May edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. New R Training Courses Announced. Three new R courses from leading R experts are...

Read more »

And the Winner is……

May 16, 2012
By

R will overtake SAS and SPSS in 2015 - according to David Smith in his post on the Revolutions blog.I can believe that!© 2012, David E. Giles

Read more »

Stepping Outside My Open-Source Comfort Zone: A First Look at Golden Helix SVS

May 16, 2012
By
Stepping Outside My Open-Source Comfort Zone: A First Look at Golden Helix SVS

I'm a huge supporter of the Free and Open Source Software movement. I've written more about R than anything else on this blog, all the code I post here is free and open-source, and a while back I invited you to steal this blog under a cc-by-sa license....

Read more »

Quickly Visualize Your Whole Dataset

May 16, 2012
By

(This article was first published on W. Andrew Barr's Paleoecology Blog, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on his blog: W. Andrew Barr's Paleoecology Blog. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave,...

Read more »

Express Divorce in Mexico

May 16, 2012
By
Express Divorce in Mexico

On October 2008 Mexico's capital, the Federal District, approved a version of no-fault divorce locally known as "express divorce". With the new law the requesting spouse no longer had to provide a cause to dissolve the marriage and the couple no longer had to live apart before filing for divorce. Furthermore, the process of...

Read more »

Submitting a package to CRAN

May 16, 2012
By

The process is relatively smooth, I think - especially if you start off with a Linux machine. However, when I had to compile the package and documentation for BCEA, I still had a couple of blips. The first problem was in the documentation. R tries...

Read more »

Extending the sensory profiling data model

May 16, 2012
By
Extending the sensory profiling data model

In this post I extend the multiplicative Bayesian sensory profiling model with effects for rounds and sessions. Is is not a difficult extension, but it brings the need for informative priors into the model. I do believe round and session effects exist,...

Read more »

Population of Tawi-Tawi from 1903 to 2010

May 16, 2012
By
Population of Tawi-Tawi from 1903 to 2010

R Codeslibrary(ggplot2)library(colorRamps)TawiTawiPop <- c(17000, 45000, 46000, 59000, 79000, 110000, 143000, 195000, 228204, 250718, 322317, 450346, 366550)YearNames <- c("1903", "1918", "1939", "1948", "1960", "1970", "1975", "1...

Read more »

Dynamic Content with RStudio, Markdown, and Marked.

May 15, 2012
By
Dynamic Content with RStudio, Markdown, and Marked.

As Markus Gesmann recently pointed out, the new version of RStudio (0.96) has some really nice features for creating dynamic reports with Yihui Xie’s knitr. You can integrate not just R and LaTeX, but also R and Markdown (as well as some other fo...

Read more »

Using R to graph a subject trend in PubMed

May 15, 2012
By
Using R to graph a subject trend in PubMed

The traditional way to show that your topic is worth studying in front of an audience is to show the state of the field based on a literature review. This is especially true if your subject is obscure except to a handful of scientists in the world.I was confronted with this problem more than once and the last time...

Read more »

How long before R overtakes SAS and SPSS?

May 15, 2012
By

Based on an analysis of Google Scholar data on usage of statistical software, Bob Muenchen makes a forecast: R will overtake SAS and SPSS in 2015. Forecasting is extrapolation — always a tricky business — so Bob also provides these qualitative reasons why R will continue to grow at the expense of SAS and SPSS: The continued rapid growth...

Read more »

Interactive reports in R with knitr and RStudio

May 15, 2012
By
Interactive reports in R with knitr and RStudio

Last Saturday I met the guys from RStudio at the R in Finance conference in Chicago. I was curious to find out what RStudio could offer. In the past I have used mostly Emacs + ESS for editing R files. Well, and what a surprise it was. JJ, Joe and Josh ...

Read more »

Will 2015 be the Beginning of the End for SAS and SPSS?

May 15, 2012
By
Will 2015 be the Beginning of the End for SAS and SPSS?

Learning to use a data analysis tool well takes significant effort, so people tend to continue using the tool they learned in college for much of their careers. As a result, the software used by professors and their students is … Continue reading →

Read more »

Forthcoming R User Meetings

May 15, 2012
By

Two R User Group meetings are happening soon thanks to the support of Mango-solutions (one of R-bloggers’ long term sponsors).  Details below:   1)      ZurichR – Wednesday 23rd May 2012 (www.zurichr.org) ZurichR is a free networking event for R users sponsored by Mango Solutions and ETH Zurich All welcome to attend.  Please confirm attendance in advance to [email protected] Time:       6.30pm – 9.30 pm...

Read more »

Skew of Bonds

May 15, 2012
By
Skew of Bonds

As the researchpuzzler highlights in “a bad bet”, US bonds were a popular subject at the CFA Institute Annual Conference.  While US Bonds have been in an amazing 30 year run (see previous posts Lattice Explore Bonds, Bond Market as a Casino Ga...

Read more »

Improving script_002: “Monitor”

May 15, 2012
By
Improving script_002: “Monitor”

I read in an article that Ian Cowe said that what normally chemometricians do is to look to the graphics, of course interpret those graphics. So I still go on trying to develop a function can help me to understand the graphics and all the statistics th...

Read more »

RcppSMC 0.1.1

CRAN now tests packages against g++-4.7 (as this version has become the default on Debian's testing variant. This compiler switch once again triggered a set of build failures, mostly from include files now deemed missing. For RcppSMC, it came down ...

Read more »

Functions ddply and melt make plotting summary stats in R more tolerable

May 15, 2012
By
Functions ddply and melt make plotting summary stats in R more tolerable

The main reason why I have usually chosen to use excel to make my plots at work is because I had difficulty feeding the summary stats in R into a plotting function.  One thing I learned this week is how … Continue reading →

Read more »

R solvements to Project Euler — problem 1

May 15, 2012
By

Things have been going wild since I opened this blog. Tasks were piled up while I was tight on time. At present, I’m facing a major challenge in my life. However, I decide to spare some time for self-improvements. R … Continue reading →

Read more »

GitHub data analysis

May 15, 2012
By
GitHub data analysis

Few weeks ago GitHub announced, that its timeline data is available on bigquery for analysis. Moreover, it offers prizes for the best visualization of the data. Despite my art skills and minimal chances to win beauty contest, I decided to crunch GitHub data and run data analysis. After initial trial of bigquery service, I found hard

Read more »

Blog aggregators

May 15, 2012
By

A very useful way of keeping up with blogs in a particular area is to subscribe to a blog aggregator. These will syndicate posts from a large number of blogs and provide links back to the original sources. So you only need to subscribe once to get all the good stuff in that area. There are now several blog...

Read more »

Setting up StatET & Eclipse in Windows

May 15, 2012
By
Setting up StatET & Eclipse in Windows

A view of the StatET plugin in the Juno Eclipse. The environment is perfect for developing R packages and creating more complex functions. I wanted to write about creating R-packages in Windows but after trying to get StatET to work seamlessly...

Read more »

Plotting data and distribution simultaneously (with ggplot2)

May 14, 2012
By
Plotting data and distribution simultaneously (with ggplot2)

Ever wanted to see at a glance the distribution of your data across different axes? It happens often to me, and R allows to build a nice plot composition - This is my latest concoction. I used ggplot2 here, but equivalent graphics can be made...

Read more »

Multiple Sclerosis Tweet-Chat: Review

May 14, 2012
By
Multiple Sclerosis Tweet-Chat: Review

We had a great Twitter conversation last Thursday on the use of big-data analytics, Revolution R Enterprise, and IBM Netezza in the search for a cure for MS. Many thanks to the other panelists: Murali Ramanathan (SUNY Buffalo), Tim Coetzee (National MS Society) and moderator Shawn Dolley (IBM) for fielding and answering questions from interested parties following #IBMDataChat. As...

Read more »