March 2013

GSOC 2013: IID Assumptions in Performance Measurement

March 9, 2013 | Peter Carl

Google Summer of Code for 2013 has been announced and organizations such as R are beginning to assemble ideas for student projects this summer. If you’re an interested student, there’s a list of project proposals on the R wiki. If you’re considering being a mentor, post a project ... [Read more...]

Visualizing Risky Words — Part 2

March 9, 2013 | hrbrmstr

This is a follow-up to my Visualizing Risky Words post. You’ll need to read that for context if you’re just jumping in now. Full R code for the generated images (which are pretty large) is at the end. Aesthetics are the primary reason for using a word cloud, ...
[Read more...]

Analyzing SimplyStatistics visits info

March 9, 2013 | Fellgernon Bit - rstats

Recently we had to analyze the data of the number of visits per day to SimplyStatistics.org. There were two goals: Estimate the fraction of visitors retained after a spike in the number of visitors Identify (if any) any factors that influence the fraction estimated in 1. For me it was ... [Read more...]

A bit more on sample size

March 8, 2013 | John Mount

In our article What is a large enough random sample? we pointed out that if you wanted to measure a proportion to an accuracy “a” with chance of being wrong of “d” then a idea was to guarantee you had a sample size of at least: This is the central ... [Read more...]

Quandl package released to CRAN

March 8, 2013 | David Smith

In a guest post here on February 20, Tammer Kamel introduced us to Quandl, a kind of "wikipedia" of time series data. In the post, Tammer (the founder of Quandl) noted that they were working on an R package to give R users access to Quandl as a data source. That ... [Read more...]

Comparing quantiles for two samples

March 8, 2013 | arthur charpentier

Recently, for a research paper, I some samples, and I wanted to compare them. Not to compare they means (by construction, all of them were centered) but there dispersion. And not they variance, but more their quantiles. Consider the following boxplot type function, where everything here is quantile related (which ... [Read more...]

Data Visualization: Shiny Democratization

March 8, 2013 | Sean Gonzalez

In organizing Data Visualization DC we focus on three themes: The Message, The Process, The Psychology. In other words, ideas and examples of what can be communicated, the tools and know-how to get it done, and how best to communicate. … Continue reading → The post Data Visualization: Shiny Democratization appeared first ... [Read more...]

Cool GSS training video! And cumulative file 1972-2012!

March 8, 2013 | andrew

Felipe Osorio made the above video to help people use the General Social Survey and R to answer research questions in social science. Go for it! Meanwhile, Tom Smith reports: The initial release of the General Social Survey (GSS), cumulative file for 1972-2012 is now on our website. Codebooks and ... [Read more...]

Visualizing rOpenSci collaboration

March 8, 2013 | Recology - R

We (rOpenSci) have been writing code for R packages for a couple years, so it is time to take a look back at the data. What data you ask? The commits data from GitHub ~ data that records who did what and when. Using the Github commits API we can gather ... [Read more...]

ddply in action

March 7, 2013 | Mark T Patterson

Top Batting Averages Over Time Top Batting Averages Over Time reference:http://www.baseball-databank.org/ ShortI'm going to use plyr and ggplot2 to look at how top batting averages have changed over time First load the data:
options(width = 100)<br>library(ggplot2)<br>
## Warning message: package 'ggplot2' was built under R version 2.14.2<br>
library(plyr)<br><br>data(baseball)<br>head(baseball)<br>
##            id year stint team lg  g  ab  r  h X2b X3b hr rbi sb cs bb so ibb hbp sh sf gidp<br>## 4   ansonca01 1871     1  RC1    25 120 29 39  11   3  0  16  6  2  2  1  NA  NA NA NA   NA<br>## 44  forceda01 1871     1  WS3    32 162 45 45   9   4  0  29  8  0  4  0  NA  NA NA NA   NA<br>## 68  mathebo01 1871     1  FW1    19  89 15 24   3   1  0  10  2  1  2  0  NA  NA NA NA   NA<br>## 99  startjo01 1871     1  NY2    33 161 35 58   5   1  1  34  4  2  3  0  NA  NA NA NA   NA<br>## 102 suttoez01 1871     1  CL1    29 128 35 45   3   7  3  23  3  1  1  0  NA  NA NA NA   NA<br>## 106 whitede01 1871     1  CL1    29 146 40 47   6   5  1  21  2  2  4  1  NA  NA NA NA   NA<br>
It looks like we've loaded the data successfully. Next, ... [Read more...]

geom_point Legend with Custom Colors in ggplot

March 7, 2013 | Mollie

Formerly, I showed how to make line segments using ggplot.Working from that previous example, there are only a few things we need to change to add custom colors to our plot and legend in ggplot.First, we'll add the colors of our choice. I'll do th... [Read more...]

ggplot ggoldy

March 7, 2013 | Andy

One of my graduate students worked some ggplot magic and created an almost Light Bright-esqe plot of our very own Goldy Gopher. She also, thoughtfully, published a tutorial on her blog. Read and enjoy! [visit Rita's blog here] [Read more...]

Stan 1.2.0 and RStan 1.2.0

March 6, 2013 | Bob Carpenter

Stan 1.2.0 and RStan 1.2.0 are now available for download. See: http://mc-stan.org/ Here are the highlights. Full Mass Matrix Estimation during Warmup Yuanjun Gao, a first-year grad student here at Columbia (!), built a regularized mass-matrix estimator. This helps for posteriors with high correlation among parameters and varying scales. We’re ... [Read more...]
1 9 10 11 12 13 14

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)