Uncategorized

Welcome to Simply Statistics 2.0

November 18, 2012 | 0 Comments

Welcome to the re-designed, re-hosted and re-platformed Simply Statistics blog. We have moved the blog over to the WordPress platform to give us some newer features that were lacking over at tumblr. So far the transition has gone okay but … Continue reading → [Read more...]

Sunday Data/Statistics Link Roundup (11/18/12)

November 18, 2012 | 0 Comments

An interview with Brad Efron about scientific writing. I haven’t watched the whole interview, but I do know that Efron is one of my favorite writers among statisticians. Slidify, another approach for making HTML5 slides directly from R.  I love … Continue reading → [Read more...]

Logo Contest Winner

November 16, 2012 | 0 Comments

Congratulations to Bradley Saul, the winner of the Simply Statistics Logo contest! We had some great entries which made it difficult to choose between them. You can see the new logo to the right of our home page or the … Continue reading → [Read more...]

Reproducible Research: With Us or Against Us?

November 15, 2012 | 0 Comments

Last night this article by Chris Drummond of the Canadian National Research Council (Conseil national de recherches Canada) popped up in my Google Scholar alert. The title of the article, “Reproducible Research: a Dissenting Opinion” would seem to indicate that he disagrees … Continue reading → [Read more...]

Some academic thoughts on the poll aggregators

November 8, 2012 | 0 Comments

The night of the presidential elections I wrote a post celebrating the victory of data over punditry. I was motivated by the personal attacks made against Nate Silver by pundits that do not understand Statistics. The post generated a little … Continue reading → [Read more...]

Nate Silver does it again! Will pundits finally accept defeat?

November 6, 2012 | 0 Comments

My favorite statistician did it again! Just like in 2008, he predicted the presidential election results almost perfectly. For those that don’t know, Nate Silver is the statistician that runs the fivethirtyeight blog. He combines data from hundreds of polls, uses … Continue reading → [Read more...]

Sunday Data/Statistics Link Roundup (11/4/12)

November 4, 2012 | 0 Comments

Brian Caffo headlines the WaPo article about massive online open courses. He is the driving force behind our department’s involvement in offering these massive courses. I think this sums it up: `“I can’t use another word than unbelievable,” Caffo said. … Continue reading → [Read more...]

Variable probability Bernoulli outcomes – Fast and Slow

November 1, 2012 | 0 Comments

I am working on a project that requires the generation of Bernoulli outcomes. Typically, I would go about this using the built in sample() function like so: This works great and is fast, even for large n. Problem is, I want to generate each sample with its own unique probability. ... [Read more...]

Stan for Bayesian Analysis

October 23, 2012 | 0 Comments

Bayesian analysis has been growing in popularity among ecologists recently, largely due to accessible books such as Models for Ecological Data: An Introduction, Introduction to WinBUGS for Ecologists, and Bayesian Methods for Ecology. Most ecologists with limited programming background have … Continue reading → [Read more...]

Color Palettes in HCL Space

October 12, 2012 | 0 Comments

This is a quick follow-up to my previous post about Color Palettes in RGB Space. Achim Zeileis had commented that, perhaps, it would be more informative to evaluate the color palettes in HCL (polar LUV) space, as that spectrum more accurately describes how humans perceive color. Perhaps more clear trends ... [Read more...]

Where in the world is R and RStudio

October 1, 2012 | 0 Comments

Using the web logs collected when users download RStudio, we’ve prepared the following two maps showing where RStudio is being used, over the whole globe and just within the continental USA. Obviously this data is somewhat biased, as it reflects the number of downloads of RStudio, rather than the ... [Read more...]

Padding integers for use in filenames

September 29, 2012 | 0 Comments

If you’ve ever written code that generates a whole whack of files, you may have came across the following problem when processing them. Using a naming convention wherein files are numbered will  gum up any ordering which is based on string sorting (ls, for example). What you end up ... [Read more...]

Changes in optimization performance of gcc over time

September 16, 2012 | 0 Comments

The SPEC benchmarks came out a year after the first release of gcc (in fact gcc was and still is one of the programs included in the benchmark). Compiling the SPEC programs using the gcc option -O2 (sometimes -O3) has always been the way to measure gcc performance, but after 25 ... [Read more...]

Descriptive statistics of some Agile feature characteristics

September 2, 2012 | 0 Comments

The purpose of software engineering research is to figure out how software development works so that the software industry can improve its quality/timeliness (i.e., lower costs and improved customer satisfaction). Research is hampered by the fact that companies are not usually willing to make public good quality data ... [Read more...]

Processing Data from a Statistica Worksheet Using R

August 29, 2012 | 0 Comments

Context: I work with data from non-profit organizations, and so a big concern in many of my analyses is if and how much people are donating from one year to the next.  One of the  things I normally like to do … Continue reading → [Read more...]

Heatmap tables with ggplot2

August 20, 2012 | 0 Comments

I wrote before about heatmap tables as a better way of producing frequency or other tables, with a solution which works nicely in latex. It is possible to do them much more easily in ggplot2, like this library(Hmisc) library(ggplot2) library(reshape) data(HairEyeColor) P=t(HairEyeColor[,,2]) Pm=melt(... [Read more...]

Horizon plots with ggplot2, sort-of

August 19, 2012 | 0 Comments

The Timely Portfolio blog via R-bloggers has recently published some interesting entries about the value of horizon plots for visual comparison of a number of time series. Very nice it looks too. You can read more about them here. The trick to understanding them is to imagine that each row ...
[Read more...]

Using R from Inside Statistica

August 17, 2012 | 0 Comments

I’ve been spending a lot of time in the last month or so doing projects at work not statistics related, hence the lack of posts!  In the interim, I had to do some serious research on handling datasets bigger than … Continue reading → [Read more...]
1 2 3 16

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)