More reasons not to use Excel for modeling

April 17, 2013
By
More reasons not to use Excel for modeling

As if the London Whale nearly bankrupting Chase Bank wasn't lesson enough, we now hear that an Excel error impacted the conclusions of a major economics paper that influenced the recent austerity policies in the US, UK and elsewhere. Matt Frost says enough is enough, and implores to everyone to stop using error-prone point-and-click tools, and instead use a...

Read more »

Mind Reading… What are our customers thinking?

April 17, 2013
By

Normal 0 false false false EN-US X-NONE X-NONE ...

Read more »

Bioconductor looking for Google Summer of Code Applicants

April 17, 2013
By
Bioconductor looking for Google Summer of Code Applicants

Aannounced: the 177 mentoring organizations accepted for 2013’s Google Summer of Code program. We’re proud that Bioconductor is one the organizations chosen. Google Summer of Code is a global program that offers students stipends (USD$5000) to write code for open source projects. We’ve proposed three ideas. Students may also propose their own ideas. Project 1: ExperimentHub AnnotationHub and its supporting packages are primed to support such a project. AnnotationHub provides infrastructure...

Read more »

ggplot dodged vs faceted bar chart

April 17, 2013
By
ggplot dodged vs faceted bar chart

I've been bowling once per year at a charity event for the last few years and have kept track of the outcomes to share my group. I used ggplot2 to create a bar chart for the scores. Below are two graphs, one is dodged, the other is faceted. There's no ...

Read more »

Version 1.2 of devtools released

April 17, 2013
By
Version 1.2 of devtools released

We’re very pleased to announce the release of devtools 1.2. This version continues to make working with packages easier by increasing installation speed (skipping the build step unless local = FALSE), enhancing vignette handling (to support the non-Sweave vignettes available in R 3.0.0), and providing better default compiler flags for C and C++ code. Also

Read more »

Intro R training from RStudio: NYC May 13-14, SF May 20-21 (and discounts)

April 17, 2013
By

At RStudio, we’re hosting our Introduction to R Workshop this May in two locations. As an R-help subscriber, we’re offering 10% off! Intro to data science with R (http://goo.gl/bplg3)   May 13-14 New York City Intro to data science with R (http://goo.gl/VCUFL)  May 20-21 San Francisco Bay Area What will you learn? Practical skills for visualizing, transforming, and modeling data in R....

Read more »

Interview with a forced convert from Matlab to R

April 17, 2013
By
Interview with a forced convert from Matlab to R

Here is an interview with Ron Hochreiter, Assistant Professor at WU Vienna University Economics and Business. In 25 words or less tell us what you do (using German words is cheating). I consider myself as a data scientist (teaching and research) with roots in Mathematical Programming, i.e. Optimization under Uncertainty (Stochastic Programming). You were an The post Interview...

Read more »

big geo-data visualisations

April 17, 2013
By
big geo-data visualisations

Spotting international conflict is very easy with the GDELT data set, combined with ggplot and R. The simple gif above shows snapshots of Russian/Soviet activity from January 1980 and January 2000. I think it also illustrates how Russia nowadays looks more to the east and the South than during the Cold War. The trend, though...

Read more »

Reinhart & Rogoff: Everyone makes coding mistakes, we need to make it easy to find them + Graphing uncertainty

April 17, 2013
By
Reinhart & Rogoff: Everyone makes coding mistakes, we need to make it easy to find them + Graphing uncertainty

You may have already seen a lot written on the replication of Reinhart & Rogoff’s (R &amp R) much cited 2010 paper done by Herndon, Ash, and Pollin. If you haven’t, here is a round up of some of some of what has been written: Konczal, Yglesias, Krugman, Cowen, Peng,

Read more »

R Color Reference Sheet

April 16, 2013
By
R Color Reference Sheet

R has a built-in collection of 657 colors that you can use in plotting functions by using color names. There are also various facilities to select color sequences more systematically: Color palettes and ramps available in packages RColorBrewer and colorRamps. R base functions colorRamp and colorRampPalette that you can use to create your own color

Read more »

Looking Ahead: Revolution R Enterprise Release 7

April 16, 2013
By

by Thomas Dinsmore Revolution R Enterprise Release 6.2 goes live next week, so naturally our development team is thinking ahead to Release 7, which we plan to release later this year. Some of those enhancements are hush-hush, and we can't talk about them yet. But one of the most important enhancements we've already announced: support for predictive analytics inside...

Read more »

Flotsam 11: mostly on books

April 16, 2013
By
Flotsam 11: mostly on books

‘No estaba muerto, andaba the parranda’† as the song says. Although rather than partying it mostly has been reading, taking pictures and trying to learn how to record sounds. Here there are some things I’ve come across lately. I can’t remember if I’ve recommended Matloff’s The Art of R Programming before; if I haven’t, go

Read more »

Plotting data over a map with R

April 16, 2013
By
Plotting data over a map with R

After searching for a few hours on the web, I’ve been able to get my R code working and plot breast cancer data on a world map. It might not the best looking map possible (R graphics is incredible!), but I am happy with that for now.To produce the map I used the “maps” package available through CRAN repository....

Read more »

UseR! 2013 website at user2013.org

April 16, 2013
By

For reasons beyond my understanding, the user 2013 committee didn’t register a domain name for the website, and the official address of the conference is: http://161.67.142.97/congresos/useR-2013/. Not only is this impossible to remember for humans, but it won’t show up in search engines. So I decided to help them out and invest 8 euro to ...

Read more »

Test Driven Analysis?

April 16, 2013
By
Test Driven Analysis?

At the last LondonR meeting Francine Bennett from Mastodon C shared some of her experience and findings from an analysis of a large prescriptions data set of the UK's national health service (NHS). However, it was her last slide, which I found the most...

Read more »

Is the size of your lm model causing you headaches?

April 15, 2013
By

Normal 0 false false false EN-US X-NONE X-NONE ...

Read more »

RStudio is reminding me of the older Macs

April 15, 2013
By
RStudio is reminding me of the older Macs

The only thing missing is the cryptic ID number.Well, the only bad thing is that I am trying to run a probabilistic graphical model on some real data, and having a crash like this will definitely slow things down.

Read more »

MCMSki IV, Jan. 6-8, 2014, Chamonix (news #5)

April 15, 2013
By
MCMSki IV, Jan. 6-8, 2014, Chamonix (news #5)

More exciting news about MCMSki IV! First thing first, the 16 contributed sessions are now all-set, having gotten the stamp of approval from the scientific committee! Thanks to everyone who submitted a session proposal. (There were so many proposals that we alas had to reject some, as well as every single talk proposal… Sorry people:

Read more »

How long is the average dissertation?

April 15, 2013
By
How long is the average dissertation?

The best part about writing a dissertation is finding clever ways to procrastinate. The motivation for this blog comes from one of the more creative ways I’ve found to keep myself from writing. I’ve posted about data mining in the past and this post follows up on those ideas using a topic that is relevant

Read more »

Unshorten URLs in R

April 15, 2013
By

Well, of course, this tip comes out one week after I needed it. The author uses the RCurl package to request the header of the shortened URL and then parse the "location" parameter on the return. This sort of operation tends to be needed frequently, es...

Read more »

Math symbols in R charts: a cheat sheet

April 15, 2013
By
Math symbols in R charts: a cheat sheet

If you're creating a scientific graphic in the R language, there's a good chance you'll be wanting to include some mathematical symbols somewhere on the chart. You might want to use a symbol like μ as an axis label, annotate a curve with simple math like x2, or even put a complete equation like: in the title. You can...

Read more »

THE FINAL FOUR – Drag Race season 5, episode 11 predictions

April 15, 2013
By
THE FINAL FOUR – Drag Race season 5, episode 11 predictions

We’re in the Final Four now, the actual final four that matters (sorry sports forecasters). Last week, Coco got the chop, which made sense statistically (she had a huge relative risk AND had been the first queen to have had to lipsync four times) and from a narrative standpoint — Alyssa got eliminated the week… Continue reading →

Read more »

Never too experienced to make a basic mistake

April 15, 2013
By

I was one of the 170 or so people at the Data Science hackathon in London over the weekend. As always this was well run by Carlos and his team who kept us fed, watered and connected to the Internet. One of the three challenges involved a dataset containing pairs of Twitter users, A and

Read more »

Mapping the GDELT data (and some Russian protests, too)

April 15, 2013
By
Mapping the GDELT data (and some Russian protests, too)

(This article was first published on Quantifying Memory, and kindly contributed to R-bloggers) In this post I show how to select relevant bits of the GDELT data in R and present some introductory ideas about how to visualise it as a network map. I've included all the code used to generate the illustrations. Because of this, if you here...

Read more »

Stock-picking opportunity and the ratio of variabilities

April 15, 2013
By
Stock-picking opportunity and the ratio of variabilities

How good is the current opportunity to pick stocks relative to the past? Idea The more stocks act differently from each other relative to how volatile they are, the more opportunity there is to benefit by selecting stocks.  This post looks at a particular way of investigating that idea. Data Daily (log) returns of 442 … Continue reading...

Read more »

Simulating the Gambler’s Ruin

April 14, 2013
By
Simulating the Gambler’s Ruin

The gambler’s ruin problem is one where a player has a probability p of winning  and probability q of losing. For example let’s take a skill game where the player x can beat player y with probability 0.6 by getting closer to target. The game play begins with player x being allotted 5 points and player y allotted 10

Read more »

The OpenStreetMap Package Opens Up

April 14, 2013
By
The OpenStreetMap Package Opens Up

A new version of the OpenStreetMap package is now up on CRAN, and should propagate to all the mirrors in the next few days. The primary purpose of the package is to provide high resolution map/satellite imagery for use in your R plots. The package supports base graphics and ggplot2, as well as transformations between spatial coordinate

Read more »

Checking the Goodness of Fit of the Poisson Distribution in R for Alpha Decay by Americium-241

Checking the Goodness of Fit of the Poisson Distribution in R for Alpha Decay by Americium-241

Introduction Today, I will discuss the alpha decay of americium-241 and use R to model the number of emissions from a real data set with the Poisson distribution.  I was especially intrigued in learning about the use of Am-241 in smoke detectors, and I will elaborate on this clever application.  I will then use the Pearson chi-squared

Read more »

Datasets handpicked by students

April 14, 2013
By

I’m often on the hunt for datasets that will not only work well with the material we’re covering in class, but will (hopefully) pique students’ interest. One sure choice is to use data collected from the students, as it is … Continue reading →

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.