big geo-data visualisations

April 17, 2013
By
big geo-data visualisations

Spotting international conflict is very easy with the GDELT data set, combined with ggplot and R. The simple gif above shows snapshots of Russian/Soviet activity from January 1980 and January 2000. I think it also illustrates how Russia nowadays looks more to the east and the South than during the Cold War. The trend, though...

Read more »

Reinhart & Rogoff: Everyone makes coding mistakes, we need to make it easy to find them + Graphing uncertainty

April 17, 2013
By
Reinhart & Rogoff: Everyone makes coding mistakes, we need to make it easy to find them + Graphing uncertainty

You may have already seen a lot written on the replication of Reinhart & Rogoff’s (R &amp R) much cited 2010 paper done by Herndon, Ash, and Pollin. If you haven’t, here is a round up of some of some of what has been written: Konczal, Yglesias, Krugman, Cowen, Peng,

Read more »

R Color Reference Sheet

April 16, 2013
By
R Color Reference Sheet

R has a built-in collection of 657 colors that you can use in plotting functions by using color names. There are also various facilities to select color sequences more systematically: Color palettes and ramps available in packages RColorBrewer and colorRamps. R base functions colorRamp and colorRampPalette that you can use to create your own color

Read more »

Looking Ahead: Revolution R Enterprise Release 7

April 16, 2013
By

by Thomas Dinsmore Revolution R Enterprise Release 6.2 goes live next week, so naturally our development team is thinking ahead to Release 7, which we plan to release later this year. Some of those enhancements are hush-hush, and we can't talk about them yet. But one of the most important enhancements we've already announced: support for predictive analytics inside...

Read more »

Flotsam 11: mostly on books

April 16, 2013
By
Flotsam 11: mostly on books

‘No estaba muerto, andaba the parranda’† as the song says. Although rather than partying it mostly has been reading, taking pictures and trying to learn how to record sounds. Here there are some things I’ve come across lately. I can’t remember if I’ve recommended Matloff’s The Art of R Programming before; if I haven’t, go

Read more »

Plotting data over a map with R

April 16, 2013
By
Plotting data over a map with R

After searching for a few hours on the web, I’ve been able to get my R code working and plot breast cancer data on a world map. It might not the best looking map possible (R graphics is incredible!), but I am happy with that for now.To produce the map I used the “maps” package available through CRAN repository....

Read more »

UseR! 2013 website at user2013.org

April 16, 2013
By

For reasons beyond my understanding, the user 2013 committee didn’t register a domain name for the website, and the official address of the conference is: http://161.67.142.97/congresos/useR-2013/. Not only is this impossible to remember for humans, but it won’t show up in search engines. So I decided to help them out and invest 8 euro to ...

Read more »

Test Driven Analysis?

April 16, 2013
By
Test Driven Analysis?

At the last LondonR meeting Francine Bennett from Mastodon C shared some of her experience and findings from an analysis of a large prescriptions data set of the UK's national health service (NHS). However, it was her last slide, which I found the most...

Read more »

Is the size of your lm model causing you headaches?

April 15, 2013
By

Normal 0 false false false EN-US X-NONE X-NONE ...

Read more »

RStudio is reminding me of the older Macs

April 15, 2013
By
RStudio is reminding me of the older Macs

The only thing missing is the cryptic ID number.Well, the only bad thing is that I am trying to run a probabilistic graphical model on some real data, and having a crash like this will definitely slow things down.

Read more »

MCMSki IV, Jan. 6-8, 2014, Chamonix (news #5)

April 15, 2013
By
MCMSki IV, Jan. 6-8, 2014, Chamonix (news #5)

More exciting news about MCMSki IV! First thing first, the 16 contributed sessions are now all-set, having gotten the stamp of approval from the scientific committee! Thanks to everyone who submitted a session proposal. (There were so many proposals that we alas had to reject some, as well as every single talk proposal… Sorry people:

Read more »

How long is the average dissertation?

April 15, 2013
By
How long is the average dissertation?

The best part about writing a dissertation is finding clever ways to procrastinate. The motivation for this blog comes from one of the more creative ways I’ve found to keep myself from writing. I’ve posted about data mining in the past and this post follows up on those ideas using a topic that is relevant

Read more »

Unshorten URLs in R

April 15, 2013
By

Well, of course, this tip comes out one week after I needed it. The author uses the RCurl package to request the header of the shortened URL and then parse the "location" parameter on the return. This sort of operation tends to be needed frequently, es...

Read more »

Math symbols in R charts: a cheat sheet

April 15, 2013
By
Math symbols in R charts: a cheat sheet

If you're creating a scientific graphic in the R language, there's a good chance you'll be wanting to include some mathematical symbols somewhere on the chart. You might want to use a symbol like μ as an axis label, annotate a curve with simple math like x2, or even put a complete equation like: in the title. You can...

Read more »

THE FINAL FOUR – Drag Race season 5, episode 11 predictions

April 15, 2013
By
THE FINAL FOUR – Drag Race season 5, episode 11 predictions

We’re in the Final Four now, the actual final four that matters (sorry sports forecasters). Last week, Coco got the chop, which made sense statistically (she had a huge relative risk AND had been the first queen to have had to lipsync four times) and from a narrative standpoint — Alyssa got eliminated the week… Continue reading →

Read more »

Never too experienced to make a basic mistake

April 15, 2013
By

I was one of the 170 or so people at the Data Science hackathon in London over the weekend. As always this was well run by Carlos and his team who kept us fed, watered and connected to the Internet. One of the three challenges involved a dataset containing pairs of Twitter users, A and

Read more »

Mapping the GDELT data (and some Russian protests, too)

April 15, 2013
By
Mapping the GDELT data (and some Russian protests, too)

(This article was first published on Quantifying Memory, and kindly contributed to R-bloggers) In this post I show how to select relevant bits of the GDELT data in R and present some introductory ideas about how to visualise it as a network map. I've included all the code used to generate the illustrations. Because of this, if you here...

Read more »

Stock-picking opportunity and the ratio of variabilities

April 15, 2013
By
Stock-picking opportunity and the ratio of variabilities

How good is the current opportunity to pick stocks relative to the past? Idea The more stocks act differently from each other relative to how volatile they are, the more opportunity there is to benefit by selecting stocks.  This post looks at a particular way of investigating that idea. Data Daily (log) returns of 442 … Continue reading...

Read more »

Simulating the Gambler’s Ruin

April 14, 2013
By
Simulating the Gambler’s Ruin

The gambler’s ruin problem is one where a player has a probability p of winning  and probability q of losing. For example let’s take a skill game where the player x can beat player y with probability 0.6 by getting closer to target. The game play begins with player x being allotted 5 points and player y allotted 10

Read more »

The OpenStreetMap Package Opens Up

April 14, 2013
By
The OpenStreetMap Package Opens Up

A new version of the OpenStreetMap package is now up on CRAN, and should propagate to all the mirrors in the next few days. The primary purpose of the package is to provide high resolution map/satellite imagery for use in your R plots. The package supports base graphics and ggplot2, as well as transformations between spatial coordinate

Read more »

Checking the Goodness of Fit of the Poisson Distribution in R for Alpha Decay by Americium-241

Checking the Goodness of Fit of the Poisson Distribution in R for Alpha Decay by Americium-241

Introduction Today, I will discuss the alpha decay of americium-241 and use R to model the number of emissions from a real data set with the Poisson distribution.  I was especially intrigued in learning about the use of Am-241 in smoke detectors, and I will elaborate on this clever application.  I will then use the Pearson chi-squared

Read more »

Datasets handpicked by students

April 14, 2013
By

I’m often on the hunt for datasets that will not only work well with the material we’re covering in class, but will (hopefully) pique students’ interest. One sure choice is to use data collected from the students, as it is … Continue reading →

Read more »

BayesComp homepage

April 14, 2013
By
BayesComp homepage

Today, the BayesComp section of ISBA launched its website. It is organised as a wiki and members of the section are strongly incited to take part into the construction of the website. To quote from Peter Green’s introduction: This new Wikidot site aims to be a community-edited resource on all aspects of Bayesian computation, available

Read more »

Win Your Snake Draft: Calculating “Value Over Replacement” using R

April 14, 2013
By
Win Your Snake Draft: Calculating “Value Over Replacement” using R

In prior posts, I have demonstrated how to download, calculate, and compare fantasy football projections from ESPN, CBS, and NFL.com and how to calculate players’ risk levels. In this post, I will demonstrate how to win your snake The post Win Your Snake Draft: Calculating "Value Over Replacement" using R appeared first on Fantasy Football Analytics.

Read more »

Win Your Snake Draft: Calculating “Value Over Replacement” using R

April 14, 2013
By
Win Your Snake Draft: Calculating “Value Over Replacement” using R

In prior posts, I have demonstrated how to download, calculate, and compare fantasy football projections from ESPN, CBS, and NFL.com and how to calculate players' risk levels. In this post, I will demonstrate how...

Read more »

PSID data set builder for R

April 14, 2013
By

Economists frequently use public datasets. One frequently used dataset is the Panel Study of Income Dynamics, short PSID, maintained by the Institute of Social Research at the University of Michigan.I'm introducing psidR, which is a small helper packag...

Read more »

Predicting Dichotomous Outcomes I

April 14, 2013
By
Predicting Dichotomous Outcomes I

We are trying to predict a dependent dichotomous variable (male/female, yes/no, like/dislike,etc) with independent “predictor” variables. Let’s say we want to determine whether or not an employee will quit based on the percentage of their tenure spent traveling. We assemble the data from HR and erroneously employ simple linear regression to model the relationship, a

Read more »

Using R — Working with Geospatial Data

April 14, 2013
By
Using R — Working with Geospatial Data

This entry is part 6 of 12 in the series Using RGIS, an acronym that brings joy to some and strikes fear in the heart of those not interested in buying expensive software. Luckily fight or flight can be saved for another day because you …   read more ...

Read more »

Tip: Julia vs. R – introduction videos and more

April 14, 2013
By

Last years UseR 2012 conference in Nashville had an interesting discussion session titled “What other languages should R users know about?“. General consensus was that multilingualism is inevitable in modern computing, and panel members presented various languages that complement R in different ways. Some of the usual suspects included SQL, python, and of course C++, ...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series











Contact us if you wish to help support R-bloggers, and place your banner here.