MLB Pitcher Values

April 18, 2013
By
MLB Pitcher Values

Not the start Blue Jays fans were looking for. With a 6-9 record at the time of publication they already trail Boston (led by reviled ex-BJ manager, John Farrell) by 4.5 games. The batting has been particularly anemic but the pitching – particularly the starting rotation – has also been a concern I have whipped

Read more »

Using ggplot2 to recreate 2012 Best Cities Results

April 18, 2013
By
Using ggplot2 to recreate 2012 Best Cities Results

Data from: http://images.businessweek.com/slideshows/2012-09-26/americas-50-best-citiesThis time, I used ggplot2 to recreate the graphs created previously using Tableau. After all, R and ggplot2 are open source and free. Yes, I could leave a space...

Read more »

Using multilevel models to get accurate inferences for repeated measures ANOVA designs

April 18, 2013
By
Using multilevel models to get accurate inferences for repeated measures ANOVA designs

It is now increasingly common for experimental psychologists (among others) to use multilevel models (also known as linear mixed models) to analyze data that used to be shoe-horned into a repeated measures ANOVA design. Chapter 18 of Serious Stats introduces multilevel models by considering them as an extension of repeated measures ANOVA models that can

Read more »

Examples for sjPlotting functions, including correlations and proportional tables with ggplot #rstats

April 18, 2013
By
Examples for sjPlotting functions, including correlations and proportional tables with ggplot #rstats

Sometimes people ask me how the examples of my plotting functions I show here can be reproduced without having a SPSS data set (or at least, without having the data set I use because it’s not public yet). So I … Weiterlesen →

Read more »

Cut Dates Into Quarters

April 18, 2013
By

Frequently I need to recode a date column to quarters. For example, at Excelsior College we have continuous enrollment so we report new enrollments per quarter. To complicate things a bit, our fiscal year starts in July so that July, August, and September represent the first quarter, January, February, and March are actually the third quarter. But sometimes...

Read more »

CrossFit weights: gender matters less than you’d think

April 17, 2013
By
CrossFit weights: gender matters less than you’d think

Exploring Gaussian Mixture Models Exploring Gaussian Mixture Models This week in the Empirical Research Methods course, we've been talking a lot about measurement error. The idea of having some latent variable of interest, coupled with 'flawed' measures reminded me of a section of Cosma's course I really enjoyed, but haven't gotten a change to go...

Read more »

Banging on the JGBs

April 17, 2013
By

Since I have not posted in quite a while, I wanted to let everyone know that I am still alive and kicking.  The resurrection of excitement (opportunity) in the markets, quarterly reporting cycle, and the overwhelming number of unbelievable R/javascript releases have kept me from writing something good enough to justify a post.   In the markets, Japan and gold...

Read more »

More reasons not to use Excel for modeling

April 17, 2013
By
More reasons not to use Excel for modeling

As if the London Whale nearly bankrupting Chase Bank wasn't lesson enough, we now hear that an Excel error impacted the conclusions of a major economics paper that influenced the recent austerity policies in the US, UK and elsewhere. Matt Frost says enough is enough, and implores to everyone to stop using error-prone point-and-click tools, and instead use a...

Read more »

Mind Reading… What are our customers thinking?

April 17, 2013
By

Normal 0 false false false EN-US X-NONE X-NONE ...

Read more »

Bioconductor looking for Google Summer of Code Applicants

April 17, 2013
By
Bioconductor looking for Google Summer of Code Applicants

Aannounced: the 177 mentoring organizations accepted for 2013’s Google Summer of Code program. We’re proud that Bioconductor is one the organizations chosen. Google Summer of Code is a global program that offers students stipends (USD$5000) to write code for open source projects. We’ve proposed three ideas. Students may also propose their own ideas. Project 1: ExperimentHub AnnotationHub and its supporting packages are primed to support such a project. AnnotationHub provides infrastructure...

Read more »

ggplot dodged vs faceted bar chart

April 17, 2013
By
ggplot dodged vs faceted bar chart

I've been bowling once per year at a charity event for the last few years and have kept track of the outcomes to share my group. I used ggplot2 to create a bar chart for the scores. Below are two graphs, one is dodged, the other is faceted. There's no ...

Read more »

Version 1.2 of devtools released

April 17, 2013
By
Version 1.2 of devtools released

We’re very pleased to announce the release of devtools 1.2. This version continues to make working with packages easier by increasing installation speed (skipping the build step unless local = FALSE), enhancing vignette handling (to support the non-Sweave vignettes available in R 3.0.0), and providing better default compiler flags for C and C++ code. Also

Read more »

Intro R training from RStudio: NYC May 13-14, SF May 20-21 (and discounts)

April 17, 2013
By

At RStudio, we’re hosting our Introduction to R Workshop this May in two locations. As an R-help subscriber, we’re offering 10% off! Intro to data science with R (http://goo.gl/bplg3)   May 13-14 New York City Intro to data science with R (http://goo.gl/VCUFL)  May 20-21 San Francisco Bay Area What will you learn? Practical skills for visualizing, transforming, and modeling data in R....

Read more »

Interview with a forced convert from Matlab to R

April 17, 2013
By
Interview with a forced convert from Matlab to R

Here is an interview with Ron Hochreiter, Assistant Professor at WU Vienna University Economics and Business. In 25 words or less tell us what you do (using German words is cheating). I consider myself as a data scientist (teaching and research) with roots in Mathematical Programming, i.e. Optimization under Uncertainty (Stochastic Programming). You were an The post Interview...

Read more »

big geo-data visualisations

April 17, 2013
By
big geo-data visualisations

Spotting international conflict is very easy with the GDELT data set, combined with ggplot and R. The simple gif above shows snapshots of Russian/Soviet activity from January 1980 and January 2000. I think it also illustrates how Russia nowadays looks more to the east and the South than during the Cold War. The trend, though...

Read more »

Reinhart & Rogoff: Everyone makes coding mistakes, we need to make it easy to find them + Graphing uncertainty

April 17, 2013
By
Reinhart & Rogoff: Everyone makes coding mistakes, we need to make it easy to find them + Graphing uncertainty

You may have already seen a lot written on the replication of Reinhart & Rogoff’s (R &amp R) much cited 2010 paper done by Herndon, Ash, and Pollin. If you haven’t, here is a round up of some of some of what has been written: Konczal, Yglesias, Krugman, Cowen, Peng,

Read more »

R Color Reference Sheet

April 16, 2013
By
R Color Reference Sheet

R has a built-in collection of 657 colors that you can use in plotting functions by using color names. There are also various facilities to select color sequences more systematically: Color palettes and ramps available in packages RColorBrewer and colorRamps. R base functions colorRamp and colorRampPalette that you can use to create your own color

Read more »

Looking Ahead: Revolution R Enterprise Release 7

April 16, 2013
By

by Thomas Dinsmore Revolution R Enterprise Release 6.2 goes live next week, so naturally our development team is thinking ahead to Release 7, which we plan to release later this year. Some of those enhancements are hush-hush, and we can't talk about them yet. But one of the most important enhancements we've already announced: support for predictive analytics inside...

Read more »

Flotsam 11: mostly on books

April 16, 2013
By
Flotsam 11: mostly on books

‘No estaba muerto, andaba the parranda’† as the song says. Although rather than partying it mostly has been reading, taking pictures and trying to learn how to record sounds. Here there are some things I’ve come across lately. I can’t remember if I’ve recommended Matloff’s The Art of R Programming before; if I haven’t, go

Read more »

Plotting data over a map with R

April 16, 2013
By
Plotting data over a map with R

After searching for a few hours on the web, I’ve been able to get my R code working and plot breast cancer data on a world map. It might not the best looking map possible (R graphics is incredible!), but I am happy with that for now.To produce the map I used the “maps” package available through CRAN repository....

Read more »

UseR! 2013 website at user2013.org

April 16, 2013
By

For reasons beyond my understanding, the user 2013 committee didn’t register a domain name for the website, and the official address of the conference is: http://161.67.142.97/congresos/useR-2013/. Not only is this impossible to remember for humans, but it won’t show up in search engines. So I decided to help them out and invest 8 euro to ...

Read more »

Test Driven Analysis?

April 16, 2013
By
Test Driven Analysis?

At the last LondonR meeting Francine Bennett from Mastodon C shared some of her experience and findings from an analysis of a large prescriptions data set of the UK's national health service (NHS). However, it was her last slide, which I found the most...

Read more »

Is the size of your lm model causing you headaches?

April 15, 2013
By

Normal 0 false false false EN-US X-NONE X-NONE ...

Read more »

RStudio is reminding me of the older Macs

April 15, 2013
By
RStudio is reminding me of the older Macs

The only thing missing is the cryptic ID number.Well, the only bad thing is that I am trying to run a probabilistic graphical model on some real data, and having a crash like this will definitely slow things down.

Read more »

MCMSki IV, Jan. 6-8, 2014, Chamonix (news #5)

April 15, 2013
By
MCMSki IV, Jan. 6-8, 2014, Chamonix (news #5)

More exciting news about MCMSki IV! First thing first, the 16 contributed sessions are now all-set, having gotten the stamp of approval from the scientific committee! Thanks to everyone who submitted a session proposal. (There were so many proposals that we alas had to reject some, as well as every single talk proposal… Sorry people:

Read more »

How long is the average dissertation?

April 15, 2013
By
How long is the average dissertation?

The best part about writing a dissertation is finding clever ways to procrastinate. The motivation for this blog comes from one of the more creative ways I’ve found to keep myself from writing. I’ve posted about data mining in the past and this post follows up on those ideas using a topic that is relevant

Read more »

Unshorten URLs in R

April 15, 2013
By

Well, of course, this tip comes out one week after I needed it. The author uses the RCurl package to request the header of the shortened URL and then parse the "location" parameter on the return. This sort of operation tends to be needed frequently, es...

Read more »

Math symbols in R charts: a cheat sheet

April 15, 2013
By
Math symbols in R charts: a cheat sheet

If you're creating a scientific graphic in the R language, there's a good chance you'll be wanting to include some mathematical symbols somewhere on the chart. You might want to use a symbol like μ as an axis label, annotate a curve with simple math like x2, or even put a complete equation like: in the title. You can...

Read more »

THE FINAL FOUR – Drag Race season 5, episode 11 predictions

April 15, 2013
By
THE FINAL FOUR – Drag Race season 5, episode 11 predictions

We’re in the Final Four now, the actual final four that matters (sorry sports forecasters). Last week, Coco got the chop, which made sense statistically (she had a huge relative risk AND had been the first queen to have had to lipsync four times) and from a narrative standpoint — Alyssa got eliminated the week… Continue reading →

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.