GeoCoding, R, and The Rolling Stones – Part 1

March 20, 2013
By
GeoCoding, R, and The Rolling Stones – Part 1

In this article I discuss a general approach for Geocoding a location from within R, processing XML reports, and using R packages to create interactive maps. There are various ways to accomplish this, though using Google’s GeoCoding service is a good place to start. We’ll also talk a bit about the XML package that is

Read more »

On the acceptance of R

March 20, 2013
By

Some history and a prediction. Past A discussion broke out on the R-help mailing list in January 2006 about a technical report put out by the statistical computing group at UCLA.  The report in question talked mainly about SAS, SPSS and Stata.  It talked briefly — and not especially positively — about R.  Someone accused The post On...

Read more »

Stan at Google this Thurs and at Berkeley this Fri noon

March 20, 2013
By
Stan at Google this Thurs and at Berkeley this Fri noon

Michael Betancourt will be speaking at Google and at the University of California, Berkeley. The Google talk is closed to outsiders (but if you work at Google, you should go!); the Berkeley talk is open to all: Friday March 22, 12:10 pm, Evans Hall 1011. Title of talk: Stan: Practical Bayesian Inference with Hamiltonian Monte The post Stan...

Read more »

Fifth Torino R net meeting details – and Milano R announcement

March 20, 2013
By
Fifth Torino R net meeting details – and Milano R announcement

Fifth Torino R net meeting on 11 Apr 2013, Campus Luigi Einaudi, Università degli Studi di Torino, will have three presentations Winning with R (and friends) – How data analysts affect the standings in sports championships, Massimilano Marchi, Regione Emilia-Romagna; Predictive … Continue reading →

Read more »

Decisionstats/OpenCPU interview: R, D3, security, the cloud, and snacks.

March 20, 2013
By

I had the pleasure of being interviewed by Ajay Ohri from decisionstats.com earlier this week. Ajay is a great interviewer and writer and has extensive knowledge and experience on how R fits into the BI tool kit. His book R for Business Analytics (Springer, 2012) is a good read for anyone in industry looking to ...

Read more »

Optimal Meeting Point on the Paris Metro

March 20, 2013
By

tl;dr: Play with the app here When you live in Paris, chances are you are (home or work) very close to a metro station, so when you want to meet with some friends, you usually end up picking another metro station as a meeting point. Yet, finding the optimal place to meet can easily become a complex problem considering...

Read more »

Behavioral Economics and Beer… highly correlated

March 19, 2013
By
Behavioral Economics and Beer… highly correlated

Short: I plot the frequency of wikipedia searches of “Behavioral Economics”, and “Beer” – who knew the correlation would be 0.7! Data reference:Data on any wikipedia searches (back to 2007) are available at http://glimmer.rstudio.com/pssguy/wikiSearchRates/. The website allows you to download frequency hits per day as a csv, which is what I've done here....

Read more »

Animating neural networks from the nnet package

March 19, 2013
By
Animating neural networks from the nnet package

My research has allowed me to implement techniques for visualizing multivariate models in R and I wanted to share some additional techniques I’ve developed, in addition to my previous post. For example, I think a primary obstacle towards developing a useful neural network model is an under-appreciation of the effects model parameters have on model

Read more »

Samsung Phone Data Analysis Project

March 19, 2013
By
Samsung Phone Data Analysis Project

Below are my findings from the second data analysis project in Dr. Jeffery Leek’s John Hopkins Coursera class. Introduction I used the  “Human Activity Recognition Using Smartphones Dataset” (UCI, 2013) to build a model. This data  was recorded from a Samsung prototype smartphone with a built-in accelerometer. The purpose of my model was to recognize the type

Read more »

R’s 2012 Growth in Capability Exceeds SAS’ All Time Total

March 19, 2013
By
R’s 2012 Growth in Capability Exceeds SAS’ All Time Total

by Robert A. Muenchen I’m slowly gathering all the data needed to update my ongoing article, The Popularity of Data Analysis Software. The section below is the latest installment. Growth in Capability The capability of all the software in this … Continue reading →

Read more »

knitr2wordpress and gradient_cloud Revisited

March 19, 2013
By
knitr2wordpress and gradient_cloud Revisited

This post serves three function: It allows me to revisit an old blogpost It let's me test out the new-ish knitr function knti2wp and RWordPress It enables me to avoid the massive ammount of reading I need to do and … Continue reading →

Read more »

Analyzing Local Data with a Shiny Web App

March 19, 2013
By

A great. recent enhancement for the Shiny App is the ability to upload local files. Now, in addition to users being able to interact with data provided on the host e.g. Soccer Tables or via the web, Wikipedia Search Rates they can use apps to view and analyse their own data I have knocked up

Read more »

EpiWorkshop 2013: DNA methylation analysis in R

March 19, 2013
By

Elemento Lab at Weill Cornell Medical College organized a workshop on Epigenomics. I had the opportunity to give a tutorial on DNA methylation analysis in R. The tutorial demonstrates how to analyze high-throughput bisulfite sequencing d...

Read more »

What’s New in 6.2: Open Source R 2.15.3

March 19, 2013
By

by Thomas Dinsmore Last week, Revolution Analytics released the Limited Availability edition of Revolution R Enterprise Release 6.2. Interest in this new release is high, and we're very pleased with user response. Over the next several weeks, I will share more detailed information about the capabilities included in this new release. Revolution R Enterprise Release 6.2 supports open source...

Read more »

Learning-by-doing: my quest to master ggplot2 (part 1)

March 19, 2013
By

Your browser does not support iframes.

Read more »

Veterinary Epidemiologic Research: GLM – Evaluating Logistic Regression Models (part 3)

March 19, 2013
By
Veterinary Epidemiologic Research: GLM – Evaluating Logistic Regression Models (part 3)

Third part on logistic regression (first here, second here). Two steps in assessing the fit of the model: first is to determine if the model fits using summary measures of goodness of fit or by assessing the predictive ability of the model; second is to deterime if there’s any observations that do not fit the

Read more »

Dealing with different object types in a vector in R

March 19, 2013
By

I came across a little problem while dealing with a vector in R which had one of the most simple solutions. These are, in my opinion, the most annoying problems with the most simple and commonsensical solution. Anyways, yet again Utkarsh comes to rescu...

Read more »

How not to reveal your MySQL DB login/password when sharing code on GitHub or BitBucket?

March 19, 2013
By

Solution: use your ~/.my/cnfInside your ~/.my.cnf file define the connection parameters to your databases. For example, here I define two groups called local and toto:user = rootpassword = ultra_secrethost = localhostuser = capitaine_flamp...

Read more »

googleVis 0.4.2 with support for shiny released on CRAN

March 19, 2013
By

The new version of googleVis 0.4.2 is now available via CRAN. Many thanks to all who provided feedback on version 0.4.0 and particularly to Sebastian Campbell, John Maindonald and Aonan Zhang. As usual, if you find any issues or bugs, please send us an email or add a line to our online issues log.With version...

Read more »

The evolution of EU legislation (graphed with ggplot2 and R)

March 19, 2013
By
The evolution of EU legislation (graphed with ggplot2 and R)

During the last half century the European Union has adopted more than 100 000 pieces of legislation. In this presentation I look into the patterns of legislative adoption over time. I tried to create clear and engaging graphs that provide … Continue reading →

Read more »

Layman’s Random Forests

March 18, 2013
By

I’m not a fan of the Top 40 style content on Quora, but a student in Dr. Leek’s Coursera class shared this absolute gem from Edwin Chen. I have not seen a better explanation: How do random forests work in layman’s terms? Suppose you’re very indecisive, so whenever you want to watch a movie, you ask

Read more »

Review of Mathematica 9 and R-link

March 18, 2013
By

VIDEO TRANSCRIPT: Hello, this is Matt Asher from StatisticsBlog.com. I’m going to be reviewing Mathematica 9, from Wolfram Research. In particular, I’ll be focusing on using it with R and to do Monte Carlo simulations and other statistical work. You can find a full transcript of this video at my blog, including the source code

Read more »

One Pager Performance Report with knitr, R, and a Different Font

March 18, 2013
By

Although I suffer from complete ignorance of typography, with a little help from a post from Hyndsight and post from mages' blog, I wanted to try a different font on the one-pager performance report that we created in Onepager Now with knitR. I do not think Open Sans Light is the best choice for this...

Read more »

R – Simple Recursive XML Parsing

March 18, 2013
By

This is intended for those who are starting out in R and interested in parsing an XML document recursively. It uses DT Lang's XML package.If you want to just read certain types of nodes, then XPATH is great. This document by DT Lang is perfect for that...

Read more »

Baseball Statistics with R – Batting Average

March 18, 2013
By
Baseball Statistics with R – Batting Average

I'm working on a new book about the R programming language. R is a language that is designed for use with statistics and data. I use it to analyze sports and social networking. I thought that it would be fun to write the book focusing on baseball statistics using data from Major League Baseball. This post...

Read more »

Replay of Revolution R Enterprise: 100% R and More

March 18, 2013
By

If you missed last week's broadcast of the webinar Revolution R Enterprise: 100% R and More, I've embedded the replay below. If you're not familiar with the power, productivity and enterprise readiness that Revolution R Enterprise brings to open source R, this is a good place to start. Slides from the webinar and a downloadable video of the replay...

Read more »

RuPaul’s Drag Race season 5 predictions: episode 8

March 18, 2013
By
RuPaul’s Drag Race season 5 predictions: episode 8

Wow, last week’s Drag Race post made the rounds in the stats and Drag Race circles. It was cross-posted to Jezebel and has been getting some pretty high-profile links. A little birdy told me that Ms. Ru herself has read it. I think I can die a happy man knowing that RuPaul has visited Bad… Continue reading →

Read more »

Geometric Random graphs

March 18, 2013
By

Some days ago a friend of mine asked how much i knew about graph-theory. My answer: nothing. Anyway, i was able to read a little bit on Random Geometric graphs, so i came with this little function to help visualize these things: There are some pretty...

Read more »

Which political science journals will have a data policy?

March 18, 2013
By
Which political science journals will have a data policy?

Making available replication materials for the research you do is A Good Thing. It’s also work, and it’s quite easy to never get around to. Certainly I claim no special virtue in this department so I am always happy when there’s an institutional stick to prod my better nature in the right direction. One such institutional

Read more »

Sponsors