Samsung Phone Data Analysis Project

March 19, 2013
By
Samsung Phone Data Analysis Project

Below are my findings from the second data analysis project in Dr. Jeffery Leek’s John Hopkins Coursera class. Introduction I used the  “Human Activity Recognition Using Smartphones Dataset” (UCI, 2013) to build a model. This data  was recorded from a Samsung prototype smartphone with a built-in accelerometer. The purpose of my model was to recognize the type

Read more »

R’s 2012 Growth in Capability Exceeds SAS’ All Time Total

March 19, 2013
By
R’s 2012 Growth in Capability Exceeds SAS’ All Time Total

by Robert A. Muenchen I’m slowly gathering all the data needed to update my ongoing article, The Popularity of Data Analysis Software. The section below is the latest installment. Growth in Capability The capability of all the software in this … Continue reading →

Read more »

knitr2wordpress and gradient_cloud Revisited

March 19, 2013
By
knitr2wordpress and gradient_cloud Revisited

This post serves three function: It allows me to revisit an old blogpost It let's me test out the new-ish knitr function knti2wp and RWordPress It enables me to avoid the massive ammount of reading I need to do and … Continue reading →

Read more »

Analyzing Local Data with a Shiny Web App

March 19, 2013
By

A great. recent enhancement for the Shiny App is the ability to upload local files. Now, in addition to users being able to interact with data provided on the host e.g. Soccer Tables or via the web, Wikipedia Search Rates they can use apps to view and analyse their own data I have knocked up

Read more »

EpiWorkshop 2013: DNA methylation analysis in R

March 19, 2013
By

Elemento Lab at Weill Cornell Medical College organized a workshop on Epigenomics. I had the opportunity to give a tutorial on DNA methylation analysis in R. The tutorial demonstrates how to analyze high-throughput bisulfite sequencing d...

Read more »

What’s New in 6.2: Open Source R 2.15.3

March 19, 2013
By

by Thomas Dinsmore Last week, Revolution Analytics released the Limited Availability edition of Revolution R Enterprise Release 6.2. Interest in this new release is high, and we're very pleased with user response. Over the next several weeks, I will share more detailed information about the capabilities included in this new release. Revolution R Enterprise Release 6.2 supports open source...

Read more »

Learning-by-doing: my quest to master ggplot2 (part 1)

March 19, 2013
By

Your browser does not support iframes.

Read more »

Veterinary Epidemiologic Research: GLM – Evaluating Logistic Regression Models (part 3)

March 19, 2013
By
Veterinary Epidemiologic Research: GLM – Evaluating Logistic Regression Models (part 3)

Third part on logistic regression (first here, second here). Two steps in assessing the fit of the model: first is to determine if the model fits using summary measures of goodness of fit or by assessing the predictive ability of the model; second is to deterime if there’s any observations that do not fit the

Read more »

Dealing with different object types in a vector in R

March 19, 2013
By

I came across a little problem while dealing with a vector in R which had one of the most simple solutions. These are, in my opinion, the most annoying problems with the most simple and commonsensical solution. Anyways, yet again Utkarsh comes to rescu...

Read more »

How not to reveal your MySQL DB login/password when sharing code on GitHub or BitBucket?

March 19, 2013
By

Solution: use your ~/.my/cnfInside your ~/.my.cnf file define the connection parameters to your databases. For example, here I define two groups called local and toto:user = rootpassword = ultra_secrethost = localhostuser = capitaine_flamp...

Read more »

googleVis 0.4.2 with support for shiny released on CRAN

March 19, 2013
By

The new version of googleVis 0.4.2 is now available via CRAN. Many thanks to all who provided feedback on version 0.4.0 and particularly to Sebastian Campbell, John Maindonald and Aonan Zhang. As usual, if you find any issues or bugs, please send us an email or add a line to our online issues log.With version...

Read more »

The evolution of EU legislation (graphed with ggplot2 and R)

March 19, 2013
By
The evolution of EU legislation (graphed with ggplot2 and R)

During the last half century the European Union has adopted more than 100 000 pieces of legislation. In this presentation I look into the patterns of legislative adoption over time. I tried to create clear and engaging graphs that provide … Continue reading →

Read more »

Layman’s Random Forests

March 18, 2013
By

I’m not a fan of the Top 40 style content on Quora, but a student in Dr. Leek’s Coursera class shared this absolute gem from Edwin Chen. I have not seen a better explanation: How do random forests work in layman’s terms? Suppose you’re very indecisive, so whenever you want to watch a movie, you ask

Read more »

Review of Mathematica 9 and R-link

March 18, 2013
By

VIDEO TRANSCRIPT: Hello, this is Matt Asher from StatisticsBlog.com. I’m going to be reviewing Mathematica 9, from Wolfram Research. In particular, I’ll be focusing on using it with R and to do Monte Carlo simulations and other statistical work. You can find a full transcript of this video at my blog, including the source code

Read more »

One Pager Performance Report with knitr, R, and a Different Font

March 18, 2013
By

Although I suffer from complete ignorance of typography, with a little help from a post from Hyndsight and post from mages' blog, I wanted to try a different font on the one-pager performance report that we created in Onepager Now with knitR. I do not think Open Sans Light is the best choice for this...

Read more »

R – Simple Recursive XML Parsing

March 18, 2013
By

This is intended for those who are starting out in R and interested in parsing an XML document recursively. It uses DT Lang's XML package.If you want to just read certain types of nodes, then XPATH is great. This document by DT Lang is perfect for that...

Read more »

Baseball Statistics with R – Batting Average

March 18, 2013
By
Baseball Statistics with R – Batting Average

I'm working on a new book about the R programming language. R is a language that is designed for use with statistics and data. I use it to analyze sports and social networking. I thought that it would be fun to write the book focusing on baseball statistics using data from Major League Baseball. This post...

Read more »

Replay of Revolution R Enterprise: 100% R and More

March 18, 2013
By

If you missed last week's broadcast of the webinar Revolution R Enterprise: 100% R and More, I've embedded the replay below. If you're not familiar with the power, productivity and enterprise readiness that Revolution R Enterprise brings to open source R, this is a good place to start. Slides from the webinar and a downloadable video of the replay...

Read more »

RuPaul’s Drag Race season 5 predictions: episode 8

March 18, 2013
By
RuPaul’s Drag Race season 5 predictions: episode 8

Wow, last week’s Drag Race post made the rounds in the stats and Drag Race circles. It was cross-posted to Jezebel and has been getting some pretty high-profile links. A little birdy told me that Ms. Ru herself has read it. I think I can die a happy man knowing that RuPaul has visited Bad… Continue reading →

Read more »

Geometric Random graphs

March 18, 2013
By

Some days ago a friend of mine asked how much i knew about graph-theory. My answer: nothing. Anyway, i was able to read a little bit on Random Geometric graphs, so i came with this little function to help visualize these things: There are some pretty...

Read more »

Which political science journals will have a data policy?

March 18, 2013
By
Which political science journals will have a data policy?

Making available replication materials for the research you do is A Good Thing. It’s also work, and it’s quite easy to never get around to. Certainly I claim no special virtue in this department so I am always happy when there’s an institutional stick to prod my better nature in the right direction. One such institutional

Read more »

Callback functions for GUI widgets

March 18, 2013
By
Callback functions for GUI widgets

Of all the things I dislike about R, one of the biggest is the fact that you can declare a function within the list of arguments to another function. I’ve gotten over it for very minor operations needed by things like lapply, but it can drive me bonkers elsewhere. One such instance is writing an

Read more »

column-store R or: how i learned to stop worrying and love monetdb

March 18, 2013
By

"Combining R's sophisticated calculations and MonetDB's excellent data access performance is a no-brainer. One gets the best of two (open source) worlds with minimal hassle." - Dr. Hannes Mühleisen"oh wow that was fast like a cheetah with a jetpack or something" - anthony damicowhy try monetdb + ra speed test of four analysis commands on sixty-seven million...

Read more »

Veterinary Epidemiologic Research: GLM – Logistic Regression (part 2)

March 17, 2013
By
Veterinary Epidemiologic Research: GLM – Logistic Regression (part 2)

Second part on logistic regression (first one here). We used in the previous post a likelihood ratio test to compare a full and null model. The same can be done to compare a full and nested model to test the contribution of any subset of parameters: Interpretation of coefficients Note: Dohoo do not report the

Read more »

R, where should I start?

March 17, 2013
By

This is a dynamic post which I will continue to update whenever I find something new. Hope you will find the following links useful.Online Courses for Learning the R languageTry R from Code Schoole-Books for Learning the R LanguageR for Beginners ...

Read more »

Comparing ESPN’s, CBS’s, and NFL.com’s Fantasy Football Projections using R

March 17, 2013
By
Comparing ESPN’s, CBS’s, and NFL.com’s Fantasy Football Projections using R

In the future, we will determine how to select the best possible team by maximizing your team's projected points and minimizing its downside risk.  But in order to do this, we will have to rely on our best guess of how many points each player will score.  We will use 2012 projections from ESPN, CBS, and NFL.com and actual...

Read more »

Extracting Information From Objects Using Names()

March 17, 2013
By
Extracting Information From Objects Using Names()

One of the big differences between a language like Stata compared to R is the ability in R to handle many different types of objects at once, and combine them together or pull them apart.  I had a post about objects last year, but I thought I'd sh...

Read more »

Mumbai, Mar 2013 – Portfolio Tutorial

March 17, 2013
By

(This article was first published on Rmetrics blogs, and kindly contributed to R-bloggers) To leave a comment for the author, please follow the link and comment on their blog: Rmetrics blogs. R-bloggers.com offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave,...

Read more »

Variability of garch predictions

March 17, 2013
By
Variability of garch predictions

How variable are garch predictions? Previously There have been several posts on garch, in particular: A practical introduction to garch modeling The components garch model in the rugarch package Both of these posts speak about the two common prediction targets: prediction (of volatility) at the individual times (usually days) term structure prediction — the average … Continue reading...

Read more »

Sponsors