January 2013

Webinar Jan 24: Using R with Hadoop

January 10, 2013 | David Smith

In two weeks (on January 24), Think Big Analytics' Jeffrey Breen will present a new webinar on using R with Hadoop. Here's the webinar description: R and Hadoop are changing the way organizations manage and utilize big data. Think Big Analytics and Revolution Analytics are helping clients plan, build, test and ... [Read more...]

My Personal Intro to F1 Race Statistics

January 10, 2013 | Tony Hirst

One of the many things I keep avoiding is statistics. I’ve never really been convinced about the 5% significance level thing; as far as I can tell, hardly anything that’s interesting normally distributes; all the counting that’s involved just confuses me; and I never really got to grips ... [Read more...]

Formulae in R: ANOVA and other models, mixed and fixed

January 10, 2013 | Will

R’s formula interface is sweet but sometimes confusing. ANOVA is seldom sweet and almost always confusing. And random (a.k.a. mixed) versus fixed effects decisions seem to hurt peoples’ heads too. So, let’s dive into the intersection of these three. I’m aware that there are lots ... [Read more...]

R for actuarial science

January 10, 2013 | arthur charpentier

As mentioned in the Appendix of Modern Actuarial Risk Theory, “R (and S) is the ‘lingua franca’ of data analysis and statistical computing, used in academia, climate research, computer science, bioinformatics, pharmaceutical industry, customer analytics, data mining, finance and by some insurers. Apart from being stable, fast, always up-to-date and ... [Read more...]

Stacked Bar Charts in R

January 10, 2013 | Mollie

Reshape Wide to LongLet's use the Loblolly dataset from the datasets package. These data track the growth of some loblolly pine trees.__ Loblolly[1:10,]   height age Seed1    4.51   3  30115  10.89   ... [Read more...]

Install R in Ubuntu 12.04 Precise Pangolin

January 10, 2013 | Hidden Treasures

R is a free software environment for statistical computing and graphics. It compiles and runs on a wide variety of UNIX platforms, Windows and MacOS. One of my main motivations to install R is Sweave. The Sweave is a literate programming language which integrates LaTeX and R code. The main ... [Read more...]

Optimizing parameters for an oscillator – Video

January 10, 2013 | FelixS

Here’s a video how the modFit function from the FME package optimizes parameters for an oscillation. A Nelder-Mead-optimizer (R function optim) finds the best fitting parameters for an undampened oscillator. Minimum was found after 72 iterations, true parameter eta was -.05: Evolution of parameters in optimization process from Felix Schö... [Read more...]

Reading Codebook Files in R

January 10, 2013 | Jason Bryer

One issue I continuously encounter when starting to work with a new dataset is that of the codebook. In general, I prefer to load a codebook into R like any other data source, specifically as a data frame. And ideally, one data frame to provides the variable names with descriptions ... [Read more...]

Elements of Statistical Learning: free book download

January 9, 2013 | David Smith

The go-to bible for this data scientist and many others is The Elements of Statistical Learning: Data Mining, Inference, and Prediction by Trevor Hastie, Robert Tibshirani, and Jerome Friedman. Each of the authors is an expert in machine learning / prediction, and in some cases invented the techniques we turn to ... [Read more...]

Every NFL punt since 2002

January 9, 2013 | dan

The site reddit told us about data on every single NFL (U.S. National Football League) play since 2002. We read it in and did an analysis of punting. The results are beautiful. The post Every NFL punt since 2002 appeared first on Decision Science News. [Read more...]

Getting Access data into R

January 9, 2013 | Sandy Muspratt

1. IntroductionThese notes give the steps to configure a Windows machine so that R can communicate with Microsoft Access databases. It turns out that the same mechanism can be used to connect with Microsoft Excel workbooks, so the notes include R to Excel communication as well. In R, there are two ... [Read more...]

Revisiting Cronbach 1951 via Simulation with Shiny

January 9, 2013 | jackstat

At the time of the creation of this blog, Cronbach’s 1951 piece on coefficient alpha has 18,132 citations according to google scholar.  The main use of coefficient alpha is to assess internal consistency reliability of a test or survey.   Although it may have been forgotten, the proof Cronbach demonstrated established that ... [Read more...]

Factor Analysis of Baseball’s Hall of Fame Voters

January 9, 2013 | Andrew Landgraf

Factor Analysis of Baseball's Hall of Fame VotersRecently, Nate Silver wrote a post which analyzed how voters who voted for and against Barry Bonds for Baseball's Hall of Fame differed. Not surprisingly, those who voted for Bonds were more likely to vote for other suspected steroids users (like Roger Clemens). ... [Read more...]

WordPress Stats in R

January 9, 2013 | Tony Hirst

A trackback from Martin Hawksey’s recent post on Analysing WordPress post velocity and momentum stats with Google Sheets (Spreadsheet), which demonstrates how to pull WordPress stats into a Google Spreadsheet and generates charts and reports therein, reminded me of the WordPress stats API. So here’s a quick function ... [Read more...]
1 10 11 12 13 14 17

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)