What your choice of statistics software says about you

January 3, 2013
By

Sean Taylor, a PhD candidate in Information Systems at NYU’s Stern School of Business, describes the "Statistics Software Signal" and his observation that some software packages are correlated with bad science. While, I don't agree with all of his points (some fine analyses have been done with Stata, for example), I thought this was an interesting take on the...

Read more »

Accessing environments

January 3, 2013
By

Extending R with C++ code by using Rcpp typically involves function calls by leveraging the existing .Call() interface of the R API. Passing values back and forth is then done in manner similar to programming with functions. However, on occassion it i...

Read more »

The GARCH-DCC Model and 2-stage DCC(MVT) estimation.

January 2, 2013
By
The GARCH-DCC Model and 2-stage DCC(MVT) estimation.

This short demonstration illustrates the use of the DCC model and its methods using the rmgarch package, and in particular an alternative method for 2-stage DCC estimation in the presence of the MVT distribution shape (nuisance) parameter. The theoretical background and representation of the model is detailed in the package’s vignette. The dataset and period

Read more »

Genetic Algorithms with gaoptim package

January 2, 2013
By

Two days ago i just submitted my first R package: gaoptim. For my surprise, the next day it was already living on CRAN. In this post i want to show you how to use gaoptim to perform a simple function maximization. This same task could be accomplished ...

Read more »

Data-driven science is a failure of imagination

January 2, 2013
By
Data-driven science is a failure of imagination

Professor Hans Rosling certainly is a remarkable figure. I recommend watching his performances. Especially the BBC's "Joy of Stats" is exemplary. Rosling sells passion for data, visual clarity and great deal of comedy. He represents the data-driven paradigm in science. What…Read more →

Read more »

Let’s Rapplicate!

January 2, 2013
By
Let’s Rapplicate!

It's been a while since you last heard from Rapporter, and we came up with a (hopefully) good excuse for our absence from the blogosphere: Rapplications. To demystify: we developed an API that allows you to create dynamic reports by using the R templates and datasets available on Rapporter. All you need is an account on Rapporter (you can...

Read more »

(Semi-)automating the R markdown to blogger workflow

January 2, 2013
By

In his recent post 100 most read R posts for 2012 (stats from R-bloggers) – big data, visualization, data manipulation, and other languages Tal Galili - the guy behind R-Bloggers - presents his wishlist for 2013. Among other things he states &ldquo...

Read more »

100 most read R posts in 2012 (stats from R-bloggers) – big data, visualization, data manipulation, and other languages

January 2, 2013
By
100 most read R posts in 2012 (stats from R-bloggers) – big data, visualization, data manipulation, and other languages

R-bloggers.com is now three years young. The site is an (unofficial) online journal of the R statistical programming environment, written by bloggers who agreed to contribute their R articles to the site. Last year, I posted on the top 24...

Read more »

R version 3 scheduled for April

January 2, 2013
By

Ringing in the New Year, Peter Dalgaard announced yesterday on behalf of the entire R Core Team that the R language will graduate to Version 3 around April 1. This is only the third time that R has incremented its primary version number. Version 1.0.0 (released on February 29, 2000) was the first version deemed stable for production use....

Read more »

Just another R blog

January 2, 2013
By

New year, new resolutions. This year, as a personal challenge, I decided to create a blog where I could share (and also receive) some tricks and tips about R programming language. The main motivation behind this blog is to learn how to use Knitr (http://yihui.name/knitr/). While I'm very concerned about the importance of...

Read more »

The (near) Future of Data Analysis – A Review

January 2, 2013
By
The (near) Future of Data Analysis – A Review

Sean Murphy co-organizes Data Business DC, among many other things. Hadley Wickham, having just taught workshops in DC for RStudio, shared with the DC R Meetup his view on the future, or at least the near future of Data Analysis. … Continue reading → The post The (near) Future of Data Analysis – A Review appeared first on...

Read more »

The Unravelling of Structured Investment Vehicles or Birthdays

January 2, 2013
By

The best way for me to achieve deep understanding of a theorem is not through lengthy proofs alone, but through practical application/implementation or as they said in the Marine Corps Pract-App. One of the many reasons I love R is the ease to write functions and test results. The 2008 financial crisis was the topic of a recent dinner

Read more »

You can’t spell loss reserving without R

January 2, 2013
By
You can’t spell loss reserving without R

Last year, I spent a morning trying to return to first principles when modeling loss reserves. (Brief aside to non-actuaries: a loss reserve is the financial provision set aside to pay for claims which have either not yet settled, or have not yet been reported. If that doesn’t sound fascinating, this will likely be a

Read more »

Computing for Data Analysis, and Other Free Courses

January 2, 2013
By

Coursera's free Computing for Data Analysis course starts today. It's a four week long course, requiring about 3-5 hours/week. A bit about the course:In this course you will learn how to program in R and how to use R for effective data analysis. Y...

Read more »

R code and data for book “R and Data Mining: Examples and Case Studies”

January 2, 2013
By
R code and data for book “R and Data Mining: Examples and Case Studies”

R code and data for book “R and Data Mining: Examples and Case Studies” are now available at http://www.rdatamining.com/books/rdm/code. An online PDF version of the book (the first 11  chapters only) can also be downloaded at http://www.rdatamining.com/docs. Below are its … Continue reading →

Read more »

NFL Code on Github

January 2, 2013
By
NFL Code on Github

I’ve made some revisions and simplifications to the code to compile NFL data. It’s now all out on Github for anyone to play with in advance of the Superbowl. In the meantime, here’s a lovely picture comparing every team’s offense- as measured by total offensive yards- against their defenders. Note the anemic Chicago offense. https://github.com/PirateGrunt/NFL

Read more »

Packages v. Libraries in R

January 2, 2013
By
Packages v. Libraries in R

In the past I've used the terms "R library" and "R package" synonymously (e.g. this blog post and this paper), but a careful reader has called me out. Mark Sharp notes that there are differences between libraries and packages. Chapter one of the R Manual Writing R Extensions gives the details: A package is a directory of files which I encourage you...

Read more »

The (near) Future of Data Analysis – A Review

January 2, 2013
By
The (near) Future of Data Analysis – A Review

Sean Murphy co-organizes Data Business DC, among many other things. Hadley Wickham, having just taught workshops in DC for RStudio, shared with the DC R Meetup his view on the future, or at least the near future of Data Analysis. Herein lies my notes for this talk, spiffed up into semi-comprehensible language. Please...

Read more »

Producing animated GIFs and Videos

January 2, 2013
By
Producing animated GIFs and Videos

It took me a while to figure out how to use the animation package on my Windows OS. In making an animated GIF, the problem seems to have been quite simple in the end (and I should have been more patient in reading the instructions!) - Following installation of the program ImageMagick, one has...

Read more »

Clone all your gists locally with R

January 2, 2013
By

I really like gists as a quick way to include more lengthly code snippets into my blog posts. However, I am not a git user as such, and so I was quite concerned when I noticed that all my gists on this blog had vanished after Christmas. I suppose this was a result of Github's downtime...

Read more »

Armadillo subsetting

January 2, 2013
By

A StackOverflow question asked how convert from arma::umat to arma::mat. The former is format used for find and other logical indexing. For the particular example at hand, a call to the conv_to converter provided the solution. We rewrite the answer he...

Read more »

Happy International Year of Statistics

January 2, 2013
By
Happy International Year of Statistics

2013 promises to be a great year for all statistics aficionado as today is the first day of the International Year of Statistics. More than 1400 organizations from 108 countries — professional

Read more »

Multiple Classification and Authorship of the Hebrew Bible

January 1, 2013
By
Multiple Classification and Authorship of the Hebrew Bible

Sitting in my synagogue this past Saturday, I started thinking about the authorship analysis that I did using function word counts from texts authored by Shakespeare, Austen, etc.  I started to wonder if I could do something similar with the … Continue reading →

Read more »

Efficiecy of Extracting Rows from A Data Frame in R

January 1, 2013
By
Efficiecy of Extracting Rows from A Data Frame in R

In the example below, 552 rows are extracted from a data frame with 10 million rows using six different methods. Results show a significant disparity between the least and the most efficient methods in terms of CPU time. Similar to the finding in my previous post, the method with data.table package is the most efficient

Read more »

Polarisation and Mobilisation indicators

January 1, 2013
By
Polarisation and Mobilisation indicators

This blog post makes available a set of indicators discussed in a forthcoming edition of Digital Icons. In brief, the script takes a text input and calculates polarisation and mobilisation indexes based on the number of pronouns featured.The hypothesised relationship between pronouns and polarisation is one discussed extensively by critical discourse analysts, social...

Read more »

Standard Normal Variate (SNV: Other way)

January 1, 2013
By

This is another way to pre-treat aspectra set with the SNV math-treatment (Standard Normal Variate). You can see the other one in the post : "Standard Normal Variate (SNV)".In this post, I use the R function "sweep".library(ChemometricsWithR)#in a...

Read more »

Unicode in R packages (not)

January 1, 2013
By

Perhaps you are trying to add your nice new object as data for an R package. But wait. It has  foreign letters in its dimnames, so ’R CMD check’ will certainly complain. What you need is something to turn R’s natural Unicode-processing goodness into a relic from the early days of computing without inadvertently aliasing any words

Read more »

Sugar Functions head and tail

January 1, 2013
By

The R functions head and tail return the first (last) n elements of the input vector. With Rcpp sugar, the functions head and tail work the same way as they do in R. Here we use std::sort from the STL and then tail to return the top n items (items wit...

Read more »

STL for_each and generalized iteration

January 1, 2013
By

The STL contains a very general looping or sweeping construct in the for_each algorith. It can be used with function objects (such as the simple square function used here) but also with custom class which can be used to keep to keep state. #include <...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de







ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.