R, D3.js and SNA Course

R, D3.js and SNA Course

I took the SNA course by Lada Adamic in coursera. It's a super interesting course. In fact, I was using the networks only how a visualization tool, and that is what it make me little bit embarrassing because there are more, a lot of more. You can detec...

Read more »

R/Finance 2013 Is Coming Quickly…

May 5, 2013
By
R/Finance 2013 Is Coming Quickly…

There is about two weeks remaining until R/Finance 2013 - being held on May 17th and 18th at UIC in Chicago.  Make sure you register beforehand to ensure you have a spot, and – yes - you do want to come to the conference dinner on Friday.   I am particularly excited about the lineup of keynotes

Read more »

Simulation shows gain of clmm over ANOVA is small

May 5, 2013
By
Simulation shows gain of clmm over ANOVA is small

After last post's setting up for a simulation, it is now time to look how the models compare. To my disappointment with my simple simulations of assessors behavior the gain is minimal. Unfortunately, the simulation took much more time than I ...

Read more »

Volatility Regimes: Part 2

Volatility Regimes: Part 2

Adam Duncan from January, 2013Also avilable on R-bloggers.com Strategy Implications In this part of the volatility regimes analysis, we’ll use the regime identification framework established in part 1 to draw conclusions about which strategies work best is each regime. That should prove useful to us and goes a long way to answering the question, “What strategies should I be...

Read more »

Quandl Package – 5,000,000 free datasets at the tip of your fingers!

May 5, 2013
By
Quandl Package – 5,000,000 free datasets at the tip of your fingers!

# Yes, you read that correctly and no Quandl (http://www.quandl.com/) did not pay me anything.# Quandl is a new database management tool which seeks to become the place to find datasets.  They boast of having over 5x10^6 data sets available t...

Read more »

AIC & BIC vs. Crossvalidation

May 4, 2013
By
AIC & BIC vs. Crossvalidation

Model selection is a process of seeking the model in a set of candidate models that gives the best balance between model fit and complexity (Burnham & Anderson 2002). I have always used AIC for that. But you can also…Read more →

Read more »

A Prototype of Monotonic Binning Algorithm with R

May 4, 2013
By
A Prototype of Monotonic Binning Algorithm with R

I’ve been asked many time if I have a piece of R code implementing the monotonic binning algorithm, similar to the one that I developed with SAS (http://statcompute.wordpress.com/2012/06/10/a-sas-macro-implementing-monotonic-woe-transformation-in-scorecard-development) and with Python (http://statcompute.wordpress.com/2012/12/08/monotonic-binning-with-python). Today, I finally had time to draft a quick prototype with 20 lines of R code, which is however barely useable without the

Read more »

Backporting R 3.0.0 to Quantal, Precise, and Lucid

May 4, 2013
By

Today (May 4, 2013) I will begin the process of backporting R 3.0.0 to Quantal, Precise, and Lucid. This will include all the recommended packages and the packages for R found in the universe repository for Ubuntu. Things to keep in mind: If you do...

Read more »

LaTeX in R graphs

May 3, 2013
By
LaTeX in R graphs

A nice post was recently published on the rsnippets blog, about the tikzDevice R package. This package is – indeed – awesome. Even if it has been removed from the CRAN website. Of course, it can be download from the archive folder, on http://cran.r-project.org/…, but also (for a more recent version)  on http://download.r-forge.r-project.org/…. But first, it is necessary to install...

Read more »

Animation, from R to LaTeX

May 3, 2013
By
Animation, from R to LaTeX

Just a short post, to share some codes used to generate animated graphs, with R. Assume that we would like to illustrate the law of large number, and the convergence of the average value from binomial sample. We can generate samples  using > n=200 > k=1000 > set.seed(1) > X=matrix(sample(0:1,size=n*k,replace=TRUE),n,k) Each row  will be a trajectory of heads and...

Read more »

Old Post with New d3 Life–GARCH and MA Performance

May 3, 2013
By

Parallel coordinates become much more useful when they are interactive, so I recreated one of my favorite blog posts "Trend is Not Your Friend" Applied to 48 Industries and convert the chart to a living breathing d3 parallel coordinates chart courtesy ...

Read more »

Extending RevoScaleR for Mining Big Data – Naive Bayes

May 3, 2013
By
Extending RevoScaleR for Mining Big Data – Naive Bayes

by Derek McCrae Norton, Senior Sales Engineer In this third installment (following part 1 and part 2) of Extending RevoScaleR for Mining Big Data we look at how to use the building blocks provided by RevoScaleR to create a Naive Bayes model. Motivation: Fit a Naive Bayes model to big data. Naive Bayes is a simple probabilistic classifier based...

Read more »

All About Spherically Distributed Regression Errors

May 2, 2013
By
All About Spherically Distributed Regression Errors

This post is based on a handout that I use for one of my courses, and it relates to the usual linear regression model,                                   y = Xβ + ε In our list of standard assumptions about the error term in this linear multiple regression...

Read more »

Improved R Profiling Summaries

May 2, 2013
By

In my last post I mentioned that I had improved on R’s summaryRprof() function with a custom function called proftable(). I’ve updated proftable() to take advantage of R 3.0.0’s ability to record line numbers while profiling. I’ve put it on github – you can get it there or below. proftable reads in a file generated by...

Read more »

…learning LaTeX, from scratch!

May 2, 2013
By
…learning LaTeX, from scratch!

” LaTeX is a high-quality typesetting system; it includes features designed for the production of technical and scientific documentation, and is the de facto standard for the communication and publication of scientific documents.” It is also… Free and Open. Specially … Sigue leyendo →

Read more »

How R Grows

May 2, 2013
By
How R Grows

by Joseph Rickert Saturday morning I was drinking my coffee wondering how much effort goes into R worldwide. (It’s my job.) I noticed that there were 4469 packages on CRAN, and it occurred to me that tabulating the packages by publication date would give some indication of how much effort is being expended to improve packags and keep them...

Read more »

Changing The Presidential Election with R in the Browser

May 2, 2013
By

After I finished with the tutorial post d3 <- R with rCharts and slidify and then saw R creates d3/javascript charts in Ipython Style Notebook, a light clicked.  I could finally answer the lingering question I have had ever since I saw the NYT ...

Read more »

Do Torontonians Want a New Casino? Survey Analysis Part 1

May 2, 2013
By
Do Torontonians Want a New Casino?  Survey Analysis Part 1

Toronto City Council is in the midst of a very lengthy process of considering whether or not to allow the OLG to build of a new casino in Toronto, and where.  The process started in November of 2012, and set … Continue reading →

Read more »

Why Blog?

May 2, 2013
By
Why Blog?

The Blog Review ProcessA series of events in my life have lead me to reconsider the value of blogging.The Back StoryShort story: I got fired.Long story: Recently I was hired to write occasional blog posts for Quandl. They probably figured that due to m...

Read more »

Writing from R to Excel with xlsx

May 1, 2013
By
Writing from R to Excel with xlsx

Paul Teetor, who is doing yeoman’s duty as one of the organizers of the Chicago R User Group (CRUG), asked recently if I would do a short presentation about a “favorite package”.  I picked xlsx, one of the many packages that provides a bridge between spreadsheets and R.  Here are the slides from my presentation

Read more »

NYT uses R to investigate NFL draft picks

May 1, 2013
By
NYT uses R to investigate NFL draft picks

Last week, the New York Times published online an interactive tool to explore NFL draft picks, revealing the fact that there's not much relationship between an early pick and the star performers in the season: Kevin Quealy, graphics editor at the NYT, detailed the process behind creating this graphic on his chartsnthings blog. He and others on the graphics...

Read more »

TV shows rated by episode as a Shiny App

May 1, 2013
By
TV shows rated by episode as a Shiny App

A few days ago there was an interesting R based article by diffuseprior on the decline and fall in the quality of The Simpsons The author scraped results from GEOS, an online survey of TV programs, and applied the R package changepoint to offer an analysis of the show over time This seemed a candidate

Read more »

R for dummies

May 1, 2013
By
R for dummies

I already mentioned R for dummies a while ago on the ‘Og and never got around to read it from cover to back. Now that I am reduced to a dummy state with too much free time!, I can produce a full review of the book. R for dummies was written by two Belgian statistics

Read more »

…start using Sweave, from scratch!

May 1, 2013
By
…start using Sweave, from scratch!

INTRODUCTION Sweave is nothing more, nothing less than the best way R can connect with a text editor, in this case LaTeX. So you don’t know anyting about LaTeX? neither did I 8 months ago… The hyperlinks in this post … Sigue leyendo →

Read more »

R creates d3/javascript charts in Ipython Style Notebook

May 1, 2013
By

I am not sure I have ever done a post like this, but I was so blown away I had to do this post simply to embed this amazing Youtube video from the author of the R packages rCharts and slidify.  Watch this screencast as he creates d3/raphael charts...

Read more »

Book Review: The R Book, Second Edition (2013)

May 1, 2013
By

The first edition of The R Book by Michael J. Crawley was an ambitious work, but managed to be slightly rubbish due to the atrocious typographical layout of the original book. The good news is that the new 2nd edition, released in 2013, has a substanti...

Read more »

A Crash Course in R

May 1, 2013
By

This code has been kindly contributed by Robin Edwards

Read more »

Color analysis of Flickr images

May 1, 2013
By
Color analysis of Flickr images

Since I’ve seen this beautiful color wheel visualizing the colors of Flickr images, I’ve been fascinated with large scale automated image analysis. At the German Market Research association’s conference in late April, I presented some analyses that went in the same direction (click to enlarge): On the image above you can see the color

Read more »

A pathological glm() problem that doesn’t issue a warning

May 1, 2013
By
A pathological glm() problem that doesn’t issue a warning

I know I have already written a lot about technicalities in logistic regression (see for example: How robust is logistic regression? and Newton-Raphson can compute an average). But I just ran into a simple case where R‘s glm() implementation of logistic regression seems to fail without issuing a warning message. Yes the data is a Related posts:

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.