## odfweave setup and counting logicals

July 12, 2014
By

Two short items in this blogpost. Since it was not obvious how to run odfWeave() in my particular setup, the call I am using. Then there were several people crosstabulating logical vectors, so I wanted to play along, 80 times faster than table().odfWeaveMy particular setup consists of R, 7-zip, libreoffice. Somehow they don't 100% play along when using odfWeave....

## R Notes: vectors

July 12, 2014
By

R is different from C family languages. It has a C syntax, but a Lisp semantics. Programmers from C/C++/Java world would find many usages in R adhoc and need to memorize special cases. This is because they use R from a C's perspective. R is a very...

## Le Monde puzzle [#875]

July 11, 2014
By

I learned something in R today thanks to Le Monde mathematical puzzle: A two-player game consists in A picking a number n between 1 and 10 and B and A successively choosing and applying one of three transforms to the current value of n n=n+1, n=3n, n=4n, starting with B, until n is larger than

## Sometimes I feel (some) need for speed

July 11, 2014
By

I’m the first to acknowledge that most of my code could run faster. The truth of the matter is that, in essence, I write ‘quickies’: code that will run once or twice, so there is no incentive to spend days or hours in shaving seconds of a computation. Most analyses of research data fall in

## IEEE ranks R #9 amongst all languages

July 11, 2014
By

IEEE — the world's largest professional association for the language of technology — recently published its ranking of the popularity of programming languages. The R language comes in at number 9 in the list. The ranking is based on 12 weighted factors, including Google search rankings and trends, social media chatter, aggregator posts (Reddit and Hacker news), social programming...

## Bayesian Wizardry for Muggles

July 11, 2014
By

Monday, I will be giving the closing talk of the R in Insurance Conference, in London, on Bayesian Computations for Actuaries, as to be more specific, Getting into Bayesian Wizardry… (with the eyes of a muggle actuary). The animated version of the slides (since we will spend some time on MCMC algorithm, I thought that animated graphs could be...

## ABC in Cancún

July 10, 2014
By

Here are our slides for the ABC short course Jean-Michel and I give at ISBA 2014 in Cancún next Monday (if your browser can manage Slideshare…) Although I may switch the pictures from Iceland to Mexico, on Sunday, there will be not much change on those slides we both have previously used in previous

## Item Response Theory and Item Information Exploration

July 10, 2014
By

In the following post I will map out some item information functions for item response theory (IRT) models using the common 3 parameter logistic model for binary responses. The model takes three parameters (obviously) which relate to the item features ...

## R 3.1.1 is released (and how to quickly update it on Windows OS)

July 10, 2014
By

R 3.1.1 (codename “Sock it to Me“) was released today! You can get the latest binaries version from here. (or the .tar.gz source code from here). The full list of new features and bug fixes is provided below. Upgrading to R 3.1.1 on Windows If you are using Windows you can easily upgrade to the latest version of R using the

## R 3.1.1 is released

July 10, 2014
By

As announced by Peter Dalgaard for the R Core Team today, R 3.1.1 has been released. Codenamed "Sock it to Me", this is a patch release for R 3.1, and mostly includes minor bug fixes. It also includes some small improvements, including easier access to package help files, improved accuracy when importing data with very large integers, and some...

## Mathematical functions on non-numbers?!

July 10, 2014
By
$Mathematical functions on non-numbers?!$

A question came up about regarding Q1.4.9 in Cherney, Denton, Waldron. Here is a snippet of the original question: Consider …Continue reading →

## How Much Can We Learn from Top Rankings using Nonnegative Matrix Factorization?

July 10, 2014
By

Purchases are choices from available alternatives. Post-purchase, we know what is the most preferred, but all the other options score the same. Regardless of differences in appeal, all the remaining items received the same score of not chosen. A second...

## Syrian Refugee Density in Lebanon

July 10, 2014
By

I've done a few posts on Syria and have used data provided by the UNHCR for different analysis or visualization.  There are several links on their Syrian refugee data portal that communicate the breadth of this crisis numerically and vis...

## Tips & Tricks 4: Reading In Data Files

July 10, 2014
By

Today's exercise is another nice and simple one, and allows you to get used to manipulating datasets in R.Exercise 4 - How to read a file of coordinate data into R and make sure it is numeric.Reading your data files into R  for anal...

## Reflections on John Chambers’ UserR! 2014 Keynote Address

July 10, 2014
By

by Joseph Rickert John Chambers opened UseR! 2014 by describing how the R language grew out of early efforts to give statisticians easier access to high quality statistical software. In 1976 computational statistics was a very active field, but most algorithms were compiled as Fortran subroutines. Building models with this software was not a trivial process. First you had...

## Including GitHub and Bioconductor on Rdocumentation: Technical Details

July 10, 2014
By

In our last blog post we announced the addition of GitHub and  Bioconductor R packages to Rdocumentation. For the more technical amongst you, I’ll give a short, high-level description of what’s under the hood at Rdocumentation. Along with that I’ll zoom in on some of the challenges encountered while adding GitHub and Bioconductor repositories. Rdocumentation

## In case you missed it: June 2014 Roundup

July 9, 2014
By

In case you missed them, here are some articles from June of particular interest to R users: The useR! 2014 conference in Los Angeles opened with 16 tutorials. DataInformed published an article by David Smith on how various companies use R. Joe Rickert reviews the new book "Applied Predictive Modeling" by Max Kuhn and Kjell Johnson, which is rich...

## FRAMA Part IV: Continuing the Long/Short Filter Search

July 9, 2014
By

This post examines an n-day median filter for two desirable properties: robustness to outliers and an inherent trend-confirming lag. While … Continue reading →

## Buster – a new R package for bagging hierarchical clustering

July 9, 2014
By
$Buster – a new R package for bagging hierarchical clustering$

I recently found myself a bit stuck. I needed to cluster some data. The distances between the data points were not representable in Euclidean space so I had to use hierarchical clustering. But then I wanted stable clusters that would retain their shape as I updated the data set with new observations. This I could … Continue reading...

## Recording of OpenCPU talk at #useR2014

July 9, 2014
By

A recording of the useR! 2014 prentation about OpenCPU is now available on Youtube. This talk gives a brief (20 minute) motivation and introduction to some of the high level concepts of the OpenCPU system. The video contains mostly screen recor...

## Can Rcpp fuse ?

July 8, 2014
By

One of the features of Rcpp11 people seemed to like during useR is the fuse function. fuse is somewhat similar to the c function in R. For the purpose of this post, let's simplify what fuse does, and just say that it takes several compatible vector...

## Are Consumer Preferences Deep or Shallow?

July 8, 2014
By

John Hauser, because no one questions his expertise, is an excellent spokesperson for the viewpoint that consumer preferences are real, as presented in his article "Self-Reflection and Articulated Consumer Preferences." Simply stated, preferences are e...

## Speed Tests for Rolling/Running Functions

July 8, 2014
By

I use rolling and running functions almost daily with financial time series. In my post A Whole New World with Chains and Pipes, I made this statement I have noticed that rolling analysis with xts can sometimes be slow. as.matrix is my favorite way to ...

## R Day at Strata NYC

July 8, 2014
By

RStudio will teach the new essentials for doing data science in R at this year’s Strata NYC conference, Oct 15 2014. R Day at Strata is a full day of tutorials that will cover some of the most useful topics in R. You’ll learn how to manipulate and visualize data with R, as well as

## DSC 2014, Day 2

July 8, 2014
By

This will be somewhat shorter summary of the second day of DSC 2014. There was a wider variety of presentations today, but I am only going to discuss those that touched on the main theme of the day, which was … Continue reading →

## Dependencies of popular R packages

July 8, 2014
By

With the growing popularity of R, there is an associated increase in the popularity of online forums to ask questions. One of the most popular sites is StackOverflow, where more than 60 thousand questions have been asked and tagged to be related to R. On the same page, you can also find related tags. Among the top 15 tags...

## meteoForecast, a package to obtain NWP-WRF forecasts in R

The Weather Research and Forecasting (WRF) Model is a numerical weather prediction (NWP) system. NWP refers to the simulation and …Sigue leyendo →

July 8, 2014
By

Recently we released googleVis 0.5.3 on CRAN. The package provides an interface between R and Google Charts, allowing you to create interactive web charts from R.Screen shot of some of the Google ChartsAlthough this is mainly a maintenance release, I'd...