Object Orientation in R – Notes from a novice

January 25, 2013
By
Object Orientation in R – Notes from a novice

Having posted some code to Git a few days ago and having been wholly dissatisfied with it, I began to do what I often do with code I don’t like. I started re-writing it bigger and weirder and more philosophically pure. Part of this search for Platonic code lead me to explore object oriented programming

Read more »

Tensor Algebra: Efficient Operations on Multidimensional Arrays with R

January 25, 2013
By

 Multidimensional arrays are ubiquitous. Any complex problem having multivariate observables would easily generate a need to represent corresponding data in multidimensional arrays. Most of the practitioners would choose to apply operations on the...

Read more »

Resolving species names when you have a lot of them

January 25, 2013
By

taxize use case: Resolving species names when you have a lot of them Species names can be a pain in the ass, especially if you are an ecologist. We ecologists aren't trained in taxonomy, yet we often end up with huge species lists. Of course we want to correct any spelling errors in the names, and get the newest...

Read more »

Resolving species names when you have a lot of them

January 25, 2013
By

taxize use case: Resolving species names when you have a lot of them Species names can be a pain in the ass, especially if you are an ecologist. We ecologists aren't trained in taxonomy, yet we often end up with huge species lists. Of course we want to correct any spelling errors in the names, and get the newest...

Read more »

Time series cross-validation 5

January 24, 2013
By
Time series cross-validation 5

The caret package for R now supports time series cross-validation!  (Look for version 5.15-052 in the news file).  You can use the createTimeSlices function to do time-series cross-validation with a fixed window, as well as a growin...

Read more »

local package delays

January 24, 2013
By
local package delays

When Jean-Michel and I left Berlin, a month ago, I really thought we were that close to sending the new edition of Bayesian Core. Alas, we are not done yet for a series of reasons: leaving for India did not give me enough time to complete the help manual, some codes from the original version

Read more »

R PMML Support: Data Transformations

January 24, 2013
By
R PMML Support: Data Transformations

R and PMML Export   R is becoming the tool of choice for many data scientists. It is no wonder that many commercial and open-source statistical tools are also embracing R.Predictive ModelsA set of robust predictive analytic techniques is...

Read more »

Visualizing threaded conversation volume and intensity

January 24, 2013
By
Visualizing threaded conversation volume and intensity

As a researcher interested in information flows in digital environments I’m often interested in finding patterns in social trace data. For this discussion we can think of digital social trace data as the text that people post into threaded topics on forums, like on Reddit or a Wiki Talk page on Wikipedia. One way to

Read more »

Storing a Function in a Separate File in R

January 24, 2013
By

If you're going to be using a function across several different R files, you might want to store the function in its own file.If you want to name the function in its own fileThis is probably the best option in general, if only because you may want to p...

Read more »

Votamatic predicted the election with R

January 24, 2013
By
Votamatic predicted the election with R

While Nate Silver got a lot of the attention for correctly forecasting the US presidential election, other forecasters were just as succesful. Drew Linzer used the R language to build the statistical model behind votamatic.org, and was able to predict the outcome of the election months before most pundits. Drew's model initially relied mostly on fundamental quantities: the president’s...

Read more »

No more ascii-art

January 24, 2013
By
No more ascii-art

At least fourfive R packages will turn your regression models into pretty latex tables: texreg, xtable, apsrtable, memisc, and stargazer.  This is very nice if you happen to be a latex document or its final reader, but it’s not so great if you’re making those models to start with. What if you wanted to see

Read more »

Writing Better Statistical Programs in R

January 24, 2013
By
Writing Better Statistical Programs in R

A while back a friend asked me for advice about speeding up some R code that they’d written. Because they were running an extensive Monte Carlo simulation of a model they’d been developing, the poor performance of their code had become an impediment to their work. After I looked through their code, it was clear

Read more »

Animated graphs, another alternative for Data Visualization

January 24, 2013
By
Animated graphs, another alternative for Data Visualization

The world of Data Visualization offers infinite variants to display our Data. However, there is still some reluctance in exploiting all the possibilities that computers give us nowadays in this field, probably not because of a rejection of novelties but … Continue reading →

Read more »

A corner on convenient data analysis

January 24, 2013
By

Many people are of the opinion that R has a corner on convenient data analysis.  That may or may not be true. But now R literally has a corner that makes data analysis more convenient.  If you have a data frame or a matrix with a few columns, then you can use head and/or tail The post A...

Read more »

Shiny, deSolve and ggplot play nicely together

January 24, 2013
By
Shiny, deSolve and ggplot play nicely together

This is just a quick post (after a long absence) to quickly share my first Shiny attempt. The aim is to make available mechanistic models to coworkers through a web-based interface. I had planned to use RApache and RGTK+, but the arrival of Shiny has ...

Read more »

Working with R2MLwiN Part 1

January 24, 2013
By
Working with R2MLwiN Part 1

Getting started with the R2MLwiN packageWith the release of the R2MLwiN package late 2012, R users have access to another software package for running Bayesian models using Markov chain Monte Carlo (MCMC) methods. R2MLwiN is an R command interface to MLwiN, allowing users to fit multilevel models using MLwiN from within the R environment.To use R2MLwiN, MLwiN needs to...

Read more »

Comparing two data frames with different number of rows

January 24, 2013
By

I posted a question over on StackOverflow on an efficient way of comparing two data frames with the same column structure, but with different rows. What I would like to end up with is an n x m logical matrix where n and m are the number of rows in the first and second data frames, respectively; and...

Read more »

Maps in R: choropleth maps

January 23, 2013
By
Maps in R: choropleth maps

This is the third article of the Maps in R series. After having shown how to draw a map without placing data on it and how to plot point data on a map, in this installment the creation of a … Continue reading →

Read more »

Make plot panels fit the distribution of your data

January 23, 2013
By

I am a big fan of lattice/latticeExtra. In fact, nearly all visualisations I have produced so far make use of this great package. The possibilities for customisation are endless and the amount of flexibility it provides is especially valuable for produ...

Read more »

[R-bloggers]RcmdrPlugin.KMggplot2_0.2-0 is on CRAN now

January 23, 2013
By
[R-bloggers]RcmdrPlugin.KMggplot2_0.2-0 is on CRAN now

I posted a new version of the ”RcmdrPlugin.KMggplot2” package, which is an Rcmdr plug-in for a ”ggplot2” GUI front-end. This package assists you to make ”ggplot2” graphics. RcmdrPlugin.KMggplot2 (CRAN) NEWS Changes in version 0.2-0 (2013-01-23) Added the

Read more »

Rcpp reaches 100 dependents on CRAN

January 23, 2013
By

With the arrival earlier today of the stochvol package onto the CRAN network for R, our Rcpp project reached a new milestone: 100 packages have either a Depends:, Imports: or LinkingTo: statement on it. The full list will always be at the bottom of ...

Read more »

Revolution Newsletter: January 2013

January 23, 2013
By

The most recent edition of the Revolution Newsletter is out. The news section is below, and you can read the full January edition (with highlights from this blog and community events) online. You can subscribe to the Revolution Newsletter to get it monthly via email. Top Innovator, Big Data Technologies. Revolution Analytics is the proud recipient of the Top...

Read more »

7 Functions to do Metric Multidimensional Scaling in R

January 23, 2013
By
7 Functions to do Metric Multidimensional Scaling in R

In this post we will talk about 7 different ways to perform a metric multidimensional scaling in R. Multidimensional Scaling Multidimensional Scaling (MDS), is a set of multivariate data analysis methods that are used to analyze similarities or dissimilarities in data. One of the nice features of MDS is that it allows us to represent … Continue reading...

Read more »

Management of Research Data – a Shell+Python+Excel+R Approach

January 23, 2013
By
Management of Research Data – a Shell+Python+Excel+R Approach

I am a computer science researcher, usually working on both Windows and Linux system. Windows is the place where I do the document work, like reading paper, browsing the internet, writing papers with LaTex… Linux is where I run and generate experimental results. The Chaos After years of messy data management and recent data chaos,

Read more »

Going Beyond Florence Nightingale’s Data Diagram: Did Flo Blow It with Wedges?

January 23, 2013
By
Going Beyond Florence Nightingale’s Data Diagram: Did Flo Blow It with Wedges?

In 2010, I wrote a short blog item about Florence Nightingale the statistician, solely because of its novelty value. I didn't even bother to look closely at the associated graphic she designed, but that's what I intend to do here. In this first installment, I reflect on her famous data visualization by reconstructing it...

Read more »

Linear models summary sheet

January 23, 2013
By

As part of my long slog towards statistical understanding, I started making notes on the very specific topic of linear models. The details are tricky and hard to keep in mind, and it is difficult to go back and forth between books and notes to try to r...

Read more »

SPARQL with R in less than 5 minutes

January 23, 2013
By
SPARQL with R in less than 5 minutes

In this article we’ll get up and running on the Semantic Web in less than 5 minutes using SPARQL with R. We’ll begin with a brief introduction to the Semantic Web then cover some simple steps for downloading and analyzing government data via a SPARQL query with the SPARQL R package. What is the Semantic Web? To newcomers, the...

Read more »

SPARQL with R in less than 5 minutes

January 23, 2013
By
SPARQL with R in less than 5 minutes

In this article we’ll get up and running on the Semantic Web in less than 5 minutes using SPARQL with R. We’ll begin with a brief introduction to the Semantic Web then cover some simple steps for downloading and analyzing government data via a SPARQL query with the SPARQL R package. What is the Semantic Web? To newcomers, the...

Read more »

SPARQL with R in less than 5 minutes

January 23, 2013
By
SPARQL with R in less than 5 minutes

In this article we’ll get up and running on the Semantic Web in less than 5 minutes using SPARQL with R. We’ll begin with a brief introduction to the Semantic Web then cover some simple steps for downloading and analyzing government data via a SPARQL query with the SPARQL R package. What is the Semantic The post SPARQL...

Read more »

Sponsors