jsonlite 0.9.12: now even lighter and faster

September 28, 2014
By
jsonlite 0.9.12: now even lighter and faster

The jsonlite package implements a robust, high performance JSON parser and generator for R, optimized for statistical data and the web. This week version 0.9.12 appeared on CRAN which includes a completely rewritten json parser and more optimized C code for json generation. The new parser is based on yajl...

Read more »

future of computational statistics

September 28, 2014
By
future of computational statistics

I am currently preparing a survey paper on the present state of computational statistics, reflecting on the massive evolution of the field since my early Monte Carlo simulations on an Apple //e, which would take a few days to return a curve of approximate expected squared error losses… It seems to me that MCMC is

Read more »

Row Search in Parallel

September 28, 2014
By
Row Search in Parallel

I’ve been always wondering whether the efficiency of row search can be improved if the whole data.frame is splitted into chunks and then the row search is conducted within each chunk in parallel. In the R code below, a comparison is done between the standard row search and the parallel row search with the FOREACH

Read more »

Back to square one – R and RStudio installation

September 28, 2014
By
Back to square one  – R and RStudio installation

I remember my first experience installing R. Basic installation can be humbling for someone not familiar with mirror networks or file binaries. I remember not knowing the difference between base and contrib… which one to select? The concept of CRAN and mirrors was also new to me. Which location do I choose and are they

Read more »

Deep Down Below – Using in-database analytics from within Tableau (with MADlib)

September 28, 2014
By
Deep Down Below – Using in-database analytics from within Tableau (with MADlib)

Introduction Using Tableau for visualizing all kinds of data is quite a joy, but it’s not that strong on build-in analytics or predictive features. Tableaus integration of R was a huge step in the right direction (and I love it very much - see here, here and here) but still has some limitations (e.g. no RAWSQL...

Read more »

Updated dplyr Examples

September 28, 2014
By
Updated dplyr Examples

Over the summer I made two posts about using the dplyr package.  The first was an example of the dplyr verbs applied to fish data.  The second was an example of modifications that I had made to lencat() to work better … Continue reading →

Read more »

Stage abundances, eigenvector of population matrix

September 28, 2014
By
Stage abundances, eigenvector of population matrix

The previous article introduced the seasonal matrices and the population growth rate λ of imaginary annual plant.  This article focuses on  the meaning of the eigenvector at first, and then … Continue reading →

Read more »

Bayesian models in R

September 28, 2014
By
Bayesian models in R

There are many ways to run general Bayesian calculations in or from R. The best known are JAGS, OpenBUGS and STAN. Then some time ago Rasmus Bååth had a post Three ways to run Bayesian models in R in which he mentioned LaplacesDemon (not on CRAN) on top of those. A check of the Bayes task view...

Read more »

Exploring Mangalyaan tweets with R

September 27, 2014
By
Exploring Mangalyaan tweets with R

Mangalyaan is the spacecraft of Indian Space Research Orgnization’s Mars Orbiter Mission that entered the orbit of Mars last week. There were several tweets in Twitter with hashtag #Mangalyaan about it last week. I wanted to use R to explore … Continue reading →

Read more »

Recognizing Patterns in the Purchase Process by Following the Pathways Marked By Others

September 27, 2014
By
Recognizing Patterns in the Purchase Process by Following the Pathways Marked By Others

Herbert Simon's "ant on the beach" does not search for food in a straight line because the environment is not uniform with pebbles, pools and rough terrain. At least the ant's decision making is confined to the 3-dimensional space defining the beach. C...

Read more »

A book about some important bits of R

September 27, 2014
By

I see that Hadley Wickham’s new book, “Advanced R”, is being published in dead tree form and will be available a month or so. Hadley has generously made the material available online; I quickly skimmed the material a few months ago when I first heard about it and had another skim this afternoon. The main

Read more »

Canned Regular Expressions: qdapRegex 0.1.2 on CRAN

September 27, 2014
By
Canned Regular Expressions: qdapRegex 0.1.2 on CRAN

We’re pleased to announce first CRAN release of qdapRegex! You can read about qdapRegex or skip right to the examples. qdapRegex is a collection of regular expression tools associated with the qdap package that may be useful outside of the context … Continue reading →

Read more »

Error propagation based on interval arithmetics

September 27, 2014
By
Error propagation based on interval arithmetics

I added an interval function to my ‘propagate’ package (now on CRAN) that conducts error propagation based on interval arithmetics. It calculates the uncertainty of a model by using interval arithmetics based on (what I call) a “combinatorial sequence grid evaluation” approach, thereby avoiding the classical dependency problem that often inflates the result interval. This

Read more »

Gender Analysis of Facebook Post Likes

September 27, 2014
By
Gender Analysis of Facebook Post Likes

A lot of people showed a huge interest in analyzing Facebook data with R. So I decided to write some more tutorials about the possibilities you have with Rfacebook package created by Pablo Barbera.... The post Gender Analysis of Facebook Post Likes appeared first on ThinkToStart.

Read more »

FIFA 15 Analysis with R

September 26, 2014
By
FIFA 15 Analysis with R

Several months ago, I used R to analyze professional soccer players based on their attributes from the video game, FIFA14. Now that FIFA15 is upon us, let's take a similar look.FIFA 15 is a video game by EA Sports that mimics the experience of managing and playing for a soccer team. The game uses the likenesses and attributes...

Read more »

Make a KML-File from an OpenStreetMap Trail

September 26, 2014
By
Make a KML-File from an OpenStreetMap Trail

Ever wished to use a trail on OSM on your GPS or smartphone? With this neat little R-Script this can easily be done. You'll just need to search OpenStreetMap for the ID of the trail (way), put this as argument to osmar::get_osm, convert to KML and you're good to go! # get OSM...

Read more »

Packrat presentation at useR! 2014

September 26, 2014
By

There comes a time in a software toolchain’s lifecycle where the focus shifts from developer...

Read more »

Police militarization in the US, over time

September 26, 2014
By
Police militarization in the US, over time

The militarization of local police departments here in the US has been much in the news lately, and the New York Times published in June an in-depth article on how materiel from wars has ended up in the hands of US counties. Besides the traditional reporting it's a fantastic piece of data journalism: the Times submitted a freedom-of-information request...

Read more »

Simple R Debugging GUI for Bio7

September 26, 2014
By

26.09.2014 For the next release of Bio7 I implemented a first simple debugging GUI (Graphical User Interface) for R scripts. For the debugging process a change from Rserve to an available Java R console connection in Bio7 is necessary (with Rserve alone a debugging interface wouldn’t be possible). Both connections runs in the same process

Read more »

List of R programmers

September 26, 2014
By

List of R programmers: Hello R people. In December of 2013 I posted a cheap-o wiki-editable (thank you github) contact list which recruiters can use to find you, if they’re looking for R programmers. In what I consider a resounding success, within a few weeks it got onto the first page of google (thank you github), and...

Read more »

Overcoming D3 Cartographic Envy With R + ggplot

September 25, 2014
By
Overcoming D3 Cartographic Envy With R + ggplot

When I used one of the Scotland TopoJSON files for a recent post, it really hit me just how much D3 cartography envy I had/have as an R user. Don’t get me wrong, I can conjure up D3 maps pretty well and the utility of an interactive map visualization goes without saying, but

Read more »

R and Docker

September 25, 2014
By
R and Docker

Earlier this evening I gave a short talk about R and Docker at the September Meetup of the Docker Chicago group. Thanks to Karl Grzeszczak for setting the meeting, and for providing a pretty thorough intro talk regarding CoreOS and Docker. My slides...

Read more »

Google location data — Where I’ve been.

September 25, 2014
By
Google location data — Where I’ve been.

I was emailed by a friend that was looking into their google location data and had asked if I had ever used a json file before in R. I said I had not, but I knew there were packages to do such things. The things I sent were things he had already tried,...

Read more »

Installing dplyr 0.3 on Mac OS X (Mavericks)

September 25, 2014
By

UPDATE Per the author, a devtools::install_github("hadley/devtools") should take care of everything you need prior to installing the latest dplyr (though I did not have postgres libs installed and suspect that might still be needed). The R dplyr package just turned 0.3 and to get it working in my development environment (OS X Mavericks) I had to do the following: brew install postgresql...

Read more »

How to draw venn pie-agram (multi-layer pie chart) in R?

September 25, 2014
By
How to draw venn pie-agram (multi-layer pie chart) in R?

I was wondering how to draw a venn diagram like pie chart in R, to show the distribution of my RNA-seq reads mapped onto different annotation regions (e.g. intergenic, intron, exons etc.). A google search returns several options, including the nice one...

Read more »

Top open R jobs (for September 25th 2014)

September 25, 2014
By
Top open R jobs (for September 25th 2014)

This is the bimonthly R Jobs post (for 2014-09-25), based on the R-bloggers’ sister website: R-users.com. If you are an employer who is looking to hire people from the R community, please visit this link to post a new R job (it’s free, and registration takes less than 10 seconds). After almost 8 months, this is the first time that two weeks had pass without a single new job to share. As compensation, I...

Read more »

Brazilian Presidential Election

September 25, 2014
By
Brazilian Presidential Election

Three major polling houses published their polls this week: MDA, Ibope, and Vox Populi. The following numbers incorporate these data. With current data, a runoff between Dilma and Marina seems to be inevitable (.87), though its certainty has decreased from the previous week as the following chart indicates. How to understand the following plots: The … Read More...

Read more »

Regular expressions for everyone else

September 25, 2014
By
Regular expressions for everyone else

Regular expressions are an amazing tool for working with character data, but they are also painful to read and write.  Even after years of working with them, I struggle to remember the syntax for negative lookahead, or which way round the start and end anchor symbols go. Consequently, I’ve created the regex package for human

Read more »

Estimating Generalization Error with the PRESS statistic

September 25, 2014
By
Estimating Generalization Error with the PRESS statistic

As we’ve mentioned on previous occasions, one of the defining characteristics of data science is the emphasis on the availability of “large” data sets, which we define as “enough data that statistical efficiency is not a concern” (note that a “large” data set need not be “big data,” however you choose to define it). In Related posts:

Read more »