dplyr: A gamechanger for data manipulation in R

August 19, 2014
By

I demonstrate how to use dplyr for data manipulation in R (R code and data on GitHub ). I had heard of the package before and finally gave it a try after attending Hadley Wickham's presentation at useR! in LA a couple of months ago. dplyr will change y...

Read more »

GBMs are awesome: Part I

August 19, 2014
By

GBMs have become my favorite type of model over the last two years. In this tutorial, I demonstrate how to use a GBM for binary classification in R (predicting...

Read more »

Recent Articles

August 19, 2014
By
Recent Articles

  I have uploaded a few papers I have written and presented at some national conferences over the past several years.  Currently, all the articles relate to election research.

Read more »

Hijacking R Functions: Changing Default Arguments

August 19, 2014
By
Hijacking R Functions: Changing Default Arguments

I am working on a package to collect common regular expressions into a canned collection that users can easily use without having to know regexes. The package, qdapRegex, has...

Read more »

Visualize pre-post comparison of intervention #rstats

August 19, 2014
By
Visualize pre-post comparison of intervention #rstats

My sjPlot-package was just updated on CRAN, introducing a new function called sjp.emm.int to plot estimated marginal means (least-squares means) of linear models with interaction terms. Or: plotting adjusted...

Read more »

Introducing Rfiglet: ASCII logos from the comfort of R

August 19, 2014
By

The Rfiglet Package For those who don't know what figlet is, it's a command line utility for creating ascii logos.  Rfiglet, therefore, is a set of R bindings for...

Read more »

Transform point shapefile to SpatStat object

August 19, 2014
By

Today I wanted to do some point pattern analysis in R using the fantastic package spatstat.The problem was that I only had a point shapefile, so I googled a...

Read more »

googleVis 0.5.5 released

August 19, 2014
By
googleVis 0.5.5 released

Earlier this week we released googleVis 0.5.5 on CRAN. The package provides an interface between R and Google Charts, allowing you to create interactive web charts from R. This...

Read more »

analyze the programme for the international assessment of adult competencies (piaac) with r

August 19, 2014
By

heaven knows we've all been there: you're in a heated argument with some patriotic zealot who thinks (insert country here) has the best labor force on earth.  you know...

Read more »

Data Cleaning is a critical part of the Data Science process

August 18, 2014
By

A New York Times article yesterday discovers the 80-20 rule: that 80% of a typical data science project is sourcing cleaning and preparing the data, while the remaining 20%...

Read more »

A Conversation with Tal Galili at useR! 2014

August 18, 2014
By
ttttttttttttttttUntitled

“One can acquire everything in solitude except character.” ― Stendhal The Interview Tal Galili is,...

Read more »

Announcing the DSLA Podcast!

August 18, 2014
By

You’ve asked and we’ve listened. The audio content from our DataScience.LA interviews will now be...

Read more »

What are the Odds of an Independent Scotland?

August 18, 2014
By
What are the Odds of an Independent Scotland?

“For things to remain the same, everything must change.” (Gattopardo by Giuseppe Tomasi di Lampedusa) In less than a month, Scots will decide if they want Scotland tied or...

Read more »

Example 2014.10: Panel by a continuous variable

August 18, 2014
By
Example 2014.10: Panel by a continuous variable

In Example 8.40, side-by-side histograms, we showed how to generate histograms for some continuous variable, for each level of a categorical variable in a data set. An anonymous...

Read more »

A Hammer Trading System — Demonstrating Custom Indicator-Based Limit Orders in Quantstrat

August 18, 2014
By
A Hammer Trading System — Demonstrating Custom Indicator-Based Limit Orders in Quantstrat

So several weeks ago, I decided to listen on a webinar (and myself will be giving one on using quantstrat … Continue reading →

Read more »

Goodbye static graphs, hello shiny, ggvis, rmarkdown (vs JS solutions)

August 18, 2014
By
Goodbye static graphs, hello shiny, ggvis, rmarkdown (vs JS solutions)

One of the very exciting and promising developments from RStudio is the rmarkdown/shiny/ggvis combination of tools. We’re on the verge of static graphs and presentations being as old-fashioned as...

Read more »

GEFCom 2014 energy forecasting competition is underway

August 17, 2014
By

GEFCom 2014 is the most advanced energy forecasting competition ever organized, both in terms of the data involved, and in terms of the way the forecasts will be evaluated....

Read more »

Hayward/San Leandro Housing Prices

Hayward/San Leandro Housing Prices

I’ve done a previous post about the salaries of data scientists, but now I’m going to look at one of the negative sides of all the high salaries generated by the...

Read more »

Changes to FSA — Estimating Abundance

August 17, 2014
By
Changes to FSA — Estimating Abundance

I mentioned previously, that I have been updating the Mark-Recapture vignettes.  That has morphed into a document that is an update of the Mark-Recapture Closed and Open vignettes and...

Read more »

21 R navigation tools

August 17, 2014
By
21 R navigation tools

Navigation gets you from where you are to where you want to be. Speaking of navigation, you can jump to selected sections of this post: Navigation; R-bloggers; Task views;...

Read more »

Quicksort speed, just in time compiling and vectorizing

August 17, 2014
By
Quicksort speed, just in time compiling and vectorizing

I was reading the Julia documentation the other day. They do speed comparisons to other languages. Obviously R does not come out very well. The R code for quicksort...

Read more »

A Look at Random Seeds in R… Or: “85, why can’t you be more like 548?”

August 17, 2014
By
A Look at Random Seeds in R… Or: “85, why can’t you be more like 548?”

Have you ever wondered whether the set.seed() function in R has any quirkiness? This analysis was inspired by a Stack Overflow posting by Wolfgang and I incorporate...

Read more »

A Matrix Powers Package, and Some General Edifying Material on R

August 16, 2014
By
A Matrix Powers Package, and Some General Edifying Material on R

Here I will introduce matpow, a package to flexibly and conveniently compute matrix powers.  But even if you are not interested in matrices, I think many of you will...

Read more »

Search for CRAN, GitHub and BioConductor packages at Rdocumentation.org

August 15, 2014
By
Search for CRAN, GitHub and BioConductor packages at Rdocumentation.org

If you're looking for just the right package to solve your R problem, you could always browse through the list of available packages on CRAN. But with almost 6000...

Read more »

Reasonable Inheritance of Cluster Identities in Repetitive Clustering

August 15, 2014
By
Reasonable Inheritance of Cluster Identities in Repetitive Clustering

… or Inferring Identity from Observations Let’s assume the following application: A conservation organisation starts a project to geographically catalogue the remaining representatives of an endangered plant species. For...

Read more »

Update to resolv (0.1.2) + valgrind and R + Parallel DNS Requests with Revolution R’s ‘foreach’ and `doParallel`

August 15, 2014
By

Thanks to a blog comment by @arj, I finally ran at least one of the new Rcpp-based through valgrind (resolv) and, sure enough there were a...

Read more »

sort.data.frame

August 15, 2014
By

I came accross this post on SO, where several solutions to sorting data.frames are presented. It must have been solved a million times, but here's a solution I like...

Read more »

rOpenSci at NESCent Open Tree of Life Hackathon

August 15, 2014
By

The Open Tree of Life project aims to synthesize our combined knowledge of how organisms relate to each other, and make the results available to anyone who wants...

Read more »

Exploring Hotel Review Data from Trip Advisor with R

August 14, 2014
By
Exploring Hotel Review Data from Trip Advisor with R

I wanted to use R to explore hotel review data. I chose to explore reviews for 3 hotels from Trip Advisor. First, I had to scrape the review data....

Read more »