Combating Multicollinearity by Asking the Right Questions and Uncovering Latent Features

October 26, 2014
By
Combating Multicollinearity  by Asking the Right Questions and Uncovering Latent Features

Overview. When responding to questions about brand perceptions or product feature satisfaction, consumers construct a rating  by relying on their overall satisfaction with the brand or product plus some general category knowledge of how diffi...

Read more »

Quarterback Completion Heatmap Using dplyr

October 26, 2014
By
Quarterback Completion Heatmap Using dplyr

Several months ago, I found Bryan Povlinkski's (really nicely cleaned) dataset with 2013 NFL play-by-play information, based on data released by Brian Burke at Advanced Football Analytics. I decided to browse QB completion rates based on Pass Location (Left, Middle, Right), Pass Distance (Short or Deep), and Down....

Read more »

ALUES: Agricultural Land Use Evaluation System, R package

October 26, 2014
By

Authors:Arnold R. Salvacion                                                                       [email protected] Analysis and Visualization using R (blog)                                          Al-Ahmadgaid B. Asaad (maintainer)[email protected] Land Use Evaluation System (ALUES) is an R package that evaluates land suitability for different crop production. The package is based on the Food and Agriculture Organization (FAO) and the International Rice Research Institute (IRRI) methodology for land evaluation. Development...

Read more »

Visualizing (generalized) linear mixed effects models with ggplot #rstats #lme4

October 26, 2014
By
Visualizing (generalized) linear mixed effects models with ggplot #rstats #lme4

In the past week, colleagues of mine and me started using the lme4-package to compute multi level models. This inspired me doing two new functions for visualizing random effects (as retrieved by ranef()) and fixed effects (as retrieved by fixed()) of (generalized) linear mixed effect models. The upcoming version of my sjPlot package will contain

Read more »

Tuning Laplaces Demon III

October 26, 2014
By
Tuning Laplaces Demon III

This is the third post with LaplacesDemon tuning. same problem, different algorithms. For introduction and other code see this post. The current post takes algorithms Independence Metropolis to Reflective Slice Sampler.Independence MetropolisIndependen...

Read more »

Model Segmentation with Recursive Partitioning

October 26, 2014
By
Model Segmentation with Recursive Partitioning

Read more »

Brazilian latest polls and house effects

October 25, 2014
By
Brazilian latest polls and house effects

The latest polls just released tonight are suggesting a numerical tie between Dilma Rousseff (PT) and Aecio Neves (PSDB) considering the limit of the margin of error. Actually, these polls fired up a possible game-changing for the opposition over the government as some of the polls did capture any impact stimulated by the televised debate … Read More...

Read more »

Call for participation: AusDM 2014, Brisbane, 27-28 November

October 25, 2014
By
Call for participation: AusDM 2014, Brisbane, 27-28 November

********************************************************* 12th Australasian Data Mining Conference (AusDM 2014) Brisbane, Australia 27-28 November 2014 http://ausdm14.ausdm.org/ ********************************************************* The Australasian Data Mining Conference has established itself as the premier Australasian meeting for both practitioners and researchers in data mining. Since AusDM’02 the conference … Continue reading →

Read more »

What are the chances of Dilma tomorrow?

October 25, 2014
By
What are the chances of Dilma tomorrow?

Although polling data are the most common source in an electoral campaign, there are also models that use prediction markets data (trade contracts flow) as the source of information about who is going to win the election. What is the best way of predicting an election is up to debate, but models based on the … Read More...

Read more »

RSQLite 1.0.0

October 25, 2014
By
RSQLite 1.0.0

I’m very pleased to announce a new version of RSQLite 1.0.0. RSQLite is the easiest way to use SQL database from R: library(DBI) # Create an ephemeral in-memory RSQLite database con <- dbConnect(RSQLite::SQLite(), ":memory:") # Copy in the buit-in mtcars data frame dbWriteTable(con, "mtcars", mtcars, row.names = FALSE) #> TRUE # Fetch all results

Read more »

Hungarian RUG guest speakers: Romain Francois and Matt Dowle

October 25, 2014
By
Hungarian RUG guest speakers: Romain Francois and Matt Dowle

The Budapest Users of R Network, which is the only one active Hungarian R User Group so far, was founded a bit more than a year ago, and I was really happy to see more than 200 registrations before its first anniversary. The RUG proved to be active and popular in Hungary, which makes me feel extremely...

Read more »

jsonlite 0.9.13: high performance number formatting

October 24, 2014
By
jsonlite 0.9.13: high performance number formatting

The jsonlite package implements a robust, high performance JSON parser and generator for R, optimized for statistical data and the web. This week version 0.9.13 appeared on CRAN which is the third release in a relatively short period focusing on performance optimization. Fast number formatting Version 0.9.11 and 0.9.12 had already introduced majors...

Read more »

Rocker: Docker containers for R

October 24, 2014
By

If you haven't heard the buzz about Docker but you often need to spin up Linux-based VM's for testing, simulations, etc. then you should check it out. In short, Docker rocks: we use it for testing our Linux-based distros of Revolution R Open. If you want to use R and Docker together, Dirk Eddelbuettel and Carl Boettiger have made...

Read more »

Simple Probs

October 24, 2014
By
Simple Probs

Somebody said me that it'd be really nice to see a posting with simple simulations for the runoff this weekend. Answering such a call, this is the best I could come up with. The following is a highly simplified simulation that does not account for time trends nor for house effects. But it's still theoretically … Read More...

Read more »

R and RStudio incompatibility with Yosemite Mac OS X 10.10

October 23, 2014
By

There is currently a bug (or feature?) in the current version of Yosemite (OS X 10.10) that messes with the passing of environmental variables to programs launched from Finder (as pointed out by Adam Maxwell). Notably, this means that PATH variables are not passed properly to R.app or RStudio.app. You may end up seeing errors

Read more »

How to Download and Run R Scripts from this Site

October 23, 2014
By

This post outlines how to download and run R scripts from this website.  We have many Fantasy Football scripts that show how to download and calculate fantasy projections, determine the riskiness of a player, identify sleepers, The post How to Download and Run R Scripts from this Site appeared first on Fantasy Football Analytics.

Read more »

‘Open sourcing’ microsimulation with R

October 23, 2014
By
‘Open sourcing’ microsimulation with R

These are the slides from a presentation today at the European conference of the IMA, held in Maastricht, 23rd to 24th October, 2014. Microsimulation, as its name suggests, is about modelling things at the individual-level. In practice, this usually means estimating the characteristics of people using statistical or econometric techniques. Microsimulation, as represented by the International Microsimulation Association...

Read more »

Generating secure random numbers with openssl

October 23, 2014
By
Generating secure random numbers with openssl

I started working on a new R package with bindings for OpenSSL. The initial release is now available from CRAN. To install the package on Linux you need libssl-dev (Debian/Ubuntu) or openssl-devel (Fedora, RHEL, CentOS). For Mac and Windows, pr...

Read more »

Feller’s shoes and Rasmus’ socks [well, Karl's actually...]

October 23, 2014
By
Feller’s shoes and Rasmus’ socks [well, Karl's actually...]

Yesterday, Rasmus Bååth posted a very nice blog using ABC to derive the posterior distribution of the total number of socks in the laundry when only pulling out orphan socks and no pair at all in the first eleven draws. Maybe not the most pressing issue for Bayesian inference in the era

Read more »

sjPlot 1.6 – major revisions, anyone for beta testing? #rstats

October 23, 2014
By
sjPlot 1.6 – major revisions, anyone for beta testing? #rstats

In the last couple of weeks I have rewritten some core parts of my sjPlot-package and also revised the package- and online documentation. Most notably are the changes that affect theming and appearance of plots and figures. There’s a new function called sjp.setTheme which now sets theme-options for all sjp-functions, which means you only need

Read more »

Happening just now… 6th Conference of the R Spanish User Community

October 23, 2014
By
Happening just now… 6th Conference of the R Spanish User Community

The R-Spain Conferences have been taking place since 2009 as an expression of the growing interest that R elicits in many fileds. The organisers are the Comunidad R Hispano (R-es). The community supports many groups and initiatives aimed to develop … Sigue leyendo →

Read more »

Introducing Rocker: Docker for R

October 23, 2014
By

You only know two things about Docker. First, it uses Linuxcontainers. Second, the Internet won't shut up about it. -- attributed to Solomon Hykes, Docker CEO So what is Docker? Docker is a relatively new open source application and service, which is seeing interest across a number of areas. It uses recent Linux kernel features (containers, namespaces) to shield processes....

Read more »

A first look at Distributed R

October 23, 2014
By
A first look at Distributed R

by Joseph Rickert One of the most interesting R related presentations at last week’s Strata Hadoop World Conference in New York City was the session on Distributed R by Sunil Venkayala and Indrajit Roy, both of HP Labs. In short, Distributed R is an open source project with the end goal of running R code in parallel on data...

Read more »

Making an R Package to use the HERE geocode API

October 23, 2014
By

HERE is a product by Nokia, formerly called Nokia maps and before that, Ovi maps. It's the result of the acquisition of NAVTEQ in 2007 combined with Plazes and Metacarta, among others. It has a geocoding API, mapping tiles, routing services, and other things. I'm focused on the geocoding service. Under the “Base” license,...

Read more »

Extending methylKit : Extract promoters with differentially methylated CpGs

October 23, 2014
By
Extending methylKit : Extract promoters with differentially methylated CpGs

In my previous post, I wrote about the features of methylKit. Here, I will discuss how to extend bisulfite sequencing data analysis beyond methylKit.Annotation is an important feature of genomic analyses. Coming to bisulfite sequencing analyses such as...

Read more »

Leveraging R for Econ Job Market

October 23, 2014
By
Leveraging R for Econ Job Market

I wanted to describe a little helper I am using to help refine the places I want to apply at since I am going to be on the Economics Job Market this year. The two main websites were job openings are advertised are: EconJobMarket Job Openings for Economists Now JOE has a really nice feature

Read more »

Introducing Rocker: Docker for R

October 23, 2014
By
Introducing Rocker: Docker for R

You only know two things about Docker. First, it uses Linux containers. Second, the Internet won't shut up about it. -- attributed to Solomon Hykes, Docker CEO So what is Docker? Docker is a relatively new open source application and service, which is seeing interest across a number of areas. It uses recent Linux kernel features (containers, namespaces) to shield processes. While its...

Read more »

Why is my OS X Yosemite install taking so long?: an analysis

Why is my OS X Yosemite install taking so long?: an analysis

Why? Since the latest Mac OS X update, 10.10 "Yosemite", was released last Thursday, there have been complaints springing up online of the progress bar woefully underestimating the actual time to complete installation. More specifically, it appeared as if, for a certain group of people (myself included), the installer would stall out at "two minutes »more

Read more »

Sampling Importance Resampling (SIR) and social revolution.

October 22, 2014
By
Sampling Importance Resampling (SIR) and social revolution.

Motivation The purpose of this gallery post is several fold: to demonstrate the use of the new and improved C++-level implementation of R’s sample() function (see here) to demonstrate the Gallery’s new support for images in contributed posts to demonstrate the usefulness of SIR for updating posterior beliefs given a sample from an arbitrary prior distribution Application: Foreign Threats and Social Revolution The...

Read more »