RcppAnnoy 0.0.6

A few days ago, Erik released a new version of his Annoy library -- a small, fast, and lightweight C++ template header library for approximate nearest neighbours -- which now no longer requires Boost. While I don't mind Boost (actually, quite the op...

take those hats off [from R]!

May 4, 2015
By

This is presumably obvious to most if not all R programmers, but I became aware today of a hugely (?) delaying tactic in my R codes. I was working...

Working with “large” datasets, with dplyr and data.table

May 4, 2015
By

A few months ago, I was doing some training on data science for actuaries, and I started to get interesting puzzeling questions. For instance, Fleur was working on telematic...

Call R and Python from base SAS

May 4, 2015
By

Since 2009, it has been possible to call R from SAS programs. However, this integration requires IML, an add-on matrix-object language for SAS which isn't available with all SAS...

using GOSemSim to rank proteins obtained by co-IP

May 4, 2015
By

Co-IP is usually used to identified interactions among specific proteins. It is widely used in detecting protein complex. Unfortunately, an identified protein may not be an interactor, and sometimes...

Geomorph beta in development (2.1.5)

May 3, 2015
By

Dear geomorph users,We've been busy adding some new functions to the forthcoming v.2.1.5, currently in beta stage and available on gitHub (installed using: devtools::install_github("EmSherratt/geomorph",ref = "Develop")). Users be aware that ...

dplyr Tutorial: verbs + split-apply

May 3, 2015
By

At a recent Saint Louis R users meeting I had the pleasure of giving a basic introduction to the awesome dplyr R package. For me, data analysis ubiquitously involves...

Cohort Analysis with Heatmap

Previously I shared the data visualization approach for descriptive analysis of progress of cohorts with the “layer-cake” chart (part I and part II). In this post, I want to share...

Introducing Radiant: A shiny interface for R

May 3, 2015
By

Radiant is a platform-independent browser-based interface for business analytics in R, based on the Shiny package. Key features Explore: Quickly and easily summarize, visualize, and analyze your data ...

Survival Analysis With Generalized Additive Models : Part IV (the survival function)

May 2, 2015
By
$Survival Analysis With Generalized Additive Models : Part IV (the survival function)$

The ability of PGAMs to estimate the log-baseline hazard rate, endows them with the capability to be used as smooth alternatives to the Kaplan Meier curve. If we assume...

Update to Introduction to programming econometrics with R

May 2, 2015
By

This semester I taught a course on applied econometrics with the R programming language. For this, I created a document that I gave to my students and shared online....

Survival Analysis With Generalized Additive Models : Part III (the baseline hazard)

May 2, 2015
By
$Survival Analysis With Generalized Additive Models : Part III (the baseline hazard)$

In the third part of the series on survival analysis with GAMs we will review the use of the baseline hazard estimates provided by this regression model. In contrast...

Survival Analysis With Generalized Models: Part II (time discretization, hazard rate integration and calculation of hazard ratios)

May 2, 2015
By
$Survival Analysis With Generalized Models: Part II (time discretization, hazard rate integration and calculation of hazard ratios)$

In the second part of the series we will consider the time discretization that makes the Poisson GAM approach to survival analysis possible. Consider a set of s individual...

Rcpp 0.11.6

The new release 0.11.5 of Rcpp arrived on the CRAN network for GNU R yesterday; the corresponding Debian package has also been uploaded. Rcpp has become the most popular...

A new minor release 5.100.1 of Armadillo was released by Conrad yesterday. Armadillo is a powerful and expressive C++ template library for linear algebra aiming towards a...

Should I use premium Diesel? Result: No

May 2, 2015
By

A while ago I had a post: 'Should I use premium Diesel? Setup. Since that time the data has been acquired. This post describes the results.DataData is registered by me...

Introducing Radiant: A shiny interface for R

May 1, 2015
By

Radiant is a platform-independent browser-based interface for business analytics in R, based on the Shiny package. Key features Explore: Quickly and easily summarize, visualize, and analyze your data ...

Revolution R Open 8.0.3 now available

May 1, 2015
By

Revolution R Open 8.0.3 is now available for download for Windows, OS X, Red Hat, Ubuntu and OpenSUSE. This release includes seveal new features: it upgrades RRO to the...

RStudio v0.99 Preview: Graphviz and DiagrammeR

May 1, 2015
By

Soon after the announcement of htmlwidgets, Rich Iannone released the DiagrammeR package, which makes it easy to generate graph and flowchart diagrams using text in a Markdown-like syntax. The package...

Survival Analysis With Generalized Additive Models : Part I (background and rationale)

May 1, 2015
By

After a really long break, I’d will resume my blogging activity. It is actually a full circle for me, since one of the first posts that kick started this blog,...

Shiny: Officer Involved Shootings

May 1, 2015
By

US Officer Involved Shootings Mar-Apr 2015 with Shiny Now everyone can be a data analyst with RStudio’s Shiny package. Fellow R programmer and Las Vegas import, Steve Wells, has created...

rstanmulticore: A cross-platform R package to automatically run RStan MCMC chains in parallel

May 1, 2015
By

*** This work has been supported by a grant from the Spencer Foundation (#201400002). The views expressed are those of the author and do not necessarily reflect those of...

How large vectors in R might be stored compactly

April 30, 2015
By

Vectors in R can currently have elements of two sizes — 8-byte double-precision floating-point elements for `numeric’ vectors, or 4-byte elements for `integer’ or `logical’ vectors.  You can also have vectors whose...

Upcoming talks about jsonlite and mongolite

April 30, 2015
By

This summer I will be giving an invited talk at the annual French R Meeting in Grenoble as well as a shorter talk...

Dockerizing a Shiny App

April 30, 2015
By

After a long pause of more than four months, I am finally back to post here. Unfortunately, many commitments prevented me keep posting, but coming back, i changed the...

Le Monde puzzle [#909]

April 30, 2015
By

Another of those “drop-a-digit” Le Monde mathematical puzzle: Find all integers n with 3 or 4 digits an single interior zero digit, such that removing that zero digit produces...

Introduction to Applied Econometrics With R

April 30, 2015
By

I came across a January post from David Smith at Revolution Analytics, in his Revolutions blog. It's titled, An Introduction to Applied Econometrics With...

The First NY R Conference

April 30, 2015
By

by Joseph Rickert Last Friday and Saturday the NY R Conference briefly lit up Manhattan's Union Square neighborhood as the center of the R world. You may have caught...