Accidental aRt

July 30, 2014
By
Accidental aRt

R is a very powerful language for creating custom data visualizations, but during the development process sometimes you make a mistake and things go horribly wrong. But sometime serendipity intervenes, and the (unintended) result can be quite beautiful. Accidental aRt, if you will. Curated by Kara Woo and Erika Mudrak, this fantastic Tumblr captures beautiful but unintended examples from...

Read more »

Refresh Old rCharts+flickr post with httr and pipeR

July 30, 2014
By

The R world keeps moving, and I noticed this old post didn’t work anymore, so I have rewritten it to use Hadley Wickham's httr instead of Rflickr for two reasons: Rflickr is not working for me anymore httr is a very helpful package for navigating the "what was scary to me" world of http and oauth In...

Read more »

ideal point graphics, via d3

July 30, 2014
By
ideal point graphics, via d3

I’ve updated some of the graphical displays of the ideal point estimates I serve up here. I’ve rendered some of these in d3, with some rollover lah-de-dah: (1) 113th House ideal points in a long “caterpillar” format; (2) scatterplot of ideal point against Obama 2012 vote in district. Screenshot of the scatterplot appears below. My

Read more »

drinkR: Estimate your Blood Alcohol Concentration using R and Shiny.

July 30, 2014
By
drinkR: Estimate your Blood Alcohol Concentration using R and Shiny.

Inspired by events that took place at UseR 2014 last month I decided to implement an app that estimates one’s blood alcohol concentration (BAC). Today I present to you drinkR, implemented using R and Shiny, Rstudio’s framework for building web apps using R. So, say that I had a good dinner, drinking a couple of glasses...

Read more »

writing #rstats bindings for bwa-mem, my notebook.

July 30, 2014
By

I wanted to learn how to bind a C library to R, so I've created the following bindings for BWA. My code is available on github at :https://github.com/lindenb/rbwaMost of the C code was inspired from Heng Li's code https://github.com/lh3/bwa/blob/master...

Read more »

Building Interactive Graphs with ggplot2 and Shiny

July 30, 2014
By
Building Interactive Graphs with ggplot2 and Shiny

Some time ago, I was contacted from guys at Packt Publishing. Their just published the Building Interactive Graphs with ggplot2 and Shiny online course and they ask me my (humble) opinion. I am proud of their request, and I will review shortly … Continue reading →

Read more »

“F-bombs” in GitHub Commits (warning: contains profanity)

July 30, 2014
By
“F-bombs” in GitHub Commits (warning: contains profanity)

Warning: this post contains profanity...arguably excessive amounts of it.  If you are a humorless no-fun, you are recommended to proceed no further. Seriously though, the title is quite descriptive of the content of this post.  If you are offended by the use of such language, or if your boss is likely to come peering over your shoulder soon, I...

Read more »

Format Markdown Documents in R

July 29, 2014
By
Format Markdown Documents in R

Have you ever used a markdown file to create an html file? Have you ever wanted to quickly format the subsequent html file to add some color or other aspects? If your answer is yes to both of those questions, this package may be of interest to you. ...

Read more »

Variable Selection in Market Segmentation: Clustering or Biclustering?

July 29, 2014
By
Variable Selection in Market Segmentation: Clustering or Biclustering?

Will you have that segmentation with one or two modes?The data matrix for market segmentation comes to us with two modes, the rows are consumers and the columns are variables. Clustering uses all the columns to transform the two-mode data matrix (row a...

Read more »

Packages vs ProjectTemplate

Packages vs ProjectTemplate tl;dr Imposing a different structure than R packages for distributing R code is a bad idea, especially now that R package tools have gotten to the point where managing a package has become much easier. ProjectTemplate ?? My last two posts (1, 2) provided an argument and an example of why one should use R packages to contain...

Read more »

Revisiting package dependencies

July 29, 2014
By
Revisiting package dependencies

by Andrie de Vries In my previous post I wrote about how to identify and visualize package dependencies. Within hours, Duncan Murdoch (member of R-core) identified some discrepancies between my list of dependencies and the visualisation. Since then, I fixed the dispecrancies. In this blog post I attempt to clarify the issues involved in listing package dependencies. In miniCRAN...

Read more »

2 new R jobs (for July 29th 2014)

July 29, 2014
By
r_jobs

This is the bimonthly R Jobs post (for 2014-07-29), based on the R-bloggers’ sister website: R-users.com. If you are an employer who is looking to hire people from the R community, please visit this link to post a new R job (it’s free, and registration takes less than 10 seconds). If you are a job seekers, please follow the links below to learn more and apply for your job of interest (or visit previous...

Read more »

Fast-track publishing using the new R markdown – a tutorial and a quick look behind the scenes

July 29, 2014
By
Fast-track publishing using the new R markdown – a tutorial and a quick look behind the scenes

The new R Markdown (rmarkdown-package) introduced in Rstudio 0.98.978 provides some neat features by combining the awesome knitr-package and the pandoc-system. The system allows for some neat simplifications of the fast-track-publishing (ftp) idea using so called formats. I've created a new package, the Grmd-package, with an extension to the html_document format, called the docx_document. The formatter allows an almost...

Read more »

Hit and run. Think Bayes!

July 29, 2014
By
Hit and run. Think Bayes!

At the R in Insurance conference Arthur Charpentier gave a great keynote talk on Bayesian modelling in R. Bayes' theorem on conditional probabilities is strikingly simple, yet incredibly thought provoking. Here is an example from Daniel Kahneman to test your intuition. But first I have to start with Bayes' theorem.Bayes' theoremBayes' theorem states that given...

Read more »

analyze the fda adverse event reporting system (faers) with r

July 29, 2014
By

doctors prescribe medications for patients all the time.  all the time.  sometimes the results are beneficial, other times the drug has no discernible effect, but occasionally those substances actually cause harm.  since the drug is alre...

Read more »

Creating an analysis as a package and vignette

Creating an analysis as a package and vignette Following from my last post, I am going to go step by step through the process I use to generate an analysis as a package vignette. This will be an analysis of the tweets from the 2012 and 2014 ISMB conference (thanks to Neil and Stephen for...

Read more »

Volatility Histeresis: A First Attempt

July 28, 2014
By
Volatility Histeresis: A First Attempt

So the last time that a FRAMA strategy was tried with price crossovers, the problem was that due to counter-trending … Continue reading →

Read more »

Chain Operations: An Interesting Feature in dplyr Package

July 28, 2014
By
Chain Operations: An Interesting Feature in dplyr Package

Read more »

Data Caching

July 28, 2014
By

Data caching is not new. It is often necessary to save intermediate data files when the process of loading and/or manipulating data takes a considerable amount of time. This problem is further complicated when working with dynamic data that changes regularly. In these situations it often sufficient to use data that is current with in some time frame (e.g....

Read more »

Clipping spatial data in R

July 28, 2014
By
Clipping spatial data in R

This miniature vignette shows how to clip spatial data based on different spatial objects in R and a ‘bounding box’. Spatial overlays are common in GIS applications and R users are fortunate that the clipping and spatial subsetting functions are mature and fairly fast. We’ll also write a new function called gClip(), that will make clipping by bounding boxes...

Read more »

Comment on Sustainability and innovation in staple crop production in the US Midwest

July 28, 2014
By
Comment on Sustainability and innovation in staple crop production in the US Midwest

After writing a blog post about the paper “Sustainability and innovation in staple crop production in the US Midwest” I decided to submit a formal comment to the International Journal of Agricultural Sustainability in July 2013, which was published today. As far as I know, Heinemann et al. provided a rebuttal to my comments, which

Read more »

jpmml and R (Free Webinar)

July 28, 2014
By
jpmml and R (Free Webinar)

This free, global webinar will provide an introduction to jpmml, the world’s leading open-source PMML scoring engine currently being utilized by companies such as Airbnb to rapidly deploy predictive models into production. Webinar Format: – What is PMML? – Building … Continue reading →

Read more »

The Pay-for-Performance Myth

July 28, 2014
By
The Pay-for-Performance Myth

Last week, Eric Chemi and Ariana Giorgi published an interesting article on “The Pay-for-Performance Myth” With all the public chatter about exorbitant executive compensation and income inequality, it’s useful to look at the relationship between chief executive officer pay and corporate performance. Typically, when the subject of their big pay packages arises, CEOs—usually through their spokespeople—say they are paid for...

Read more »

EARL and other upcoming events

July 28, 2014
By
EARL and other upcoming events

Highlighted EARL As in “Effective Applications of the R Language”. 2014 September 15-17, London. Somehow they gave higher billing to Ben Goldacre than to Pat Burns.  If Obama were coming, they’d probably bill him above me too — and what does he know about R?  In spite of that little glitch, I’m sure it will … Continue reading...

Read more »

Pie Charts. Are they worth the Fight?

July 28, 2014
By
Pie Charts. Are they worth the Fight?

Like Rob, I recently got back from ICOTS. What a great conference. Kudos to everyone who worked hard to organize and pull it off. In one of the sessions I was at, Amelia McNamara (@AmeliaMN) gave a nice presentation about how … Continue reading →

Read more »

Reuters uses R to report on surge in coastal flooding in US

July 28, 2014
By
Reuters uses R to report on surge in coastal flooding in US

As global warming causes sea levels to rise, the risk of flooding for coastal settlements also rises over time. A recent analysis by Reuters find that incidents of coastal flooding along the Eastern seaboard of the United States have surged in recent years as the sea level steadily rises. Flood levels have been exceeded in six eastern cities by...

Read more »

Analyses as Packages

Analyses as Packages TL;DR Instead of writing an analysis as a single or set of R scripts, use a package and include the analysis as a vignette of the package. Read below for the why, the how is in the next post. Analyses and Reports As data science or statistical researchers, we tend to do a lot of analyses, whether for our...

Read more »

a:class <- b

July 28, 2014
By

Usually in strongly typed languages, like C++, the type of a variable comes before the variable, e.g.: int x = 23 ; So x is declared of type int and assignd the value 23. Now, some languages do things differently, e.g. in julia: x::Int8 = 10...

Read more »

Who wants to disrupt R training and R education?

July 28, 2014
By
Who wants to disrupt R training and R education?

Using EdTech applications as an academic, R trainer or training company is no longer a unique selling proposition, but a must-have commodity. In this post, we introduce the new DataCamp course creation tools for academics, trainers and enterprises, and make a call to those interested in using these tools. More information via [email protected] Everyone involved in academic teaching

Read more »