the most patronizing start to an answer I have ever received

April 29, 2015
By
the most patronizing start to an answer I have ever received

Another occurrence of a question on X validated where the originator (primitivus petitor) was trying to get an explanation without the proper background. On either Bayesian statistics or simulation. The introductory sentence to the question was about “trying to understand how the choice of priors affects a Bayesian model estimated using MCMC”

Read more »

Visualizing fits, inference, implications of (G)LMMs with Jaime Ashander

April 29, 2015
By

A couple of weeks at the Davis R Users’ Group, Jaime Ashander gave a presentation on and visualizing and diagnosing (G)LMMs in R. Here’s the video: Jaime also wrote up the notes from his talk, including all the code, on his blog here (with the raw R Markdown file on github here). The material...

Read more »

Turning Data Into Awesome With sqldf and pandasql

April 29, 2015
By
Turning Data Into Awesome With sqldf and pandasql

Both R and Python possess libraries for using SQL statements to interact with data frames. While both languages have native facilities for manipulating data, the sqldf and pandasql provide a simple and elegant interface for conducting tasks using an intuitive framework that’s widely used by analysts.         R and sqldf sqldf("SELECT COUNT(*) FROM df2 WHERE

Read more »

Benchmarks of RRO on OSX and Ubuntu

April 29, 2015
By
Benchmarks of RRO on OSX and Ubuntu

Bay Area engineer Vineet Abraham recently ran some benchmarks for Revolution R Open (RRO) running on Mac OS X and on Ubuntu. Thanks to the multi-threaded processing capabilites of RRO, several operations ran much faster than R downloaded from CRAN, without having to change any code: For the most part, RRO performs significantly faster than standard R both locally...

Read more »

See R in action at the BUILD conference

April 29, 2015
By
See R in action at the BUILD conference

Build 2015, the Microsoft conference which brings around 5,000 developers to the Moscone Center in San Francisco, begins tomorrow. The conference is sold out, but you can livestream the keynote presentations from buildwindows.com to catch all the big announcements. You can also follow along on Twitter at the #Build2015 hashtag. There will be a major keynote presentation featuring CEO...

Read more »

Finance-YahooQuote 0.25 hotfix

April 29, 2015
By

A hotfix release for the Finance-YahooQuote Perl module on CPAN is now available. Available Yahoo! Finance decided to change the base URL. My thanks to Nicola Chiapolini who not only noticed but also sent me the one-line patch fixing this: --- YahooQuote.pm~ 2010-03-27 01:44:10.000000000 +0100 +++ YahooQuote.pm ...

Read more »

Twelve Graphs & Dashboards You Should See On Climate Change, Science, & Public Opinion

April 28, 2015
By
Twelve Graphs & Dashboards You Should See On Climate Change, Science, & Public Opinion

Plotly has teamed up with The White House on President Obama’s Climate Data Initiative to explore and explain climate trends. This post is our first contribution. You’ll see interactive graphs about: temperature and CO2 (4), climate change & environmental impact (4), attitudes about global warming (3), and a population graph. If you like this post, please share...

Read more »

choroplethrZip v1.3.0: easier demographics, national maps

April 28, 2015
By
choroplethrZip v1.3.0: easier demographics, national maps

Introduction choroplethr v3.0 is now available on github. You can get it by typing # install.packages("devtools") library(devtools) install_github('arilamstein/[email protected]') Version 1.3.0 has two new features: Data frame df_zip_demographics contains eight demographic statistics about each ZIP Code Tabulated Area (ZCTA) in the US. Data comes from the 2013 5-year American Community Survey (ACS). Function ?get_zip_demographics will return

Read more »

RStudio v0.99 Preview: Code Diagnostics

April 28, 2015
By
RStudio v0.99 Preview: Code Diagnostics

In RStudio v0.99 we’ve made a major investment in R source code analysis. This work resulted in significant improvements in code completion, and in the latest preview release enable a new inline code diagnostics feature that highlights various issues in your R code as you edit. For example, here we’re getting a diagnostic that notes that there is an extra

Read more »

Winning streaks in baseball

April 28, 2015
By
Winning streaks in baseball

How rare are winning streaks in baseball? The post Winning streaks in baseball appeared first on Decision Science News.

Read more »

Situational Baseball: Analyzing Runs Potential Statistics

April 28, 2015
By
Situational Baseball: Analyzing Runs Potential Statistics

by Mark Malter After reading the book, Analyzing Baseball with R, by Max Marchi and Jim Albert, I decided to expand on some of their ideas relating to runs created and put them into an R shiny app . The Server and UI code are linked at the bottom of the Introduction tab. I downloaded the Retrosheet play-by-play data...

Read more »

RcppTOML 0.0.3: A New Approach to Configuration Files

April 28, 2015
By

A small project I worked on during the last few weeks has now come together in new package RcppTOML which arrived on CRAN yesterday. It provides R with a reader for TOML files. TOML stands for Tom's Obvious Markup Language. And before you roll your eyes, glance at the TOML site. It really is different, and...

Read more »

Downloading and Visualizing Seismic Events from USGS

April 28, 2015
By
Downloading and Visualizing Seismic Events from USGS

The unlucky events that took place in Nepal have flooded the web with visualization of the earthquakes from USGS. They normally visualize earthquakes with a colour scale that depends on the age of the event and a marker size that depends on magnitude. I remembered that some time ago I tested ways for downloading and visualizing data from USG...

Read more »

R in Insurance 2015 Conference Programme

April 28, 2015
By
R in Insurance 2015 Conference Programme

The programme for the 3rd R in Insurance conference is on-line. The event will take place on 29 June 2015 at the University of Amsterdam. Time to register now. Special thanks to our sponsors, without whom the conference wouldn't be possible: CYBAEA, RS...

Read more »

Some basics of biomaRt

April 27, 2015
By
Some basics of biomaRt

One of the commonest bioinformatics questions, at Biostars and elsewhere, takes the form: “I have a list of identifiers (X); I want to relate them to a second set of identifiers (Y)”. HGNC gene symbols to Ensembl Gene IDs, for example. When this occurs I have been known to tweet “the answer is BioMart” (there

Read more »

I Fought the (distribution) Law (and the Law did not win)

April 27, 2015
By
I Fought the (distribution) Law (and the Law did not win)

A few days ago, I was asked if we should spend a lot of time to choose the distribution we use, in GLMs, for (actuarial) ratemaking. On that topic, I usually claim that the family is not the most important parameter in the regression model. Consider the following dataset > db <- data.frame(x=c(1,2,3,4,5),y=c(1,2,4,2,6)) > plot(db,xlim=c(0,6),ylim=c(-1,8),pch=19) To visualize a regression...

Read more »

Awesome-R: A curated list of the best add-ons for R

April 27, 2015
By

One of the great things about R is that there's so much available to use with it: there are several interfaces to choose from, thousands of add-on packages to extend its capabilites, hundreds of books and on-line tutorials — an abundance of riches to improve your R experience. But with that abundance comes a problem: how to find the...

Read more »

8 new R jobs (2015-04-27)

April 27, 2015
By
8 new R jobs (2015-04-27)

This is the bimonthly post (for 2015-04-13) for new R Jobs from R-users.com. Employers: visit this link to post a new R job to the R community (it’s free and quick). Job seekers: please follow the links below to learn more and apply for your job of interest (or visit previous R jobs posts). Full-Time Senior Data Analyst – Online Advertising (at Booking.com) Booking.com – Posted by Booking.com Amsterdam Noord-Holland, Netherlands 23 Apr2015...

Read more »

Oracle R, Hash Table Results, And VIM To The Rescue

April 27, 2015
By
Oracle R, Hash Table Results, And VIM To The Rescue

Downloaded and installed Solaris 11.2 on my laptop, and WOW! That was a throwback to the late 90’s! Old version of GNOME, no truetype fonts so the whole visual experience was very pixelly. Firefox was installed but every website I visited yelled,...

Read more »

Exploration of Functional Diversity indices using Shiny

April 27, 2015
By
Exploration of Functional Diversity indices using Shiny

Biological diversity (or biodiversity) is a complex concept with many different aspects in it, like species richness, evenness or functional redundancy. My field of research focus on understanding the effect of changing plant diversity on higher trophic levels communities but also ecosystem function. Even if the founding papers of this area of research already hypothesized

Read more »

EARL2015 Conference, London – Presenters Announced

April 27, 2015
By
EARL2015 Conference, London – Presenters Announced

We are delighted to announce the impressive line up of speakers for September’s London EARL Conference.  The speakers represent industries including Energy, Leisure, Insurance, FCMG, Finance, Market Research, Healthcare and Sport and offer real world examples of the usage and application of … Continue reading →

Read more »

Randomly Sample Twitter Followers in R

April 27, 2015
By
Randomly Sample Twitter Followers in R

So yesterday, I set up an #AmazonGiveaway for my new R book at https://giveaway.amazon.com/p/ea32d421d8d7672d — but I had my 10 year old input the number that will determine every nth person who gets the printed

Read more »

Comparing Tree-Based Classification Methods via the Kaggle Otto Competition

April 27, 2015
By
Comparing Tree-Based Classification Methods via the Kaggle Otto Competition

In this post, I’m going to be looking at the progressive performance of different tree-based classification methods in R, using the Kaggle Otto Group Product Classification Challenge as an example. This competition challenges participants to correctly classify products into 1 of 9 classes based on data in 93 features. I’ll start with basic decision trees and … Continue reading...

Read more »

How to install rNOMADS with GRIB file support on Windows

April 26, 2015
By
How to install rNOMADS with GRIB file support on Windows

Two years ago, I wrote a software package for R called “rNOMADS” that interfaces with online weather and sea ice model repositories to gather data in real time, for free.  The data are delivered in two ways: a simple, pure R, cross platform interface using GrADS-DODS, and binary files in GRIB format.  The one issue

Read more »

Keeping Track of an Evolving “Top N” Cutoff Threshold Value

April 26, 2015
By
Keeping Track of an Evolving “Top N” Cutoff Threshold Value

In a previous post (Charts are for Reading), I noted how it was difficult to keep track of which times in an F1 qualifying session had made the cutoff time as a qualifying session evolved. The problem can be stated as follows: in the first session, with 20 drivers competing, the 15 drivers with the

Read more »

Doing quantitative archaeology with open source software

April 25, 2015
By

(This is a guest post by Ben Marwick, originally published on ATOR blog) This short post is written for archaeologists who frequently perform common data analysis and visualization tasks in Excel, SPSS or similar commercial packages. It was motivated by my recent observations at the Society of American Archaeology meeting in San Francisco – the largest annual meeting of archaeologists in the...

Read more »

Unemployment of Europe in 2014 by NUTS 2 region

April 25, 2015
By
Unemployment of Europe in 2014 by NUTS 2 region

During the Christmas break I worked on some code to show unemployment by NUTS 2 region. At that point no 2014 data was available. When I noticed the 214 was available I dug up the code and plotted again.Data and CodeAs written, the code was made beginn...

Read more »

Random Data Sets Quickly

April 24, 2015
By
Random Data Sets Quickly

This post will discuss a recent GitHub package I’m working on, wakefield to generate random data sets. The post is broken into the following sections: Demo 1.1 Random Variable Functions 1.2 Random Data Frames 1.3 Missing Values 1.4 Default Data … Continue reading →

Read more »

Stochastic SIR Epidemiological Compartment Model

April 24, 2015
By
Stochastic SIR Epidemiological Compartment Model

Introduction This post is a simple introduction to Rcpp for disease ecologists, epidemiologists, or dynamical systems modelers - the sorts of folks who will benefit from a simple but fully-working example. My intent is to provide a complete, self-contained introduction to modeling with Rcpp. My hope is that this model can be easily modified to run any dynamical simulation that has dependence on the...

Read more »