Announcing the Publication of R Machine Learning Essentials

November 10, 2014
By
Announcing the Publication of R Machine Learning Essentials

R machine learning essentials will be published soon. T

Read more »

Update all user installed R packages – again

November 10, 2014
By
Update all user installed R packages – again

And I had to do it again: I am using R installed from homebrew, and after the upgrade from Mavericks to Yosemite, I had to re-install all packages – or was it a GCC upgrade? I don’t know – but I had to do it again. I still had the link to Randys Zwitch’s solution

Read more »

Growth of open data in biology

November 10, 2014
By
Growth of open data in biology

Why open data growth At rOpenSci we try to make it easier for people to use open data and contribute open data to the community. The question often arises: How much open data do we have? Another angle on this topic is: How much is open data growing? We provide access to dozens of data respositories through our various packages. We...

Read more »

solar navigation

November 9, 2014
By
solar navigation

Introduction Solar altitude is a function of time, longitude and latitude, and so it can be possible to infer location based on measuring altitude as a function of time. This form of solar navigation can be based on sunrise and sunset times, at least on non-equinox days. I have explored this for a school-based project I call “SkyView” involving light sensors and Arduino microcontrollers,...

Read more »

OpenCPU release 1.4.5: configurable webhooks

November 9, 2014
By
OpenCPU release 1.4.5: configurable webhooks

OpenCPU 1.4.5 is a patch release that improves performance by taking advantage of latest versions of jsonlite, devtools, knitr, openssl, etc. Also new in this release is the option to pass build parameters for deploying on ocpu.io (or your own ...

Read more »

Basic mapping and attribute joins in R

November 9, 2014
By
Basic mapping and attribute joins in R

This post is based on the free and open source Creating-maps-in-R teaching resource for introducing R as a command-line GIS. R is well known as an language ideally suited for data processing, statistics and modelling. R has a number of spatial packages, allowing analyses that would require hundreds of lines of code in other languages to be implemented with...

Read more »

R is now the #12 most popular programming language

November 9, 2014
By

According to the most recent TIOBE index, R is the #12 most popular programming language.

Read more »

Using Markdown and Pandoc for Publication

November 9, 2014
By

Using Markdown and Pandoc for PublicationThe other day I was involved in editing job, in which I was supposed to edit 18 articles written in Microsoft Word (doc/docx format) and convert them into pdf format (for printing into a book) and html format (for web publishing). Manuscripts written by people not proficient in doc(x) format are notorious...

Read more »

The completeness of online gun shooting victim counts

November 9, 2014
By
The completeness of online gun shooting victim counts

There are a number of on line efforts to register victims of shootings online. Shootingtracker tries to register all mass shootings, those with four or more victims. Slate had the gun death tally (GDT), gun deaths starting at Newtown, running thro...

Read more »

Spatial data extraction around buffered points in R

November 8, 2014
By

Quantifying spatial data (e.g. land cover) around points can be done in a variety of ways, some of which require considerable amounts of patience, clicking around, and/or cash for a license. Here’s a bit of code that I cobbled together to quickly extract land cover data from the National Land Cover Database for buffered regions around points (e.g....

Read more »

SBS documentary “The Age of Big Data”

November 8, 2014
By
SBS documentary “The Age of Big Data”

by Yanchang Zhao, RDataMining.com “Data is becoming a powerful and most valuable commodity in 21st century. It is leading to scientific insights and new ways of understanding human behaviour. Data can also make you rich. Very rich.” — SBS documentary … Continue reading →

Read more »

Another take on building a multi-lingual shiny app

November 8, 2014
By
Another take on building a multi-lingual shiny app

I was reading this interesting post about how to build a multi-lingual Shiny app. I’m also building a multi-lingual Shiny app and came up with slightly different take on it. First, I don’t use a function for finding the translation, … Continue reading →

Read more »

F1 Championship Race, 2014 – Winning Combinations…

November 8, 2014
By
F1 Championship Race, 2014 – Winning Combinations…

As we come up to the final two races of the 2014 Formula One season, the double points mechanism for the final race means that two drivers are still in with a shot at the Drivers’ Championship: Lewis Hamilton and Nico Rosberg. As James Allen describes in Hamilton closes in on world title: maths favour

Read more »

Zacks Data on Quandl

November 8, 2014
By
Zacks Data on Quandl

Data from Zacks Research have just been made available on Quandl. Registered Quandl users have free preview access to these data, which cover the following: Earnings Estimates: forward-looking consensus forecasts; Earnings Surprises: estimated future and actual historical earnings; Earnings Announcements: predictions for earnings announcement dates, parameters, and supplementary data; Sales Estimates: analogous to earnings estimates,

Read more »

6 new R jobs (for November 7th 2014)

November 7, 2014
By
6 new R jobs (for November 7th 2014)

This is the bimonthly R Jobs post (for 2014-11-07), based on the R-bloggers’ sister website: R-users.com. If you are an employer who is looking to hire people from the R community, please visit this link to post a new R job (it’s free, and registration takes less than 10 seconds). If you are a job seekers, please follow the links below to learn more and apply for your job of interest (or visit previous...

Read more »

Learn about Revolution R Open in live webinar, November 12

November 7, 2014
By

On Wednesday next week, I'll be presenting a live webinar to introduce Revolution R Open and several other open source projects from Revolution Analytics. In the webinar I'll describe: The enhancements included in Revolution R Open The Reproducible R Toolkit and the checkpoint package How to call R from other applications with DeployR Open How to run R in...

Read more »

Predicting High Yield with SPY–a Two Part Post

November 7, 2014
By
Predicting High Yield with SPY–a Two Part Post

This post will cover ideas from two individuals: David Varadi of CSS Analytics with whom I am currently collaborating on some … Continue reading →

Read more »

When to fly to get there on time? Six million flights analyzed.

November 6, 2014
By
When to fly to get there on time? Six million flights analyzed.

We analyzed over six million flights to help you decide on the best time to travel to avoid delays. The post When to fly to get there on time? Six million flights analyzed. appeared first on Decision Science News.

Read more »

RcppRedis 0.1.2

November 6, 2014
By

A new release of RcppRedis is now on CRAN. It contains additional commands for hashes and sets, all contributed by John Laing and Whit Armstrong. Changes in version 0.1.2 (2014-11-06) New commands execv, hset, hget, sadd, srem, and smembers...

Read more »

Evaluating BreakoutDetection

November 6, 2014
By
Evaluating BreakoutDetection

A couple of weeks ago, Twitter open-sourced their BreakoutDetection package for R, a package designed to determine shifts in time-series data. The Twitter announcement does a great job of explaining the main technique for detection (E-Divisive with Medians), so I won’t rehash that material here. Rather, I wanted to see how this package works relative

Read more »

Seasonal periods

November 6, 2014
By

I get questions about this almost every week. Here is an example from a recent comment on this blog: I have two large time series data. One is separated by seconds intervals and the other by minutes. The length of each time series is 180 days. I’m using R (3.1.1) for forecasting the data. I’d

Read more »

geocodeHERE 0.1 is on CRAN

November 6, 2014
By

In my previous blog post, I detailed how I created my first R package called geocodeHERE. This package is a convenient wrapper for Nokia's HERE geocoding API. The cool thing about this API is that it allows for bulk geocoding. So, instead of doing n API calls to geocode n addresses, you can do it with...

Read more »

Introduction to Data Science with R video workshop

November 6, 2014
By
Introduction to Data Science with R video workshop

RStudio has teamed up with O’Reilly media to create a new way to learn R! The Introduction to Data Science with R video course is a comprehensive introduction to the R language. It’s ideal for non-programmers with no data science experience or for data scientists switching to R from Excel, SAS or other software. Join

Read more »

Faster, easier, and more reliable character string processing with stringi 0.3-1

November 6, 2014
By

A new release of the stringi package is available on CRAN (please wait a few days for Windows and OS X binary builds). # install.packages("stringi") or update.packages() library("stringi") stringi is an R package providing (but definitely not limiting to) equivalents…Read more ›

Read more »

Looking into a very messy data set

November 6, 2014
By
Looking into a very messy data set

by Joseph Rickert I recently had the opportunity to look at the data used for the 2009 KDD Cup competition. There are actually two sets of files that are still available from this competition. The "large" file is a series of five .csv files that when concatenated form a data set with 50,000 rows and 15,000 columns. The "small"...

Read more »

Improving R Data Visualisations Through Design

November 6, 2014
By
Improving R Data Visualisations Through Design

When I start an R class, one of my opening lines is nea

Read more »

Visualising stranded RNA-seq data with Gviz/Bioconductor

November 6, 2014
By
Visualising stranded RNA-seq data with Gviz/Bioconductor

Gviz is a really great package for visualising genomics data in R. Recently I have been looking at stranded RNA-seq data, which provides the ability to differentiate sense and antisense expression from a genomic locus thanks to the way in … Continue reading →

Read more »

The reddit Front Page is Not a Meritocracy

November 6, 2014
By
The reddit Front Page is Not a Meritocracy

I was pleasantly surprised when somebody shared my traveling salesman animation to reddit and the post made it all the way to reddit's default front page (i.e. the top 25). The gif racked up over 1.3 million pageviews on Imgur, a testament to reddit's traffic-generating prowess. Before the post made it to the front page, though, it was...

Read more »

Excel (and French people) are such a pain in the…

November 6, 2014
By
Excel (and French people) are such a pain in the…

A few days ago, I published a post entitled extracting datasets from excel files in a zipped folder, because I wanted to use datasets that were online, in some (zipped) excel format. The first difficult part was the folder with a non-standard character (the French é). Because next week I should be using those dataset in a crash course...

Read more »