A quick introduction to machine learning in R with caret

April 6, 2016
By
A quick introduction to machine learning in R with caret

If you’ve been using R for a while, and you’ve been working with basic data visualization and data exploration techniques, the next logical step is to start learning some machine learning. To help you begin learning about machine learning in R, I’m going to introduce you to an R package: the caret package. We’ll build The post

Read more »

An Analysis of Traffic Violation Data with SQL Server and R

April 6, 2016
By
An Analysis of Traffic Violation Data with SQL Server and R

By Srini Kumar, Director of Data Science at Microsoft Who does not hate being stopped and given a traffic ticket? Invariably, we think that something is not fair that we got it and everyone else did not. I am no different, and living in the SF Bay Area, I have often wondered if I could get the data about...

Read more »

Perform co-operations with the coop package

April 6, 2016
By

About The coop package does co-operations: covariance, correlation, and cosine, and it does them quickly. The package is available on CRAN and GitHub, and has two vignettes: Introducing coop: Fast Covariance, Correlation, and Cosine Operations Algorithms and Benchmarks for the coop Package Incidentally, the vignettes don't render correctly on CRAN's end for some reason; if any of you rmarkdown...

Read more »

3D plotting exercises

April 6, 2016
By
3D plotting exercises

In this set of exercises we will introduce the concept of 3D plotting. Specifically, we will use these commands:image(), contour() and persp(). For these exercises, you need to have a basic understanding of R objects and functions, in particular some knowledge about matrix . This set is the fourth set of exercises is a series

Read more »

RcppAPT 0.0.2

April 5, 2016
By

A new version of RcppAPT -- our interface from R to the C++ library behind the awesome apt, apt-get, apt-cache, ... commands and their cache -- is now on CRAN. It adds three new commands to the package. Two are relatively simple: showSrc() and dumpPa...

Read more »

I Went to ROpenSci Unconference and All I Got Were These Lousy Hex Stickers

April 5, 2016
By

Just kidding; it was amazing. Last week, I traveled to San Francisco to participate in an unconference/hackathon organized and hosted by ROpenSci. This was my first R conference or meeting, and it was a such a great experience. I am still feeling a bit at a loss for words about what a tremendous time I had, actually, but...

Read more »

R pkg Easter Eggs — Revenge of Pacman!

April 5, 2016
By
R pkg Easter Eggs — Revenge of Pacman!

In my last post, I praised the virtues of pacman. I also wanted to indulge of one of the main benefits of working with one of pacman’s devs: Tyler Rinker. Tyler is a geek’s geek, in the best possible way. And one of the best officemates one could ask for to boot! As evidence of that,...

Read more »

Computational Actuarial Science, with R, in Barcelona

April 5, 2016
By

This Wednesday, I will give a graduate crash course on computational actuarial science, with R, which will be the second part of the lecture of Tuesday. Slides are now available,

Read more »

Statistical rethinking [book review]

April 5, 2016
By
Statistical rethinking [book review]

Statistical Rethinking: A Bayesian Course with Examples in R and Stan is a new book by Richard McElreath that CRC Press sent me for review in CHANCE. While the book was already discussed on Andrew’s blog three months ago, and enthusiastically recommended by Rasmus Bååth on Amazon, here are the reasons why I

Read more »

Blow Out Tide in the Delaware Estuary

April 5, 2016
By
Blow Out Tide in the Delaware Estuary

High winds in the Delaware Estuary region caused a "blowout" tide in early April 2016, where observed water surface elevations were much lower than those predicted via harmonic constituents.  Extreme low blowout tides can hamper navigation due ins...

Read more »

The Pirate Plot (2.0) – The RDI plotting choice of R pirates

April 5, 2016
By
The Pirate Plot (2.0) – The RDI plotting choice of R pirates

  Plain vanilla barplots are as uninformative (and ugly) as they are popular. And boy, are they popular. From the floors of congress, to our latest scientific articles, barplots surround us. The reason why barplots are so popular is because they are so simple and easy to understand. However, that simplicity also carries costs — namely, ...

Read more »

Plotting App for ggplot2

April 5, 2016
By
2016-04-06 18_27_39-Clipboard

Through this post, I would like to share an update to my RTutoR package. The first version of this package included an R Basics Tutorial App which I published earlier at DataScience+ The updated version of this package, which is now on CRAN, includes a plotting app. This app provides an automated interface for generating Plotting App for...

Read more »

AirbnB uses R to scale data science

April 5, 2016
By
AirbnB uses R to scale data science

Airbnb, the property-rental marketplace that helps you find a place to stay when you're travelling, uses R to scale data science. Airbnb is a famously data-driven company, and has recently gone through a period of rapid growth. To accommodate the influx of data scientists (80% of whom are proficient in R, and 64% use R as their primary data...

Read more »

Travis CI: “You Have Too Many Tests LOLZ!”

April 5, 2016
By

No output has been received in the last 10m0s, this potentially indicates a stalled build or something wrong with the build itself. As part of getting RSiteCatalyst 1.4.8 ready for CRAN, I’ve managed to accumulate hundreds of testthat tests across 63 test files. Each of these tests runs on Travis CI against an authenticated API, and the

Read more »

R Quick Tip: Collapse a lists of data.frames with data.table

April 5, 2016
By

With my HIBPwned package, I consume the HaveIBeenPwned API and return back a list object with an element for each email address. Each element holds a data.frame of breach data or a stub response with a single column data.frame containing NA. Elements are named with the email addresses they relate to. I had a list The post

Read more »

Aggregation with dplyr: summarise and summarise_each

April 5, 2016
By

How to apply one or many functions to one or many variables using dplyr: a practical guide to the use of summarise() and summarise_each() The post Aggregation with dplyr: summarise and summarise_each appeared first on MilanoR.

Read more »

Le Monde puzzle [#956]

April 4, 2016
By
Le Monde puzzle [#956]

A Le Monde mathematical puzzle with little need of R programming: Does there exist a function f from N to N such that (i) f is (strictly) increasing, (ii) f(n)≥n, and (iii) f²(n)=f(f(n))=3n? Indeed, the constraints imply (i) f²(0)=0, hence that that f(0)=0, (ii) f(1)=2 as it can be neither 1 (else f²(1) would be equal

Read more »

iptools 0.4.0 released into the wild (i.e. is hitting the CRAN mirrors today)

April 4, 2016
By

The iptools package—a toolkit for manipulating, validating and testing IP addresses and ranges, along with datasets relating to IP addresses—is flying through the internets and hitting a CRAN mirror near you, soon. What’s fixed? Tim Smith fixed a bug in ip_in_range() that occurred when the netmask was /32 (thanks, Tim!). What’s new? The range_boundaries() function

Read more »

Help improve treatment for brain injuries using machine learning and R

April 4, 2016
By
Help improve treatment for brain injuries using machine learning and R

The field of neuroscience -- the study of brains and the nervous system -- has taken some major leaps in recent years. Scientists can now gather real-time electrical activity from the brain during actions and thoughts, which is helping to pinpoint the exact location of brain lesions caused by strokes, and is leading to promising treatments for epilepsy and...

Read more »

Workshops announced for EARL 2016

April 4, 2016
By

EARL2016 will feature a day of workshops on 13th September preceding the full conference days. These will be interactive workshops on a variety of R related topics, from introductory to advanced levels. Due to the interactive nature of the workshops, … Continue reading →

Read more »

10 new R jobs from around the world (2016-04-04)

April 4, 2016
By
10 new R jobs from around the world (2016-04-04)

R-users Resumes R-users.com features a Resume section with CVs from over 180 R users. You can submit your resume for free by registering as a “job seeker” (you may also browse the resumes through a paid subscription). Here are the new R Jobs for 2016-04-04. To post your R job on the next post Just visit this link and post a new (free) R job to the R community. To get added...

Read more »

Geographic data to service the needs of a remote employee – part1

April 4, 2016
By
Geographic data to service the needs of a remote employee – part1

By Ava Yang Background Ever since I started working remotely, some friends suggested “Why not work as usual as you travel across China, instead of staying in Shanghai?” Many cool people have practiced the lifestyle, but most of the time … Continue reading →

Read more »

A Tale of Two Bugs

April 4, 2016
By
A Tale of Two Bugs

Today I am happy to announce that a new version of choroplethr is now on CRAN. To get it type the following: The new version is 3.5.2. This version fixes two fairly pernicious bugs that were introduced in the last update to choroplethr. For reference, he is a detailed description of the bugs and their The post

Read more »

Which are More Accurate: Fantasy Football Rankings or Projections?

April 4, 2016
By

When deciding how to compare players in fantasy football, managers have an important decision: whether to compare players with rankings or projections.  In this article, we compare the accuracy of The post Which are More Accurate: Fantasy Football Rankings or Projections? appeared first on Fantasy Football Analytics.

Read more »

RSiteCatalyst Version 1.4.8 Release Notes

April 4, 2016
By
RSiteCatalyst Version 1.4.8 Release Notes

For being in RSiteCatalyst retirement, I’m ending up working on more functionality lately ¯_(ツ)_/¯. Here are the changes for RSiteCatalyst 1.4.8, which should be available on CRAN shortly: Segment Stacking RSiteCatalyst now has the ability to take multiple values in the segment.id keyword for the Queue* functions. This functionality was graciously provided by Adam Gitzes, closing an issue

Read more »

Auto-deploying documentation: better change tracking of artefacts

April 4, 2016
By

As part of my never-ending quest to deploy documentation better, I’ve made yet another tweak to my scripts that deploy R vignettes or Rmarkdown documents to the gh-pages branch of my github repositories via Travis-CI. The script from Robert Flight that’s provided the basis for most of this work does something specific to update the The post

Read more »

The Coaster Maker by Shiny

April 4, 2016
By
The Coaster Maker by Shiny

The word you invented is well formed and could be used in the Italian language (The Accademia della Crusca regarding to the word “Petaloso”, recently invented by an eight-year-old boy) Are you tired of your old coasters? Do you like to make things by your own? Do you have a PC and a printer at … Continue reading...

Read more »

Where People Live, part 2

April 4, 2016
By
Where People Live, part 2

Following my previous post, I wanted to use another dataset to visualize where people live, on Earth. The dataset is coming from sedac.ciesin.columbia.edu. We you register, you can download the database > base=read.table("glp00ag15.asc",skip=6) The database is a ‘big’ 1440×572 matrix, in each cell (latitude and longitude) we have the population > X=t(as.matrix(base,ncol=1440)) > dim(X) 1440 572 The dataset...

Read more »

List of useful RStudio addins made by useRs (a github repo)

April 4, 2016
By

  Dean Attali started a GitHub repo listing RStudio addins, with the hope that other useRs will add to the list and make it a place to discover addins. Feel free to browse for useful addins, or make a pull request to add your own addin to the list. Here are the current items on the list: Name Description Package On...

Read more »

Sponsors