10 Million Dots: Mapping European Population

March 23, 2017
By
10 Million Dots: Mapping European Population

Creating a giant dot density map using R.

Read more »

Mean trophic levels of a genera from FishBase

March 22, 2017
By
Mean trophic levels of a genera from FishBase

Mean trophic levels of a genera from FishBase How would you selectively aggregate observations using R? For instance, say you have a table of trophic level estimates by fish species, but many species are missing values. For those species missing a val...

Read more »

New mlr Logo

March 22, 2017
By

We at mlr are currently deciding on a new logo, and in the spirit of open-source, we would like to involve the community in the voting process! You can vote for your favorite logo on GitHub by reacting to the logo with a +1. Thanks to Hannah Atkin fo...

Read more »

Euler Problem 17: Number Letter Counts

March 22, 2017
By
Euler Problem 17: Number Letter Counts

A solution in the R language to Euler Problem 17: If all the numbers from 1 to 1000 inclusive were written out in words, how many letters would be used? Continue reading → The post Euler Problem 17: Number Letter Counts appeared first on The Devil is in the Data.

Read more »

Data Visualization – Part 2

March 22, 2017
By
Data Visualization – Part 2

A Quick Overview of the ggplot2 Package in R While it will be important to focus on theory, I want to explain the ggplot2 package because I will be using it throughout the rest of this series. Knowing how it works will keep the focus on the results rather than the code. It’s an incredibly

Read more »

Datashader is a big deal

March 22, 2017
By
Datashader is a big deal

I recently got back from Strata West 2017 (where I ran a very well received workshop on R and Spark). One thing that really stood out for me at the exhibition hall was Bokeh plus datashader from Continuum Analytics. I had the privilege of having Peter Wang himself demonstrate datashader for me and answer a … Continue...

Read more »

Running your R code on Azure with mrsdeploy

March 22, 2017
By
Running your R code on Azure with mrsdeploy

by John-Mark Agosta, data scientist manager at Microsoft Let’s say you’ve built a model in R that is larger than you can conveniently run locally, and you want to take advantage of Azure’s resources simply to run it on a larger machine. This blog explains how to provision and run an Azure virtual machine (VM) for this, using the...

Read more »

February 2017 New Package Picks

March 22, 2017
By
February 2017 New Package Picks

by Joseph Rickert One hundred and forty-five new packages were added to CRAN in February. Here are 47 interesting packages organized into five categories; Biostatistics, Data, Data Science, Statistics and Utilities. Biostatistics BaTFLED3D v0.1.7: Implements a machine learning algorithm to make predictions and determine interactions in data that varies along three independent modes. It was

Read more »

The Hitchhiker’s Guide to Ggplot2 in R

March 22, 2017
By
The Hitchhiker’s Guide to Ggplot2 in R

Published: 2016-11-30 Updated: 2017-03-23 "Any bleeder knows that books are never finished, only abandoned." Why Information Grows About the book You can find the book here. This is a book that may look complete but changes in R package are al...

Read more »

The Hitchhiker’s Guide to Ggplot2 in R

March 22, 2017
By
The Hitchhiker’s Guide to Ggplot2 in R

Published: 2016-11-30 Updated: 2017-03-23 "Any bleeder knows that books are never finished, only abandoned." Why Information Grows About the book You can find the book here. This is a book that may look complete but changes in R package are al...

Read more »

Suggests != Depends

March 22, 2017
By

A number of packages on CRAN use Suggests: casually. They list other packages as "not required" in Suggests: -- as opposed to absolutely required via Imports: or the older Depends: -- yet do not test for their use in either examples or, more commonly, unit tests. So e.g. the unit tests are bound to fail because, well, Suggests != Depends. This has...

Read more »

San Francisco EARL: First round of speakers announced

March 22, 2017
By
San Francisco EARL: First round of speakers announced

We’re excited to announced the first round of gReat speakers for San Francisco EARL. Alongside our keynote speakers, Hilary Parker and Ricardo Bion, R Users from a range of industries will share their R stories. Take a look at our … Continue reading →

Read more »

Simulating Unown encounter rates in Pokémon Go

March 21, 2017
By
Simulating Unown encounter rates in Pokémon Go

Pokémon Go is an augmented reality game where people with smartphones walk around and catch Pokémon. As in the classic games, players are Pokémon “trainers” who have to travel around and collect creatures. Some types are rarer than others, som...

Read more »

anytime 0.2.2

March 21, 2017
By

A bugfix release of the anytime package arrived at CRAN earlier today. This is tenth release since the inaugural version late last summer, and the second (bugfix / feature) release this year. anytime is a very focused package aiming to do just one th...

Read more »

February 2017 New Package Picks

March 21, 2017
By
February 2017 New Package Picks

One hundred and forty-five new packages were added to CRAN in February. Here are 47 interesting packages organized into five categories; Biostatistics, Data, Data Science, Statistics and Utilities. Biostatistics BaTFLED3D v0.1.7: Implements a machine learning algorithm to make predictions and determine interactions in data that varies along three independent modes. It was developed to predict the growth of...

Read more »

February 2017 New Package Picks

March 21, 2017
By
February 2017 New Package Picks

One hundred and forty-five new packages were added to CRAN in February. Here are 47 interesting packages organized into five categories; Biostatistics, Data, Data Science, Statistics and Utilities. Biostatistics BaTFLED3D v0.1.7: Implements a machine...

Read more »

Parallel benchmarking with OpenML and mlr

March 21, 2017
By
Parallel benchmarking with OpenML and mlr

With this post I want to show you how to benchmark several learners (or learners with different parameter settings) using several data sets in a structured and parallelized fashion. For this we want to use batchtools. The data that we will use here is stored on the open machine learning platform openml.org and we can download it together...

Read more »

Use mlrMBO to optimize via command line

March 21, 2017
By

Many people who want to apply Bayesian optimization want to use it to optimize an algorithm that is not implemented in R but runs on the command line as a shell script or an executable. We recently published mlrMBO on CRAN. As a normal package it normally operates inside of R, but with this post I want to demonstrate how...

Read more »

Data Analytics for Societal Good

March 21, 2017
By
Data Analytics for Societal Good

At my workplace, employees celebrate a month of Data Analysis for Societal good, every year. During this time, we try to help NPOs (Non-Profit Organisations) in gaining insights from their data, for free. Whilst we are engaged in this practice at workp...

Read more »

Financial time series forecasting – an easy approach

March 21, 2017
By
Financial time series forecasting – an easy approach

Financial time series analysis and their forecasting have an history of remarkable contributions. It is then quite hard for the beginner to get oriented and capitalize from reading such scientific literature as it requires a solid understanding of basic statistics, a detailed study of the ground basis of time series analysis tools and the knowledge Related Post

Read more »

The Next Era of Research Communication

March 21, 2017
By
The Next Era of Research Communication

From the days of actual research papers (before the digital age), to now where research papers are posted online first, not much has really changed in the way we communicate. We still use static images, formulas and a bunch of text to show what we have...

Read more »

Give a talk about an application of R at EARL

March 21, 2017
By
Give a talk about an application of R at EARL

The EARL (Enterprise Applications of R) conference is one of my favourite events to go to. As the name of the conference suggests, the focus of the conference is where the rubber of the R language meets the road of it being used to solve real-world problems. Prior conferences have included presentations on how Maersk uses R to optimize...

Read more »

The one thing you need to master data science

March 21, 2017
By
The one thing you need to master data science

The most important factor for mastering data science is ... The post The one thing you need to master data science appeared first on SHARP SIGHT LABS.

Read more »

camsRad, satellite-based time series of solar irradiation

March 21, 2017
By
camsRad, satellite-based time series of solar irradiation

camsRad is a lightweight R client for the CAMS Radiation Service, that provides satellite-based time series of solar irradiation for the actual weather conditions as well as for clear-sky conditions. Satellite-based solar irradiation data have been around roughly as long our modern era satellites. But the price tag has been very high, in the range of several...

Read more »

Simultaneous intervals for derivatives of smooths revisited

March 21, 2017
By
Simultaneous intervals for derivatives of smooths revisited

Eighteen months ago I screwed up! I’d written a post in which I described the use of simulation from the posterior distribution of a fitted GAM to derive simultaneous confidence intervals for the derivatives of a penalized spline. It was a nice post that attracted some interest. It was also wrong. In December I corrected the first...

Read more »

Is it possible to use RevoScaleR package in Power BI?

March 20, 2017
By
Is it possible to use RevoScaleR package in Power BI?

I was invited to deliver a session for Belgium User Group on SQL Server and R integration. After the session – which we did online using web based Citrix  – I got an interesting question: “Is it possible to use RevoScaleR performance computational functions within Power BI?“. My first answer was,  a sceptical yes. But … Continue...

Read more »

Sentiment Analysis of Warren Buffett’s Letters to Shareholders

March 20, 2017
By
Sentiment Analysis of Warren Buffett’s Letters to Shareholders

Last week, I was reading through Warren Buffett's most recent letter to Berkshire Hathaway shareholders. Every year, he writes a letter that he makes publicly available on the Berkshire Hathaway website. In the letters he talks about the performance of...

Read more »

Alteryx integrates with Microsoft R

March 20, 2017
By
Alteryx integrates with Microsoft R

You can now use Alteryx Designer, the data science workflow tool from Alteryx, as a drag-and-drop interface for many of the big-data statistical modeling tools included with Microsoft R. Alteryx v11.0 includes expanded support for Microsoft SQL Server 2016, Microsoft R Server, Azure SQL Data Warehouse, and Microsoft Analytics Platform System (APS), with new workflow tools to access functionality...

Read more »

What’s in the words? Comparing artists and lyrics with R.

March 20, 2017
By
What’s in the words? Comparing artists and lyrics with R.

It's been a while since I had the opportunity to post something on music. Let's get back to that.I got my hands on some song lyrics by a range of artists. (I have an R script to download all lyrics for a given artist from a lyrics website. Since these lyrics are protected by copyright law, I...

Read more »

Sponsors

Mango solutions









Zero Inflated Models and Generalized Linear Mixed Models with R

r-brain.io



www.ama.org/events-training

Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training





omictools

Contact us if you wish to help support R-bloggers, and place your banner here.