Update on Our ‘revisit’ Package

August 16, 2017
By
Update on Our ‘revisit’ Package

On May 31, I made a post here about our R package revisit, which is designed to help remedy the reproducibility crisis in science. The intended user audience includes reviewers of research manuscripts submitted for publication, scientists who wish to confirm the results in a published paper, and explore alternate analyses, and members of the … Continue reading Update...

Read more »

Visualising Water Consumption using a Geographic Bubble Chart

August 16, 2017
By

A geographic bubble chart is a straightforward method to visualise quantitative information with a geospatial relationship. Last week I was in Vietnam helping the Phú Thọ Water Supply Joint...

Read more »

Use the LENGTH statement to pre-set the lengths of character variables in SAS – with a comparison to R

Use the LENGTH statement to pre-set the lengths of character variables in SAS – with a comparison to R

I often create character variables (i.e. variables with strings of text as their values) in SAS, and they sometimes don’t render as expected.  Here is an example involving the...

Read more »

How to build an image recognizer in R using just a few images

August 16, 2017
By
How to build an image recognizer in R using just a few images

Microsoft Cognitive Services provides several APIs for image recognition, but if you want to build your own recognizer (or create one that works offline), you can use the new...

Read more »

Thank You For The Very Nice Comment

August 16, 2017
By
Thank You For The Very Nice Comment

Somebody nice reached out and gave us this wonderful feedback on our new Supervised Learning in R: Regression (paid) video course. Thanks for a wonderful course on DataCamp on...

Read more »

Data wrangling : Cleansing – Regular expressions (1/3)

August 16, 2017
By
Data wrangling : Cleansing – Regular expressions (1/3)

Data wrangling, is the process of importing, cleaning and transforming raw data into actionable information for analysis. It is a time-consuming process which is estimated to take about 60-80%...

Read more »

Understanding overfitting: an inaccurate meme in supervised learning

August 16, 2017
By
Understanding overfitting: an inaccurate meme in supervised learning

Preamble There is a lot of confusion among practitioners regarding the concept of overfitting. It seems like, a kind of an urban legend or a meme, a folklore is circulating in data...

Read more »

MODIStsp 1.3.3 is out – Speeding things up and squashing some bugs !

August 16, 2017
By
MODIStsp 1.3.3 is out – Speeding things up and squashing some bugs !

  A new version of MODIStsp (1.3.3) is on CRAN as of today ! Below, you can find a short description of the main improvements.Processing speed improvements   Processing of...

Read more »

Master the tidyverse

August 15, 2017
By

If you’ve read the book “R for Data Science” or plan to, now you can dive deeper with co-author and RStudio Master Instructor Garrett Grolemund, winner of the Excellence...

Read more »

RStudio 1.1 Preview – Data Connections

August 15, 2017
By
RStudio 1.1 Preview – Data Connections

Today, we’re continuing our blog series on new features in RStudio 1.1. If you’d like to try these features out for yourself, you can download a preview release of...

Read more »

When A Fire Starts to Burn – Fiery 1.0 released

August 15, 2017
By
When A Fire Starts to Burn – Fiery 1.0 released

I’m pleased to announce that fiery has been updated to version 1.0 and is now available on CRAN. As the version bump suggests, this is a rather major update to...

Read more »

Buzzfeed trains an AI to find spy planes

August 15, 2017
By
Buzzfeed trains an AI to find spy planes

Last year, Buzzfeed broke the story that US law enforcement agencies were using small aircraft to observe points of interest in US cities, thanks to analysis of public flight-records...

Read more »

Working with air quality and meteorological data Exercises (Part-1)

August 15, 2017
By
Working with air quality and meteorological data Exercises (Part-1)

Atmospheric air pollution is one of the most important environmental concerns in many countries around the world, and it is strongly affected by meteorological conditions. Accordingly, in this set...

Read more »

Simple practice: basic maps with the Tidyverse

August 15, 2017
By
Simple practice: basic maps with the Tidyverse

To master data science, you need to practice. This sounds easy enough, but in reality, many people have no idea how to practice. Ultimately, you need to .... The post Simple practice:...

Read more »

Magick 1.0: 🎩 ✨🐇 Advanced Graphics and Image Processing in R

August 15, 2017
By
Magick 1.0: 🎩 ✨🐇 Advanced Graphics and Image Processing in R

Last week, version 1.0 of the magick package appeared on CRAN: an ambitious effort to modernize and simplify high quality image processing in R. This R package builds upon...

Read more »

set_na_where(): a nonstandard evaluation use case

August 14, 2017
By
set_na_where(): a nonstandard evaluation use case

In this post, I describe a recent case where I used rlang’s tidy evaluation system to do some data-cleaning. This example is not particularly involved, but it demonstrates is a basic...

Read more »

#9: Compacting your Shared Libraries

August 14, 2017
By

Welcome to the nineth post in the recognisably rancid R randomness series, or R4 for short. Following on the heels of last week's post, we aim to look into...

Read more »

Shiny 1.0.4

August 14, 2017
By
Shiny 1.0.4

Shiny 1.0.4 is now available on CRAN. To install it, run: install.packages("shiny") For most Shiny users, the most exciting news is that file inputs now support dragging and dropping: It is now...

Read more »

rstudio::conf(2018): Contributed talks, e-posters, and diversity scholarships

August 14, 2017
By

rstudio::conf, the conference on all things R and RStudio, will take place February 2 and 3, 2018 in San Diego, California, preceded by Training Days on January 31 and...

Read more »

Reproducibility: A cautionary tale from data journalism

August 14, 2017
By
Reproducibility: A cautionary tale from data journalism

Timo Grossenbacher, data journalist with Swiss Radio and TV in Zurich, had a bit of a surprise when he attempted to recreate the results of one of the R...

Read more »

acs v2.1.1 is now on CRAN

August 14, 2017
By
acs v2.1.1 is now on CRAN

A new version of the acs package is now on CRAN. I recommend that all users of choroplethr update to this version. Here is how to... The post acs v2.1.1 is...

Read more »

Big Data Solutions: A/B t test

August 14, 2017
By

@drsimonj here to share my code for using Welch’s t-test to compare group means using summary statistics.  Motivation I’ve just started working with A/B tests that use big data. Where once...

Read more »

Sending Emails from R Exercises

August 14, 2017
By
Sending Emails from R Exercises

When monitoring a data source, model, or other automated process, it’s convienent to have method for easily delivering performance metrics and notifying you whenever something is amiss. One option...

Read more »

A Stan case study, sort of: The probability my son will be stung by a bumblebee

August 14, 2017
By
A Stan case study, sort of: The probability my son will be stung by a bumblebee

The Stan project for statistical computation has a great collection of curated case studies which anybody can contribute to, maybe even me, I was thinking. But I don’t have...

Read more »

Treating your data: The old school vs tidyverse modern tools

August 14, 2017
By
Treating your data: The old school vs tidyverse modern tools

By Gabriel Vasconcelos When I first started using R there was no such thing as the tidyverse. Although some of the tidyverse packages were available independently, I learned to...

Read more »

Shinydashboards from right to left (localizing a shinydashboard to Hebrew)

August 14, 2017
By

Post by Adi Sarid (Sarid Institute for Research Services LTD.) Lately I’ve been working a lot with the shinydashboard library. Like shiny, it allows any R programmer to harness the power...

Read more »

Parse an Online Table into an R Dataframe – Westgard’s Biological Variation Database

Background From time to time I have wanted to bring an online table into an R dataframe. While in principle, the data can be cut and paste into Excel,...

Read more »

Supervised Learning in R: Regression

August 13, 2017
By
Supervised Learning in R: Regression

We are very excited to announce a new (paid) Win-Vector LLC video training course: Supervised Learning in R: Regression now available on DataCamp The course is primarily authored by...

Read more »

End-to-end visualization using ggplot2

August 13, 2017
By
End-to-end visualization using ggplot2

ggplot2 is kind of a household word for R users. I’ve ended up using it for complex data munging and wrangling work, where I needed to get clarity on...

Read more »

Search R-bloggers

Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R

r-brain.io



Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training



omictools

statcon.de

Contact us if you wish to help support R-bloggers, and place your banner here.