Use the LENGTH statement to pre-set the lengths of character variables in SAS – with a comparison to R

Use the LENGTH statement to pre-set the lengths of character variables in SAS – with a comparison to R

I often create character variables (i.e. variables with strings of text as their values) in SAS, and they sometimes don’t render as expected.  Here is an example involving the built-in data set SASHELP.CLASS. Here is the code: data c1;      set sashelp.class;      * define a new character variable to classify someone as tall or

Read more »

Data wrangling : Cleansing – Regular expressions (1/3)

August 16, 2017
By
Data wrangling : Cleansing – Regular expressions (1/3)

Data wrangling, is the process of importing, cleaning and transforming raw data into actionable information for analysis. It is a time-consuming process which is estimated to take about 60-80%...

Read more »

Understanding overfitting: an inaccurate meme in supervised learning

August 16, 2017
By
Understanding overfitting: an inaccurate meme in supervised learning

Preamble There is a lot of confusion among practitioners regarding the concept of overfitting. It seems like, a kind of an urban legend or a meme, a folklore is circulating in data...

Read more »

MODIStsp 1.3.3 is out – Speeding things up and squashing some bugs !

August 16, 2017
By
MODIStsp 1.3.3 is out – Speeding things up and squashing some bugs !

  A new version of MODIStsp (1.3.3) is on CRAN as of today ! Below, you can find a short description of the main improvements.Processing speed improvements   Processing of...

Read more »

RStudio 1.1 Preview – Data Connections

August 15, 2017
By
RStudio 1.1 Preview – Data Connections

Today, we’re continuing our blog series on new features in RStudio 1.1. If you’d like to try these features out for yourself, you can download a preview release of...

Read more »

When A Fire Starts to Burn – Fiery 1.0 released

August 15, 2017
By
When A Fire Starts to Burn – Fiery 1.0 released

I’m pleased to announce that fiery has been updated to version 1.0 and is now available on CRAN. As the version bump suggests, this is a rather major update to...

Read more »

Buzzfeed trains an AI to find spy planes

August 15, 2017
By
Buzzfeed trains an AI to find spy planes

Last year, Buzzfeed broke the story that US law enforcement agencies were using small aircraft to observe points of interest in US cities, thanks to analysis of public flight-records...

Read more »

Working with air quality and meteorological data Exercises (Part-1)

August 15, 2017
By
Working with air quality and meteorological data Exercises (Part-1)

Atmospheric air pollution is one of the most important environmental concerns in many countries around the world, and it is strongly affected by meteorological conditions. Accordingly, in this set...

Read more »

Simple practice: basic maps with the Tidyverse

August 15, 2017
By
Simple practice: basic maps with the Tidyverse

To master data science, you need to practice. This sounds easy enough, but in reality, many people have no idea how to practice. Ultimately, you need to .... The post Simple practice:...

Read more »

Magick 1.0: 🎩 ✨🐇 Advanced Graphics and Image Processing in R

August 15, 2017
By
Magick 1.0: 🎩 ✨🐇 Advanced Graphics and Image Processing in R

Last week, version 1.0 of the magick package appeared on CRAN: an ambitious effort to modernize and simplify high quality image processing in R. This R package builds upon...

Read more »

set_na_where(): a nonstandard evaluation use case

August 14, 2017
By
set_na_where(): a nonstandard evaluation use case

In this post, I describe a recent case where I used rlang’s tidy evaluation system to do some data-cleaning. This example is not particularly involved, but it demonstrates is a basic...

Read more »

#9: Compacting your Shared Libraries

August 14, 2017
By

Welcome to the nineth post in the recognisably rancid R randomness series, or R4 for short. Following on the heels of last week's post, we aim to look into...

Read more »

rstudio::conf(2018): Contributed talks, e-posters, and diversity scholarships

August 14, 2017
By

rstudio::conf, the conference on all things R and RStudio, will take place February 2 and 3, 2018 in San Diego, California, preceded by Training Days on January 31 and...

Read more »

Reproducibility: A cautionary tale from data journalism

August 14, 2017
By
Reproducibility: A cautionary tale from data journalism

Timo Grossenbacher, data journalist with Swiss Radio and TV in Zurich, had a bit of a surprise when he attempted to recreate the results of one of the R...

Read more »

acs v2.1.1 is now on CRAN

August 14, 2017
By
acs v2.1.1 is now on CRAN

A new version of the acs package is now on CRAN. I recommend that all users of choroplethr update to this version. Here is how to... The post acs v2.1.1 is...

Read more »

Big Data Solutions: A/B t test

August 14, 2017
By

@drsimonj here to share my code for using Welch’s t-test to compare group means using summary statistics.  Motivation I’ve just started working with A/B tests that use big data. Where once...

Read more »

Sending Emails from R Exercises

August 14, 2017
By
Sending Emails from R Exercises

When monitoring a data source, model, or other automated process, it’s convienent to have method for easily delivering performance metrics and notifying you whenever something is amiss. One option...

Read more »

A Stan case study, sort of: The probability my son will be stung by a bumblebee

August 14, 2017
By
A Stan case study, sort of: The probability my son will be stung by a bumblebee

The Stan project for statistical computation has a great collection of curated case studies which anybody can contribute to, maybe even me, I was thinking. But I don’t have...

Read more »

Treating your data: The old school vs tidyverse modern tools

August 14, 2017
By
Treating your data: The old school vs tidyverse modern tools

By Gabriel Vasconcelos When I first started using R there was no such thing as the tidyverse. Although some of the tidyverse packages were available independently, I learned to...

Read more »

Shinydashboards from right to left (localizing a shinydashboard to Hebrew)

August 14, 2017
By

Post by Adi Sarid (Sarid Institute for Research Services LTD.) Lately I’ve been working a lot with the shinydashboard library. Like shiny, it allows any R programmer to harness the power...

Read more »

Parse an Online Table into an R Dataframe – Westgard’s Biological Variation Database

Background From time to time I have wanted to bring an online table into an R dataframe. While in principle, the data can be cut and paste into Excel,...

Read more »

Supervised Learning in R: Regression

August 13, 2017
By
Supervised Learning in R: Regression

We are very excited to announce a new (paid) Win-Vector LLC video training course: Supervised Learning in R: Regression now available on DataCamp The course is primarily authored by...

Read more »

End-to-end visualization using ggplot2

August 13, 2017
By
End-to-end visualization using ggplot2

ggplot2 is kind of a household word for R users. I’ve ended up using it for complex data munging and wrangling work, where I needed to get clarity on...

Read more »

RProtoBuf 0.4.10

August 13, 2017
By

RProtoBuf provides R bindings for the Google Protocol Buffers ("ProtoBuf") data encoding and serialization library used and released by Google, and deployed fairly widely in numerous projects as a...

Read more »

Hacking statistics or: How I Learned to Stop Worrying About Calculus and Love Stats Exercises (Part-6)

August 13, 2017
By
Hacking statistics or: How I Learned to Stop Worrying About Calculus and Love Stats Exercises (Part-6)

Statistics are often taught in school by and for people who like Mathematics. As a consequence, in those class emphasis is put on leaning equations, solving calculus problems and...

Read more »

The Hitchhiker’s Guide to Ggplot2 in R

August 13, 2017
By
The Hitchhiker’s Guide to Ggplot2 in R

Published: 2016-11-30 Updated: 2017-08-14 "Any bleeder knows that books are never finished, only abandoned." César A. Hidalgo About the book You can find the book here. In the last update we changed...

Read more »

R⁶ — Exploring macOS Applications with codesign, Gatekeeper & R

August 13, 2017
By

(General reminder abt “R⁶” posts in that they are heavy on code-examples, minimal on expository. I try to design them with 2-3 “nuggets” embedded for those who take the...

Read more »

Introducing reqres

August 12, 2017
By
Introducing reqres

I’m very happy to announce that reqres has been released on CRAN. reqres is a new (in R context) approach to working with HTTP messages, that is, the requests you...

Read more »

Hurricane Irene at the Delaware Estuary Revisited

August 12, 2017
By
Hurricane Irene at the Delaware Estuary Revisited

Back in August 2011, Hurricane Irene struck the mid-Atlantic coast.  This animated graph shows how the storm surge from Irene and the terrestrial flooding from Irene and Tropical Storm...

Read more »

Search R-bloggers

Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R

r-brain.io



Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training



omictools

statcon.de

Contact us if you wish to help support R-bloggers, and place your banner here.