Check Data Quality with padr

June 26, 2017
By
Check Data Quality with padr

The padr package was designed to prepare datetime data for analysis. That is, to take raw, timestamped data, and quickly convert it into a tidy format that can be analyzed with all the tidyverse tools. Recently, a colleague and I discovered a second use for the package that I had not anticipated: checking data quality. Every analysis should contain...

Read more »

Data Munging With R Preview — Storing Values (Assigning)

June 26, 2017
By
Data Munging With R Preview — Storing Values (Assigning)

Since about October last year, I’ve been writing an introduction to R book. It’s been quite the experience. I’ve finally started making time to document some of the interesting...

Read more »

pixel art of ggplot2 faceting using geofacet

June 26, 2017
By
pixel art of ggplot2 faceting using geofacet

I just discovered an interesting ggplot2 extension, geofacet, that supports arranging facet panels that mimics geographic topoloty. After playing with it, I realized that it is not only for visualizing...

Read more »

H2O Benchmark for CSV Import

June 25, 2017
By
H2O Benchmark for CSV Import

The importFile() function in H2O is extremely efficient due to the parallel reading. The benchmark comparison below shows that it is comparable to the read.df() in SparkR and significantly...

Read more »

Hex stickers for the forecast package

June 25, 2017
By

I’ve caved in to the hex sticker craze, and produced some hex stickers for the forecast package for R. If you attend a workshop I teach, I’ll give you...

Read more »

Data visuals notes for my talks in 2017

June 25, 2017
By
Data visuals notes for my talks in 2017

Data visuals: notes for my talks in 2017 Supplementary notes for CJ Brown’s talks on dataviz in 2017 for Griffith University’s honours students and the UQ Winterschool in Bioinformatics. Skip to the quiz Visualsing...

Read more »

Data Visualization with googleVis exercises part 4

June 25, 2017
By
Data Visualization with googleVis exercises part 4

Adding Features to your Charts We saw in the previous charts some basic and well-known types of charts that googleVis offers to users. Before continuing with other, more sophisticated...

Read more »

R Weekly Bulletin Vol – XII

June 25, 2017
By
R Weekly Bulletin Vol – XII

This week’s R bulletin will cover topics on how to resolve some common errors in R. Hope you like this R weekly bulletin. Enjoy reading! Shortcut Keys 1. Find...

Read more »

Using Tweedie Parameter to Identify Distributions

June 24, 2017
By
Using Tweedie Parameter to Identify Distributions

In the development of operational loss models, it is important to identify which distribution should be used to model operational risk measures, e.g. frequency and severity. For instance, why...

Read more »

Using tidycensus and leaflet to map Census data

June 23, 2017
By

Recently, I have been following the development and release of Kyle Walker’s tidycensus package. I have been filled with amazement, delight, and well, perhaps another feeling…

Read more »

Track changes in data with the lumberjack %>>%

June 23, 2017
By
Track changes in data with the lumberjack %>>%

So you are using this pipeline to have data treated by different functions in R. For example, you may be imputing some missing values using the simputation package. Let...

Read more »

The R community is one of R’s best features

June 23, 2017
By
The R community is one of R’s best features

R is incredible software for statistics and data science. But while the bits and bytes of software are an essential component of its usefulness, software needs a community to...

Read more »

Logarithmic Scale Explained with U.S. Trade Balance

June 23, 2017
By
Logarithmic Scale Explained with U.S. Trade Balance

Skewed data prevail in real life. Unless you observe trivial or near constant processes data is skewed one way or another due to outliers, long tails, errors or something...

Read more »

Working With SPSS© Data in R

Working With SPSS© Data in R

Introduction I was in need of importing SPSS© data for work. There are some options but I've used both foreign and haven R packages. I prefer haven because it integrates...

Read more »

State-space modelling of the Australian 2007 federal election

June 23, 2017
By
State-space modelling of the Australian 2007 federal election

Pooling the polls with Bayesian statistics In an important 2005 article in the Australian Journal of Political Science, Simon Jackman set out a statistically-based approach to pooling polls in an...

Read more »

Operations Research with R

June 23, 2017
By

Stefan Feuerriegel This blog entry concerns our course on “Operations Reserch with R” that we teach as part of our study program. We hope that the materials are of...

Read more »

Hey! You there! You are welcome here

June 23, 2017
By
Hey! You there! You are welcome here

What's that? You've heard of R? You use R? You develop in R? You know someone else who's mentioned R? Oh, you're breathing? Well, in that case, welcome! Come...

Read more »

Face Recognition in R

June 22, 2017
By
Face Recognition in R

Face Recognition in R OpenCV is an incredibly powerful tool to have in your toolbox. I have had a lot of success using it in Python but very little...

Read more »

May New Package Picks

June 22, 2017
By
May New Package Picks

Two hundred and twenty-nine new packages were submitted to CRAN in May. Here are my picks for the “Top 40”, organized...

Read more »

Set Theory Arbitrary Union and Intersection Operations with R

June 22, 2017
By

Part 3 of 3 in the series Set TheoryThe union and intersection set operations were introduced in a previous post using two sets, and . These set operations can...

Read more »

RTutor: Emission Certificates and Green Innovation

RTutor: Emission Certificates and Green Innovation

Which policy instruments should we use to cost-effectively reduce greenhouse gas emissions? For a given technological level there are many economic arguments in favour of tradeable emission certificates or...

Read more »

Interactive R visuals in Power BI

June 22, 2017
By
Interactive R visuals in Power BI

Power BI has long had the capability to include custom R charts in dashboards and reports. But in sharp contrast to standard Power BI visuals, these R charts were...

Read more »

Two years as a Data Scientist at Stack Overflow

June 22, 2017
By
Two years as a Data Scientist at Stack Overflow

Last Friday marked my two year anniversary working as a data scientist at Stack Overflow. At the end of my first year I wrote a blog post about...

Read more »

Online portfolio allocation with a very simple algorithm

June 22, 2017
By
Online portfolio allocation with a very simple algorithm

By Yuri Resende   Today we will use an online convex optimization technique to build a very simple algorithm for portfolio allocation. Of course this is just an illustrative...

Read more »

Data wrangling : Reshaping

June 22, 2017
By
Data wrangling : Reshaping

Data wrangling is a task of great importance in data analysis. Data wrangling, is the process of importing, cleaning and transforming raw data into actionable information for analysis. It...

Read more »

nanotime 0.2.0

June 22, 2017
By

A new version of the nanotime package for working with nanosecond timestamps just arrived on CRAN. nanotime uses the RcppCCTZ package for (efficient) high(er) resolution time...

Read more »

Can we predict flu deaths with Machine Learning and R?

June 22, 2017
By
Can we predict flu deaths with Machine Learning and R?

Among the many R packages, there is the outbreaks package. It contains datasets on epidemics, on of which is from the 2013 outbreak of influenza A H7N9 in China,...

Read more »

Introducing Community Tutorials

June 22, 2017
By
Introducing Community Tutorials

Today we’re introducing Datazar Community Tutorials. At Datazar, we love writing tutorials and how-tos on R, Python, D3, research and science best practices in general. So starting today, we’re...

Read more »

All the fake data that’s fit to print

June 22, 2017
By

charlatan makes fake data. Excited to annonunce a new package called charlatan. While perusing packages from other programming languages, I saw a neat Python library called faker. charlatan is inspired from and ports...

Read more »

Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R

r-brain.io



Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training





omictools

Contact us if you wish to help support R-bloggers, and place your banner here.