Extracting data on shadow economy from PDF tables

December 25, 2016
By
Extracting data on shadow economy from PDF tables

Data on the shadow economy? I’m reading Kenneth Rogoff’s The Curse of Cash. It was one of Bloomberg’s Best Books of 2016 and the Financial Times’ Best Economics Books of 2016, and I recommend it. It’s an excellent and convincing book, makin...

Read more »

Christmas Tree with ggplot

December 25, 2016
By
Christmas Tree with ggplot

# create data x <- c(8,7,6,7,6,5,6,5,4,5,4,3,4,3,2,3,2,1,0.5,0.1) dat1 <- data.frame(x1 = 1:length(x), x2 = x) dat2 <- data.frame(x1 = 1:length(x), x2 = -x) dat1$xvar <- dat2$xvar <- NA dat1$yvar <- dat2$yvar <- NA dat1$siz <- dat2$siz <- NA dat1$col <- dat2$col dec_threshold){ dat1$xvar <- row #sample(1:dat1$x1,1) dat1$yvar <- sample(1:dat1$x2-1,1) dat1$siz <- runif(1,0.5,1.5) dat1$col dec_threshold){ dat2$xvar <-

Read more »

Computing Sample Size for Variance Estimation

December 24, 2016
By
Computing Sample Size for Variance Estimation

The R package samplesize4surveys contains functions that allow to calculate sample sizes for estimating proportions, means, difference of proportions and even difference of two means. It also permits the calculation of sample error and power level for ...

Read more »

Distributional Semantics in R: Part 1 {tm} classes + read/write

December 24, 2016
By
Distributional Semantics in R: Part 1 {tm} classes + read/write

The R code for this tutorial on Methods of Distributional Semantics in R is found in the respective GitHub repository. Following my Methods of Distributional Semantics in R BelgradeR Meetup with Data Science Serbia, organized in Startit Center, Belgrade, 11/30/2016, several people asked me for the R code used for the analysis of William Shakespeare’s...

Read more »

New Release of ggguitar available on CRAN

December 24, 2016
By
New Release of ggguitar available on CRAN

Based on feedback ggguitar has been updated and released on CRAN.  The updated vignette includes more examples of how the package can be used.  Example below as well: Update - Blogger was reformatting the R code - so made it available in this gist instead.Merry Christmas.

Read more »

anytime 0.2.0: Feature, fixes and tests!

December 24, 2016
By

A brand new anytime package just arrived at CRAN. This is release number eight, evenly spread with over two per month, since the initial release in September. Needless to say I have been told off not to make this many releases. As they say, no good deed goes unpunished. anytime is a very focused package aiming to...

Read more »

Does replyr::let work with data.table?

December 24, 2016
By
Does replyr::let work with data.table?

I’ve been asked if the adapter “let” from our R package replyr works with data.table. My answer is: it does work. I am not a data.table user so I am not the one to ask if data.table benefits a from a non-standard evaluation to standard evaluation adapter such as replyr::let. Using replyr::let with data.table looks … Continue...

Read more »

Functional programming and unit testing for data munging with R available on Leanpub

December 23, 2016
By

The book I’ve been working on these pasts months (you can read about it here, and read it for free here) is now available on Leanpub! You can grab a copy and read it on your ebook reader or on your computer, and what’s even better is that it is av...

Read more »

Classification with Linear Discriminant Analysis

December 23, 2016
By

Classification with linear discriminant analysis is a common approach to predicting class membership of observations. A previous post explored the descriptive aspect of linear discriminant analysis with data collected on two groups of beetles. In this post, we will use the discriminant functions found in the first post to classify... The post Classification with Linear Discriminant Analysis appeared...

Read more »

Get Ready for RStudio::Conf

December 23, 2016
By

by Joseph Rickert The 2017 R Conference season will get off to an early start on January 13th and 14th with RStudio::Conf 2017 in Orlando, Florida. The schedule promises an intense but collegial experience with plenty of hands-on practice working with R and the RStudio tool chain of packages and products. To prepare for the

Read more »

Merry ChRistmas!

December 23, 2016
By
Merry ChRistmas!

Christmas day is soon upon us, so here's a greeting made with R: Each frame is a Voronoi Tesselation: about 1,000 points are chosen across the plane, which each generate a polygon comprising the region closer to it than any other selected point. These process is repeated for three designs (a heart, the word "Merry", and the word "Xmas"),...

Read more »

Did you say SQL Server? Yes I did….

December 23, 2016
By
Did you say SQL Server? Yes I did….

Introduction My last blog post in 2016 on SQL Server 2016….. Some years ago, I have heard predictions from ‘experts‘ that within a few years Hadoop / Spark systems would take over traditional RDBMS’s like SQL Server. I don’t think … Continue reading →

Read more »

forecastHybrid 0.3.0 on CRAN

December 23, 2016
By
forecastHybrid 0.3.0 on CRAN

Make it easy to make ensemble time series forecast forecastHybrid is an R package to make it easier to use the average predictions of ‘ensembles’ (or ‘combinations’) of time series models from Rob Hyndman’s forecast package. It looks after t...

Read more »

Price Volatility – Basic Brownian Motion

December 23, 2016
By
Price Volatility – Basic Brownian Motion

The Situation You are a consultant who has been hired by a business that sells one commodity product. On December 31st the price is $100 per unit. The business owner wants to know what to expect by the end of January. Your client gave you the message: Prices are based off the the sales the

Read more »

finch – parse Darwin Core files

December 23, 2016
By

finch has just been released to CRAN (binaries should be up soon). finch is a package to parse Darwin Core files. Darwin Core is: a body of standards. It includes a glossary of terms (in other contexts these might be called properties, elements, fields, columns, attributes, or concepts) intended to facilitate the sharing of information about biological diversity by...

Read more »

Merry Christmas 2016 (with R)

December 22, 2016
By
Merry Christmas 2016 (with R)

I'd like to wish all my readers a Merry Christmas 2016- R style! Behold my 3d Christmas tree created using the plot3D R package: While this might seem like yet another Christmas decoration done in R, it is unique in that the tree is rendered in 3d perspective. I myself wrote...

Read more »

Ordering Categories within ggplot2 Facets (followup)

December 22, 2016
By
Ordering Categories within ggplot2 Facets (followup)

I saw Simon Jackson’s recent blog post regarding ordering categories within facets. He proposed a way of dealing with the problem of ordering variables shared across facets within facets. This problem becomes apparent in text analysis where words are shared … Continue reading →

Read more »

Price Volatility – Basic Brownian Motion

December 22, 2016
By
Price Volatility – Basic Brownian Motion

You are a consultant who has been hired by a business that sells one commodity. The business owner wants to know what to expect by the end of January.

Read more »

Kindle Clippings

December 22, 2016
By

I highlight a lot of junk on my Kindle. Well, it’s not all junk! 💩 There’s usually some good stuff buried deep within my clippings.txt file. But it’s hard to manually parse through the file (and the junk). In the past I’ve relied on online ...

Read more »

Goodreads API

December 22, 2016
By
Goodreads API

It’s December 23rd and I’ve only read 49 books. Whoops. There’s still time, but it’s definitely getting dicey. I’m about halfway through three books right now so I think I’ll be able to pull it off. Fingers crossed. Of course, last year I ...

Read more »

suRprise! – Classifying Kinder Eggs by Boosting

December 22, 2016
By
suRprise! – Classifying Kinder Eggs by Boosting

Abstract Carrying the Danish tradition of Juleforsøg to the realm of statistics, we use R to classify the figure content of Kinder Eggs using boosted regression trees for the egg's weight and possible rattling noises. This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. The markdown+Rknitr source code...

Read more »

Start me up

December 22, 2016
By

The startup package makes it easy to control your R startup processes and to share part of your startup settings with others (e.g. as a public Git repository) while keeping secret parts to yourself. Instead of having long and windy .Renviron and .Rpro...

Read more »

Pipes (%>%) Everywhere

December 22, 2016
By

An R user asked a question regarding whether it’s possible to have the RStudio pipe (%>%) shortcut (Cmd-Shift-M) available in other macOS applications. If you’re using Alfred then you can use this workflow for said task (IIRC this requires an Alfred license which is reasonably cheap). When you add it to Alfred you must edit... Continue reading...

Read more »

Take a Test Drive of the Linux Data Science Virtual Machine

December 22, 2016
By
Take a Test Drive of the Linux Data Science Virtual Machine

If you've been thinking about trying out the Data Science Virtual Machine on Linux, but don't yet have an Azure account, you can now take a free test drive -- no credit card required! Just visit the Linux DSVM Marketplace page and click the blue button: The Linux Data Science Virtual Machine includes all of the tools a modern...

Read more »

Comparative examples using replyr:let

December 22, 2016
By
Comparative examples using replyr:let

Consider the problem of “parametric programming” in R. That is: simply writing correct code before knowing some details, such as the names of the columns your procedure will have to be applied to in the future. Our latest version of replyr::let makes such programming easier. Archie’s Mechanics #2 (1954) copyright Archie Publications (edit: great news! … Continue...

Read more »

Model Evaluation 2

December 22, 2016
By
Model Evaluation 2

We are committed to bringing you 100% authentic exercise sets. We even try to include as different datasets as possible to give you an understanding of different problems. No more classifying Titanic dataset. R has tons of datasets in its library. This is to encourage you to try as many datasets as possible. We will

Read more »

DataCamp’s 2017 Conference Guide

December 22, 2016
By
DataCamp’s 2017 Conference Guide

2017 is bound to be an exciting year in Data Science. Here's DataCamp's list of conferences that we're most excited about in the new year. Whether you're an R user, a Python hacker, or just a general data science fan - you're sure to find a great confe...

Read more »

Euler Problem 4: Largest Palindromic Product

December 22, 2016
By
Euler Problem 4: Largest Palindromic Product

Solution to Euler Problem 4: Find the largest palindrome made from the product of two 3-digit numbers. Continue reading → The post Euler Problem 4: Largest Palindromic Product appeared first on The Devil is in the Data.

Read more »

Exploring the European Social Survey (ESS) – pipe-friendly workflow with sjmisc, part 2 #rstats #tidyverse

December 22, 2016
By
Exploring the European Social Survey (ESS) – pipe-friendly workflow with sjmisc, part 2 #rstats #tidyverse

This is another post of my series about how my packages integrate into a pipe-friendly workflow. The post focusses on my sjmisc-package, which was just updated on CRAN, and highlights some of the new features. Examples are based on data from the European Social Survey, which are freely available. Steps of the data analysis process

Read more »

Sponsors

Mango solutions







Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de







CRC R books series







Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.