## Track changes in data with the lumberjack %>>%

June 23, 2017
So you are using this pipeline to have data treated by different functions in R. For example, you may be imputing some missing values using the simputation package. Let us first load the only realistic dataset in R > data(retailers, … Continue reading →

## The R community is one of R’s best features

June 23, 2017
R is incredible software for statistics and data science. But while the bits and bytes of software are an essential component of its usefulness, software needs a community to...

## Logarithmic Scale Explained with U.S. Trade Balance

June 23, 2017
Skewed data prevail in real life. Unless you observe trivial or near constant processes data is skewed one way or another due to outliers, long tails, errors or something...

## Hey! You there! You are welcome here

June 23, 2017
What's that? You've heard of R? You use R? You develop in R? You know someone else who's mentioned R? Oh, you're breathing? Well, in that case, welcome! Come...

## Face Recognition in R

June 22, 2017
Face Recognition in R OpenCV is an incredibly powerful tool to have in your toolbox. I have had a lot of success using it in Python but very little...

## May New Package Picks

June 22, 2017
Two hundred and twenty-nine new packages were submitted to CRAN in May. Here are my picks for the “Top 40”, organized...

## Set Theory Arbitrary Union and Intersection Operations with R

June 22, 2017
Part 3 of 3 in the series Set TheoryThe union and intersection set operations were introduced in a previous post using two sets, and . These set operations can...

## RTutor: Emission Certificates and Green Innovation

Which policy instruments should we use to cost-effectively reduce greenhouse gas emissions? For a given technological level there are many economic arguments in favour of tradeable emission certificates or...

## Interactive R visuals in Power BI

June 22, 2017
Power BI has long had the capability to include custom R charts in dashboards and reports. But in sharp contrast to standard Power BI visuals, these R charts were...

## Two years as a Data Scientist at Stack Overflow

June 22, 2017
Last Friday marked my two year anniversary working as a data scientist at Stack Overflow. At the end of my first year I wrote a blog post about...

## Online portfolio allocation with a very simple algorithm

June 22, 2017
$Online portfolio allocation with a very simple algorithm$

By Yuri Resende   Today we will use an online convex optimization technique to build a very simple algorithm for portfolio allocation. Of course this is just an illustrative...

## Data wrangling : Reshaping

June 22, 2017
Data wrangling is a task of great importance in data analysis. Data wrangling, is the process of importing, cleaning and transforming raw data into actionable information for analysis. It...

## nanotime 0.2.0

June 22, 2017
A new version of the nanotime package for working with nanosecond timestamps just arrived on CRAN. nanotime uses the RcppCCTZ package for (efficient) high(er) resolution time...

## Can we predict flu deaths with Machine Learning and R?

June 22, 2017
Among the many R packages, there is the outbreaks package. It contains datasets on epidemics, on of which is from the 2013 outbreak of influenza A H7N9 in China,...

## Introducing Community Tutorials

June 22, 2017
Today we’re introducing Datazar Community Tutorials. At Datazar, we love writing tutorials and how-tos on R, Python, D3, research and science best practices in general. So starting today, we’re...

## All the fake data that’s fit to print

June 22, 2017
charlatan makes fake data. Excited to annonunce a new package called charlatan. While perusing packages from other programming languages, I saw a neat Python library called faker. charlatan is inspired from and ports...

## Plotting partial pooling in mixed-effects models

June 21, 2017
In this post, I demonstrate a few techniques for plotting information from a relatively simple mixed-effects model fit in R. These plots can help us develop intuitions about what these...

## RcppCCTZ 0.2.3 (and 0.2.2)

June 21, 2017
A new minor version 0.2.3 of RcppCCTZ is now on CRAN. RcppCCTZ uses Rcpp to bring CCTZ to R. CCTZ is a C++ library for translating between absolute and...

## Large integers in R: Fibonacci number with 1000 digits, Euler Problem 25

June 21, 2017
Solution to Euler Problem 25 in the R language. What is the index of the first term in the Fibonacci sequence to contain 1000 digits? Continue reading

## Confidence Intervals without Your Collaborator’s Tears

June 21, 2017
Abstract We provide an interpretation for the confidence interval for a binomial proportion hidden as the transcript of an hypothetical statistical consulting session. ...

## Updated Data Science Virtual Machine for Windows: GPU-enabled with Docker support

June 21, 2017
The Windows edition of the Data Science Virtual Machine (DSVM), the all-in-one virtual machine image with a wide-collection of open-source and Microsoft data science tools, has been updated to...

## Neural networks Exercises (Part-3)

June 21, 2017
Neural network have become a corner stone of machine learning in the last decade. Created in the late 1940s with the intention to create computer programs who mimics the...

## Call for Help: R/Shiny Developer

June 21, 2017
Dear Fantasy Football Analytics Community, Four years ago, we released web apps to help people make better decisions in fantasy football based on the wisdom of the crowd.  Over the...

## Importing and Managing Financial Data

June 21, 2017
I'm excited to announce my DataCamp course on importing and managing financial data in R! I'm also honored that...

June 21, 2017
...

## Analytics Administration for R

June 20, 2017
Analytic administrator is a role that data scientists assume when they onboard new tools, deploy solutions, support existing standards, or train...

## Smoothing a time-series with a Bayesian model

June 20, 2017
Smoothing a time-series with a Bayesian model Recently I looked at fitting a smoother to a time-series using Bayesian modelling. Now I will look at how you can control the smoothness by using...

## R leads, Python gains in 2017 Burtch Works Survey

June 20, 2017
For the past four years, recruiting firm Burtch Works has conducted a simple survey of data scientists with just one question: "Which do you prefer to use — SAS,...

## An Out of Sample Update on DDN’s Volatility Momentum Trading Strategy and Beta Convexity

June 20, 2017
The first part of this post is a quick update on Tony Cooper’s of Double Digit Numerics’s volatility ETN momentum … Continue reading →

