How Much Have People Made on Bitcoin?

April 2, 2018
By
How Much Have People Made on Bitcoin?

Intro: How much has bitcoin made Society? This post will calculate the realized gains of bitcoin (up to year 2013) as a measure of how much society has made off of bitcoin. How are realized gains different from market capitalization? Realized gains measure the amount of money people can claim to have made off of bitcoin. This can be markedly different from...

Read more »

Inkognito – Sequential Bayesian Identity Disclosure

April 2, 2018
By
Inkognito – Sequential Bayesian Identity Disclosure

Abstract We provide Bayesian decision support for revealing the identity of opponents in the board game Inkognito. This includes the use of combinatorics to deduce the likelihood of observing a particular configuration and a sequential Bayesian belief updating scheme to infer opponent's identity. From a R point of view we use base R where it's best: manipulating matrices and supplement...

Read more »

How to Visualize Data With Highcharter

April 2, 2018
By
How to Visualize Data With Highcharter

INTRODUCTION Highcharter is a R wrapper for Highcharts javascript libray and its modules. Highcharts is very mature and flexible javascript charting library and it has a great and powerful API1. The main features of this package are: Various chart type with the same style: scatters, bubble, line, time series, heatmaps, treemap, bar charts, networks. Chart Related exercise sets:Vector exercises...

Read more »

Digital Marketplace. Six months later.

April 2, 2018
By
Digital Marketplace. Six months later.

Six months ago we considered the question: Is the Government realising its ambition for SMEs on G-Cloud? This article reviews progress six months later, additionally considering the Digital Outcomes & Specialists (DOS) framework. The post Digital Marketplace. Six months later. appeared first on thinkr.

Read more »

Visualizing geo-spatial data with sf and plotly

April 2, 2018
By
Visualizing geo-spatial data with sf and plotly

Need help with R, data viz, and/or stats? Work with me or attend my 2 day workshop! In my last post, we explored interactive visualizations of simple features (i.e., interactive maps) via ggplot2’s geom_sf() and plotly’s ggplotly(). This time we’ll make similar visualizations using plotly’s “non-ggplot2” mapping interfaces (namely plot_ly(), plot_geo(), and plot_mapbox()). Here’s a

Read more »

Learning from and improving upon ggplotly conversions

April 2, 2018
By
Learning from and improving upon ggplotly conversions

Need help with R, data visualization, and/or statistics? Hire me or attend my 2 day workshop! As the maintainer of the R package plotly, I’m certainly aware that ggplotly() is not perfect.1 And, even when conversion from ggplot2 to plotly ‘works’ it can leave some things to be desired. For example, it might take a

Read more »

R Tip: Think in Terms of Values

April 2, 2018
By

R tip: first organize your tasks in terms of data, values, and desired transformation of values, not initially in terms of concrete functions or code. I know I write a lot about coding in R. But it is in the service of supporting statistics, analysis, predictive analytics, and data science. R without data is like … Continue reading R...

Read more »

Assessing Shooting Performance in NBA and NCAA Basketball

April 2, 2018
By
Assessing Shooting Performance in NBA and NCAA Basketball

I wrote an open-source app called NBA Shots DB that uses the NBA Stats API to populate a database with all 4.5 million shots attempted in NBA games since 1996. The app also processes a dataset provided by Sportradar of over 1 million NCAA men’s shot attempts since 2013 into a format that can be merged with the NBA...

Read more »

Surviving Shelter: Analysis of Time Spent and Outcome in Dallas Animal Shelters

April 1, 2018
By
Surviving Shelter: Analysis of Time Spent and Outcome in Dallas Animal Shelters

In previous post we discovered Dallas Animal Services data sources (available on Dallas Open Data) and successfully analyzed how animals get admitted to and discharged from the city shelters. We loaded actual shelter records and looked at the types of admittance, different outcomes and their relationships. In this post we continue this analysis by focusing on the time animals spend...

Read more »

More Options For Querying DNS From R with 1.1.1.1

April 1, 2018
By
More Options For Querying DNS From R with 1.1.1.1

You have to have been living under a rock to not know about Cloudflare’s new 1.1.1.1 DNS offering. I won’t go into “privacy”, “security” or “speed” concepts in this post since that’s a pretty huge topic to distill for folks given the, now, plethora of confusing (and pretty technical) options that exist to support one... Continue reading →

Read more »

Approximations of Pi: A Random Walk though the Beauty of Pi

April 1, 2018
By

Various methods to approximate and visualise the digits of pi using the R computing language for statistics. The apparent randomness of the decimal expression of Pi is a source for beautiful visualisations. Continue reading → The post Approximations of Pi: A Random Walk though the Beauty of Pi appeared first on The Devil is in the Data.

Read more »

Writing functions for dplyr and ggplot2 – April 2, 2018

April 1, 2018
By
Writing functions for dplyr and ggplot2 – April 2, 2018

In my last two posts I have been writing about the task of using R to “drive” MS Excel. The first post focused on just the basic mechanics of getting my colleague what she needed. The second post picked up with some ugly inefficient code and made it better using lapply and a for loop, just good old fashioned...

Read more »

Performance: Avoid Coercing Indices To Doubles

April 1, 2018
By
Performance: Avoid Coercing Indices To Doubles

x or x? That is the question. Assume that we have a vector $x$ of $n = 100,000$ random values, e.g. __ n x idxs y y typeof(idxs) "integer" __ typeof(idxs + 1) "double" __ typeof(1) "double" Note also that doubles (aka “numerics” in R) take up twice the amount of memory: __ object.size(idxs) 400040 bytes __...

Read more »

RStudio GPU Workstations in the Cloud with Paperspace

April 1, 2018
By
RStudio GPU Workstations in the Cloud with Paperspace

We are very pleased to announce the availability of an RStudio TensorFlow template for the Paperspace cloud desktop service. If you don’t have local access to a modern NVIDIA GPU, your best bet is typically to run GPU intensive training jobs...

Read more »

Pimp my RMD: a few tips for R Markdown

April 1, 2018
By

Read more »

backed by data

April 1, 2018
By
backed by data

Minmising risk of delayed departure - This post is an attempt to answer this question: https://community.rstudio.com/t/how-to-answer-a-question-without-statistical-tests-but-is-backed-by-data/3711 It was a pretty interesting question, and I...

Read more »

oneliner – a new style guide for styler

I am happy to introduce oneliner, a package that implements the one-line-style as a third-party style guide ready to use with styler. Hence, after the tidyverse style guide, this is the first third-party style guide for styler I am aware of - and a particularly useful one.

Read more »

Compute/Visualize Drive Space Consumption of Your Installed R Packages

April 1, 2018
By
Compute/Visualize Drive Space Consumption of Your Installed R Packages

The fs package makes it super quick and easy to find out just how much “package hoarding” you’ve been doing: While you can modify the above and peruse the list of packages/directories in tabular format or programmatically, you can also do a bit more work to get a visual overview of package size (click/tap the... Continue reading →

Read more »

Tensorflow – Neural Network Training: Exercises

April 1, 2018
By
Tensorflow – Neural Network Training: Exercises

Deep learning is under active development. Papers with new approaches are being published every day. In this set of exercises we will go through some of the newer methods that boost the neural network’s performance. By the end of this post, you will be able to train neural networks with adaptive learning rates and apply Related exercise sets:Density-Based Clustering...

Read more »

Bump Chart

April 1, 2018
By
Bump Chart

Track performance over time - A Bump Chart is a special form of a line plot. This kind of plot is designed for exploring changes in rank over time. The focus here is usually on comparing the position or performance of multiple observation...

Read more »

Fastai Collaborative Filtering with R and Reticulate

March 31, 2018
By

Jeremy Howard and Rachel Thomas are founders of fast.ai whose aim is to make deep learning accessible to all. They offer a course called Practical Deep Learning for Coders (Part 1). The last session, taught by Jeremy, was in Fall 2017 and the videos were released early January 2018. Their approach is top down by showing different applications first...

Read more »

Le Monde puzzle [#1048]

March 31, 2018
By
Le Monde puzzle [#1048]

An arithmetic Le Monde mathematical puzzle: A magical integer m is such that the remainder of the division of any prime number p by m is either a prime number or 1. What is the unique magical integer between 25 and 100? And is there any less than 25? The question is dead easy to

Read more »

Package Paths in R

March 31, 2018
By
Package Paths in R

Recently, while working on the Azure Data Lake R extension, I had to figure out a good way to create a zip file containing a package together with all its dependencies. This came down to understanding where does R store and search for packages. Despite the documentation, it did require additional reading and experimentation. First, The post Package Paths...

Read more »

The 10 Data Science Crack Commandments

March 31, 2018
By
The 10 Data Science Crack Commandments

 It’s the ten crack commandments, what? homie can’t tell me nothing about this code Can’t tell me nothing about these #rstats Number 1, make a function from a script. Everyone knows we’re to busy to be copy/pasting shit http://adv-r.had.co.nz/Functions.html Number 2, never let ’em know your data manipulation moves. Don’t you know Bad Boys move in silence and

Read more »

Deaths per firearm violence event by @ellis2013nz

March 31, 2018
By
Deaths per firearm violence event by @ellis2013nz

Caution - this blog post discusses homicide and suicide statistics and could be upsetting. If you feel disturbed or unhappy then here is a list of ways to get some emergency counselling help around the world. A grim research question In my last post...

Read more »

Advanced Techniques With Raster Data – Part 3: Exercises

March 31, 2018
By
Advanced Techniques With Raster Data – Part 3: Exercises

Background In this post, the ninth of the geospatial processing series with raster data, I will focus on interpolating and modeling air surface temperature data recorded at weather stations. For this purpose I will explore regression-kriging (RK), a spatial prediction technique commonly used in geostatistics that combines a regression of the dependent variable (air temperature Related exercise sets:Spatial Data...

Read more »

Loops in R – Exercises

March 30, 2018
By
Loops in R – Exercises

Using loops is generally discouraged in R when it is possible to avoid them using vectorized alternatives. Vectorized solution are be both faster to write, read and execute – except sometimes they aren’t and the definition of vectorization isn’t always straightforward. In any event, solutions using loops can be: The fastest to prototype The easiest Related exercise sets:Scripting Loops...

Read more »

Use Python functions and modules in R with the "reticulate" package

March 30, 2018
By
Use Python functions and modules in R with the "reticulate" package

Since its inception over 40 years ago, when S (R's predecessor) was just a sketch on John Chambers' wall at Bell Labs, R has always been a language for providing interfaces. I was reminded of this during Dirk Eddelbuettel's presentation at the Chicago R User Group meetup last night, where he enumerated Chambers' three principles behind its design (from...

Read more »

Collapsing Categories or Values

March 29, 2018
By

Introduction I have received a few queries recently that can be categorized as “How do I collapse a list of categories or values into a shorter list of category or values?” For example, one user wanted to collapse species of fish into ...

Read more »

Search R-bloggers


Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R



Quantide: statistical consulting and training

ODSC2 west

ODSC1_jobs

datasociety

http://www.eoda.de

max kuhn

CRC R books series







Six Sigma Online Training



mljar.com

datazar.com



Contact us if you wish to help support R-bloggers, and place your banner here.