Constant Expiry VIX Futures (Using Public Data)

May 17, 2017
By
Constant Expiry VIX Futures (Using Public Data)

This post will be about creating constant expiry (E.G. a rolling 30-day contract) using VIX settlement data from the CBOE … Continue reading →

Read more »

Easily add images to a correspondence analysis plot in R

May 17, 2017
By
Easily add images to a correspondence analysis plot in R

You can take your correspondence analysis plots to the next level by including images. Better still, you don’t need to paste in the images after the analysis is complete – you can include them right...

Read more »

Phylomoji with ggtree and emojifont

May 16, 2017
By
Phylomoji with ggtree and emojifont

With ggtree (Yu et al. 2017), it is very easy to create phylomoji. Emoji is internally supported by ggtree. Use emoji in taxa labels library(ggtree) tree_text <- "(((((cow, (whale, dolphin)), (pig2, boar)), camel), fish), seedling);" x <- read.tree(text=tree_text) ggtree(x, linetype="dashed", color='firebrick') + xlim(NA, 7) + ylim(NA, 8.5) + geom_tiplab(aes(color=label), parse='emoji', size=14, vjust=0.25) + labs(title="phylomoji",...

Read more »

Phylomoji with ggtree and emojifont

May 16, 2017
By
Phylomoji with ggtree and emojifont

With ggtree (Yu et al. 2017), it is very easy to create phylomoji. Emoji is internally supported by ggtree. Use emoji in taxa labels library(ggtree) tree_text <- "(((((cow, (whale, dolphin)), (pig2, boar)), camel), fish), seedling);" x <- read.tree(text=tree_text) ggtree(x, linetype="dashed", color='firebrick') + xlim(NA, 7) + ylim(NA, 8.5) + geom_tiplab(aes(color=label), parse='emoji', size=14, vjust=0.25) + labs(title="phylomoji",...

Read more »

Upcoming Rcpp Talks

Very excited about the next few weeks which will cover a number of R conferences, workshops or classes with talks, mostly around Rcpp and one notable exception: May 19: Rcpp: From Simple Examples to Machine learning, pre-conference workshop at our R...

Read more »

When in Doubt, Upgrade

When in Doubt, Upgrade

I have said this many times in several Github issues and it is also the very first FAQ of knitr. It is a little exaggerative and certainly not a universal rule, but it helps a lot if you could try to upgrade everything before you report a software issue. It is even more helpful if you could also...

Read more »

timekit: New Documentation, Function Improvements, Forecasting Vignette

timekit: New Documentation, Function Improvements, Forecasting Vignette

We’ve just released timekit v0.3.0 to CRAN. The package updates include changes that help with making an accurate future time series with tk_make_future_timeseries() and we’ve added a few features to tk_get_timeseries_signature(). Most important ar...

Read more »

Databases using R

May 16, 2017
By
Databases using R

Current State Using databases is unavoidable for those who analyze data as part of their jobs. As R developers, our first instinct may be to approach databases the same way we do regular files. We may attempt to read the data either all at once or as few times as possible. The aim is to reduce the number of times...

Read more »

R in Financial Services: Challenges and Opportunities

May 16, 2017
By

At the New York R Conference earlier this year, my colleague Lixun Zhang gave a presentation on the challenges and opportunites financial services companies encounter when using R. In the talk, he shares some lessons learned while working with an couple of international banks that have been using SAS, but are transitioning workloads to R. The talk also includes...

Read more »

Here is the new padr

May 16, 2017
By
Here is the new padr

I am very happy to announce v0.3.0 of the padr package, which was introduced in January. As requested by many, you are now able to use intervals of which the unit is different from 1. In earlier version the eight interval values only allowed for a single unit (e.g. year, day, hour). Now you can use any time period...

Read more »

Dueling Data Science Surveys: KDnuggets & Rexer Go Live

May 16, 2017
By

What tools do we use most for data science, machine learning, or analytics? Python, R, SAS, KNIME, RapidMiner,…? How do we use them? We are about to find out as the two most popular surveys on data science tools have … Continue reading →

Read more »

R⁶ — Using R With Amazon Athena & AWS Temporary Security Credentials

May 16, 2017
By

Most of the examples of working with most of the AWS services show basic username & password authentication. That’s all well-and-good, but many shops use the AWS Security Token Service to provide temporary credentials and session tokens to limit exposure and provide more uniform multi-factor authentication. At my workplace, Frank Mitchell created a nice electron... Continue reading...

Read more »

What We Learned From Teaching 1M People Data Science

May 16, 2017
By
What We Learned From Teaching 1M People Data Science

Learning Data Science By Doing DataCamp’s goal is to build the best learning platform for data science. Our philosophy is that you learn data science by doing: our high-quality videos, in-browser coding, and gamification provide learners with an eng...

Read more »

It can be easy to explore data generating mechanisms with the simstudy package

May 16, 2017
By
It can be easy to explore data generating mechanisms with the simstudy package

I learned statistics and probability by simulating data. Sure, I battled my way through proofs, but I never believed the results until I saw it in a simulation. I guess I have it backwards, it worked for me. And now that I do this for a living, I continue to use simulation to understand models, … Continue...

Read more »

Spinning Globes With R

May 16, 2017
By
Spinning Globes With R

It has been a long held dream of mine to create a spinning globe using nothing but R (I wish I was joking, but I’m not). Thanks to the brilliant mapmate package created by Matt Leonawicz and shed loads of computing power, today that dream became a reality. The globe below took 19 hours and 30

Read more »

I don’t know Fisher’s exact test, but I know Stan

May 15, 2017
By
I don’t know Fisher’s exact test, but I know Stan

A few days ago, I watched a terrific lecture by Bob Carpenter on Bayesian models. He started with a Bayesian approach to Fisher’s exact test. I had never heard of this classical procedure, so I was curious to play with the example. In this post, I use the same data that he used in the lecture and in an earlier, pre-Stan...

Read more »

shinyHeatmaply – a shiny app for creating interactive cluster heatmaps

May 15, 2017
By
shinyHeatmaply – a shiny app for creating interactive cluster heatmaps

My friend Jonathan Sidi and I (Tal Galili) are pleased to announce the release of shinyHeatmaply (0.1.0): a new Shiny application (and Shiny gadget) for creating interactive cluster heatmaps. shinyHeatmaply is based on the heatmaply R package which strives to make it easy as possible to create interactive cluster heatmaps. The app introduces a functionality that saves to disk a self … Continue...

Read more »

A Note on missing()

I had misunderstood the function missing() for several years. Originally I thought it only applies to an argument that does not have a default or user-specified value. For example, this is fairly easy to understand: f = function(x) { missing(x) } f() # should be TRUE One day I was surprised to find that this also returned TRUE: f =...

Read more »

shinyMlr

May 15, 2017
By

shinyMlr is a web application, built with the R-package “shiny” that provides a user interface for mlr. By wrapping the main functionalities of mlr into our app, as well as implementing additional features for data visualisation and data preprocessing, we built a widely usable application for your day to day machine learning tasks, which we would like to present to you today.

Read more »

R and Python support now built in to Visual Studio 2017

May 15, 2017
By
R and Python support now built in to Visual Studio 2017

The new Visual Studio 2017 has built-in support for programming in R and Python. For older versions of Visual Studio, support for these languages has been available via the RTVS and PTVS add-ins, but the new Data Science Workloads in Visual Studio 2017 make them available without a separate add-in. Just choose the "Data Science and analytical applications" option...

Read more »

Pretty scatter plots with ggplot2

May 15, 2017
By
Pretty scatter plots with ggplot2

@drsimonj here to make pretty scatter plots of correlated variables with ggplot2! We’ll learn how to create plots that look like this:  Data In a data.frame d, we’ll simulate two correlated variables a and b of length n: set.seed(170513) n <- 200 d <- data.frame(a = rnorm(n)) d$b <- .4 * (d$a + rnorm(n)) head(d) #>...

Read more »

Instrumental Variables in R exercises (Part-1)

May 15, 2017
By
Instrumental Variables in R exercises (Part-1)

One of the most frequently encountered issues in econometrics is endogeneity. Consider the simple Ordinary Least Squares (OLS) regression setting in which we model wages as a function of years of schooling (education): One of the main assumption of OLS is that the independent variables are not correlated with the error term. However, this is Related exercise sets:

Read more »

Three-Way Analysis of Variance: Simple Second-Order Interaction Effects and Simple Main Effects

May 15, 2017
By

In this article we will show how to run a three-way analysis of variance when both the third-order interaction effect and the second-order interaction effects are statistically significant. This type of analysis can become pretty tedious, especially when our factors have many levels, so we will try to explain it here as clearly as possible. … Continue...

Read more »

R⁶ — Tracking WannaCry Bitcoin Wallet Payments with R

May 15, 2017
By
R⁶ — Tracking WannaCry Bitcoin Wallet Payments with R

If you follow me on Twitter or monitor @Rapid7’s Community Blog you know I’ve been involved a bit in the WannaCry ransomworm triage. One thing I’ve been doing is making charts of the hourly contribution to the Bitcoin addresses that the current/main attackers are using to accept ransom payments (which you really shouldn’t pay, now,... Continue reading...

Read more »

Looking forward to RFinance – Chicago

May 15, 2017
By

- I’m looking forward to attending R in Finance conference in Chicago, next friday (2017-05-09). The program looks great! I am really happy, and a bit surprised, to see so many presentations related to market microstructure in...

Read more »

Scraping Fiverr To Analyze Freelancing Market Trends

Scraping Fiverr To Analyze Freelancing Market Trends

Fiverr When deciding on a website for my web scraping project, I thought of Fiverr's captivating subway ads. Fiverr is a digital marketplace for buying and The post Scraping Fiverr To Analyze Freelancing Market Trends appeared first on NYC Data Science Academy Blog.

Read more »

Building a Medicare Shiny App – Part 1

May 15, 2017
By
Building a Medicare Shiny App – Part 1

Hello R community. if you’re up for some fun tinkering with a Shiny App please join me on a new project. I would love to see some collaboration in designing a Shiny Application which will help people make a decision about a healthcare provider. I have only just begun on this project but would to

Read more »

Predicting the Baseball Hall of Fame

May 14, 2017
By
Predicting the Baseball Hall of Fame

Intro The Great Bambino. The Big Unit. Joltin' Joe. Henry Rowengartner. If you're familiar with the sport of baseball, you might recognize some of these names The post Predicting the Baseball Hall of Fame appeared first on NYC Data Science Academy Blog.

Read more »

Network analysis of Game of Thrones family ties

May 14, 2017
By
Network analysis of Game of Thrones family ties

In this post, I am exploring network analysis techniques in a family network of major characters from Game of Thrones. Not surprisingly, we learn that House Stark (specifically Ned and Sansa) and House Lannister (especially Tyrion) are the most import...

Read more »

Sponsors

Mango solutions







Zero Inflated Models and Generalized Linear Mixed Models with R

r-brain.io



Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training





omictools

Contact us if you wish to help support R-bloggers, and place your banner here.