Hacking the Data Science Radar with Data Science

Hacking the Data Science Radar with Data Science

This post was first published by original author Duncan Garmonsway and reproduced with his kind permission. This post reverse-engineers the Mango Solutions Data Science Radar using Programming (R) Visualisation (ggplot2) Data wrangling (dpylr/tidyr/etc.) Modelling (lm) Technology (embedded V8 javascript) Communication … Continue reading →

Read more »

Macroeconomic charts by the Fed using R and Plotly

July 4, 2016
By

In this post we’ll try to replicate some of the charts created by the Federal Reserve which visualize some well known macroeconomic indicators. We’ll also showcase the new Plotly 4.0 syntax. Key Macroeconomic Indicators Monitory Policy Transmission Nominal and Real Fed Funds Rate Some other examples:

Read more »

The ggthemr package – Theme and colour your ggplot figures

July 4, 2016
By
The ggthemr package – Theme and colour your ggplot figures

Want better colours for ggplot2? "Ggthemr" is an R package that provides new colour themes and also the specification of your own colour palettes. Change the look and feel of your ggplot2 plots in R with two quick commands! Beautiful figures await!

Read more »

R for Publication by Page Piccinini: Lesson 6, Part 2 – Linear Mixed Effects Models

July 4, 2016
By
R for Publication by Page Piccinini: Lesson 6, Part 2 – Linear Mixed Effects Models

In today’s lesson we’ll continue to learn about linear mixed effects models (LMEM), which give us the power to account for multiple types of effects in a single model. This is Part 2 of a two part lesson. I’ll be taking for granted that you’ve completed Lesson 6, Part 1, so if you haven’t done that Lesson 6, Part...

Read more »

correlation matrices on copulas

July 3, 2016
By
correlation matrices on copulas

Following my post of yesterday about the missing condition in Lynch’s R code, Gérard Letac sent me a paper he recently wrote with Luc Devroye on correlation matrices and copulas. Paper written for the memorial volume in honour of Marc Yor. It considers the neat problem of the existence of a copula (on x…x) associated

Read more »

A Future for R: Slides from useR 2016

July 2, 2016
By
A Future for R: Slides from useR 2016

Unless you count DSC 2003 in Vienna, last week's useR conference at Stanford was my very first time at useR. It was a great event, it was awesome to meet our lovely and vibrant R community in real life, which we otherwise only get know from online interactions, and of course it was very nice to meet...

Read more »

A look at AirBnB demographics

July 2, 2016
By
A look at AirBnB demographics

Once in a while I use AirBnB. There are a couple of features that I (intuitively) use to judge if an apartment is save to book; ratings, images of the flat and the user avatar. Apparently, these avatars play an important part in the overall service and usage of AirBnB. A recent study finds that “Attractive Airbnb hosts are...

Read more »

Forest Plot (with Horizontal Bands)

July 2, 2016
By
Forest Plot (with Horizontal Bands)

Forest plots are often used in clinical trial reports to show differences in the estimated treatment effect(s) across various patient subgroups. See, for example a review. The page on Clinical Trials Safety Graphics includes a SAS code for a forest plot that depicts the hazard ratios for various patient subgroups (this web page has links to

Read more »

Combine ArcGIS and R – Clustering Toolbox

July 2, 2016
By
Combine ArcGIS and R – Clustering Toolbox

Last year at the ESRI User Conference in San Diego, there was an announcement of an initiative to bridge ArcGIS and R. This became reality I think early this year with R-Bridge.Basically, ESRI has created an R library that is able to communicate and ex...

Read more »

the curious incident of the inverse of the mean

July 1, 2016
By

A s I figured out while working with astronomer colleagues last week, a strange if understandable difficulty proceeds from the simplest and most studied statistical model, namely the Normal model x~N(θ,1) Indeed, if one reparametrises this model as x~N(υ⁻¹,1) with υ>0, a single observation x brings very little information about υ! (This is not a

Read more »

Run compiled R packages in AzureML

July 1, 2016
By
Run compiled R packages in AzureML

We've shown a few times here how you can run R code on data in the cloud with Azure ML Studio, and even how to enable that code as a web service to be called from other applications. But what if you want to run code in a compiled language, like C++? Fortunately, you can take advantage of R's...

Read more »

Animated world inequality map

Animated world inequality map

In my last post I had a first look (for me) at Estimated Household Income Inequality data from the University of Texas Inequality Project. These data came to my attention when Professor James K. Galbraith used them in his keynote presentation to the 2016 New Zealand Association of Economists conference. Some of the slides...

Read more »

Bio7 2.4 for Windows and Mac Released

July 1, 2016
By
Bio7 2.4 for Windows and Mac Released

A new release of Bio7 is available for Windows and MacOSX. The Linux version will be released soon, too. This release comes with a plethora of new functions, especially for the R editor. Download at: http://bio7.org Release notes: General Updated Bio7 to use Eclipse 4.6 (Neon) as RCP basis Updated the embedded Java Runtime Environment

Read more »

Euro 2016 analytics: Who’s playing the toughest game?

July 1, 2016
By
Euro 2016 analytics: Who’s playing the toughest game?

I am really enjoying Uefa Euro 2016 Footbal Competition, even because our national team has done pretty well so far. That’s why after  browsing for a while statistics section of official EURO 2016 website I decided to do some analysis on the data they share Just to be clear from the beginning: we are not talking Related Post

Read more »

RMarkdown and Metropolis/Mtheme

June 30, 2016
By

Nick Tierney asked on Twitter about rmarkdown and metropolis about whether folks had used RMarkdown-driven LaTeX Beamer presentations. And the answer is a firm hell yeah. I have been using mtheme (and/or a local variant I called 'm2') as well as the ...

Read more »

How to write good tests in R

June 30, 2016
By
How to write good tests in R

Testing is an often overlooked yet critical component of any software system. In some ways this is more true of …Continue reading →

Read more »

The useR! 2016 Tutorials

June 30, 2016
By

by Joseph Rickert Over the years I have seen several excellent tutorials at useR!conferences that were not only very satisfying "you had to be there" experiences but were also backed up with meticulously prepared materials of lasting value. This year, quite a few useR!20i6 tutorials measure up to this level of quality. My take on why things turned out...

Read more »

Join us at rstudio::conf 2017!

June 30, 2016
By
Join us at rstudio::conf 2017!

Following our initial and very gratifying Shiny Developer Conference this past January, which sold out in a few days, RStudio is very excited to announce a new and bigger conference today! rstudio::conf, the conference about all things R and RStudio, will take place January 13 and 14, 2017 in Orlando, Florida. The conference will feature

Read more »

Case Study: Customized R Training and a “Day 1” Curriculum

June 30, 2016
By
Case Study: Customized R Training and a “Day 1” Curriculum

Earlier this year I had the honor of training the research division of a financial services firm in R. I’ve been meaning to write a case study on this project for a while, but have put it off due to the size and complexity of the engagement. In this post I’ll limit myself to talking about The post

Read more »

3D density plot in R with Plotly

June 30, 2016
By
3D density plot in R with Plotly

In Bayesian nonparametrics, many models address the problem of density regression, including covariate dependent processes. These were settled by the pioneering works by  MacEachern (1999) who introduced the general class of dependent Dirichlet processes. The literature on dependent processes was developed in numerous models, such as nonparametric regression, time series data, meta-analysis, to cite but

Read more »

Boost Your Data Munging with R

June 29, 2016
By
Boost Your Data Munging with R

This article was first published on the toptal.com blog. Additionally be noticed that my blog is migrating to new host due to GitHub Pages drops support for RDiscount, Redcarpet, and RedCloth (Textile) markup engines. Old host will be still available but new posts will be published on jangorecki.gitlab.io, drop-in replacement after changing from github.io to...

Read more »

Boost Your Data Munging with R

June 29, 2016
By
Boost Your Data Munging with R

This article was first published on the toptal.com blog. Additionally be noticed that my blog is migrating to new host due to GitHub Pages drops support for RDiscount, Redcarpet, and RedCloth (Textile) markup engines. Old host will be still available but new posts will be published on jangorecki.gitlab.io, drop-in replacement after changing from github.io to...

Read more »

Computerworld’s advanced beginner’s guide to R

June 29, 2016
By
Computerworld’s advanced beginner’s guide to R

Many newcomers to R got their start learning the language with Computerworld's Beginner's Guide to R, a 6-part introduction to the basics of the language. Now, budding R users who want to take their skills to the next level have a new guide to help them: Computerword's Advanced Beginner's Guide to R. Written by Sharon Machlis, author of the...

Read more »

International Household Income Inequality data

June 29, 2016
By
International Household Income Inequality data

I’m at the New Zealand Association of Economists annual conference in Auckland. The opening keynote speech was from James K. Galbraith on a global view of inequality. He showed a variety of results from the University of Texas Inequality Project’s Estimated Household Income Inequality dataset, which I hadn’t realised existed before. It’s the result of...

Read more »

Simulation and power analysis of generalized linear mixed models

June 28, 2016
By
Simulation and power analysis of generalized linear mixed models

Simulation and power analysis of generalized linear mixed models Brandon LeBeau University of Iowa Overview (G)LMMs Power simglm package Demo Shiny App! Linear Mixed Model (LMM) Power Power is the ability to statistica...

Read more »

Express Intro to dplyr

June 28, 2016
By
Express Intro to dplyr

Working The Data Like a Boss ! I recently introduced the data.table package which provides a nice way to manage and aggregate large data sources using the standard bracket notation that is commonly employed when manipulating data frames in R. As data sources grow larger one must be prepared with a variety of approaches to

Read more »

Making “Time Rivers” in R

June 28, 2016
By
Making “Time Rivers” in R

Once again, @albertocairo notices an interesting chart and spurs pondering in the visualization community with his post covering an unusual “vertical time series” chart produced for the print version of the NYTimes: I’m actually less concerned about the vertical time series chart component here since I agree with TAVE* Cairo that folks are smart enough... Continue reading →

Read more »

Short course on Bayesian data analysis and Stan 18-20 July in NYC!

June 28, 2016
By
Short course on Bayesian data analysis and Stan 18-20 July in NYC!

Jonah Gabry, Vince Dorie, and I are giving a 3-day short course in two weeks. Before class everyone should install R, RStudio and RStan on their computers. (If you already have these, please update to the latest version of R and the latest version of Stan, which is 2.10.) If problems occur please join the The post

Read more »

The Datathon and how to make the most of it

June 28, 2016
By
The Datathon and how to make the most of it

There are hundreds of definitions for the term Data Scientist… Fancy Venn Diagrams (like this one or this one), radial charts with a ridiculous number of skills (like the one here), multi-dimensional highly complicated charts...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Dommino data lab

Quantide: statistical consulting and training



http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training





Contact us if you wish to help support R-bloggers, and place your banner here.