R in Insurance 2017 Programme online

May 11, 2017
By
R in Insurance 2017 Programme online

The programme for the 2017 R in Insurance conference in Paris has been published. Talks will discuss new ideas and research with the applications in life and general insurance, from network analysis, reserving, pricing to catastrophe modelling, followe...

Read more »

tidyquant: New Tools for Performing Financial Analysis within the Tidy Ecosystem

tidyquant: New Tools for Performing Financial Analysis within the Tidy Ecosystem

In advance of upcoming Business Science talks on tidyquant at R/Finance and EARL San Francisco, we are releasing a technical paper entitled “New Tools For Performing Financial Analysis within the ‘Tidy’ Ecosystem”. The technical paper covers an...

Read more »

Stack Overflow Trends

May 10, 2017
By
Stack Overflow Trends

Developer Q&A site Stack Overflow recently introduced Stack Overflow Trends, a useful tool for tracking the growth and decline in the rate of questions asked on various topics (by their Stack Overflow tag). For example, you can see that activity around both R and Python has been increasing over the past 8 years: As you'd expect from a general...

Read more »

Accessing and Manipulating Biological Databases Exercises (Part-3)

May 10, 2017
By
Accessing and Manipulating Biological Databases  Exercises (Part-3)

In the exercises below we cover how we can Manipulate Biological Data using Seqinr packages Install Packages seqinr Answers to the exercises are available here. If you obtained a different (correct) answer than those listed on the solutions page, please feel free to post your answer as a comment on that page. Exercise 1 Read Related exercise sets:

Read more »

Which linear model is best?

May 10, 2017
By
Which linear model is best?

Recently I have been working on a Kaggle competition where participants are tasked with predicting Russian housing prices. In developing a model for the challenge, I came across a few methods for selecting the best regression model for a given dataset. Let’s load up some data and take a look. ## 47 6 ## … Continue...

Read more »

Text Analysis with R for Students of Literature

Text Analysis with R for Students of Literature

About the book I obtained a copy of this book by Matthew Jockers throughout Universities' access from Springer. You can also get a copy from Amazon. This book is short and to the point. I would actually strongly recommend it to anyone interested ...

Read more »

Pretty histograms with ggplot2

May 10, 2017
By
Pretty histograms with ggplot2

@drsimonj here to make pretty histograms with ggplot2! In this post you’ll learn how to create histograms like this:  The data Let’s simulate data for a continuous variable x in a data frame d: set.seed(070510) d <- data.frame(x = rnorm(2000)) head(d) #> x #> 1 ...

Read more »

Euler Problem 20: Large Integer Factorials

May 10, 2017
By
Euler Problem 20: Large Integer Factorials

A proposed solution in the R language to Euler Problem 20: Find the sum of the digits in the faculty of 100: 100 × 99 × ... × 3 × 2 × 1 Continue reading → The post Euler Problem 20: Large Integer Factorials appeared first on The Devil is in the Data.

Read more »

Everyone knows that loops in R are to be avoided, but vectorization is not always possible

May 9, 2017
By

It goes without saying that there are always many ways to solve a problem in R, but clearly some ways are better (for example, faster) than others. Recently, I found myself in a situation where I could not find a way to avoid using a loop, and I was immediately concerned, knowing that I would want this code to...

Read more »

Mapping Quandl Macroeconomic Data

May 9, 2017
By

In previous posts, we built a map to access global ETFs and a simple Shiny app to import and forecast commodities data from Quandl. Today, we will begin a project that combines those previous apps. Our end goal is to build an interactive map to access macroeconomic data via Quandl, allowing the user to choose an economic...

Read more »

niceOverPlot, or when the number of dimensions does matter

niceOverPlot, or when the number of dimensions does matter

  Hi there!    Over the last few months, my lab-mate Irene Villa (see more of her work here!) and I, have been discussing ecological niche overlap. The niche concept dates back to ideas first proposed by ornithologist J. Grinnell (1917). Later on, G.E. Hutchinson (1957) defined the ecological niche of a species as the...

Read more »

Predicting Hospital Length of Stay using SQL Server R Services

May 9, 2017
By
Predicting Hospital Length of Stay using SQL Server R Services

Last week, my Microsoft colleagues Bharath Sankaranarayan and Carl Saroufim presented a live webinar showing how you can predict a patient's length of stay at a hospital using SQL Server R Services. The recorded webinar is available for on-demand viewing now. (Registration is required to view.) The webinar is based on the Machine Learning Solution Template Predicting Length of...

Read more »

Better block sampling in MCMC with the Automated Factor Slice Sampler

May 9, 2017
By
Better block sampling in MCMC with the Automated Factor Slice Sampler

One nice feature of NIMBLE’s MCMC system is that a user can easily write new samplers from R, combine them with NIMBLE’s samplers, and have them automatically compiled to C++ via the NIMBLE compiler. We’ve observed that block sampling using a simple adaptive multivariate random walk Metropolis-Hastings sampler doesn’t always work well in practice, so

Read more »

Looking for a Programming or Statistics High Level Course? MIT Open Course Ware.

Looking for a Programming or Statistics High Level Course? MIT Open Course Ware.

Although MIT OCW has been operating for more than 15 years, I consider it important to do this post as there are still many people who do not know about its existence.MIT OpenCourseWare (OCW) is a web-based publication of virtually all MIT course co...

Read more »

Shiny Applications Layouts Exercises (Part-6)

May 9, 2017
By
Shiny Applications Layouts Exercises (Part-6)

Shiny Applications Layouts – Absolutely-positioned panel In the sixth part of our journey through Shiny App Layouts we will meet the absolutely-positioned panels. These are panels that you can drag and drop or not wherever you want in the interface. Moreover you can put anything in them, including inputs and outputs. This part can be Related exercise sets:

Read more »

Load a Python/pandas data frame from an HDF5 file into R

May 9, 2017
By

The title is self-descriptive, so I will not dwell on the issue at length before showing the code. Just a small note: to my knowledge, there is only one public snippet out there that addresses this particular problem. It uses the Bioc package rhdf5 and you can find it here. The main problem is that it only works when… Continuar leyendo...

Read more »

datasauRus now on CRAN

May 9, 2017
By
datasauRus now on CRAN

datasauRus is a package storing the datasets from the paper Same Stats, Different Graphs: Generating Datasets with Varied Appearance and Identical Statistics through Simulated Annealing. It’s a useful package for: Having a dinosaur dataset Showing a dinosaur related variant of The post datasauRus now on CRAN appeared first on Locke Data. Locke Data are a data...

Read more »

Reports or Newspapers – The Two Sides of Healthcare Priorities

May 9, 2017
By
Reports or Newspapers – The Two Sides of Healthcare Priorities

  Both the World Health Organization‘s statistical profile of Qatar and the much more detailed Annual Health Report of the Department of Epidemiology and Medical Statistics of the Sate of Qatar show beyond the shadow of a doubt that cardiovascular diseases, diabetes, hypertension, obesity and other metabolic/noncommunicable diseases are the Read More ...

Read more »

Studying CRAN package names

May 9, 2017
By
Studying CRAN package names

- Setting a name for a CRAN package is an intimate process. Out of an infinite range of possibilities, an idea comes for a package and you spend at least a couple of days writing up and testing your code...

Read more »

Travis-CI Flaw Exposed Some ‘Secure’ Environment Variable Contents

May 8, 2017
By

Tagging this as #rstats-related since many R coders use Travis-CI to automate package builds (and other things). Security researcher Ivan Vyshnevskyi did some ++gd responsible disclosure to the Travis-CI folks letting them know they were leaking the contents of “secure” environment variables in the build logs. The TL;DR on “secure” environment variables is that they... Continue reading...

Read more »

Video Introduction to Bayesian Data Analysis, Part 3: How to do Bayes?

May 8, 2017
By
Video Introduction to Bayesian Data Analysis, Part 3: How to do Bayes?

This is the last video of a three part introduction to Bayesian data analysis aimed at you who isn’t necessarily that well-versed in probability theory but that do know a little bit of programming. If you haven’t watched the other parts yet, I re...

Read more »

Jobs for R users – from all over the world (2017-05-08)

May 8, 2017
By
Jobs for R users – from all over the world (2017-05-08)

To post your R job on the next post Just visit this link and post a new R job to the R community. You can post a job for free (and there are also “featured job” options available for extra exposure). Current R jobs Job seekers: please follow the links below to learn more and apply for your R job of interest: Featured Jobs Full-Time Data Analyst,...

Read more »

Machine Learning Pipelines for R

May 8, 2017
By
Machine Learning Pipelines for R

Building machine learning and statistical models often requires pre- and post-transformation of the input and/or response variables, prior to training (or fitting) the models. For example, a model may require training on the logarithm of the response and input variables. As a consequence, fitting and then generating predictions from these models requires repeated application of … Continue...

Read more »

From Points to (Messy) Lines

May 8, 2017
By
From Points to (Messy) Lines

A week or so ago, I came up with a new chart type – race concordance charts – for looking at a motor circuit race from the on-track perspective of a particular driver. Here are a couple of examples from the 2017 F1 Grand Prix: The gap is the time to the car on track

Read more »

Machine Learning. Regression Trees and Model Trees (Predicting Wine Quality)

We will develop a forecasting example using model trees and regression trees algorithms. The exercise was originally published in "Machine Learning in R" by Brett Lantz, PACKT publishing 2015 (open source community experience destilled).The example we will develop is about predicting...

Read more »

Installing Packages without Internet

May 8, 2017
By

Graham Parsons At Mango we’re often giving R training in locations where a reliable WiFi connection is not always guaranteed, so if we need trainees to download packages from CRAN it can be a show-stopper. Here are a couple of … Continue reading →

Read more »

Graphical Presentation of Missing Data; VIM Package

May 8, 2017
By
Graphical Presentation of Missing Data; VIM Package

Missing data is a problem that challenge data analysis methodologically and computationally in medical research. Patients of the clinical trials and cohort studies may drop out of the study, and therefore, generate missing data. The missing data could be at random when participants who drop out of study are not different from those who remained Related Post

Read more »

Installing packages without the internet

May 8, 2017
By

Graham Parsons At Mango we’re often giving R training in locations where a reliable WiFi connection is not always guaranteed, so if we need trainees to download packages from CRAN it can be a show-stopper. Here are a couple of code snippets that are useful to download packages from CRAN onto a USB stick when you have a good connection...

Read more »

Trading Strategy: 52-Weeks High Effect in Stocks

May 8, 2017
By
Trading Strategy: 52-Weeks High Effect in Stocks

By Milind Paradkar In today’s algorithmic trading having a trading edge is one of the most critical elements. It’s plain simple. If you don’t have an edge, don’t trade! Hence, as a quant, one is always on a look out for good trading ideas. One of the good resources for trading strategies that have been... The post Trading...

Read more »

Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R

r-brain.io



Quantide: statistical consulting and training

ODSC1

ODSC2

datasociety

http://www.eoda.de





CRC R books series







Six Sigma Online Training





omictools

Contact us if you wish to help support R-bloggers, and place your banner here.