styler 1.2.0

November 5, 2019
By

We are pleased to announce that styler 1.2.0 is now available on CRAN. All the below features were added after styler 1.1.0, except the ones listed under Other changes were added somewhere between 1.0.0 and 1.1.0. Let’s get started: install.packages("styler") library(styler) Alignment detection styler can finally detect aligned code and keep it aligned! For example, the following code won’t be modified by styler: call( some_arg = 123, ...

Read more »

rOpenSci Announces a New Award From The Gordon and Betty Moore Foundation to Improve the Scientific Package Ecosystem for R

Today we are pleased to announce that we have received new funding from the Gordon and Betty Moore Foundation. The $894k grant will help us improve infrastructure for R packages and enable us to move towards a science first package ecosystem for the R community. You may have already noticed some developments on this front when we announced our...

Read more »

renv: Project Environments for R

November 5, 2019
By

We’re excited to announce that renv is now available on CRAN! You can install renv with: install.packages("renv") renv is an R dependency manager. Use renv to make your projects more: Isolated: Each project gets its own library of R packages, so you can feel free to upgrade and change package versions in one project without worrying about breaking your other projects. Portable: Because...

Read more »

KGC Climate Classification and Solar Irradiance through R Packages

November 5, 2019
By

I obviously haven't been blogging lately, but that doesn't mean that I haven't been thinking about what ought to be my next blog post. Fortunately, I've had the chance to get to know two particularly impressive R packages which are available to the scientific community through CRAN. Together, the two packages can be used to...

Read more »

Implicit Tax Rates on Consumption and Labor in Europe

November 5, 2019
By
Implicit Tax Rates on Consumption and Labor in Europe

The aim of this blog post is to compute the implicit tax rates (ITR) on consumption, labour and corporate income for France, Italy, Spain, Germany and the Euro Area since 1995. We use as reference the report on Taxation trends in the European Union (2019) from the European Commission, and the previous reports since 2014. This database is built and updated...

Read more »

Data Science on Rails: Analyzing Customer Churn

November 5, 2019
By
Data Science on Rails: Analyzing Customer Churn

Customer Relationship Management (CRM) is not only about acquiring new customers but especially about retaining existing ones. That is because acquisition is often much more expensive than retention. In this post, we learn how to analyze the reasons of customer churn (i.e. customers leaving the company). We do this with a very convenient point-and-click interface … Continue reading "Data...

Read more »

A First Look at Confidence Distributions

November 4, 2019
By

Using a probability distribution to characterize uncertainty is at the core of statistical inference. So, it seems natural to try to summarize the information about the parameters in statistical models with probability distributions. R. A. Fisher thought so. In fact, he expended a great deal of effort over more than thirty years, and put his professional reputation on the...

Read more »

RSiteCatalyst Version 1.4.16 Release Notes

November 4, 2019
By

It’s been a while since the last update, but RSiteCatalyst is still going strong! Thanks to Wen for submitting a fix/enhancement to enable the ability to use multiple columns from a Classification within the QueueDataWarehouse function. No other bug ...

Read more »

Spatial Data Analysis with INLA

November 4, 2019
By
Spatial Data Analysis with INLA

by Virgilio Gómez Rubio Introduction In this session I will focus on Bayesian inference using the integrated nested Laplace approximation (INLA) method. As described in Rue et al. (2009), INLA can be used to estimate the posterior marginal distribution of Bayesian hierarchical models. This method is implemented in the INLA package available for the R programming language....

Read more »

tidync: scientific array data from NetCDF in R

tidync: scientific array data from NetCDF in R

In May 2019 version 0.2.0 of tidync was approved by rOpenSci and accepted to CRAN. Here we provide a quick overview of the typical workflow with some pseudo-code for the main functions in tidync. This overview is enough to read if you just want to try out the package on your own data. The tidync package is focussed...

Read more »

Data Scientist or Data Engineer – what’s the difference?

November 4, 2019
By

When it was floated that I should write this article, I approached it with trepidation. There is no better way to start an argument in the world of data than by trying to define what a Data Scientist is or isn’t – by adding in the complication of the relatively newly appearing role of Data engineer, there is no...

Read more »

November Thanksgiving – Data Science Style!

November 4, 2019
By
November Thanksgiving – Data Science Style!

Hello All, November is the month of Thanksgiving, and vacations and of course deals galore! As part of saying thanks to my loyal readers, here are some deals specific to data science professionals and students, that you should definitely not miss on. Book deals: If you are exploring Data Science careers or preparing for interviews The post November Thanksgiving...

Read more »

Why empathy is key for Data Science initiatives

November 4, 2019
By

When we think of empathy in a career, we perhaps think of a nurse with a good bedside manner, or perhaps a particularly astute manager or HR professional. Data science is probably one of the last disciplines where empathy would seem to be important. However, this misconception is one that frequently leads to the failure of data science projects – a...

Read more »

Installing Prophet on CentOS

November 3, 2019
By

I’ve been struggling to get the {prophet} package to install on two RHEL 7 systems. Since I only have intermittent access to those systems I opted to use a local install of CentOS as a proxy environment. These instructions work for CentOS and should generalise to RHEL too. Setting Up the Environment The first thing that I needed to do was...

Read more »

{tvthemes 1.1.0} is on CRAN: Creating a {pkgdown} website, Gravity Falls palette, and more!

November 3, 2019
By
{tvthemes 1.1.0} is on CRAN: Creating a {pkgdown} website, Gravity Falls palette, and more!

The newest version of {tvthemes} is now on CRAN! v1.1.0 features one new palette (Gravity Falls), vignettes, renamed functions, and a nice pkgdown website. install.packages("tvthemes") # v1.1.0 library(tvthemes) Gravity Falls ...

Read more »

Exploratory analysis of a banana

November 3, 2019
By
Exploratory analysis of a banana

This post is just me amusing myself by exploring a tiny data set I have lying around. The dataset and the code is on Github. In 2014 (I think), I was teaching the introductory cell biology labs (pictures in the linked post) in Linköping. We were doing a series of simple preparations to look at

Read more »

R Owl of Athena

November 2, 2019
By

RBloggers|RBloggers-feedburner Intro: After developing the package RAthena, I stumbled quite accidentally into the R SDK for AWS paws. As RAthena utilises Python’s SDK boto3 I thought the development of another AWS Athena package couldn’t hurt. As mentioned in my previous blog the paws syntax is very similar to boto3 so alot of my RAthena code was very portable and this gave...

Read more »

Re-creating survey microdata from marginal totals by @ellis2013nz

November 2, 2019
By
Re-creating survey microdata from marginal totals by @ellis2013nz

I recently did some pro bono work for Gun Control NZ reviewing the analysis by a market research firm of the survey that led to this media release: “Most New Zealanders back stronger gun laws”. The analysis all checked out ok. The task at that time...

Read more »

binb 0.0.5: More improvements

November 2, 2019
By
binb 0.0.5: More improvements

The fifth release of the binb package just arrived on CRAN. binb regroups four rather nice themes for writing LaTeX Beamer presentations much more easily in (R)Markdown. As a teaser, a quick demo combining all four themes follows; documentation and e...

Read more »

Multiple data imputation and explainability

Multiple data imputation and explainability

Introduction Imputing missing values is quite an important task, but in my experience, very often, it is performed using very simplistic approaches. The basic approach is to impute missing values for numerical features using the average of each feature, or using the mode for categorical features. There are better ways of imputing missing values, for instance by predicting the values using a regression model,...

Read more »

AzureRMR 2.3.0 now on CRAN

November 1, 2019
By

This post is to announce that the latest update to AzureRMR is now available on CRAN. Version 2.3.0 brings several changes to make life easier when managing resources in Azure. New in this version is a facility for parallelising connections to Azure, using a pool of background processes. Some operations, such as downloading many small files or interacting with...

Read more »

Command Centre amplification with predictive analytics and machine learning

November 1, 2019
By

Recently, our team at Draper and Dash have been busy creating an NHS operational command centre. This command centre is different, as it uses a collection and ensemble of cutting edge predictive and machine learning techniques. To read the blog you can access this below: We have really enjoyed the process and we are in...

Read more »

#FunDataFriday – ATX GIS Day

November 1, 2019
By
#FunDataFriday – ATX GIS Day

ATX GIS day is being held in Austin, TX on Nov 13 to celebrate the use of GIS (Geographic Information Systems) data!

Read more »

LongCART – Regression tree for longitudinal data

November 1, 2019
By

Longitudinal changes in a population of interest are often heterogeneous and may be influenced by a combination of baseline factors. The longitudinal tree (that is, regression tree with longitudinal data) can be very helpful to identify and characterize the sub-groups with distinct longitudinal profile in a heterogenous population. This blog presents the capabilities of the … Continue reading LongCART...

Read more »

Visualize urban growth

October 31, 2019
By
Visualize urban growth

The General Directorate for the Cadastre of Spain has spatial information of the all buildings except for the Basque Country and Navarra. This data set is part of the implementation of INSPIRE, the Space Information Infrastructure in Europe. More information can be found here. We will use the links (urls) in ATOM format, which is an RSS type for...

Read more »

New package: simfinR

October 31, 2019
By
New package: simfinR

Introduction In my latest post I wrote about package GetEdgarData, which downloaded structured data from the SEC. I’ve been working on this project and soon realized that the available data at the SEC/DERA section is not complete. For example, all Q4 statements are missing. This seems to be the way all exchanges release the financial documents. I’ve found the same...

Read more »

Gold-Mining Week 9 (2019)

October 30, 2019
By

Week 9 Gold Mining and Fantasy Football Projection Roundup now available. The post Gold-Mining Week 9 (2019) appeared first on Fantasy Football Analytics.

Read more »

Offensive Programming in action (part III)

October 30, 2019
By
Offensive Programming in action (part III)

This is the third post on offensive programming, dedicated to using offensive programming (OP). You may refer to easy introduction to offensive programming to acquire some basic knowledge about offensive programming, and to discover offensive progra...

Read more »

A brief primer on Variational Inference

October 30, 2019
By
A brief primer on Variational Inference

Bayesian inference using Markov chain Monte Carlo methods can be notoriously slow. In this blog post, we reframe Bayesian inference as an optimization problem using variational inference, markedly speeding up computation. We derive the variational objective function, implement coordinate ascent mean-field variational inference for a simple linear regression example in R, and compare our results to results obtained via...

Read more »

Search R-bloggers

Sponsors