Dollar Signs and Percentages- 3 Different Ways to Convert Data Types in R

August 6, 2019
By

Working with percentages in R can be a little tricky, but it’s easy to change it to an integer, or numeric, and run the right statistics on it. Such as quartiles and mean and not frequencies. Essentially you are using the sub function and substituting the “%” for a blank. You don’t lose any decimals … Continue reading "Dollar...

Read more »

Beginners Cookbook for Interactive Visualization in R with highcharter

August 6, 2019
By

Philosophy This Post is purely aimed at helping beginners with cookbook-style code for Interactive Visualizations using highcharter package in R. About highcharter highcharter by Joshua Kunst R package is a wrapper for the ‘Highcharts’ library including shortcut functions to plot R objects. ‘Highcharts’ http://www.highcharts.com/ is a charting library offering numerous chart types with a simple configuration syntax. Libraries library(tidyverse) #data Manipulation library(highcharter) #interactive visualization Reading Input...

Read more »

Tech Dividends, Part 1

August 6, 2019
By
Tech Dividends, Part 1

In a previous post, we explored the dividend history of stocks included in the SP500. Today, we’ll extend that analysis to cover the Nasdaq because, well, because in the previous post I said I would do that. We’ll also explore a different source for dividend data, do some string cleaning and check out ways to customize a tooltip in...

Read more »

Causal inference with DAGs in R

August 6, 2019
By
Causal inference with DAGs in R

Directed cyclical graphs (DAGs) are a powerful tool to understand and deal with causal inference. The book “Causal inference in statistics: a primer” is a useful reference to start, authored from Pearl, Glymour, and Jewell. Directed cyclical graphs (DAGs) are a powerful tool to understand and deal with causal inference. Causal inference in statistics: a … Continua a leggere...

Read more »

How to make Square (Pie) Charts for Infographics in R

August 6, 2019
By
How to make Square (Pie) Charts for Infographics in R

Are you looking for some unique way of visualizing your numbers instead of simply using bar charts - which sometimes could be boring the audience - if used, slide after slide? Here’s Square Pie / Waffle Chart for you. Waffle Chart or as it goes technically, Square Pie Chart is just is just a pie chart that use squares instead...

Read more »

#23: Debugging with Docker and Rocker – A Concrete Example helping on macOS

August 5, 2019
By

Welcome to the 23nd post in the rationally reasonable R rants series, or R4 for short. Today’s post was motivated by an exchange on the r-devel list earlier in the day, and a few subsequent off-list emails. Roger Koenker posted a question: how to b...

Read more »

Insurance data science : use and value of unusual data #1

August 5, 2019
By
Insurance data science : use and value of unusual data #1

Next week, with , I will be at the Summer School of the Swiss Association of Actuaries, in Lausanne, with Jean-Philippe Boucher (UQAM) and Ewen Gallic (AMSE). There will be some hands-on applications, on R. I will share some codes in the slides.

Read more »

Arguments of stats::density()

August 5, 2019
By
Arguments of stats::density()

Animated illustrations of how arguments affect output of `stats::density()`. Prologue In R, one of the “go to” functions for kernel density estimation is density() from base R package ‘stats’. Given numeric sample, it returns a set of x-y pairs on estimated density curve. It is also a main “workhorse” for estimating...

Read more »

Keeping up with Tidyverse Functions using Tidy Tuesday Screencasts

August 5, 2019
By
Keeping up with Tidyverse Functions using Tidy Tuesday Screencasts

David Robinson has done several screencasts where he analyzes a Tidy Tuesday dataset live. I have listened to a few of them and found them very interesting and instructive. As I don’t use R on a daily basis, I have not kept up with what the latest is in Tidyverse. So when I listened to his screencasts, I learnt...

Read more »

mlr-2.15.0

Changes to benchmark() Changes to Filters New ensemble filters New return structure for filter values Learners References We just released mlr v2.15.0 to CRAN. This version includes some breaking changes and the usual bug fixes from the last three months. We made good progress on the goal of cleaning up the Github repo. We processed nearly all open pull requests (around 40). In the next months we will focus...

Read more »

Check your (Mixed) Model for Multicollinearity with ‘performance’

August 5, 2019
By
Check your (Mixed) Model for Multicollinearity with ‘performance’

The goal of performance is to provide lightweight tools to assess and check the quality of your model. It includes functions such as r2() for many models (including logistic, mixed and Bayesian models), icc() or helpers to check_convergence(), chec...

Read more »

How to create unigrams, bigrams and n-grams of App Reviews

August 5, 2019
By
How to create unigrams, bigrams and n-grams of App Reviews

This is one of the frequent questions I’ve heard from the first timer NLP / Text Analytics - programmers (or as the world likes it to be called “Data Scientists”). Prerequisite For simplicity, this post assumes that you already know how to install a package and so you’ve got tidytext installed on your R machine. install.packages("tidytext") Loading the Library Let’s start with loading the...

Read more »

Bayes models for estimation in stepped-wedge trials with non-trivial ICC patterns

August 5, 2019
By
Bayes models for estimation in stepped-wedge trials with non-trivial ICC patterns

Continuing a series of posts discussing the structure of intra-cluster correlations (ICC’s) in the context of a stepped-wedge trial, this latest edition is primarily interested in fitting Bayesian hierarchical models for more complex cases (though I do talk a bit more about the linear mixed effects models). The first two posts in the series focused on generating data to...

Read more »

Working With Vectors

August 5, 2019
By
Working With Vectors

In the previous exercise set we practised vectors as a data structure. As I noted at the beginning of that set, perhaps you were already familiar with data in a vector-like structure in other applications such as Microsoft Excel or SPSS. If so, perhaps you also used those data to carry out calculations. In this Related exercise sets: Spatial Data...

Read more »

Clustering Frankenstein

August 5, 2019
By
Clustering Frankenstein

Necesito para estar sentado, un arbolito en este descampado (Desarraigo, Extremoduro) From time to time I come back to experiment with this stunning photograph of Boris Karloff as Frankenstein’s monster. I have done several of them previously: from decomposing it into Voronoi regions, to draw it as a single line portrait using an algorithm to … Continue reading Clustering...

Read more »

RTutor: Improving Interactive Problem Sets by Analyzing Submissions

August 5, 2019
By
RTutor: Improving Interactive Problem Sets by Analyzing Submissions

At Ulm University, we currently use RTutor for several elective courses where students solve interactive problem sets at home. They can test their solutions and get automatic hints, and then submit their solution. Starting next year, we plan to use RTutor in a new compulsory data science project course in our business and economics bachelor. For a compulsory course,...

Read more »

Stock Market Data Scenario Set Generation – S&P 100

August 5, 2019
By

I just love to create portfolio optimization models based on Optimization theory and such models require a well-defined return scenario set which is nothing more than a matrix where we have a joint possible set of returns of all our assets under consideration. The easiest way is to use historical data for this purpose. While it is dangerous to...

Read more »

Finding duplicates in data frame across columns and replacing them with unique values using R

August 5, 2019
By
Finding duplicates in data frame across columns and replacing them with unique values using R

Suppose you have a dataset with many variables, and you want to check: if there are any duplicated for each of the observation replace duplicates with random value from pool of existing values.   In this manner, let’s create a…Read more ›

Read more »

The Shiny Developer Series

August 4, 2019
By
The Shiny Developer Series

Shiny is one of the best ways to build interactive documents, dashboards, and data science applications. But advancing your skills with Shiny does not come without challenges. Shiny developers often have a stronger background in applied statistics than in areas useful for optimizing an application, like programming, web development, and user-interface design. Though there are many packages and tools that...

Read more »

Likert Scale Survey: from googleform to #rstats graph

August 4, 2019
By
Likert Scale Survey: from googleform to #rstats graph

Many Biology students are interested in science communication or the public understanding of science and undertake projects in these areas. They often conduct surveys which include Likert-scale questions. This workflow will teach you how to set up a Google Forms survey with Likert scale questions, read the responses in to R and report on the results. It… Continue reading Likert Scale...

Read more »

Upcoming R courses with Jumping Rivers

August 4, 2019
By

You'll be pleased to know that Jumping rivers are running R training courses up and down the UK, in London, Newcastle, Belfast and Edinburgh. I've put together a quick summary of the courses available through til the end of the year. They are sorted by place then date. You can find the booking links and The post Upcoming R courses...

Read more »

Interactive Visualization in R with apexcharter

August 3, 2019
By

Interactive Visualizations are powerful these days because those are all made for web. Web - simply a combination of html,css and javascript which build interactive visualizations. Thus, paving way for a lot of javascript charting libraries like highcharts.js, apexcharts.js. Thanks to htmlwidgets of R, many R developers have started porting those javascript charting libraries to R and dreamRs is one...

Read more »

Creating Excel Workbooks with multiple sheets in R

August 3, 2019
By

Create Excel Workbooks Generally, when doing anything in R I typically work with .csv files, their fast and straightforward to use. However, I find times, where... The post Creating Excel Workbooks with multiple sheets in R appeared first on FinderDing.

Read more »

RcppCCTZ 0.2.6

August 3, 2019
By

A shiny new release 0.2.6 of RcppCCTZ is now at CRAN. RcppCCTZ uses Rcpp to bring CCTZ to R. CCTZ is a C++ library for translating between absolute and civil times using the rules of a time zone. In fact, it is two libraries. One for dealing with civ...

Read more »

R Markdown Workshop

August 2, 2019
By
R Markdown Workshop

Background This is an unusual post for me, I have avoided writing about R Markdown because there are so many resources already available on the topic (e.g., here, here, and here). However, recently I ran a session on using RMarkdown for my colleagues in the Centre for Social Issues Research. The aim of this was to demonstrate the usefulness of...

Read more »

Programmatically extract TIOBE Index Ratings

August 2, 2019
By
Programmatically extract TIOBE Index Ratings

TIOBE Index is an index (ranking) that claims to represent the popularity of programming languages. Yihui (The creator of blogdown package), recently wrote a blogpost titled “On TIOBE Index and the era of decision fatigue” and I strongly recommend you to go through that before continuing with this post. So the Disclaimer goes like this: This post/author doesn’t believe that...

Read more »

GC4XMN7 RR138 – 404 Found

August 2, 2019
By
GC4XMN7 RR138 – 404 Found

It’s been a few weeks since I found a puzzle worth programming in R, so I turned back to the RR series in the east of England to see if there was another, and there is. The cache page shows a quite unilluminating image, mostly white with a couple of small characters. Opening the image

Read more »

Machine Learning Training – Draper and Dash Healthcare Predictive Analytics – Summer Discount

August 2, 2019
By

There is an offer on with my company Draper & Dash to get a discount on ML training for your organisation. Contact the sales team to find out more about this training opportunity. Plus you get to meet our great data science team at Draper and Dash. The below is me and colleagues in action...

Read more »

simmer 4.3.0 + JSS publication

August 2, 2019
By

The 4.3.0 release of simmer, the Discrete-Event Simulator for R, is on CRAN. Along with this update, we are very glad to announce that our homonymous paper finally appeared in the Journal of Statistical Software. Please, use the following reference for citations (see citation("simmer")): Ucar I, Smeets B, Azcorra A (2019). “simmer: Discrete-Event Simulation for R.” Journal of Statistical… Continuar leyendo simmer 4.3.0...

Read more »

Search R-bloggers

Sponsors