Use Pseudo-Aggregators to Add Safety Checks to Your Data-Wrangling Workflow

October 30, 2018
By

One of the concepts we teach in both Practical Data Science with R and in our theory of data shaping is the importance of identifying the roles of columns in your data. For example, to think in terms of multi-row records it helps to identify: Which columns are keys (together identify rows or records). Which … Continue reading Use...

Read more »

R-bloggers weekly – most loved R posts from last week (2018-10-21 till 2018-10-27)

October 30, 2018
By

What are the best R posts to read from last week? To help you decide, here is a list of the top R posts, sorted based on the number of likes they got on twitter, enjoy : Drawing beautiful maps programmatically with R, sf and ggplot2 — Part 1: Basics (219 likes) Drawing beautiful maps programmatically with R, sf...

Read more »

Data + Art STEAM Project: Final Results

October 30, 2018
By
Data + Art STEAM Project: Final Results

The final results of the Data + Art STEAM project with Forest Hill Junior and Senior Public School in Toronto!

Read more »

Site Migration

October 30, 2018
By

- Its been almost two years since I first hosted this blog as an alternative outlet of my writings. Everything is currently being built using jekyll and hosted in github. It works fine but I want to have...

Read more »

Are petrol prices in Australia fair?

October 30, 2018
By
Are petrol prices in Australia fair?

Petrol is a product that used by most of Australians. So people are pretty sensitive to price changes, especially when the fuel become more expensive. With prices reaching $1.6 for unleaded the debates are becoming more and more hot. Are greedy petrol traders ripping off Australians or the price changes… The post Are petrol prices in Australia fair? appeared...

Read more »

Management accounting and controlling in R

October 29, 2018
By
Management accounting and controlling in R

CategoriesData Management Tags Data Visualisation R Programming Tips & Tricks In this article you learn how to make visualizations & tables for management accounting & controlling in R. Read packages into R library First we need to read the packages into our R library. The main package used for management accounting & controlling is finstr: # Management accounting & controlling library(finstr) # Data tables...

Read more »

Machine Learning Basics – Random Forest

October 29, 2018
By
Machine Learning Basics – Random Forest

A few colleagues of mine and I from codecentric.ai are currently working on developing a free online course about machine learning and deep learning. As part of this course, I am developing a series of videos about machine learning basics - the first v...

Read more »

Arnaub Chatterjee discusses artificial intelligence (AI) and machine learning (ML) in healthcare.

October 29, 2018
By
Arnaub Chatterjee discusses artificial intelligence (AI) and machine learning (ML) in healthcare.

Hugo Bowne-Anderson, the host of DataFramed, the DataCamp podcast, recently interviewed Arnaub Chatterjee, Senior Expert and Associate Partner in the Pharmaceutical and Medical Products group at McKinsey & Company. Here is the podcast link. Introducing Arnaub Chatterjee Hugo: Arnaub, I'm really excited to have you here today to talk about the role of AI, data science, and machine learning...

Read more »

Bootstrap Testing with MCHT

October 29, 2018
By
Bootstrap Testing with MCHT

Introduction Now that we’ve seen MCHT basics, how to make MCHTest() objects self-contained, and maximized Monte Carlo (MMC) testing with MCHT, let’s now talk about bootstrap testing. Not much is different when we’re doing bootstrap testing; the main difference is that the replicates used to generate test statistics depend on the data we feed to…Read more Bootstrap Testing with MCHT

Read more »

Data Science With R Course Series – Week 7

Data Science With R Course Series – Week 7

After week 7, you will be able to communicate confidently which model features are the most important. Interpretability is a very important topic in machine learning. The automated machine learning tool, H2O, makes a data scientist’s life easier, ho...

Read more »

About a Curious Feature and Interpretation of Linear Regressions

October 29, 2018
By
About a Curious Feature and Interpretation of Linear Regressions

A small blog post with a riddle, simulation, theory and a concluding rhyme. Consider a fictitious example in which we have collected a sample of somewhat overweight persons for which we measured weight in kg as $y$ and height in cm as $x$. We estimate the following simple linear regression: One early message in our Economics 101 course is that for...

Read more »

crfsuite for natural language processing

October 29, 2018
By
crfsuite for natural language processing

A new R package called crfsuite supported by BNOSAC landed safely on CRAN last week. The crfsuite package (https://github.com/bnosac/crfsuite) is an R package specific to Natural Language Processing and allows you to easily build and apply models for named entity recognition text chunking part of speech tagging intent recognition or classification of any category you have in mind The focus of the...

Read more »

Real estate value analysis

October 28, 2018
By
Real estate value analysis

French government provides from time to time some data sets that are worth the analysis. A friend, gave me this real estate data set, for analysis.

Read more »

carbonate

October 28, 2018
By
carbonate

carbon.js is the easiest way to create beautiful images of your source code, and carbonate creates an api in R to interact with it. The package gives a new simple way to share your script with the rest of the R community on posters, slides, Shiny apps and social media! This little snippet combines SO MANY of my favorite #rstats...

Read more »

RStudio IDE Custom Theme Support

October 28, 2018
By
RStudio IDE Custom Theme Support

We’re excited to announce that RStudio v1.2 has added support for custom editor themes. Custom editor themes will allow you to adjust the background color of the editor and syntax highlighting of code in RStudio to better suit your own personal style. New editor themes can be added to RStudio by importing a tmTheme or sharing an existing rstheme file....

Read more »

Reproducible Finance, the book! And a discount for our readers

October 28, 2018
By
Reproducible Finance, the book! And a discount for our readers

I’m thrilled to announce the release of my new book Reproducible Finance with R: Code Flows and Shiny Apps for Portfolio Analysis, which originated as a series of R Views posts in this space. The first post was written way back in November of 2016 - thanks to all the readers who have supported us along the way! If you...

Read more »

Is the answer to everything Gaussian?

October 28, 2018
By
Is the answer to everything Gaussian?

Abstract: As an applied statistician you get in touch with many challenging problems in need of a statistical solution. Often, your client/colleague already has a working solution and just wants to clarify a small statistical detail with you. Equally o...

Read more »

Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

October 28, 2018
By
Conway’s Game of Life in R: Or On the Importance of Vectorizing Your R Code

R is an interpreted programming language with vectorized data structures. This means a single R command can ask for very many arithmetic operations to be performed. This also means R computation can be fast. We will show an example of this using Conway’s Game of Life. Conway’s Game of Life is one of the most … Continue reading Conway’s...

Read more »

How quickly do stock market valuations revert back to their means?

October 28, 2018
By
How quickly do stock market valuations revert back to their means?

Mean reversion is the assumption that things tend to revert back to their means in the long run. This is especially true for valuations and certain macroeconomic variables, but not so much for stock prices themselves. In this post we'll look at the mean reversion of different valuation measures by forming equal sized baskets from each valuation decile and...

Read more »

Introducing cricpy:A python package to analyze performances of cricketers

October 28, 2018
By
Introducing cricpy:A python package to analyze performances of cricketers

Full many a gem of purest ray serene, The dark unfathomed caves of ocean bear; Full many a flower is born to blush unseen, And waste its sweetness on the desert air. Thomas Gray, An Elegy Written In A Country Churchyard Introduction It is finally here! cricpy, the python avatar , of my R package cricketr is now … Continue reading Introducing...

Read more »

Scatterplot matrices (pair plots) with cdata and ggplot2

October 27, 2018
By
Scatterplot matrices (pair plots) with cdata and ggplot2

In my previous post, I showed how to use cdata package along with ggplot2‘s faceting facility to compactly plot two related graphs from the same data. This got me thinking: can I use cdata to produce a ggplot2 version of a scatterplot matrix, or pairs plot? A pairs plot compactly plots every (numeric) variable in … Continue reading Scatterplot...

Read more »

Celebrate Halloween with Creepy Computer Games in R

October 27, 2018
By
Celebrate Halloween with Creepy Computer Games in R

In the 1980s I spent my time writing code on my 8-bit ZX81 and Atari computers. I learnt everything I know about programming from copying and modifying printed code listings from books with computer games. The games in these books are mostly simple text-based games, but the authors gave them enticing names, often imaginatively illustrated … Continue reading "Celebrate...

Read more »

Phyllotaxis Sprial and Prime Numbers – Experiment

October 27, 2018
By
Phyllotaxis Sprial and Prime Numbers – Experiment

I recently tried out Data Camp’s project called “Phyllotaxis: Draw flowers using mathematics”. Now I’m hooked on drawing spirals using golden angle. Also the mathematical art on instructor’s blog, Fronkostin is just amazing! Separately on twitter, I’ve gotten message on art with prime number, and that got me thinking to experiment with prime number & phyllotaxis flowers. I couldn’t figure...

Read more »

RcppRedis 0.1.9

October 27, 2018
By

A new minor release of RcppRedis arrived on CRAN earlier today. RcppRedis is one of several packages to connect R to the fabulous Redis in-memory datastructure store (and much more). RcppRedis does not pretend to be feature complete, but it may do so...

Read more »

Time series visualizations with wind turbine energy data in R

October 27, 2018
By
Time series visualizations with wind turbine energy data in R

CategoriesVisualizing Data Tags Data Visualisation R Programming Time Series One of the sectors with a huge demand for data science/analysis is the energy sector. A branch of this sector where demand is high is the green wind energy turbine sector. In this analysis, you will learn to do a time series wind turbine analysis in R. Read packages into R library First we need...

Read more »

How to perform merges (joins) on two or more data frames with base R, tidyverse and data.table

October 27, 2018
By
How to perform merges (joins) on two or more data frames with base R, tidyverse and data.table

Introduction In this post in the R:case4base series we will look at one of the most common operations on multiple data frames - merge, also known as JOIN in SQL terms. We will learn how to do the 4 basic types of join - inner, left, right and full join with base R and show how to perform the same with...

Read more »

Visualizing The Catholic Lectionary – Part 1

October 26, 2018
By
Visualizing The Catholic Lectionary – Part 1

What’s a Lectionary? A lectionary, according to Wikipedia, is a listing of scripture readings for Christian or Judaic worship on a given day. The Roman Catholic Lectionary will contain a list of readings for a specific day that are on a 3-year cyc...

Read more »

Maps with pie charts on top of each administrative division: an example with Luxembourg’s elections data

Maps with pie charts on top of each administrative division: an example with Luxembourg’s elections data

Abstract You can find the data used in this blog post here: https://github.com/b-rodrigues/elections_lux This is a follow up to a previous blog post where I extracted data of the 2018 Luxembourguish elections from Excel Workbooks. Now that I have the data, I will create a map of Luxembourg by commune, with pie charts of the results on top of each commune! To do this,...

Read more »

Gold-Mining Week 8 (2018)

October 26, 2018
By

Week 8 Gold Mining and Fantasy Football Projection Roundup now available. Go get that free agent gold! The post Gold-Mining Week 8 (2018) appeared first on Fantasy Football Analytics.

Read more »

Search R-bloggers


Sponsors

Mango solutions





Zero Inflated Models and Generalized Linear Mixed Models with R



datasciencego.com

Quantide: statistical consulting and training

ODSC west

ODSC2 west

datasociety

http://www.eoda.de









Six Sigma Online Training

mljar.com

Our ads respect your privacy. Read our Privacy Policy page to learn more.

Contact us if you wish to help support R-bloggers, and place your banner here.