Articles by Ian Johnson

R XGBoost Regression

November 29, 2020 | Ian Johnson

In the previous posts, I used popular machine learning algorithms to fit models to best predict MPG using the cars_19 dataset.  It was discovered that support vector machine produced the lowest RMSE.  In this post I am going to use XGBoost to...
[Read more...]

R XGBoost Regression

November 29, 2020 | Ian Johnson

In the previous posts, I used popular machine learning algorithms to fit models to best predict MPG using the cars_19 dataset.  It was discovered that support vector machine produced the lowest RMSE.  In this post I am going to use XGBoost to...
[Read more...]

R Robustreg Package Downloads

February 24, 2020 | Ian Johnson

I built robustreg in 2006 and at the time the major stat packages did not have a robust regression available.  Below are graphs of weekly and cumulative downloads from just the RStudio mirror.  I would estimate total downloads at over 150,000. The median_rcpp() function is written in C++ and is multiple ...
[Read more...]

R TensorFlow Deep Neural Network

September 22, 2019 | Ian Johnson

In the previous post I fitted a neural network to the cars_19 dataset using the neuralnet package.  In this post I am going to use TensorFlow to fit a deep neural network using the same data.The main difference between the neuralnet package and TensorFlow is TensorFlow uses the adagrad ...
[Read more...]

R Neural Network

September 9, 2019 | Ian Johnson

In the previous four posts I have used multiple linear regression, decision trees, random forest, gradient boosting, and support vector machine to predict MPG for 2019 vehicles.   It was determined that svm produced the best model.  In this post I am going to use the neuralnet package to fit a neural ...
[Read more...]

R Tensorflow Multiple Linear Regression

August 27, 2019 | Ian Johnson

In the previous three posts I used multiple linear regression, decision trees, gradient boosting, and support vector machine to predict miles per gallon for 2019 vehicles.  It was determined that svm produced the best model.  In this post, I am going to run TensorFlow through R and fit a multiple linear ...
[Read more...]

Using SVM to Predict MPG for 2019 Vehicles

August 11, 2019 | Ian Johnson

Continuing on the below post, I am going to use a support vector machine (SVM) to predict combined miles per gallon for all 2019 motor vehicles. Part 1: Using Decision Trees and Random Forest to Predict MPG for 2019 Vehicles Part 2: Using Gradient Boosted Machine to Predict MPG for 2019 Vehicles The raw data ...
[Read more...]

R: SVM to Predict MPG for 2019 Vehicles

August 11, 2019 | Ian Johnson

Continuing on the below post, I am going to use a support vector machine (SVM) to predict combined miles per gallon for all 2019 motor vehicles. Part 1: Using Decision Trees and Random Forest to Predict MPG for 2019 Vehicles Part 2: Using Gradient Boosted Machine to Predict MPG for 2019 Vehicles The raw data ...
[Read more...]

Predicting MPG for 2019 Vehicles using R

June 12, 2019 | Ian Johnson

I am going to use regression, decision trees, and the random forest algorithm to predict combined miles per gallon for all 2019 motor vehicles.  The raw data is located on the EPA government siteAfter preliminary diagnostics, exploration and cleaning I am going to start with a multiple linear regression model.The ...
[Read more...]

R: Birthday Problem

March 25, 2019 | Ian Johnson

An interesting and classic probability question is the birthday problem.The birthday problem asks how many individuals are required to be in one location so there is a probability of 50% that at least two individuals in the group have the same birthday...
[Read more...]

R: K-Means Clustering MLB Data

June 21, 2017 | Ian Johnson

k-means clustering is a useful unsupervised learning data mining tool for assigning n observations into k groups which allows a practitioner to segment a dataset. I play in a fantasy baseball league and using five offensive variables (R, AVG, HR, RBI, SB) I am going to use k-means clustering to: 1) ...
[Read more...]

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)