2071 search results for "Regression"

A First Look at rxDForest()

January 30, 2014
By
A First Look at rxDForest()

by Joseph RIckert Last July, I blogged about rxDTree() the RevoScaleR function for building classification and regression trees on very large data sets. As I explaned then, this function is an implementation of the algorithm introduced by Ben-Haim and Yom-Tov in their 2010 paper that builds trees on histograms of data and not on the raw data itself. This...

Read more »

Comparing multiple (g)lm in one graph #rstats

January 29, 2014
By
Comparing multiple (g)lm in one graph #rstats

It’s been a while since a user of my plotting-functions asked whether it would be possible to compare multiple (generalized) linear models in one graph (see comment). While it is already possible to compare multiple models as table output, I now managed to build a function that plots several (g)lm-objects in a single ggplot-graph. The

Read more »

Inference for AR(p) Time Series

January 28, 2014
By
Inference for AR(p) Time Series

Consider a (stationary) autoregressive process, say of order 2, for some white noise with variance . Here is a code to generate such a process, > phi1=.25 > phi2=.7 > n=1000 > set.seed(1) > e=rnorm(n) > Z=rep(0,n) > for(t in 3:n) Z=phi1*Z+phi2*Z+e > Z=Z > n=length(Z) > plot(Z,type="l") Here, we have to estimate two sets of parameters: the autoregressive...

Read more »

How to convert odds ratios to relative risks

January 27, 2014
By
How to convert odds ratios to relative risks

My short paper on this came out on Friday in the British Medical Journal. The aim is to help both authors and readers of research make sense of this rather confusing but unavoidable statistic, the odds ratio (OR). The fundamental … Continue reading →

Read more »

New in forecast 5.0

January 26, 2014
By
New in forecast 5.0

Last week, version 5.0 of the forecast package for R was released. There are a few new functions and changes made to the package, which is why I increased the version number to 5.0. Thanks to Earo Wang for helping with this new version. Handling missing values and outliers Data cleaning is often the first step that data scientists...

Read more »

Tuning optim with parscale

January 26, 2014
By

I often get questions what is the use of parscale parameter in optim procedure in GNU R. Therefore I have decided to write a simple example showing its usage and importance. The function I test is a simplified version of estimation problem I had to sol...

Read more »

Accurate imputation and valid statistical inference with ensemble

January 25, 2014
By
Accurate imputation and valid statistical inference with ensemble

Imputation is predictive inference and not causal inference! I have met many people, who consider the two are equivalent. Their reasoning is based on the belief that if you can produce a model which replicate the data generating...

Read more »

Online class on Statistical Learning

January 24, 2014
By
Online class on Statistical Learning

Trevor Hastie and Robert Tibshirani are teaching an online class on Statistical Learning starting this week. The first week is introduction and overview, so it's not too late to join up. They've also published a new book, An Introduction to Statistical Learning, as a more accessible companion to their widely revered The Elements of Statistical...

Read more »

Rob Hyndman on Forecasting

January 24, 2014
By
Rob Hyndman on Forecasting

If you have an interest in forecasting, especially economic forecasting, the Rob Hyndman's name will be familiar to you. Hailing from my old stamping ground - Monash University - Rob is one of the world's top forecasting experts.  Without going into all of the details, Rob is very widely published, and also...

Read more »

Coursera Specializations: Data Science, Systems Biology, Python Programming

January 22, 2014
By

I first mentioned Coursera about a year ago, when I hired a new analyst in my core. This new hire came in as a very competent Python programmer with a molecular biology and microbial ecology background, but with very little experience in statistics. I got him to take Roger Peng's Computing for Data Analysis course...

Read more »

Sponsors

Mango solutions



plotly webpage

dominolab webpage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)