2042 search results for "regression"

Call them what you will

October 28, 2013
By

I’ve been playing around with the R package texreg for creating combined regression tables for multiple models. It’s not the only package to do that – see here for a review – but it’s often handy to be able to generate both ascii art, latex, and html versions of the same table using almost identical

Read more »

Let’s have a "party" and tear this place "rpart"!

October 27, 2013
By
Let’s have a "party" and tear this place "rpart"!

For many problems, classification and regression trees can be a simple and elegant solution, assuming you know their well-documented strengths and weaknesses.  I first explored their use several years ago with JMP, which is easy to use.  If y...

Read more »

The Basics of Encoding Categorical Data for Predictive Models

October 23, 2013
By
The Basics of Encoding Categorical Data for Predictive Models

Thomas Yokota asked a very straight-forward question about encodings for categorical predictors: "Is it bad to feed it non-numerical data such as factors?" As usual, I will try to make my answer as complex as possible. (I've heard the old wives tale that eskimos have 180 different words in their language for snow. I'm starting to think that statisticians have...

Read more »

New R package: scholar

October 23, 2013
By
New R package: scholar

My new R package, scholar, has just been posted on CRAN. The scholar package provides functions to extract citation data from Google Scholar. In addition to retrieving basic information about a single scholar, the package also allows you to compare multiple scholars and predict future h-index values. There’s a full guide on Github (along

Read more »

GLM, non-linearity and heteroscedasticity

October 22, 2013
By
GLM, non-linearity and heteroscedasticity

Last week in non-life insurance course, we’ve seen the theory of the Generalized Linear Models, emphasizing the two important components the link function (which is actually the key component in predictive modeling) the distribution, or the variance function Just to illustrate, consider my favorite dataset ­lin.mod = lm(dist~speed,data=cars) A linear model means here where the residuals are assumed to be...

Read more »

An introduction to Econometrics, using R

October 22, 2013
By

If your econometrics is a bit rusty and you're also looking to learn the R language, you can kill two birds with one stone with Introductory Econometrics using Quandl and R. The first three parts of this seven-part tutorial introduces the basics of regression analysis, while the remaining sections provide R code you can try yourself to reproduce econometric...

Read more »

machine learning [book review, part 2]

October 21, 2013
By
machine learning [book review, part 2]

The chapter (Chap. 3) on Bayesian updating or learning (a most appropriate term) for discrete data is well-done in Machine Learning, a probabilistic perspective if a bit stretched (which is easy with 1000 pages left!). I like the remark (Section 3.5.3) about the log-sum-exp trick. While lengthy, the chapter (Chap. 4) on Gaussian models has

Read more »

How Do You Write Your Model Definitions?

October 20, 2013
By
How Do You Write Your Model Definitions?

I’m often irritated by that when a statistical method is explained, such as linear regression, it is often characterized by how it can be calculated rather than by what model is assumed and fitted. A typical example of this is that linear regression is often described as a method that uses ordinary least squares to calculate the best...

Read more »

Optimizing a multivariable function parameters using a random method, genetic algorithm and simulated annealing in R

October 19, 2013
By
Optimizing a multivariable function parameters using a random method, genetic algorithm and simulated annealing in R

Say that you are implementing a non-linear regression analysis, which is shortly described by wikipedia as:"In statistics, nonlinear regression is a form of regression analysis in which observational data are modeled by a function which is a nonlinear ...

Read more »

Introduction to Feature selection for bioinformaticians using R, correlation matrix filters, PCA & backward selection

October 17, 2013
By
Introduction to Feature selection for bioinformaticians using R, correlation matrix filters, PCA & backward selection

Bioinformatics is becoming more and more a Data Mining field. Every passing day, Genomics and Proteomics yield bucketloads of multivariate data (genes, proteins, DNA, identified peptides, structures), and every one of these biological data units are described by a number of features: length, physicochemical properties, scores, etc. Careful consideration of which features to select when trying...

Read more »

Sponsors

Mango solutions



RStudio homepage



Zero Inflated Models and Generalized Linear Mixed Models with R

Quantide: statistical consulting and training

datasociety

http://www.eoda.de





ODSC

ODSC

CRC R books series





Six Sigma Online Training









Contact us if you wish to help support R-bloggers, and place your banner here.

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)