1772 search results for "regression"

Predictive analysis on Web Analytics tool data

July 3, 2013
By
Predictive analysis on Web Analytics tool data

In our previous webinar, we discussed on predictive analytics and basic things to perform predictive analysis. We also discussed on an eCommerce problem and how it can be solved using predictive analysis. In this post, I will explain R script that I used to perform predictive analysis during webinar. Before I explain about R script,

Read more »

Learning Time Series with R

June 27, 2013
By

by Joseph Rickert Late last Saturday afternoon I was reading in my usual spot at the Dana Street Coffee House in Mt. View. A stranger walking by my table noticed my copy of Madsen’s Time Series Analysis (sitting there untouched again) said he needed to learn something about time series and asked if I could recommend a book. He...

Read more »

Fun with Fremont Bridge Bicyclists

June 27, 2013
By
Fun with Fremont Bridge Bicyclists

Given the title of this post and its proximity to the Solstice, you will be disappointed to know that I am not writing about naked bicyclists. I apologize for any false hope I may have instilled in you.On October 11th, 2012, the city of Seattle, WA beg...

Read more »

Natural language processing tutorial

June 25, 2013
By
Natural language processing tutorial

Introduction This will serve as an introduction to natural language processing. I adapted it from slides for a recent talk at Boston Python. We will go from tokenization to feature extraction to creating a model using a machine learning algorithm. The goal is to provide a reasonable baseline on top of which more complex natural language processing can be...

Read more »

Natural Language Processing Tutorial

June 25, 2013
By
Natural Language Processing Tutorial

Introduction This will serve as an introduction to natural language processing. I adapted it from slides for a recent talk at Boston Python. We will go from tokenization to feature extraction to creating a model using a machine learning algorithm. The goal is to provide a reasonable baseline on top of which more complex natural language processing can be done, and...

Read more »

Natural Language Processing Tutorial

June 25, 2013
By
Natural Language Processing Tutorial

Introduction This will serve as an introduction to natural language processing. I adapted it from slides for a recent talk at Boston Python. We will go from tokenization to feature extraction to creating a model using a machine learning algorithm. The goal is to provide a reasonable baseline on top of which more complex natural language processing can be done, and...

Read more »

GRNN and PNN

June 23, 2013
By
GRNN and PNN

From the technical prospective, people usually would choose GRNN (general regression neural network) to do the function approximation for the continuous response variable and use PNN (probabilistic neural network) for pattern recognition / classification problems with categorical outcomes. However, from the practical standpoint, it is often not necessary to draw a fine line between GRNN

Read more »

Measuring Associations

June 20, 2013
By
Measuring Associations

In Chapter 18, we discuss a relatively new method for measuring predictor importance called the maximal information coefficient (MIC). The original paper is by Reshef at al (2011). A summary of the initial reactions to the MIC are Speed and Tibshirani (and others can be found here). My (minor) beef with it is the lack...

Read more »

Bayesian Modeling of Anscombe’s Quartet

June 20, 2013
By
Bayesian Modeling of Anscombe’s Quartet

Anscombe’s quartet is a collection of four datasets that look radically different yet result in the same regression line when using ordinary least square regression. The graph below shows Anscombe’s quartet with imposed regression lines (taken from the Wikipedia article). While least square regression is a good choice for dataset 1 (upper left plot) it...

Read more »

Data Science Labs: Predictive Models to Improve Vaccine Quality and Production

June 20, 2013
By
Data Science Labs: Predictive Models to Improve Vaccine Quality and Production

The age of "blockbuster drugs" is coming to an end, as personalized medicine becomes a reality. Data science will be a major driver of innovation in these and other areas of the pharmaceutical industry. This was demonstrated during a project the Data Science Labs team executed on with a major pharmaceuticals company.

Read more »