Articles by Wesley

JSM 2013 – Wednesday

August 8, 2013 | Wesley

I was able to attend a continuing education short course workshops at the JSM conference that proved to be quite insightful.  The discussion was on data mining and was titled “Applied Data Mining Analysis: A Step-by-Step Introduction Using Real-World Data Sets”.  The presentation was given by Dan Steinberg and the ... [Read more...]

JSM 2013 – Tuesday

August 7, 2013 | Wesley

The Joint Statistical Meeting in Montreal has proven to be very good.   Here are a few highlight from Tuesday’s sessions.  There is one major problem that exists and that is there are too many good sessions to attend.  During one time block I had six session that I wanted ... [Read more...]

JSM 2013 – Monday

August 6, 2013 | Wesley

I am currently attending the 2013 Joint Statistical Meeting in Montreal. I will try to share a few if the things that I take away each day. Last night (Monday) I attended the JSM keynote speaker with Nate Silver and it proved to be a very interesting discussion.  Silver is best ... [Read more...]

Imputing Missing Data With Expectation – Maximization

July 14, 2013 | Wesley

It can be fairly common to find missing values in a dataset. Having only a few missing values isn’t generally a problem and those records can be deleted listwise. In other words the entire record is simply removed from the analysis. The problem is even with a limited amount ... [Read more...]

Using R to Produce Scalable Vector Graphics for the Web

June 30, 2013 | Wesley

Statistical software is normally used during the analysis stage of a project and a cleaned up static graphic is created for the presentation.  If the presentation is in web format then there are some considerations that are needed. The trick is to find ways to implement those graphs in that ... [Read more...]

Software Packages for Graphs and Charts

June 17, 2013 | Wesley

Graphs can be an important feature of analysis. A graph that has been well designed and put together can make summary statistics much more readable and increase the interpretability. It also makes reports and articles looks more professional. There are many software packages that are available to design great graphs ... [Read more...]

Latent Class Modeling Election Data

June 14, 2013 | Wesley

Latent class analysis is a useful tool that is used to identify groups within multivariate categorical data.  An example of this is the likert scale. In categorical language these groups are known as latent classes. As a simple comparison this can be compared to the k-means multivariate cluster analysis. There ... [Read more...]

Hey, I Just did a Significance Test!

June 7, 2013 | Wesley

I’ve seen it happens quite often. The sig test. Somebody simply needs to know the p-value and that one number will provide all of the information about the study that they need to know. The dataset is presented and the client/boss/colleague/etc invariably asks the question “is ... [Read more...]

Will Mu Go Out With Median

May 28, 2013 | Wesley

True story (no really, this did actually happen).  While in grad school one of the other teaching assistants was approached by one of the students and was asked “will mu go out with median?”  The teaching assistant thought the play on words was pretty funny, laughed, and then cluelessly walked ... [Read more...]

A Brief Tour of the Trees and Forests

April 29, 2013 | Wesley

Tree methods such as CART (classification and regression trees) can be used as alternatives to logistic regression. It is a way that can be used to show the probability of being in any hierarchical group. The following is a compilation of many of the key R packages that cover trees ... [Read more...]

Free e-Copy of Bayesian Computation with R (Use R)

April 24, 2013 | Wesley

Amazon is currently making the first edition of Bayesian Computation with R (Use R) by Jim Albert available for free on Kindle. I own a copy of the book and there is a lot of good content and R examples on how one can do general Bayesian statistics.  The R ... [Read more...]

Amazon AWS Summit 2013

April 18, 2013 | Wesley

I was fortunate enough to have been able to attend the Amazon AWS Summit in NYC and to listen to Werner Vogels give the keynote.  I will share a few of my thoughts on the AWS 2013 Summit and some of my take-aways.  I attended sessions that focused on two products ... [Read more...]

Simulating the Gambler’s Ruin

April 14, 2013 | Wesley

The gambler’s ruin problem is one where a player has a probability p of winning  and probability q of losing. For example let’s take a skill game where the player x can beat player y with probability 0.6 by getting closer to target. The game play begins with player ... [Read more...]

Finding the Distribution Parameters

April 9, 2013 | Wesley

This is a brief description on one way to determine the distribution of given data. There are several ways to accomplish this in R especially if one is trying to determine if the data comes from a normal distribution. Rather than focusing on hypothesis testing and determining if a distribution ... [Read more...]

Dirichlet Process, Infinite Mixture Models, and Clustering

April 7, 2013 | Wesley

The Dirichlet process provides a very interesting approach to understand group assignments and models for clustering effects.   Often time we encounter the k-means approach.  However, it is necessary to have a fixed number of clusters.  Often we encounter situations where we don’t know how many fixed clusters we need.  ... [Read more...]
1 2 3 4

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)