# 183 search results for "iris"

## Data Sets for Data Science

March 20, 2014
By

by Joseph Rickert Recently, I had the opportunity to be a member of a job panel for Mathematics, Economics and Statistics students at my alma mater, CSUEB (California State University East Bay). In the context of preparing for a career in data science a student at the event asked: “Where can I find good data sets?”. This triggered a...

## The #rBlocks Experiments

March 19, 2014
By

What's this ? Where should I start? OK, the story goes like this ... rBlocks: A port of #ipythonblocks to #rstats http://t.co/iMMRPCxQIN @simplystats @gvwilson @hadleywickham...

## Brief introduction on Sweave and Knitr for reproducible research

February 24, 2014
By
$Brief introduction on Sweave and Knitr for reproducible research$

A few weeks ago I gave a presentation on using Sweave and Knitr under the guise of promoting reproducible research. I humbly offer this presentation to the blog with full knowledge that there are already loads of tutorials available online. This presentation is specific and slightly biased towards Windows OS, so it probably has limited

## ggplot2: Cheatsheet for Visualizing Distributions

February 18, 2014
By

In the third and last of the ggplot series, this post will go over interesting ways to visualize the distribution of your data.

## Using the LaTeX listings package to style R PDF reports with knitr and pandoc

February 15, 2014
By

knitr is a an R package that allows you to include R code in markdown or LaTeX source files, and have the code and/or its output included in the resulting html or pdf files. RStudio provides good support for this, so if you want to try it out that’s a good place to start. This

## R framework with Object-Oriented Programming

February 13, 2014
By

Data analysis deals with different kinds of data. For instance we can have supermarket sales with - a transactional table, with customer ID, item ID, date of purchase - an item table, with the item ID and its price - … Continue reading →

## Self-Organising Maps for Customer Segmentation using R

February 3, 2014
By

Self-Organising Maps (SOMs) are an unsupervised data visualisation technique that can be used to visualise high-dimensional data sets in lower (typically 2) dimensional representations. In this post, we examine the use of R to create a SOM for customer segmentation. The figures shown here used use the 2011 Irish Census information for the greater Dublin

## Topological Data Analysis with R

January 16, 2014
By

by Joseph Rickert When I was in graduate school in the mid '70s Mathematics departments were still under the spell of abstraction for its own sake. At that time, Algebraic Topology which uses concepts from Abstract Algebra to study topological spaces was a major gateway to the realm of abstraction. On my first visit, it was not at all...

## Computing and visualizing LDA in R

January 15, 2014
By
$Computing and visualizing LDA in R$

As I have described before, Linear Discriminant Analysis (LDA) can be seen from two different angles. The first classify a given sample of predictors to the class with highest posterior probability . It minimizes the total probability of misclassification. To compute it uses Bayes’ rule and assume that follows a Gaussian distribution with class-specific mean