50 search results for "ecdf"

Using Tables for Statistics on Large Vectors

March 1, 2015
By
Using Tables for Statistics on Large Vectors

This is the first post I’ve written in a while. I have been somewhat radio silent on social media, but I’m jumping back in. Now, I work with brain images, which can have millions of elements (referred to as voxels). Many of these elements are zero (for background). We want to calculate basic statistics on

Read more »

A Machine Learning Result

February 5, 2015
By
A Machine Learning Result

by Joseph Rickert Learning to effectively use any of the dozens of popular machine learning algorithms requires mastering many details and dealing with all kinds of practical issues. With all of this to consider, it might not be apparent to a person coming to machine learning from a background other than computer science or applied math that there are...

Read more »

Exploratory Data Analysis – All Blog Posts on The Chemical Statistician

Exploratory Data Analysis – All Blog Posts on The Chemical Statistician

This series of posts introduced various methods of exploratory data analysis, providing theoretical backgrounds and practical examples.  Fully commented and readily usable R scripts are available for all topics for you to copy and paste for your own analysis!  Most of these posts involve data visualization and plotting, and I include a lot of detail and

Read more »

Get Some Class

October 13, 2014
By
Get Some Class

By Chris Campbell – Senior Consultant, UK. R is a classy language. Look: x <- matrix(1:9, nrow = 3) x ## ## 1 4 7 ## 2 5 8 ## 3 6 9 class(x) ## … Continue reading →

Read more »

The odds of a cluster of airplane accidents

August 2, 2014
By
The odds of a cluster of airplane accidents

Recently, there have been a lot of airplane accidents. July, 17th 2014, Hrabove, Ukraine, Malaysia Airlines, Boeing 777, fatalities 298 (/298) July, 23rd 2014, Magong, Taiwan, TransAsia Airways, ATR 72-500, fatalities 47 (/58) July, 24th 2014, Aguelhok, Mali, Air Algerie, Mc Donnell Douglas MD-83, fatalities 116 (/116) It is simple to find a lot of datasets about airplane crashes....

Read more »

Gender gap and visualisation challenge @ useR!2014

June 20, 2014
By
Gender gap and visualisation challenge @ useR!2014

7 days to go for submissions in the DataVis contest at useR!2014 (see contest webpage). Note that the contest is open for all R users, not only conference participants. Submit your solution soon! PISA dataset allows to challenge some ,,common opinions”, like are boys or girls better in math / reading. But, how to compare

Read more »

Take a look, it’s in a book: distribution of kindle e-book highlights

Take a look, it’s in a book: distribution of kindle e-book highlights

If you've ever started a book and not finished it, it may comfort you to know that you are not alone. It's hard to get accurate estimates of the percentage books that are discontinued, but the rise of e-reading (and resulting circumvention of privacy) affords us the opportunity to answer related questions. The kindle e-reading »more

Read more »

Author inflation in academic literature

April 6, 2014
By
Author inflation in academic literature

There seems to be a general consensus that author lists in academic articles are growing. Wikipedia says so, and I’ve also come across a published letter and short Nature article which accept this is the case and discuss ways of … Continue reading →

Read more »

Regression with multiple predictors

February 18, 2014
By
Regression with multiple predictors

(This article was first published on Digithead's Lab Notebook, and kindly contributed to R-bloggers) Now that I'm ridiculously behind in the Stanford Online Statistical Learning class, I thought it would be fun to try to reproduce the figure on page 36 of the slides from chapter 3 or page 81 of the book. The result is a curvaceous surface...

Read more »

ggplot2: Cheatsheet for Visualizing Distributions

February 18, 2014
By
ggplot2: Cheatsheet for Visualizing Distributions

In the third and last of the ggplot series, this post will go over interesting ways to visualize the distribution of your data.

Read more »