1954 search results for "Time series"

Exploratory Data Analysis: Quantile-Quantile Plots for New York’s Ozone Pollution Data

Exploratory Data Analysis: Quantile-Quantile Plots for New York’s Ozone Pollution Data

Introduction Continuing my recent series on exploratory data analysis, today’s post focuses on quantile-quantile (Q-Q) plots, which are very useful plots for assessing how closely a data set fits a particular distribution.  I will discuss how Q-Q plots are constructed and use Q-Q plots to assess the distribution of the “Ozone” data from the built-in

Read more »

Truncate by Delimiter in R

September 19, 2013
By

Sometimes, you only need to analyze part of the data stored as a vector. In this example, there is a list of patents. Each patent has been assigned to one or more patent classes. Let's say that we want to analyze the dataset based on only the first pat...

Read more »

Patterns in the Ivy: The Small World of Metal

September 18, 2013
By
Patterns in the Ivy: The Small World of Metal

A few months ago I started listening to Tomahawk, a band described on Wikipedia as “an experimental alternative metal/alternative rock supergroup.” Beyond the quality of their music, I found myself intrigued by the musical background of their members. In addition to Tomahawk, their other bands include acclaimed groups such as Faith No More, Helmet, the Melvins,… Continue reading →

Read more »

R tips for moderately large data

September 16, 2013
By
R tips for moderately large data

Some useful tips recently featured on r-bloggers and originally posted at Mollie’s Research Blog are worth reading. I say moderately large because I don’t really believe there is such a thing as big data (and it looks like Mollie doesn’t … Continue reading →

Read more »

BCEA in UseR!

September 13, 2013
By
BCEA in UseR!

In a recent post, I had hinted at big news for BCEA $-$ I thought it was pretty much a done deal, but because it wasn't yet set in stone, I didn't want to jinx it...But now I've sorted all the details with Springer, who have asked me to write a book on the...

Read more »

Using Arial in R figures destined for PLOS ONE

September 9, 2013
By

Despite the refreshing change that the journal PLOS ONE represents in terms of open access and an refreshing change to the stupidity that is quality/novelty selection by the two or three people that review a paper, it’s submission requirements are far less progressive. Yes they make you jump through a lot of hoops getting your figures and...

Read more »

Maximum Likelihood Estimation and the Origin of Life

September 8, 2013
By
Maximum Likelihood Estimation and the Origin of Life

# Maximum likelihood Estimation (MLE) is a powerful tool in econometrics which allows for the consistent and asymptotically efficient estimation of parameters given a correct identification (in terms of distribution) of the random variable. # It i...

Read more »

Using colClasses to Load Data More Quickly in R

September 5, 2013
By

Specifying a colClasses argument to read.table or read.csv can save time on importing data, while also saving steps to specify classes for each variable later.For example, loading a 893 MB took 441 seconds to load when not using colClasses, b...

Read more »

Fair weather fans, redux

September 1, 2013
By
Fair weather fans, redux

Fair weather fans, redux Or, A little larger small sample On August 11 the Victoria HarbourCats closed out their 2013 West Coast League season with a 4-3 win over the Bellingham Bells. In an earlier...

Read more »

MLB Rankings Using the Bradley-Terry Model

August 31, 2013
By
MLB Rankings Using the Bradley-Terry Model

Today, I take my first shots at ranking Major League Baseball (MLB) teams. I see my efforts at prediction and ranking an ongoing process so that my models improve, the data I incorporate are more meaningful, and ultimately my predictions are largely accurate. For the first attempt, let’s rank MLB teams using the Bradley-Terry (BT) model. Before we discuss the rankings, we need...

Read more »