# 2659 search results for "ggplot2"

## The Geometry of Classifiers

December 18, 2014
As John mentioned in his last post, we have been quite interested in the recent study by Fernandez-Delgado, et.al., “Do we Need Hundreds of Classifiers to Solve Real World Classification Problems?” (the “DWN study” for short), which evaluated 179 popular implementations of common classification algorithms over 120 or so data sets, mostly from the UCI … Continue reading...

## Sketching Scatterplots to Demonstrate Different Correlations

December 17, 2014
Looking just now for an openly licensed graphic showing a set of scatterplots that demonstrate different correlations between X and Y values, I couldn’t find one. So here’s a quick R script for constructing one, based on a Cross Validated question/answer (Generate two variables with precise pre-specified correlation): And here’s an example of the result:

## How to analyze a new dataset (or, analyzing ‘supercar’ data, part 1)

December 16, 2014
I love cars. The way they sound. The engineering. The craftsmanship. And let’s be honest: fast cars are just fun. Given my love of cars, I frequently watch Top Gear clips on YouTube. A couple of weeks ago, I stumbled across this:   Watching the video, I’m thinking, “253 miles per hour? You’ve got to The post

December 15, 2014
A technique succeeds in mathematical physics, not by a clever trick, or a happy accident, but because it expresses some aspect of physical truth (O. G. Sutton) Imagine three unbalanced coins: Coin 1: Probability of head=0.495 and probability of tail=0.505 Coin 2: Probability of head=0.745 and probability of tail=0.255 Coin 3: Probability of head=0.095 and … Continue reading...

## QQ-plots in R vs. SPSS – A look at the differences

December 15, 2014
We teach two software packages, R and SPSS, in Quantitative Methods 101 for psychology freshman at Bremen University (Germany). Sometimes confusion arises, when the software packages produce different results. This may be due to specifics in the implemention of a method or, as in most cases, to different default settings. One of these situations occurs

## Are high-reputation users quitting Stack Overflow?

December 14, 2014
I spend a good amount of time on the programming Q+A site StackOverflow (and a smaller amount of time on its statistics sister site, Cross Validated). Recently this question on Meta Stack Overflow (the website’s discussion forum) caught my attention, raising the question of whether Stack Overflow had become “more negative” recently. It wasn’t the first...

## Monthly Weather in Netherlands

December 14, 2014
When I downloaded the KNMI meteorological data, the intention was to do something which takes more than just the computers memory. While it is clearly not big data, at the very least 100 years of daily data is not small either. So I took along a load o...

## FOMC Dates – Price Data Exploration

December 14, 2014
As a first step in visualizing/exploring the data from my last post, FOMC Dates - Scraping Data From Web Pages, I’ll plot the FOMC announcement dates along with the following price series: 2-Year and 10-Year US Treasury yields, S&P500 ETF (SPY) and USD Index ETF (UUP).I’ll use the quantmod R package to download the price data from...

## ggRandomForests: Visually Exploring random forests. V1.1.1 release.

December 12, 2014
Release early and often. http://cran.r-project.org/web/packages/ggRandomForests/index.html I may have been aggressive numbering the first CRAN release at v1.0, but there’s no going back now. The design of the feature set is complete even if the code has some catching up to… Continue reading →

## Meetup: DataVis with Plotly on December 16th

December 12, 2014
Plotly is a  web-based platform for making graphs and analyzing data. Plotly’s APIs and web app...