The New Consumer Requires an Updated Market Segmentation

September 18, 2014
By
The New Consumer Requires an Updated Market Segmentation

The new consumer is the old consumer with more options and fewer prohibitions. Douglas Holt calls it the postmodern market defined by differentiation: "consumer identities are being fragmented, proliferated, recombined, and turned into salabl...

Read more »

“Do You Want to Steal a Snowman?” – A Look (with R) At TorrentFreak’s Top 10 PiRated Movies List #TLAPD

September 18, 2014
By
“Do You Want to Steal a Snowman?” – A Look (with R) At TorrentFreak’s Top 10 PiRated Movies List #TLAPD

We leave the Jolly Roger behind this year and turn our piRate spyglass towards the digital seas and take a look at piRated movies as seen...

Read more »

Interactive Visualizations from R using rCharts

September 18, 2014
By

At useR! 2014 Ramnath Vaidyanathan gave a tutorial and a presentation on one of his...

Read more »

Comparing machine learning models in R

September 18, 2014
By
Comparing machine learning models in R

by Joseph Rickert While preparing for the DataWeek R Bootcamp that I conducted this week I came across the following gem. This code, based directly on a Max Kuhn...

Read more »

Stay on track: Plotting GPS tracks with R

September 18, 2014
By
Stay on track: Plotting GPS tracks with R

Many GPS devices and apps have the capability to track your current position via GPS. If you go walking, running, cycling, flying or driving, you can take a look...

Read more »

Space Invaders

September 17, 2014
By
Space Invaders

I burned through all of my extra lives in a matter of minutes, and my two least-favorite words appeared on the screen: GAME OVER (Ernest Cline, Ready Player One)...

Read more »

Fun with .Rprofile and customizing R startup

Fun with .Rprofile and customizing R startup

Over the years, I've meticulously compiled–and version controlled–massive and extensive configuration files for virtually all of my most used utilities, most notably vim, tmux, and zsh. In fact, one...

Read more »

Animated choropleths to visualize mortality rates of children under 5 and gender differences using rMaps

September 17, 2014
By

This post displays two animated choropleths. One for global mortality rates for children under 5 (per 1000 live births) and the second for the difference in global mortality rates...

Read more »

BCEA 2.1

September 17, 2014
By
BCEA 2.1

We're about to release the new version of BCEA, which will contain some major changes.A couple of changes in the basic code that should improve the computational speed. In...

Read more »

Applications of R presentations at Dataweek

September 17, 2014
By

I'm speaking at the DataWeek conference in San Francisco today. My talk follows Skylar Lyon from Accenture — I'm really looking forward to hearing how he uses Revolution R...

Read more »

Migrating Table-oriented Web Scraping Code to rvest w/XPath & CSS Selector Examples

September 17, 2014
By

I was offline much of the day Tuesday and completely missed Hadley Wickham’s tweet about the new rvest package: Are you an #rstats user who misses python's...

Read more »

Bayes says “don’t worry” about Scotland’s Referendum

September 17, 2014
By
Bayes says “don’t worry” about Scotland’s Referendum

Just few hours before Scots head to the polls, there is not an overwhelming advantage of the anti-independence vote. Actually, the margin is shorter than last time I looked...

Read more »

Using great circles and ggplot2 to map arrival/departure of 2014 US Open Tennis Players

September 17, 2014
By
Using great circles and ggplot2 to map arrival/departure of 2014 US Open Tennis Players

Please click on the image for information on how to use R and ggplot2 to generate this plot. 

Read more »

Maximal Information Coefficient (Part II)

September 17, 2014
By
Maximal Information Coefficient (Part II)

A while back, I wrote a post simply announcing a recent paper that described a new statistic called the "Maximal Information Coefficient" (MIC),...

Read more »

Changes to FSA — Size Structure

September 16, 2014
By
Changes to FSA — Size Structure

I have added a (very rough) first draft to the Size Structure chapter of the forthcoming Introductory Fisheries Science with R book on the book’s fishR webpage.  Accompanying this...

Read more »

PerformanceAnalytics update released to CRAN

September 16, 2014
By
PerformanceAnalytics update released to CRAN

Version number 1.4.3541 of PerformanceAnalytics was released on CRAN today. If you’ve been following along, you’ll note that we’re altering our version numbering system.  From here on out, we’ll...

Read more »

New members for R-core and R Foundation

September 16, 2014
By

The R Foundation for Statistical Computing, the Vienna-based non-profit organization that oversees the R Project, has just added several new "ordinary members". (Ordinary members participate in R Foundation meetings...

Read more »

R package to convert statistical analysis objects to tidy data frames

September 16, 2014
By

I talked a little bit about tidy data my recent post about dplyr, but you should really go check out Hadley’s paper on the subject. R expects inputs...

Read more »

3D Sine Wave

September 16, 2014
By
3D Sine Wave

Had a headache last night, so decided to take things easy and...

Read more »

Notes from the Kölner R meeting, 12 September 2014

September 16, 2014
By
Notes from the Kölner R meeting, 12 September 2014

Last Friday we had guests from Belgium and the Netherlands joining us in Cologne. Maarten-Jan Kallen from BeDataDriven came from The Hague to introduce us to Renjin, and...

Read more »

Using SQLite in R

September 16, 2014
By
Using SQLite in R

Working on big data requires a clean and robust approach on storing and accessing the data. SQLite is an all inclusive server-less database system in a single file. This...

Read more »

Nuts and Bolts of Quantstrat, Part II

September 16, 2014
By
Nuts and Bolts of Quantstrat, Part II

Last week, I covered the boilerplate code in quantstrat. This post will cover parameters and adding indicators to strategies in … Continue reading →

Read more »

how to provide a variance calculation on your public-use survey data file without disclosing sampling clusters or violating respondent confidentiality

September 16, 2014
By

this post and accompanying syntax would not have been possible without dan oberski.  read more, find out why.  thanks dan.dear survey administrator: someone sent you this link because you...

Read more »

Why Are We Still Teaching t-Tests?

September 15, 2014
By
Why Are We Still Teaching t-Tests?

My posting about the statistics profession losing ground to computer science drew many comments, not only here in Mad (Data) Scientist, but also in the co-posting at Revolution Analytics,...

Read more »

Interview with Romain Francois at useR! 2014

September 15, 2014
By

At the useR! 2014 conference, without a doubt one of the overriding themes was R’s...

Read more »

If the typing monkeys have met Mr Markov: probabilities of spelling "omglolbbq" after the digitial monkeys have read Dracula

September 15, 2014
By
If the typing monkeys have met Mr Markov: probabilities of spelling "omglolbbq" after the digitial monkeys have read Dracula

On the weekend, randomly after watching Catching Fire, I remember the problem of the typing monkeys (Infinite monkey theorem) in which basically could be defined as (Thanks to Wiki):#...

Read more »

Using Reddit’s JSON API to analyze post popularity

September 15, 2014
By
Using Reddit’s JSON API to analyze post popularity

Graduate student Clay McLeod decided to find out what makes a post on the social-sharing site Reddit popular. These are the questions he seeks to answer: What’s in a...

Read more »

Creating a map showing land covered by rising sea levels

September 15, 2014
By

I joined the Geekli.st climate Hackathon this weekend at the Hub Westminster (my favorite venue for Hackathons). While the organizers had lots of enthusiasm they had very little in...

Read more »

Mapping every IPv4 address

September 15, 2014
By
Mapping every IPv4 address

During July I was working with a commercial data source that provides extra data around IP addresses and it dawned on me: rather than pinging billions of IP addresses and...

Read more »