Blog Archives

"I don’t wanna grow up": Age / value relationships for football players

February 1, 2013
By
"I don’t wanna grow up": Age / value relationships for football players

Let's get back to the age-value relationship from my last post. I did some more plotting to see on which position this inversed U-shaped relationship is strongest. Please note, that I use a dataframe called eu.players throughout this post, which holds downloaded football player information from transfermarkt.de.But first, let us get back to the original graph.

Read more »

The "golden age" of a football player

January 28, 2013
By
The "golden age" of a football player

It's been some time since my last post on football. And we're talking about european soccer here. So I finally managed to write some functions which allow me to extract player stats from www.transfermarkt.de. The site tracks lots of stats in the world of soccer. For each player, there is information about the dominant foot, height, age, the estimated...

Read more »

R-bloggers

January 22, 2013
By

As long as I can't find the time to post my newest adventuRes, why don't you check out the great collection of other R-blogs on the web:www.r-bloggers.com Have fun!

Read more »

"The Dude" takes the Tarantino threshold

January 15, 2013
By
"The Dude" takes the Tarantino threshold

Just as a quick reply to a friend of mine who suggested testing the swearing capabilities of The Dude:Click to enlarge.As you can see, "The Big Lebowski" (2.79 % swear words) takes the Tarantino threshold (0.98 %) easily, but it's no match against "Res...

Read more »

Fun stuff with subtitles or "The Tarantino Threshold"

January 13, 2013
By
Fun stuff with subtitles or "The Tarantino Threshold"

Fortunately, there is a page called www.opensubtitles.org, where you can get subtitle (.SRT) files for virtually every movie. Now let's see what we can do with these. SRT files are in plain text format (human readable) and can thus be read quite easily...

Read more »

Creating PDFs and websites with the "knitr" package

November 27, 2012
By

Just a fast note: I came across the R-package "knitr" which enables you to generate PDF files by mixing LaTeX and R code in one document. The result looks very nice and is great to create documentations, manuals and so on. I find knitr much easier to u...

Read more »

Josh vs. himself (or: Firefly > all)

October 22, 2012
By
Josh vs. himself (or: Firefly > all)

For Jan...I've got no data for "S.H.I.E.L.D." :(Maybe, but just maybe, "Firefly" gets the way-to-early-cancelled bonus by the voting community.

Read more »

Going to the Movies…

October 22, 2012
By
Going to the Movies…

Today, let us have a look at movies. The Internet Movie Database (IMDb) has some data dumps available on their website. It's a subset of the information available on the IMDb site, but it's more than enough. I will spare you my code to convert these da...

Read more »

Soccer is all about money (?) – Part 3: More plots & analyses

October 19, 2012
By
Soccer is all about money (?) – Part 3: More plots & analyses

Let's play around a bit more with the dataset we built in Part 1 of this series.Now we are going to compare data from more championships in Europe.Let's check out the first divisions from the following countries:- Germany (1. Bundesliga)- England (Premier League)- Spain (Primera División)- Italy (Serie A)- France (League 1)If you want to replicate the...

Read more »

Soccer is all about money (?) – Part 2: Simple analyses

October 18, 2012
By
Soccer is all about money (?) – Part 2: Simple analyses

Alright, now we have all the data we need in one dataframe. To make this code work, I assume you ran the code from Part 1. We need the dataframe big.tab.All the data presented here is based on the data from 18/10/2012. You can run an analysis with the actual data or I can do it at...

Read more »