131 search results for "web scraping"

Friday Function: setInternet2

April 15, 2011
By
Friday Function: setInternet2

Corporate IT networks are a pain for programmers. Ideally, when programming, you want the freedom to download, install and run any software that you want. Unfortunately, in the interests of security, many programmers find themselves a little restricted at the office. (I’m sure that many network admins will protest that the situation works both ways

Read more »

Find NHL Players with 30 Goals and 100 PIM using R

April 2, 2011
By
Find NHL Players with 30 Goals and 100 PIM using R

Last week Jack Edwards raised the fact that Milan Lucic was the first Bruin player to join the 30 Goal / 100 Penalty Minute club in a few years.  It got me thinking about the other players who have accomplished … Continue reading →

Read more »

NBA Analysis: Coming Soon!

March 21, 2011
By
NBA Analysis:  Coming Soon!

I decided to spend a few hours this weekend writing the R code to scrape the individual statistics of NBA players (2010-11 only).  I originally planned to write up a few NBA-related analyses, but a friend was visiting from out … Continue reading →

Read more »

Clustering NHL Skaters

February 6, 2011
By
Clustering NHL Skaters

I have been sitting on this post for some time now and wanted to get it out there.  The goal is to simply show how easy it is to pull live data from the web into R, massage it, and perform some analytics on it.  I am not sure how useful this analysis really is

Read more »

Dial-a-statistic! Featuring R and Estonia

January 16, 2011
By
Dial-a-statistic! Featuring R and Estonia

Did you wake up this morning hoping that you would be able to listen to telephone beeps inspired by Estonian web site metrics? I knew you did! First things first: I came up with the slightly crazy idea of using the bleepy sounds that telephones make, called “dual-tone multifrequency” (DTMF) tones, as a tool in

Read more »

How to buy a used car with R (part 1)

October 31, 2010
By
How to buy a used car with R (part 1)

I’m in the process of buying a used car. Since I enjoy making these decisions as complicated as possible, I’ve written some R code to scrape relevant websites for informative data. I’ve written this up as a blog entry because I think it’s a decent example of how one might use the XML...

Read more »

How to buy a used car with R (part 1)

October 31, 2010
By
How to buy a used car with R (part 1)

nStrict Standards: Non-static method StringParser_Node::destroyNode() should not be called statically, assuming $this from incompatible context in /afs/ir.stanford.edu/users/k/n/knoepfle/cgi-bin/flatpress/fp-plugins/bbcode/inc/stringparser.class.php on line 358I’m in the process of buying a used car. Since I enjoy making these decisions as complicated as possible, I’ve written some R code to scrape relevant websites for informative data. I’ve written this up as a...

Read more »

Using XML package vs. BeautifulSoup

August 31, 2010
By
Using XML package vs. BeautifulSoup

A while back I posted something about scraping a webpage using the BeautifulSoup module in Python.  One of the comments to that post was by Larry — a blogger over at IEORTools — suggesting that I take a look at … Continue reading →

Read more »

Are MLB Games Getting Longer?

August 5, 2010
By
Are MLB Games Getting Longer?

On July 29, 2010, I had a flight from Denver to Cincinnati.  About an hour before boarding, I went to ESPN’s website and found a new article by Bill Simmons, a.k.a The Sports Guy (@sportsguy33 on Twitter).  The basic premise of this article is that a core group of fans is losing interest in Red

Read more »

Analyze Gold Demand and Investments using R

June 29, 2010
By
Analyze Gold Demand and Investments using R

After the recent foray into stock analysis using quantmod, I thought it worthwhile to mention that the library can be used to analyze a wide variety of investments, including precious metals.  It is also worthwhile to mention that there are other ...

Read more »