136 search results for "Web Scraping"

Second NYC R classes(announcement and teaching experience)

January 20, 2014
By
Second NYC R classes(announcement and teaching experience)

(The photo was from our first offering of R classes) We are going to offer our Data Science by R (beginner level) course again in February. The goal of this class is to get students to a point where they are self-sufficient in R, are proficient at analyzing data and can take these skills back to their full-time jobs....

Read more »

Calling Python from R with rPython

January 13, 2014
By
Calling Python from R with rPython

Python has generated a good bit of buzz over the past year as an alternative to R. Personal biases aside, an expert makes the best use of the available tools, and sometimes Python is better suited to a task. As a case in point, I recently wanted to pull data via the Reddit API. There The post Calling...

Read more »

Why R is Better Than Excel for Fantasy Football (and most other) Data Analysis

January 13, 2014
By

Many articles have been written on why R is better than Excel for data analysis.  In this post, I will summarize the reasons why R is advantageous in most data The post Why R is Better Than Excel for Fantasy Football (and most other) Data Analysis appeared first on Fantasy Football Analytics.

Read more »

College Basketball: Presence in the NBA over Time

November 7, 2013
By
College Basketball: Presence in the NBA over Time

Interested in practicing a bit of web-scraping, I decided to make use of a nice dataset provided by Databasebasketball.com in order to examine the representation of various college programs in the NBA/ABA over time. This dataset only includes retired players, and ends in 2010, so I decided to...

Read more »

Creating your personal, portable R code library with GitHub

September 21, 2013
By
Creating your personal, portable R code library with GitHub

As I discussed in a previous post, I have a few helper functions I’ve created that I commonly use in my work. Until recently, I manually included these functions at the start of my R scripts by either the tried and true copy-and-paste method, or by extracting them from a local file with the <code>source()</code> The post Creating...

Read more »

MLB Rankings Using the Bradley-Terry Model

August 31, 2013
By
MLB Rankings Using the Bradley-Terry Model

Today, I take my first shots at ranking Major League Baseball (MLB) teams. I see my efforts at prediction and ranking an ongoing process so that my models improve, the data I incorporate are more meaningful, and ultimately my predictions are largely accurate. For the first attempt, let’s rank MLB teams using the Bradley-Terry (BT) model. Before we discuss the rankings, we need...

Read more »

ggplot2 Chloropleth of Supreme Court Decisions: A Tutorial

July 4, 2013
By
ggplot2 Chloropleth of Supreme Court Decisions: A Tutorial

I don't do much GIS but I like to. It's rather enjoyable and involves a tremendous skill set. Often you will find your self grabbing data sets from some site, scraping, data cleaning and reshaping, and graphing. On the ride … Continue reading →

Read more »

Which airline should you be loyal to?

July 2, 2013
By
Which airline should you be loyal to?

LOYALTY PROGRAM CHOICE BASED ON DEPARTURE COUNT If you read Decision Science News, you’re probably a professor or grad student or researcher or policy type who flies around a lot to conferences, symposia, workshops, tutorials, summer schools, and all-hands meetings. You travel the globe to give talks and work with co-authors. All this flying around The post Which...

Read more »

Opel Corsa Diesel Usage

June 24, 2013
By
Opel Corsa Diesel Usage

I wanted to extend my car weight distribution calculation of June 16 from only 2000 to years 2000 to 2013. Unfortunately, come Sunday afternoon the code seemed too slow and not even the beginning of a post. So, I went on to another calculation I w...

Read more »

Logging Data in R Loops: Applied to Twitter.

May 26, 2013
By

A problem that many users face in R is storing the output from loop operations. In the case of Twitter, we may be requesting the last specified number of Tweets from a number of Twitter users. Several methods exist for … Continue reading →

Read more »