129 search results for "web scraping"

Using One Programming Language In the Context of Another – Python and R

January 22, 2014
By
Using One Programming Language In the Context of Another – Python and R

Over the last couple of years, I’ve settled into using R an python as my languages of choice for doing stuff: R, because RStudio is a nice environment, I can blend code and text using R markdown and knitr, ggplot2 and Rcharts make generating graphics easy, and reshapers such as plyr make wrangling with data

Read more »

Statistics meets rhetoric: A text analysis of "I Have a Dream" in R

January 20, 2014
By
Statistics meets rhetoric: A text analysis of "I Have a Dream" in R

This article was first published on analyze stuff. It has been contributed to Anything but R-bitrary as the second article in its introductory series.By Max Ghenis Today, we celebrate the would-be 85th birthday of Martin Luther King, Jr., a man remembered for pioneering the civil rights movement through his courage, moral leadership, and oratory prowess. This post...

Read more »

Statistics meets rhetoric: A text analysis of “I Have a Dream” in R

January 20, 2014
By
Statistics meets rhetoric: A text analysis of “I Have a Dream” in R

Today, we celebrate the would-be 85th birthday of Martin Luther King, Jr., a man remembered for pioneering the civil rights movement through his courage, moral leadership, and oratory prowess. This post focuses on his most famous speech, I Have a Dream given on the steps of the Lincoln Memorial to over...

Read more »

Second NYC R classes(announcement and teaching experience)

January 20, 2014
By
Second NYC R classes(announcement and teaching experience)

(The photo was from our first offering of R classes) We are going to offer our Data Science by R (beginner level) course again in February. The goal of this class is to get students to a point where they are self-sufficient in R, are proficient at analyzing data and can take these skills back to their full-time jobs....

Read more »

Calling Python from R with rPython

January 13, 2014
By
Calling Python from R with rPython

Python has generated a good bit of buzz over the past year as an alternative to R. Personal biases aside, an expert makes the best use of the available tools, and sometimes Python is better suited to a task. As a case in point, I recently wanted to pull data via the Reddit API. There The post Calling...

Read more »

Why R is Better Than Excel for Fantasy Football (and most other) Data Analysis

January 13, 2014
By

Many articles have been written on why R is better than Excel for data analysis.  In this post, I will summarize the reasons why R is advantageous in most data The post Why R is Better Than Excel for Fantasy Football (and most other) Data Analysis appeared first on Fantasy Football Analytics.

Read more »

College Basketball: Presence in the NBA over Time

November 7, 2013
By
College Basketball: Presence in the NBA over Time

Interested in practicing a bit of web-scraping, I decided to make use of a nice dataset provided by Databasebasketball.com in order to examine the representation of various college programs in the NBA/ABA over time. This dataset only includes retired players, and ends in 2010, so I decided to...

Read more »

Creating your personal, portable R code library with GitHub

September 21, 2013
By
Creating your personal, portable R code library with GitHub

As I discussed in a previous post, I have a few helper functions I’ve created that I commonly use in my work. Until recently, I manually included these functions at the start of my R scripts by either the tried and true copy-and-paste method, or by extracting them from a local file with the <code>source()</code> The post Creating...

Read more »

MLB Rankings Using the Bradley-Terry Model

August 31, 2013
By
MLB Rankings Using the Bradley-Terry Model

Today, I take my first shots at ranking Major League Baseball (MLB) teams. I see my efforts at prediction and ranking an ongoing process so that my models improve, the data I incorporate are more meaningful, and ultimately my predictions are largely accurate. For the first attempt, let’s rank MLB teams using the Bradley-Terry (BT) model. Before we discuss the rankings, we need...

Read more »

ggplot2 Chloropleth of Supreme Court Decisions: A Tutorial

July 4, 2013
By
ggplot2 Chloropleth of Supreme Court Decisions: A Tutorial

I don't do much GIS but I like to. It's rather enjoyable and involves a tremendous skill set. Often you will find your self grabbing data sets from some site, scraping, data cleaning and reshaping, and graphing. On the ride … Continue reading →

Read more »