134 search results for "Web Scraping"

Data Science, Data Analysis, R and Python

The October 2012 issue of Harvard Business Review prominently features the words “Getting Control of Big Data” on the cover, and the magazine includes these three related articles:“Big Data: The Management Revolution,” by Andrew McAfee and Erik Brynjolfsson, pages 61 – 68;“Data Scientist: The Sexiest Job of the 21st Century,” by Thomas H. Davenport and D.J. Patil, pages...

Read more »

.Rhistory

October 27, 2012
By

Over the last couple of years I've been using R every now and then. When I stumbled upon an interesting topic, and I managed to get a hold of a data set, I tried to make sense of it using R.It's a bit like the Stat Labs approach: I might get started by...

Read more »

Hangman in R: A learning experience

July 28, 2012
By
Hangman in R: A learning experience

I love when people take a sophisticated tool and use it to play video games. Take R for example. I first saw someone create a game for R at talk.stats.com. My friend Dason inspired me to more efficiently waste time … Continue reading →

Read more »

Data Analysis Training

March 20, 2012
By
Data Analysis Training

I'm training some of my colleagues on Big'ish data analysis this week. Here's how I'm running the class. Would love your ideas to make it better. CLASS OBJECTIVES (LEARNING OUTCOMES)After completion of the course, you will be able to:Understand concept...

Read more »

Making an R Package: Not as hard as you think

January 11, 2012
By
Making an R Package: Not as hard as you think

I’ve been writing functions in R for a while to do various things like talking to APIs, web scraping, model testing and data visualisation (basically thing which can get a bit repetitive!), but have always been slightly intimidated about turning those functions into a package, which I could then call using library (package-name). Note that … Continue reading...

Read more »

Plotting Doctor Who Ratings (1963-2011) with R

January 3, 2012
By
Plotting Doctor Who Ratings (1963-2011) with R

Introduction First day back to work after New Year celebrations and my brain doesn’t really want to think too much. So I went out for lunch and had a nice walk in the park. Still had 15 minutes to kill before my lunch break was over and so decided to kill some time with a quick web

Read more »

GScholarXScraper: Hacking the GScholarScraper function with XPath

November 13, 2011
By
GScholarXScraper: Hacking the GScholarScraper function with XPath

Kay Cichini recently wrote a word-cloud R function called GScholarScraper on his blog which when given a search string will scrape the associated search results returned by Google Scholar, across pages, and then produce a word-cloud visualisation. This was of interest to me because around the same time I posted an independent Google Scholar scraper function  get_google_scholar_df()

Read more »

Facebook Graph API Explorer with R

November 10, 2011
By
Facebook Graph API Explorer with R

I wanted to play around with the Facebook Graph API  using the Graph API Explorer page as a coding exercise. This facility allows one to use the API with a temporary authorisation token. Now, I don’t know how to make an R package for the proper API where you have to register for an API key and

Read more »

UCLA Statistics: Analyzing Thesis/Dissertation Lengths

September 29, 2010
By
UCLA Statistics: Analyzing Thesis/Dissertation Lengths

As I am working on my dissertation and piecing together a mess of notes, code and output, I am wondering to myself “how long is this thing supposed to be?” I am definitely not into this to win the prize for longest dissertation. I just want to say my piece, make my point and move on. I’ve heard that...

Read more »

Cricket data analysis

September 4, 2010
By
Cricket data analysis

Cricket World Cup 2011 is approaching and I'm interested in analyzing one day international cricket data to predict some results and share interesting information about cricket.  For the analysis, I need cricket data and tried several things to ge...

Read more »