166 search results for "web scraping"

GScholarXScraper: Hacking the GScholarScraper function with XPath

November 13, 2011
By
GScholarXScraper: Hacking the GScholarScraper function with XPath

Kay Cichini recently wrote a word-cloud R function called GScholarScraper on his blog which when given a search string will scrape the associated search results returned by Google Scholar, across pages, and then produce a word-cloud visualisation. This was of interest to me because around the same time I posted an independent Google Scholar scraper function  get_google_scholar_df()

Read more »

Facebook Graph API Explorer with R

November 10, 2011
By
Facebook Graph API Explorer with R

I wanted to play around with the Facebook Graph API  using the Graph API Explorer page as a coding exercise. This facility allows one to use the API with a temporary authorisation token. Now, I don’t know how to make an R package for the proper API where you have to register for an API key and

Read more »

UCLA Statistics: Analyzing Thesis/Dissertation Lengths

September 29, 2010
By
UCLA Statistics: Analyzing Thesis/Dissertation Lengths

As I am working on my dissertation and piecing together a mess of notes, code and output, I am wondering to myself “how long is this thing supposed to be?” I am definitely not into this to win the prize for longest dissertation. I just want to say my piece, make my point and move on. I’ve heard that...

Read more »

Cricket data analysis

September 4, 2010
By
Cricket data analysis

Cricket World Cup 2011 is approaching and I'm interested in analyzing one day international cricket data to predict some results and share interesting information about cricket.  For the analysis, I need cricket data and tried several things to ge...

Read more »

What to Expect?

January 22, 2010
By
What to Expect?

In 2007, I was introduced to Twitter via the written qualifying exam towards my Ph.D.. At first, I did not know what to do with it. After a good year or so (maybe even sooner) passed, I began to follow some very interesting people that share the same interests as me. It has transformed my academic experience. It is...

Read more »

Analysing The Rock ‘n’ Roll Madrid Marathon

April 18, 2015
By
Analysing The Rock ‘n’ Roll Madrid Marathon

Nobody’s going to win all the time. On the highway of life you can’t always be in the fast lane (Haruki Murakami, What I Talk About When I Talk About Running) I started running two years ago and one if my dreams is to run a marathon someday. One month ago I run my first … Continue reading...

Read more »

Monitoring Price Fluctuations of Book Trade-In Values on Amazon

April 8, 2015
By
Monitoring Price Fluctuations of Book Trade-In Values on Amazon

I am planning to finish school soon and I would like to shed some weight before moving on. I have collected a fair number of books that I will likely never use again and it would be nice to get some money for them. Sites like Amazon and eBay let you se...

Read more »

More Airline Crashes via the Hadleyverse

March 31, 2015
By

I saw a fly-by #rstats mention of more airplane accident data on — of all places — LinkedIn (email) today which took me to a GitHub repo by @philjette. It seems there’s a web site (run by what seems to be a single human) that tracks plane crashes. Here’s a tweet from @philjette announcing it:

Read more »

Knitr’s best hidden gem: spin

March 23, 2015
By

Stop knitting & start spinning - spin can help you write reports much faster and avoid repeating yourself - Anyone who loves the idea of dynamic report generation with R is probably a big fan of knitr and its flagship function - knit. But not many people seem to know about knit's awesome cousin -...

Read more »

Fuzzy String Matching – a survival skill to tackle unstructured information

February 26, 2015
By
Fuzzy String Matching – a survival skill to tackle unstructured information

“The amount of information available in the internet grows every day” thank you captain Obvious! by now even my grandma is aware of that!. Actually, the internet has increasingly become the first address for...

Read more »