Analyzing weblog data with R

February 23, 2012

(This article was first published on Revolutions, and kindly contributed to R-bloggers)

The R-chart blog explains how to read a weblog file into R, so you can analyze traffic to a website. For example, here's a page request chart created with R:

Now, charts like this are stock-in-trade for tools like Google Analytics, but this is still useful if you want to look at the performance of a site that hasn't been instrumented for analytics. It also allows you to analyze other data in the weblog, such as frequency of HTTP status codes:

(Looks like this particular site is experiencing a lot of page-not-found 404 errors.) With access to the raw weblog data in R, you could subject it to any kind of statistical analysis. 

You can find the R code and lots of tips for dealing with weblog data in R at the link below.

R-chart: Log File Analysis with R


To leave a comment for the author, please follow the link and comment on their blog: Revolutions. offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...


Comments are closed.

Search R-bloggers


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)