Data I/O performance tips

April 20, 2010

[This article was first published on Revolutions, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

The R tag on StackOverflow recently topped 1000 questions, and continues to be a great community resource for practical tips on using the R language for data analysis and visualization. To take one example, "Efficiency of operations on R data structures" has been answered with some great tips on efficiently getting data in and out of the R system. Here’s three quick tips excerpted from user "doug"’s response:

  • For reading in flat files, the performance of read.table can be improved 5x (or more) just by opting out of a few of read.table’s default arguments
  • With only a little more hassle, you can make reading flat files even faster by using ‘scan’ instead of ‘read.table’, and
  • Paying attention to data types can often give you a performance boost and reduce your memory footprint.

See the post linked below for the complete details on how to implement these tips and power up the process of reading data into R. Efficiency of operations on R data structures 

To leave a comment for the author, please follow the link and comment on their blog: Revolutions. offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...

Tags: ,

Comments are closed.

Search R-bloggers


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)