data.table: an R package everyone should use

March 22, 2011

(This article was first published on Left Censored » R, and kindly contributed to R-bloggers)

I’m not sure how I missed this package, but I am sure glad I’ve found it. The data.table package for R provides something of a reconceptualization of the standard data.frame object. Though it remains (mostly) compatible with data.frame. The advantage of data.table is how it treats tables as if they were tables in a database. This allows extremely fast and intuitive merging of tables. Subsetting and summarizing tables are likewise much quicker. So, if you often find yourself dealing with large data.frames that need merged, subsetted, and and summarized, I encourage you to take a look at data.table. The R CRAN page and R-Forge development site are linked below.

To leave a comment for the author, please follow the link and comment on their blog: Left Censored » R. offers daily e-mail updates about R news and tutorials on topics such as: Data science, Big Data, R jobs, visualization (ggplot2, Boxplots, maps, animation), programming (RStudio, Sweave, LaTeX, SQL, Eclipse, git, hadoop, Web Scraping) statistics (regression, PCA, time series, trading) and more...

If you got this far, why not subscribe for updates from the site? Choose your flavor: e-mail, twitter, RSS, or facebook...


Comments are closed.


Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)