The function in question is
scriptSearch. I’m not much for superlatives — “most” and “best” imply one dimension, but we live in a multi-dimensional world. I’m making an exception.
The statistic I have in mind for this use of “useful” is the waiting time between calls to the function divided by the human time saved by the call.
I wrote a version of this for a company where I do consulting. There are few days working there that I don’t have at least one bout with it. Using
scriptSearch can easily save half an hour compared to what I would have done prior to having the function.
The two main inputs are:
- a string to search for
- a directory to search in
By default it only looks in R scripts in the directory (and its subdirectories).
Examples of directories to search are:
- directory holding a large collection of R scripts
- directory holding the source for local R packages
- personal directory with lots of subdirectories containing R scripts and functions
Examples of uses are:
- where is
- where are all the uses of
splishsplashin the local packages (because I want to change its arguments)?
- a few weeks ago I created a pdf called
factor_history, where is the code that produced that?
These uses might be done with something like:
scriptSearch("blimblam *<-", "path/to/scriptFarm", sub=FALSE)
You may be confused by the asterisk in the first call. The string to search for can be a regular expression. In this case the asterisk means that it will find assignments whether or not there is a space between the object name and the assignment arrow.
scriptSearch was the main motivation for updating the BurStMisc package to version 1.1. The package is on CRAN.
ntile function is also new to BurStMisc. It returns equally-sized ordered groups from a numeric vector — for example, quintiles or deciles.
A more primitive version of the function appeared in a blog post called “Miles of iles”. There is some discussion there of alternative functions.
While I was preparing the update to BurStMisc, I found that automating the writing of some tests using the
testthat package was both warranted and feasible. The
writeExpectTest function is the result.
The generally useful function that was already in BurStMisc is
corner. This allows you to see a few rows and columns of a large matrix or data frame — or higher dimensional array.
I want to spread the news
That if it feels this good getting used
You just keep on using me
— from “Use Me” by Bill Withers
The post Probably the most useful R function I’ve ever written appeared first on Burns Statistics.