Fortifying hyperSpec: Getting Ready for GSOC
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
hyperSpec
is an R
package for working with hyperspectral data sets. Hyperspectral data can take many forms, but a common application is a series of spectra collected over an x, y grid, for instance Raman imaging of medical specimens. hyperSpec
was originally written by Claudia Beleites and she currently guides a core group of contributors.1
Claudia, regular hyperSpec
contributor Roman Kiselev and myself have joined forces this summer in a Google Summer of Code project to fortify hyperSpec
. We are pleased to report that the project was accepted by R-GSOC administrators, and, as of a few days ago, the excellent proposal written by Erick Oduniyi was approved by Google. Erick is a senior computer engineering major at Wichita State University in Kansas. Erick gravitates toward interdisciplinary projects. This, and his experience with R
, Python
and related skills gives him an excellent background for this project.
The focus of this project is to fortify the infrastructure of hyperSpec
. Over the years, keeping hyperSpec
up-to-date has grown a bit unwieldy. While to-do lists always evolve, the current interrelated goals include:
- Distill2
hyperSpec
: PrunehyperSpec
back to it’s core functionality to keep it lightweight. Relocate portions, such as importing data, into their own dedicated packages. - Shield
hyperSpec
: Analyze the ecosystem ofhyperSpec
with an eye to reducing dependencies as much as possible and ensuring that necessary dependencies are the best choices. Avoid “re-inventing the wheel”, as long as the available “wheels” are computationally efficient and stable (code base and API). - Bridge
hyperSpec
: Having decided on how to reorganizehyperSpec
and which dependencies are necessary and optimal, ensure thathyperSpec
, the constellation of new sub-packages, and all dependencies are integrated efficiently. There are a number of data pre-processing and plotting functions that need to be streamlined and interfaced to external packages more consistently. Some portions may need substantial refactoring.
Addressing each of these goals will make hyperSpec
much easier to maintain, less fragile, and easier for others to contribute. Every step will bring enhanced documentation and vignettes, along with new unit tests. Work will begin in earnest on June 1st, and we are looking forward to a very productive summer.
Finally, on behalf of all participants, let me just say how grateful we are to Google for establishing the GSOC program and for supporting Erick’s work this summer!
A little history for the curious: the
hyperSpec
andChemoSpec
packages were written around the same time, independent of each other (~2009). Eventually, Claudia and I became aware of each other’s work, and we have collaborated in ways large and small ever since (I like working with Claudia because I always learn from her!). We have jointly mentored GSOC students twice before. One side project ishyperChemoBridge
, a small package that convertshyperSpec
objects intoSpectra
objects (the nativeChemoSpec
format) and vice-versa.↩︎The descriptors here are Erick’s clever choice of words.↩︎
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.