More Command-Line Text Munging Utilities
[This article was first published on Getting Genetics Done
, and kindly contributed to R-bloggers
]. (You can report issue about the content on this page here
Want to share your content on R-bloggers? click here
if you have a blog, or here
if you don't.
In a previous post I linked to gcol as a quick and intuitive alternative to awk. I just stumbled across yet another set of handy text file manipulation utilities from the creators of the BEAGLE software for GWAS data imputation and analysis. In addition to several command line utilities for converting and formatting BEAGLE files, there are several tools for doing basic text processing tasks on the command line:
- changecolumn.jar – replace values in a column of an input file.
- changeline.jar – replace values in a line of an input file.
- cut.jar – extract columns from a file.
- filtercolumns.jar – filters columns of input data according to the values in a line.
- filterlines.jar – filters lines of input data according to the values in a column.
- paste.jar – pastes together files that have shared initial columns followed by data columns.
- transpose.jar – transposes rows and columns of an input file.
Much of what these tools do can probably be emulated with some creativity with Unix commands and pipes. But since these are all Java archives they should work on any platform, not just Unix/Linux. Hit the link below to see the full list and documentation.
BEAGLE Utilities for text manipulation
If you got this far, why not subscribe for updates
from the site? Choose your flavor: e-mail
, or facebook