Enveomics collection

A toolbox for microbial genomics and metagenomics

FastA.sample.rb

Samples a random set of sequences from a multi-FastA file.

    See source code, Artistic license 2.0.

§ References

    Rodriguez-R & Konstantinidis, 2016, PeerJ Preprints.

§ Requirements

§ Usage

FastA.sample.rb --in in_file --out out_file [opts]

§ Arguments

Input File*
 --in in_file  Input FastA file (supports .gz compression).
Output file*
 --out out_file  Output FastA file (supports .gz compression).
Fraction
 --fraction float  Fraction of sequences to sample [0-1]. Mandatory unless Number is provided.
Number
 --number integer  Number of sequences to sample. Mandatory unless -f is provided.
Replacement
 --replacement   Sample with replacement.
Quiet
 --quiet   Run quietly (no STDERR output).
* Mandatory.