From a set of contigs, returns the base sequence that corresponds to a region of the reference.
From a set of contigs, returns the base sequence that corresponds to a region of the reference.
Reference region over which to get sequence.
String of bases corresponding to reference sequence.
Throws exception if query region is not found.
Aggregates together a sequence dictionary from the different individual reference sequences used in this dataset.
Aggregates together a sequence dictionary from the different individual reference sequences used in this dataset.
A sequence dictionary describing the reference contigs in this dataset.
Rewrites the contig IDs of a FASTA reference set to match the contig IDs present in a different sequence dictionary.
Rewrites the contig IDs of a FASTA reference set to match the contig IDs present in a different sequence dictionary. Sequences are matched by name.
A sequence dictionary containing the preferred IDs for the contigs.
New set of contigs with IDs rewritten.
Contigs with names that aren't present in the provided dictionary are filtered out of the RDD.
For a single RDD element, returns 0+ sequence record elements.
For a single RDD element, returns 0+ sequence record elements.
Element from which to extract sequence records.
A seq of sequence records.