Filters out reads and contigs that should not be processed.
Filters out reads and contigs that should not be processed.
RDD of reads and associated metadata.
Arguments specifying the filters to apply.
Returns a new AlignmentRecordRDD where reads that we don't want to use in genotyping have been discarded, and where contigs that we don't want to genotype have been removed.
The arguments specifying which contigs to keep.
Returns a function that returns true if a contig with a given name should be kept.
Contig name to test for filtration.
Returns true if the contig matches the naming scheme for GRCh autosomal chromosomes.
Contig name to test for filtration.
Returns true if the contig matches the GRCh mitochondrial chromosome name.
Contig name to test for filtration.
Returns true if the contig matches the naming scheme for GRCh sex chromosomes.
Read to test for filtration.
Returns true if the read is aligned.
Read to test for filtration.
Only keep reads with a mapping quality above this value.
Returns true if the read is aligned.
Contig name to test for filtration.
Returns true if the contig matches the naming scheme for HG/UCSC autosomal chromosomes.
Contig name to test for filtration.
Returns true if the contig matches the HG/UCSC mitochondrial chromosome name.
Contig name to test for filtration.
Returns true if the contig matches the naming scheme for HG/UCSC sex chromosomes.
Read to test for filtration.
Returns true if the read is not a duplicate read.
The arguments specifying which reads to keep.
A function that determines which contigs should be kept, given the contig name.
Returns a function that returns true if a read should be kept.
Reifies an input AlignmentRecordRDD down to the contigs and reads we want to genotype.