org.bdgenomics.adam

models

package models

Visibility
  1. Public
  2. All

Type Members

  1. trait Alphabet extends AnyRef

    Created by bryan on 4/17/15.

    Created by bryan on 4/17/15.

    An alphabet of symbols and related operations

  2. case class Attribute(tag: String, tagType: TagType.Value, value: Any) extends Product with Serializable

    A wrapper around the attrTuple (key) and value pair.

    A wrapper around the attrTuple (key) and value pair. Includes the attrTuple-type explicitly, rather than embedding the corresponding information in the type of 'value', because otherwise it'd be difficult to extract the correct type for Byte and NumericSequence values.

    Roughly analogous to Picards SAMTagAndValue.

    tag

    The string key associated with this pair.

    tagType

    An enumerated value representing the type of the 'value' parameter.

    value

    The 'value' half of the pair.

  3. class BEDFeature extends BaseFeature

  4. class BaseFeature extends Serializable

  5. abstract class BlockExtractable extends Extractable

  6. case class CDS(transcriptId: String, strand: Boolean, region: ReferenceRegion) extends BlockExtractable with Product with Serializable

    Coding Sequence annotations, should be a subset of an Exon for a particular Transcript

    Coding Sequence annotations, should be a subset of an Exon for a particular Transcript

    transcriptId
    strand
    region

  7. case class Consensus(consensus: String, index: ReferenceRegion) extends Product with Serializable

  8. class DNAAlphabet extends Alphabet

    The standard DNA alphabet with A,T,C, and G

  9. case class Exon(exonId: String, transcriptId: String, strand: Boolean, region: ReferenceRegion) extends BlockExtractable with Product with Serializable

    An exon model (here represented as a value of the Exon class) is a representation of a single exon from a transcript in genomic coordinates.

    An exon model (here represented as a value of the Exon class) is a representation of a single exon from a transcript in genomic coordinates.

    NOTE: we're not handling shared exons here

    transcriptId

    the (unique) identifier of the transcript to which the exon belongs

    region

    The region (in genomic coordinates) to which the exon maps

  10. trait Extractable extends AnyRef

    A trait for values (usually regions or collections of regions) that can be subsetted or extracted out of a larger region string -- for example, exons or transcripts which have a sequence defined in terms of their coordinates against a reference chromosome.

    A trait for values (usually regions or collections of regions) that can be subsetted or extracted out of a larger region string -- for example, exons or transcripts which have a sequence defined in terms of their coordinates against a reference chromosome. Passing the sequence of the reference chromosome to a transcript's 'extractSequence' method will return the sequence of the transcript.

  11. class GTFFeature extends BaseFeature

  12. case class Gene(id: String, names: Seq[String], strand: Boolean, transcripts: Iterable[Transcript]) extends Product with Serializable

    A 'gene model' is a small, hierarchical collection of objects: Genes, Transcripts, and Exons.

    A 'gene model' is a small, hierarchical collection of objects: Genes, Transcripts, and Exons. Each Gene contains a collection of Transcripts, and each Transcript contains a collection of Exons, and together they describe how the genome is transcribed and translated into a family of related proteins (or other RNA products that aren't translated at all).

    This review, Gerstein et al. "What is a gene, post-ENCODE? History and updated definition" Genome Research (2007) http://genome.cshlp.org/content/17/6/669.full

    is a reasonably good overview both of what the term 'gene' has meant in the past as well as where it might be headed in the future.

    Here, we aren't trying to answer any of these questions about "what is a gene," but rather to provide the routines necessary to _re-assemble_ hierarchical models of genes that have been flattened into features (GFF, GTF, or BED)

    id

    A name, presumably unique within a gene dataset, of a Gene

    names

    Common names for the gene, possibly shared with other genes (for historical or ad hoc reasons)

    strand

    The strand of the Gene (this is from data, not derived from the Transcripts' strand(s), and we leave open the possibility that a single Gene will have Transcripts in _both_ directions, e.g. anti-sense transcripts)

    transcripts

    The Transcripts that are part of this gene model

  13. class IndelTable extends Serializable with Logging

  14. trait Interval extends AnyRef

    An interval is a region on a coordinate space that has a defined width.

    An interval is a region on a coordinate space that has a defined width. This can be used to express a region of a genome, a transcript, a gene, etc.

  15. class NarrowPeakFeature extends BaseFeature

    See: http://genome.ucsc.edu/FAQ/FAQformat.html#format12

  16. trait OptionalReferenceOrdering[T <: ReferenceRegion] extends Ordering[Option[T]]

  17. case class ProgramRecord(id: String, commandLine: Option[String], name: Option[String], version: Option[String], previousID: Option[String]) extends Product with Serializable

  18. case class ReadBucket(unpairedPrimaryMappedReads: Iterable[AlignmentRecord] = Seq.empty, pairedFirstPrimaryMappedReads: Iterable[AlignmentRecord] = Seq.empty, pairedSecondPrimaryMappedReads: Iterable[AlignmentRecord] = Seq.empty, unpairedSecondaryMappedReads: Iterable[AlignmentRecord] = Seq.empty, pairedFirstSecondaryMappedReads: Iterable[AlignmentRecord] = Seq.empty, pairedSecondSecondaryMappedReads: Iterable[AlignmentRecord] = Seq.empty, unmappedReads: Iterable[AlignmentRecord] = Seq.empty) extends Product with Serializable

    This class is similar to SingleReadBucket, except it breaks the reads down further.

    This class is similar to SingleReadBucket, except it breaks the reads down further.

    Rather than stopping at primary/secondary/unmapped, this will break it down further into whether they are paired or unpaired, and then whether they are the first or second of the pair.

    This is useful as this will usually map a single read in any of the sequences.

  19. class ReadBucketSerializer extends Serializer[ReadBucket]

  20. class RecordGroup extends Serializable

  21. class RecordGroupDictionary extends Serializable

    Builds a dictionary containing record groups.

    Builds a dictionary containing record groups. Record groups must have a unique name across all samples in the dictionary. This dictionary provides numerical IDs for each group; these IDs are only consistent when referencing a single dictionary.

    Exceptions thrown
    AssertionError

    Throws an assertion error if there are multiple record groups with the same name.

  22. trait ReferenceOrdering[T <: ReferenceRegion] extends Ordering[T]

  23. class ReferencePosition extends ReferenceRegion

  24. case class ReferencePositionPair(read1refPos: Option[ReferencePosition], read2refPos: Option[ReferencePosition]) extends Product with Serializable

  25. class ReferencePositionPairSerializer extends Serializer[ReferencePositionPair]

  26. class ReferencePositionSerializer extends Serializer[ReferencePosition]

  27. case class ReferenceRegion(referenceName: String, start: Long, end: Long, orientation: Strand = Strand.Independent) extends Comparable[ReferenceRegion] with Interval with Product with Serializable

    Represents a contiguous region of the reference genome.

    Represents a contiguous region of the reference genome.

    referenceName

    The name of the sequence (chromosome) in the reference genome

    start

    The 0-based residue-coordinate for the start of the region

    end

    The 0-based residue-coordinate for the first residue after the start which is not in the region -- i.e. [start, end) define a 0-based half-open interval.

  28. class ReferenceRegionSerializer extends Serializer[ReferenceRegion]

  29. class SAMFileHeaderWritable extends Serializable

  30. class SequenceDictionary extends Serializable

  31. class SequenceRecord extends Serializable

    Utility class within the SequenceDictionary; represents unique reference name-to-id correspondence

  32. case class SingleReadBucket(primaryMapped: Iterable[AlignmentRecord] = Seq.empty, secondaryMapped: Iterable[AlignmentRecord] = Seq.empty, unmapped: Iterable[AlignmentRecord] = Seq.empty) extends Product with Serializable

  33. class SingleReadBucketSerializer extends Serializer[SingleReadBucket]

  34. class SnpTable extends Serializable with Logging

  35. case class Symbol(label: Char, complement: Char) extends Product with Serializable

    A symbol in an alphabet

    A symbol in an alphabet

    label

    a character which represents the symbol

    complement

    acharacter which represents the complement of the symbol

  36. case class Transcript(id: String, names: Seq[String], geneId: String, strand: Boolean, exons: Iterable[Exon], cds: Iterable[CDS], utrs: Iterable[UTR]) extends Product with Serializable

    A transcript model (here represented as a value of the Transcript class) is a simple, hierarchical model containing a collection of exon models as well as an associated gene identifier, transcript identifier, and a set of common names (synonyms).

    A transcript model (here represented as a value of the Transcript class) is a simple, hierarchical model containing a collection of exon models as well as an associated gene identifier, transcript identifier, and a set of common names (synonyms).

    id

    the (unique) identifier of the Transcript

    names

    Common names for the transcript

    geneId

    The (unique) identifier of the gene to which the transcript belongs

    exons

    The set of exons in the transcript model; each of these contain a reference region whose coordinates are in genomic space.

    cds

    the set of CDS regions (the subset of the exons that are coding) for this transcript

    utrs

  37. case class UTR(transcriptId: String, strand: Boolean, region: ReferenceRegion) extends BlockExtractable with Product with Serializable

    UnTranslated Regions

    UnTranslated Regions

    transcriptId
    strand
    region

  38. case class VCFHeaderWritable(header: VCFHeader) extends Product with Serializable

  39. class VariantContext extends AnyRef

Value Members

  1. object Alphabet

  2. object BEDFeature extends Serializable

  3. object BaseFeature extends Serializable

  4. object Consensus extends Serializable

  5. object IndelTable extends Serializable

  6. object OptionalPositionOrdering extends OptionalReferenceOrdering[ReferencePosition]

  7. object OptionalRegionOrdering extends OptionalReferenceOrdering[ReferenceRegion]

  8. object PositionOrdering extends ReferenceOrdering[ReferencePosition]

  9. object ProgramRecord extends Serializable

  10. object ReadBucket extends Serializable

  11. object RecordGroup extends Serializable

  12. object RecordGroupDictionary extends Serializable

  13. object ReferencePosition extends Serializable

  14. object ReferencePositionPair extends Logging with Serializable

  15. object ReferenceRegion extends Serializable

  16. object ReferenceUtils

  17. object RegionOrdering extends ReferenceOrdering[ReferenceRegion]

  18. object SAMFileHeaderWritable extends Serializable

  19. object SequenceDictionary extends Serializable

    SequenceDictionary contains the (bijective) map between Ints (the referenceId) and Strings (the referenceName) from the header of a BAM file, or the combined result of multiple such SequenceDictionaries.

  20. object SequenceRecord extends Serializable

  21. object SingleReadBucket extends Logging with Serializable

  22. object SnpTable extends Serializable

  23. object TagType extends Enumeration

  24. object VariantContext

    Note: VariantContext inherits its name from the Picard VariantContext, and is not related to the SparkContext object.

    Note: VariantContext inherits its name from the Picard VariantContext, and is not related to the SparkContext object. If you're looking for the latter, see org.bdgenomics.adam.rdd.variation.VariationContext

Ungrouped