Interface GCParams

    • Method Detail

      • getCutOffTimestampPerRef

        @Nullable
        Map<String,​Instant> getCutOffTimestampPerRef()
        Optional cutoff time per live reference.
      • getDefaultCutOffTimestamp

        Instant getDefaultCutOffTimestamp()
        Default cutoff time for all the references.
      • getDeadReferenceCutOffTimeStamp

        @Nullable
        Instant getDeadReferenceCutOffTimeStamp()
        Optional cutoff time for all the dead references.
      • getSparkPartitionsCount

        @Nullable
        Integer getSparkPartitionsCount()
        Optional spark partitions count to be used for distributing references. Default total reference count (live + dead) will be used.
      • getCommitProtectionDuration

        @Default
        default Duration getCommitProtectionDuration()
        Commit protection duration to avoid expiring on going or recent commits. Default is 2 hours.
      • getBloomFilterExpectedEntries

        @Nullable
        Long getBloomFilterExpectedEntries()
        Optional bloom filter expected live commits entries per reference. Default is total commits in the default reference.
      • getBloomFilterFpp

        @Default
        default double getBloomFilterFpp()
        Optional bloom filter fpp. Default value is 0.03d.
      • getNessieCatalogName

        String getNessieCatalogName()
        Nessie catalog name to be used with spark to create the output results table.
      • getOutputBranchName

        String getOutputBranchName()
        Branch's name to be used for creating the output table.

        If the branch doesn't exist for this name, branch with this name pointing to beginning of time (aka NO_ANCESTOR hash) will be created.

      • getOutputTableIdentifier

        String getOutputTableIdentifier()
        Output table identifier (namespace and table name) to be used for storing the results in getOutputBranchName().
      • validate

        @Check
        default void validate()