Interface GCParams

  • All Superinterfaces:
    java.io.Serializable

    @Immutable
    public interface GCParams
    extends java.io.Serializable
    Config params for GC.
    • Method Detail

      • getNessieClientConfigs

        java.util.Map<java.lang.String,​java.lang.String> getNessieClientConfigs()
        Nessie client configurations from NessieConfigConstants.
      • getCutOffTimestampPerRef

        @Nullable
        java.util.Map<java.lang.String,​java.time.Instant> getCutOffTimestampPerRef()
        Optional cutoff time per live reference.
      • getDefaultCutOffTimestamp

        java.time.Instant getDefaultCutOffTimestamp()
        Default cutoff time for all the references.
      • getDeadReferenceCutOffTimeStamp

        @Nullable
        java.time.Instant getDeadReferenceCutOffTimeStamp()
        Optional cutoff time for all the dead references.
      • getSparkPartitionsCount

        @Nullable
        java.lang.Integer getSparkPartitionsCount()
        Optional spark partitions count to be used for distributing references. Default total reference count (live + dead) will be used.
      • getCommitProtectionDuration

        @Default
        default java.time.Duration getCommitProtectionDuration()
        Commit protection duration to avoid expiring on going or recent commits. Default is 2 hours.
      • getBloomFilterExpectedEntries

        @Nullable
        java.lang.Long getBloomFilterExpectedEntries()
        Optional bloom filter expected live commits entries per reference. Default is total commits in the default reference.
      • getBloomFilterFpp

        @Default
        default double getBloomFilterFpp()
        Optional bloom filter fpp. Default value is 0.03d.
      • getNessieCatalogName

        java.lang.String getNessieCatalogName()
        Nessie catalog name to be used with spark to create the output results table.
      • getOutputBranchName

        java.lang.String getOutputBranchName()
        Branch's name to be used for creating the output table.

        If the branch doesn't exist for this name, branch with this name pointing to beginning of time (aka NO_ANCESTOR hash) will be created.

      • getOutputTableIdentifier

        java.lang.String getOutputTableIdentifier()
        Output table identifier (namespace and table name) to be used for storing the results in getOutputBranchName().
      • validate

        @Check
        default void validate()