Class IdentifiedResultsRepo


  • public final class IdentifiedResultsRepo
    extends Object
    DDL + DML functionality for the "IdentifiedResult" table.
    • Constructor Detail

      • IdentifiedResultsRepo

        public IdentifiedResultsRepo​(org.apache.spark.sql.SparkSession sparkSession,
                                     String catalog,
                                     String gcRefName,
                                     String gcTableIdentifier)
    • Method Detail

      • getSchema

        public org.apache.spark.sql.types.StructType getSchema()
      • collectExpiredContentsAsDataSet

        public org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> collectExpiredContentsAsDataSet​(String runId)
        Collect the expired contents for the given run id as spark dataset.
        Parameters:
        runId - run id of completed identify task.
        Returns:
        spark dataset of row where each row is having the expired contents per content id per reference.
      • getLatestCompletedRunID

        public Optional<String> getLatestCompletedRunID()