Class IdentifiedResultsRepo


  • public final class IdentifiedResultsRepo
    extends java.lang.Object
    DDL + DML functionality for the "IdentifiedResult" table.
    • Constructor Summary

      Constructors 
      Constructor Description
      IdentifiedResultsRepo​(org.apache.spark.sql.SparkSession sparkSession, java.lang.String catalog, java.lang.String gcRefName, java.lang.String gcTableIdentifier)  
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> collectExpiredContentsAsDataSet​(java.lang.String runId)
      Collect the expired contents for the given run id as spark dataset.
      java.util.Optional<java.lang.String> getLatestCompletedRunID()  
      org.apache.spark.sql.types.StructType getSchema()  
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Constructor Detail

      • IdentifiedResultsRepo

        public IdentifiedResultsRepo​(org.apache.spark.sql.SparkSession sparkSession,
                                     java.lang.String catalog,
                                     java.lang.String gcRefName,
                                     java.lang.String gcTableIdentifier)
    • Method Detail

      • getSchema

        public org.apache.spark.sql.types.StructType getSchema()
      • collectExpiredContentsAsDataSet

        public org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> collectExpiredContentsAsDataSet​(java.lang.String runId)
        Collect the expired contents for the given run id as spark dataset.
        Parameters:
        runId - run id of completed identify task.
        Returns:
        spark dataset of row where each row is having the expired contents per content id per reference.
      • getLatestCompletedRunID

        public java.util.Optional<java.lang.String> getLatestCompletedRunID()