Package org.projectnessie.gc.base
Class IdentifiedResultsRepo
- java.lang.Object
-
- org.projectnessie.gc.base.IdentifiedResultsRepo
-
public final class IdentifiedResultsRepo extends Object
DDL + DML functionality for the "IdentifiedResult" table.
-
-
Constructor Summary
Constructors Constructor Description IdentifiedResultsRepo(org.apache.spark.sql.SparkSession sparkSession, String catalog, String gcRefName, String gcTableIdentifier)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description org.apache.spark.sql.Dataset<org.apache.spark.sql.Row>collectExpiredContentsAsDataSet(String runId)Collect the expired contents for the given run id as spark dataset.Optional<String>getLatestCompletedRunID()org.apache.spark.sql.types.StructTypegetSchema()
-
-
-
Method Detail
-
getSchema
public org.apache.spark.sql.types.StructType getSchema()
-
collectExpiredContentsAsDataSet
public org.apache.spark.sql.Dataset<org.apache.spark.sql.Row> collectExpiredContentsAsDataSet(String runId)
Collect the expired contents for the given run id as spark dataset.- Parameters:
runId- run id of completed identify task.- Returns:
- spark dataset of row where each row is having the expired contents per content id per reference.
-
-