Reads the version number and GitHub URL from configuration file bundled into the jar.
For convenience, case classes can mix in this single trait to implement PipelineRunnerSupport
Represents data in a persistent store.
Interface for defining how to persist a data type.
Class for writing that exposes a more restrictive interface than OutputStream In particular, we don't want clients to close the stream Also, we force character encoding to UTF-8.
Contains information about the origin of the compiled class implementing a Producer
Support for persisting to a column-delimited file.
Directory of files.
Flat file.
Generic data blob.
Persist a collection of string-serializable objects to a flat file, one line per object.
Persist an iterator of string-serializable objects to a flat file, one line per object.
Represents dependency between Producer instances
Maven-style version id
Producer implementations that do not need to be executed by PipelineRunner can mix in this convenience trait.
Represents a Producer instance with PipelineRunnerSupport
Executes a pipeline represented by a set of Producer instances Inspects the meta-info about the pipeline steps (represented by PipelineRunnerSupport interface) and builds a DAG representation of the pipeline.
This information is used by PipelineRunner to construct and visualize the DAG for a pipeline
An individual step in a data processing pipeline.
Artifact implementations using S3 storage.
Zip file stored in S3.
Acts as an identifier for a Producer instance.
Persist a single object to a flat file.
Serialize an object to/from a String
Artifact with nested structure, containing multiple data blobs identified by String names.
Represents code from an unspecified location
DAG representation of the execution of a set of Producers
Zip file.
Utility methods for Artifact reading/writing.
Given a function that converts an InputStream into an Iterator, this closes the InputStream when the Iterator has been fully consumed.