The default compression codec to use for files written to HDFS.
The default compression codec to use for files written to HDFS. This may be modified by specifying the following property: mothra.invariantpacker.compression.
Values typically supported by Hadoop include bzip2, gzip, lz4,
lzo, lzop, snappy, and default. The empty string indicates no
compression.
The default values for the maximum number output files allowed to be open at once.
The default values for the maximum number output files allowed to be open at once.
This value may be set at run time via the
mothra.invariantpacker.fileCacheSize Java property.
When an output file has not been opened yet, it is not opened until the oldest input file reaches a particular age, the number of input files reaches a count, or the sum of the sizes of the input files reaches a value.
When an output file has not been opened yet, it is not opened until the oldest input file reaches a particular age, the number of input files reaches a count, or the sum of the sizes of the input files reaches a value. This holds the default setting for the age of the oldest input file, in seconds.
The run-time behavior may be modified by setting the 'mothra.invariantpacker.maxInputAgeSeconds' Java property.
The default maximum number of threads that attempt to write to output files simultaneously.
The default maximum number of threads that attempt to write to output files simultaneously.
This run-time behavior may be modified by setting the mothra.invariantpacker.maxThreads property.
When an output file has not been opened yet, it is not opened until the oldest input file reaches a particular age, the number of input files reaches a count, or the sum of the sizes of the input files reaches a value.
When an output file has not been opened yet, it is not opened until the oldest input file reaches a particular age, the number of input files reaches a count, or the sum of the sizes of the input files reaches a value. This holds the default setting for the minimum count of input files.
The run-time behavior may be modified by setting the 'mothra.invariantpacker.minInputCount' Java property.
When an output file has not been opened yet, it is not opened until the oldest input file reaches a particular age, the number of input files reaches a count, or the sum of the sizes of the input files reaches a value.
When an output file has not been opened yet, it is not opened until the oldest input file reaches a particular age, the number of input files reaches a count, or the sum of the sizes of the input files reaches a value. This holds the default setting for sum of the sizes of the input files, in octets.
The run-time behavior may be modified by setting the 'mothra.invariantpacker.minInputSize' Java property.
The default observationDomainId to use for output files.
The default observationDomainId to use for output files.
The run-time behavior may be modified by setting the 'mothra.invariantpacker.observationDomainId' Java property.
The default maximum number of seconds to allow an idle output file to remain open so additional incoming records may appended to it.
The default maximum number of seconds to allow an idle output file to remain open so additional incoming records may appended to it.
The run-time behavior may be modified by setting the
mothra.invariantpacker.outputIdleSeconds Java property.
Default value for often the source directory is scanned (polled) for IPFIX files to process.
Default value for often the source directory is scanned (polled) for
IPFIX files to process. This value may be specified at run-time by
specifying the mothra.invariantpacker.pollingInterval Java property.
The minimum value allowed for the maximum number of output files that may be opened at once.
The minimum value allowed for the maximum file size as specified by the
mothra.invariantpacker.maximumSize Java property.
The minimum allowed value for the mothra.invariantpacker.outputIdleSeconds
Java property.
The compression codec used for files written to HDFS.
The compression codec used for files written to HDFS. This may be set by setting the "mothra.invariantpacker.compression" property. If that property is not set, DEFAULT_COMPRESSION is used.
The maximum number of output files allowed to be open at once.
The maximum number of output files allowed to be open at once. Defaults
to DEFAULT_FILE_CACHE_SIZE.
This value may be set at run time via the
mothra.invariantpacker.fileCacheSize Java property.
The Hadoop configuration
The information model
Have an unopened output file process its input files when there exists an input file whose age is older than this (number of milliseconds), regardless of the number or sizes of the input files.
Have an unopened output file process its input files when there exists
an input file whose age is older than this (number of milliseconds),
regardless of the number or sizes of the input files. It defaults to
the value DEFAULT_MAX_AGE_SECONDS (converted to milliseonds).
The run-time behavior may be modified by setting the 'mothra.invariantpacker.maxInputAgeSeconds' Java property.
The maximum number of threads that attempt to write to output files simultaneously.
The maximum number of threads that attempt to write to output files
simultaneously. It defaults to the value DEFAULT_MAX_THREADS.
This run-time behavior may be modified by setting the mothra.invariantpacker.maxThreads property.
The (approximate) maximum size file to create.
The (approximate) maximum size file to create. The default is no maximum. When a file's size exceeds this value, the file is closed and a new file is started. Typically a file's size will not exceed this value by more than the maximum size of an IPFIX message, 64k.
Have an unopened output file process its input files when there are more than this number of input files.
Have an unopened output file process its input files when there are more than this number of input files.
The run-time behavior may be modified by setting the 'mothra.invariantpacker.minInputCount' Java property.
Have an unopened output file process its input files when the sum of the sizes of the input files is larger than this number of octets.
Have an unopened output file process its input files when the sum of the sizes of the input files is larger than this number of octets.
The run-time behavior may be modified by setting the 'mothra.invariantpacker.minInputSize' Java property.
The observationDomainId to use for output files.
The observationDomainId to use for output files.
The run-time behavior may be modified by setting the 'mothra.invariantpacker.observationDomainId' Java property.
The maximum number of seconds to allow an idle output file to remain open so additional incoming records may appended to it.
The maximum number of seconds to allow an idle output file to remain
open so additional incoming records may appended to it. It defaults to
the value DEFAULT_OUTPUT_FILE_IDLE_SECONDS.
This run-time behavior may be modified by setting the
mothra.invariantpacker.outputIdleSeconds Java property.
How often the incoming directory is scanned for files to process, in seconds.
How often the incoming directory is scanned for files to process, in
seconds. Defaults to DEFAULT_POLL_INTERVAL.
The run-time behavior may be modified by setting the 'mothra.invariantpacker.pollingInterval' Java property.
Object to implement the InvariantPacker application.
Typical Usage in a Spark environment:
spark-submit --class org.cert.netsa.mothra.packer.tools.InvariantPackerMain mothra-tools.jar [--one-shot] <sourceDir> <destinationDir> <partitionerFile>Processes files created by super_mediator running in invariant mode and writes them into HDFS.