main
public static void main(String[] args)
throws IOException
generate sentence list files for a list of documents.
Takes 4 command line parameters:
props: Jet property list file
filelist: a list of the files to be processed
dataDir: the path of the directory containing the documents
outputDir: the path of the directory to contain the output
For each file in filelist, the document is read from
dataDir/file; and then processed as specified by the Jet
properties file. If the input file has no sentence tags, and none
are added by the script in the properties file, the
sentence splitter is invoked to add sentence tags to text within
TEXT XML elements. Finally the sentence file is written to
outputDir/file.nesent with XML tags.
- Throws:
IOException