public class SaltXMLImporter extends PepperImporterImpl implements PepperImporter
PepperImporter which imports the SaltXML format into a salt
model. This module assumes, that each document is stored in a separate file.
Such a file must contain the document structure. The corpus structure is
stored in a single file called saltProject +
. The value
can be got by method
getSaltFileEnding(). | Modifier and Type | Field and Description |
|---|---|
static String |
FORMAT_NAME |
static String |
FORMAT_VERSION |
static String |
MODULE_NAME |
corpusDescisMultithreaded, logger, moduleController, resources, saltProject, sCorpusGraph, symbolicName, temprorariesNEGATIVE_FILE_EXTENSION_MARKERENDING_ALL_FILES, ENDING_FOLDER, ENDING_LEAF_FOLDER, ENDING_TAB, ENDING_TXT, ENDING_XML| Constructor and Description |
|---|
SaltXMLImporter() |
| Modifier and Type | Method and Description |
|---|---|
PepperMapper |
createPepperMapper(org.corpus_tools.salt.graph.Identifier id)
Creates a mapper of type
EXMARaLDA2SaltMapper. |
SelfTestDesc |
getSelfTestDesc()
This method is called by the Pepper framework to run an integration test
for module.
|
void |
importCorpusStructure(org.corpus_tools.salt.common.SCorpusGraph corpusGraph)
Imports the corpus-structure by a call of
SaltProject#loadSCorpusStructure(URI) |
Double |
isImportable(org.eclipse.emf.common.util.URI corpusPath)
Reads recursively first found file and returns 1.0 if file contains:
<?
|
addSupportedFormat, getCorpusDesc, getCorpusEndings, getDocumentEndings, getIdentifier2ResourceTable, getIgnoreEndings, getSupportedFormats, importCorpusStructureRec, readXMLResource, sampleFileContent, setCorpusDesc, setCorpusPathResolver, setTypeOfResource, startactivate, done, done, end, getComponentContext, getCorpusGraph, getDesc, getDocumentId2DC, getFingerprint, getMapperControllers, getMapperThreadGroup, getModuleController, getModuleType, getName, getProgress, getProgress, getProperties, getResources, getSaltProject, getStartProblems, getSupplierContact, getSupplierHomepage, getSymbolicName, getTemproraries, getVersion, isMultithreaded, isReadyToStart, proposeImportOrder, setCorpusGraph, setDesc, setIsMultithreaded, setMapperThreadGroup, setName, setPepperModuleController_basic, setPepperModuleController, setProperties, setResources, setSaltProject, setSupplierContact, setSupplierHomepage, setSymbolicName, setTemproraries, setVersion, start, toString, uncaughtExceptionclone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, waitaddSupportedFormat, getCorpusDesc, getCorpusEndings, getDocumentEndings, getIdentifier2ResourceTable, getIgnoreEndings, getSupportedFormats, setCorpusDesc, setTypeOfResourcedone, done, end, getComponentContext, getCorpusGraph, getDesc, getFingerprint, getModuleController, getModuleType, getName, getProgress, getProgress, getProperties, getResources, getSaltProject, getStartProblems, getSupplierContact, getSupplierHomepage, getSymbolicName, getTemproraries, getVersion, isMultithreaded, isReadyToStart, proposeImportOrder, setCorpusGraph, setDesc, setIsMultithreaded, setPepperModuleController_basic, setPepperModuleController, setProperties, setResources, setSaltProject, setSupplierContact, setSupplierHomepage, setSymbolicName, setTemproraries, setVersion, start, startpublic static final String MODULE_NAME
public static final String FORMAT_NAME
public static final String FORMAT_VERSION
public Double isImportable(org.eclipse.emf.common.util.URI corpusPath)
isImportable in interface PepperImporterisImportable in class PepperImporterImplpublic SelfTestDesc getSelfTestDesc()
PepperModuleSelfTestDesc object needs to
provide an input corpus path and an output corpus path.
When this module is:
SelfTestDesc.getInputCorpusPath() should contain
the format to be imported. SelfTestDesc.getExpectedCorpusPath()
should contain the expected salt project (for control).SelfTestDesc.getInputCorpusPath() should
contain a salt project which is the module's input.
SelfTestDesc.getExpectedCorpusPath() should contain the expected
salt project (for control).SelfTestDesc.getInputCorpusPath() should contain
a salt project which is the module's input.
SelfTestDesc.getExpectedCorpusPath() should contain the expected
corpus in output format.return new IntegrationTestDesc(inputPath, outputPath);When this module is an importer or a manipulator the method
SelfTestDesc.compare(SaltProject, SaltProject) is called to
compare output salt project with expected salt project. When the module
is an exporter the method SelfTestDesc.compare(URI, URI) is
called to compare the created output folder with an expected one. By
default this method checks whether the file structure and each file is
equal.getSelfTestDesc in interface PepperModulegetSelfTestDesc in class PepperModuleImplpublic void importCorpusStructure(org.corpus_tools.salt.common.SCorpusGraph corpusGraph)
throws PepperModuleException
SaltProject#loadSCorpusStructure(URI)importCorpusStructure in interface PepperImporterimportCorpusStructure in class PepperImporterImplcorpusGraph - an empty graph given by Pepper, which shall contains the
corpus structurePepperModuleExceptionpublic PepperMapper createPepperMapper(org.corpus_tools.salt.graph.Identifier id)
EXMARaLDA2SaltMapper.
OVERRIDE THIS METHOD FOR CUSTOMIZED MAPPING.
This method creates a customized PepperMapper object and returns
it. You can here do some additional initialisations. Thinks like setting
the Identifier of the SDocument or SCorpus object
and the URI resource is done by the framework (or more in detail
in method PepperModule.start()). The parameter sElementId, if a
PepperMapper object should be created in case of the object to
map is either an SDocument object or an SCorpus object of
the mapper should be initialized differently. createPepperMapper in interface PepperModulecreatePepperMapper in class PepperModuleImplid - Identifier of the SCorpus or SDocument
to be processed.PepperMapper object to do the mapping task for object
connected to given IdentifierCopyright © 2009–2021 Humboldt-Universität zu Berlin, INRIA. All rights reserved.