Class OCRDocumentPlugin

java.lang.Object
org.imixs.workflow.engine.plugins.AbstractPlugin
org.imixs.archive.documents.OCRDocumentPlugin
All Implemented Interfaces:
org.imixs.workflow.Plugin

public class OCRDocumentPlugin extends org.imixs.workflow.engine.plugins.AbstractPlugin
The TikaPlugin extracts the textual information from document attachments. The plug-in sends each new attached document to an instance of an Apache Tika Server to get the file content.

The TikaPlugin can be used instead of the TIKA_SERVICE_MODE = 'auto' which will react on the ProcessingEvent BEFORE_PROCESS. The plugin runs only in case the TIKA_SERVICE_MODE is NOT set to 'auto'!

Version:
1.0
Author:
rsoika
See Also:
  • Field Summary

    Fields inherited from class org.imixs.workflow.engine.plugins.AbstractPlugin

    INVALID_ITEMVALUE_FORMAT, INVALID_PROPERTYVALUE_FORMAT
  • Constructor Summary

    Constructors
    Constructor
    Description
     
  • Method Summary

    Modifier and Type
    Method
    Description
    void
    init(org.imixs.workflow.WorkflowContext actx)
     
    org.imixs.workflow.ItemCollection
    run(org.imixs.workflow.ItemCollection document, org.imixs.workflow.ItemCollection event)
    This method sends the document content to the tika server and updates the DMS information.

    Methods inherited from class org.imixs.workflow.engine.plugins.AbstractPlugin

    close, getCtx, getWorkflowService, mergeFieldList, uniqueList

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
  • Constructor Details

    • OCRDocumentPlugin

      public OCRDocumentPlugin()
  • Method Details

    • init

      public void init(org.imixs.workflow.WorkflowContext actx) throws org.imixs.workflow.exceptions.PluginException
      Specified by:
      init in interface org.imixs.workflow.Plugin
      Overrides:
      init in class org.imixs.workflow.engine.plugins.AbstractPlugin
      Throws:
      org.imixs.workflow.exceptions.PluginException
    • run

      public org.imixs.workflow.ItemCollection run(org.imixs.workflow.ItemCollection document, org.imixs.workflow.ItemCollection event) throws org.imixs.workflow.exceptions.PluginException
      This method sends the document content to the tika server and updates the DMS information.
      Throws:
      org.imixs.workflow.exceptions.PluginException