Tika extractor is quite simple since it does not handle with structure of documents (sheets in Excel, paragraphs in Word, etc.) The structure might have been
represented as various MediaUnits.
This class is an abstract handler decorator that should be specialised and instantiated inside the TikaService for the creation of WebLab document when
parsing the HTML Tika output.