Testing the extractor

After you write AQL statements to extract or filter text, you can test the extractor by running the AWL from the InfoSphere® BigInsights™ Tools for Eclipse and viewing the results.

Procedure

  1. Select how you want to run the extraction plan.
    Option Description
    On the entire data collection
    1. In the Extraction Plan, right-click the root label.
    2. Select Run > Run the extraction plan on the entire data collection.
    On selected documents
    1. Select one or more documents in the list of documents from the entire data collection.
    2. In the Extraction Plan, right-click the root label.
    3. Select Run > Run the extraction plan on the set of selected documents.
    On labeled documents
    1. In the Extraction Plan, right-click the root label.
    2. Select Run > Run the extraction plan on the set of documents that are labeled.
  2. If you are successful, and your extractor includes an output view statement, then you can view the results in the Annotation Explorer pane.

    The Annotation Explorer shows each extracted field in the Span Attribute Value column, along with text from before and after the extracted text, which is known as the left and right context. Double-click one of the rows that contains the values on which you filtered to see the extracted text in the original document. The text and the instance that you clicked are displayed in the editor pane. The Annotations tree view also opens in the extraction plan window.