Setting InfoSphere BigInsights project properties

Projects with the BigInsights™ nature acquire specific properties for Text Analytics.

About this task

When you create a new BigInsights project or add the BigInsights nature to an existing project, the project acquires the following default properties:
  • The default location for SRC files is <project_directory>/textAnalytics/src.
  • The default location for a generated TAM file is set to <project_directory>/textAnalytics/bin.
  • The Provenance feature is enabled.
  • The Pagination feature is enabled.

If you want to disable the Provenance or Pagination feature, clear the Enable Provenance check box or the Enable Pagination check box in the Text Analytics Properties dialog.

Note: Only the modular structure is created for migrated projects. AQL and other related files must be manually updated in the corresponding folders.

Procedure

  1. From the Package or Project Explorer, right-click the BigInsights project, and select Properties.
  2. In the Project Properties window, select BigInsights > Text Analytics from the navigation pane.
  3. In the Text Analytics Properties window, select the General tab, and click Browse Workspace or Browse File System to select the locations of the required precompiled .TAM files. Do not store the precompiled .TAM files in the build output directory because it is cleared with every build.
  4. In the Text Analytics Properties window, select the Source tab, and click Browse to change the location for the source directory. The default source directory is <project_directory>/textAnalytics/src .
  5. In the Text Analytics Properties window, select the Source tab, and click Browse to change the location for the build output directory. The default output directory is <project_directory>/textAnalytics/bin . Note that this build output directory is cleared every time the extractor is built. Ensure that no precompiled TAM files are stored in it.
  6. In the Text Analytics Properties window, select the Projects tab, and click Add to select the required projects.
  7. Optional: Change the tokenization configuration. By default, the InfoSphere® BigInsights Eclipse tools use a default multilingual tokenizer that is included in BigInsights. If you want to use a different tokenizer, complete the following steps:
    1. Click the Advanced tab in the Text Analytics Project Properties. If the Advanced tab is not visible, complete the following steps to make the tab visible:
      1. Close the Text Analytics Project Properties.
      2. Click Window > Preferences > BigInsights > Text Analytics.
      3. Click Show advanced tab in Text Analytics Property pages , and click OK.
      4. Reopen the Text Analytics Project Properties.
    2. Select either of the following parameters:
      Standard tokenizer
      This tokenizer is included with InfoSphere BigInsights.
      Custom multilingual tokenizer
      The multilingual tokenizer with a custom configuration that you must provide.
      For more information about the tokenizers, see Tokenization

Results

If the Build Automatically flag is set to true, any change in the project properties results in a recompilation of the project.