public final class Dictionary
extends java.lang.Object
FSA automaton and metadata describing the
internals of dictionary entries' coding (DictionaryMetadata.
A dictionary consists of two files:
| Modifier and Type | Field and Description |
|---|---|
static java.util.WeakHashMap<java.lang.String,Dictionary> |
defaultDictionaries
Default loaded dictionaries.
|
morfologik.fsa.FSA |
fsa
FSA automaton with the compiled dictionary data. |
DictionaryMetadata |
metadata
Metadata associated with the dictionary.
|
static java.lang.String |
METADATA_FILE_EXTENSION
Expected metadata file extension.
|
| Constructor and Description |
|---|
Dictionary(morfologik.fsa.FSA fsa,
DictionaryMetadata metadata)
It is strongly recommended to use static methods in this class for
reading dictionaries.
|
| Modifier and Type | Method and Description |
|---|---|
static java.lang.CharSequence |
convertText(java.lang.CharSequence str,
java.util.Map<java.lang.String,java.lang.String> conversionMap)
Converts the words on input or output according to conversion tables.
|
static java.lang.String |
getExpectedFeaturesName(java.lang.String name)
Returns the expected name of the metadata file, based on the name of the
FSA dictionary file.
|
static Dictionary |
getForLanguage(java.lang.String languageCode)
Return a built-in dictionary for a given ISO language code.
|
static Dictionary |
read(java.io.File fsaFile)
Attempts to load a dictionary using the path to the FSA file and the
expected metadata extension.
|
static Dictionary |
read(java.net.URL fsaURL)
Attempts to load a dictionary using the URL to the FSA file and the
expected metadata extension.
|
static Dictionary |
readAndClose(java.io.InputStream fsaData,
java.io.InputStream featuresData)
Attempts to load a dictionary from opened streams of FSA dictionary data
and associated metadata.
|
public static final java.lang.String METADATA_FILE_EXTENSION
public final morfologik.fsa.FSA fsa
FSA automaton with the compiled dictionary data.public final DictionaryMetadata metadata
public static final java.util.WeakHashMap<java.lang.String,Dictionary> defaultDictionaries
public Dictionary(morfologik.fsa.FSA fsa,
DictionaryMetadata metadata)
fsa - An instantiated FSA instance.metadata - A map of attributes describing the compression format and
other settings not contained in the FSA automaton. For an
explanation of available attributes and their possible values,
see DictionaryMetadata.public static Dictionary read(java.io.File fsaFile) throws java.io.IOException
java.io.IOExceptionpublic static Dictionary read(java.net.URL fsaURL) throws java.io.IOException
Attempts to load a dictionary using the URL to the FSA file and the expected metadata extension.
This method can be used to load resource-based dictionaries, but be aware of JAR resource-locking issues that arise from resource URLs.
java.io.IOExceptionpublic static Dictionary readAndClose(java.io.InputStream fsaData, java.io.InputStream featuresData) throws java.io.IOException
java.io.IOExceptionpublic static java.lang.String getExpectedFeaturesName(java.lang.String name)
name and appending
METADATA_FILE_EXTENSION.public static Dictionary getForLanguage(java.lang.String languageCode)
java.lang.RuntimeException - Throws a RuntimeException if the dictionary is not
bundled with the library.public static java.lang.CharSequence convertText(java.lang.CharSequence str,
java.util.Map<java.lang.String,java.lang.String> conversionMap)
str - - input character sequence to be convertedconversionMap - - conversion map used to convert the string (a map
from String to String)Copyright © 2015. All Rights Reserved.