 |
LanguageWare is the new generation IBM linguistic platform. It was designed from the ground up to address the demands posed by today's global applications. |
|
|
 | A full description of our capabilities can be found in the documentation included with the LanguageWare Resource Workbench which is available for download from Alphaworks, http://alphaworks.ibm.com/tech/lrw/faq. Below is a quick list of the main features:
Dictionary Lookup [supporting lookup functions, such as synonym expansion, hyphenation, approximate/fuzzy string matching…]
Language Identification
Lexical Analysis [process through which the text-level analysis is performed]
Morphological Analysis [lemmatization, generate inflected form, inflect from model, …]
Morphological Guesser [for handling words not in the dictionary]
Multiword Units [finds terms made up of multiple words handling inflections, misses, ordering…]
Parsing [ability to build rules, regular expressions, or shallow grammars for identifying entities in text]
Part of Speech (POS) Tagging [identifying the POS of words or phrases, with disambiguation]
Semantic Analysis & Disambiguation [allows for concepts (as opposed to terms) to be spotted in texts and disambiguated with respect to other concepts present, leveraging semantic graphs that connect concepts]
Text Correction [error checking and spelling suggestion]
Text Segmentation [tokenization & sentence and paragraph detection]
Tooling [Eclipse-based tooling, LanguageWare Resource Workbench, for customizing LanguageWare - build dictionaries, rules, UIMA pipelines, compare analysis results, get statistics, performance benchmarking, …]
|
|
|
|
|  | |