resources..arabic-pipeline-metadata.long-desc.html Maven / Gradle / Ivy
Go to download
Show more of this group Show more artifacts with this name
Show all versions of lang-arabic Show documentation
Show all versions of lang-arabic Show documentation
Support for processing Arabic documents
The newest version!
A named entity recognition pipeline that identifies basic entity types, such
as Person, Location, Organization, Money
amounts, Time and Date expressions. It works on documents
in the Arabic language.
Default annotations
:Person
Standard named entity types
:Location
:Organization
:Date
:Address
Includes email and IP addresses as well as street addresses
Additional annotations available if selected
:Money
Monetary amounts
:Percent
Expressions representing percentages
:Token
The individual tokens of the text, with "category" feature for POS
:SpaceToken
The spaces between tokens
:Sentence
Sentences detected by the sentence splitter