
.languagedetector.language-detector.0.4.source-code.README.md Maven / Gradle / Ivy
Go to download
Show more of this group Show more artifacts with this name
Show all versions of language-detector Show documentation
Show all versions of language-detector Show documentation
Language Detection Library for Java.
## Abut the "languages" folder and files
These files are from the original software from Nakatani Shuyo.
Unfortunately, the data sources from which they were generated are not available.
It looks like it comes from Wikipedia pages.
To generate your own, use a LanguageProfileBuilder and then add text using a TextObject,
then finally store the result with a LanguageProfileWriter.
## Abut the "languages.shorttext" folder and files
These files are from the original software from Nakatani Shuyo.
Either they are for detecting language on short messages, or they are built from short message text, or
both, I don't know.
## Abut the "messages.properties" file
They are used in the CharNormalizer.
© 2015 - 2025 Weber Informatics LLC | Privacy Policy