All Downloads are FREE. Search and download functionalities are using the official Maven repository.

.languagedetector.language-detector.0.4.source-code.README.md Maven / Gradle / Ivy

There is a newer version: 0.6
Show newest version
## Abut the "languages" folder and files

These files are from the original software from Nakatani Shuyo.

Unfortunately, the data sources from which they were generated are not available.
It looks like it comes from Wikipedia pages.

To generate your own, use a LanguageProfileBuilder and then add text using a TextObject,
then finally store the result with a LanguageProfileWriter.


## Abut the "languages.shorttext" folder and files

These files are from the original software from Nakatani Shuyo.

Either they are for detecting language on short messages, or they are built from short message text, or
both, I don't know.


## Abut the "messages.properties" file

They are used in the CharNormalizer.





© 2015 - 2025 Weber Informatics LLC | Privacy Policy