![JAR search and dependency download from the Maven repository](/logo.png)
com.tencent.angel.sona.ml.feature.stopwords.README Maven / Gradle / Ivy
Stopwords Corpus
This corpus contains lists of stop words for several languages. These
are high-frequency grammatical words which are usually ignored in text
retrieval applications.
They were obtained from:
http://anoncvs.postgresql.org/cvsweb.cgi/pgsql/src/backend/snowball/stopwords/
The English list has been augmented
https://github.com/nltk/nltk_data/issues/22
© 2015 - 2025 Weber Informatics LLC | Privacy Policy