CofeehousePy/mods/stopwords/coffeehousemod_stopwords/data
netkas 5693ec0558 Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
..
README Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
arabic Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
azerbaijani Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
danish Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
dutch Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
english Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
finnish Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
french Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
german Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
greek Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
hungarian Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
indonesian Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
italian Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
kazakh Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
nepali Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
norwegian Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
portuguese Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
romanian Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
russian Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
slovene Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
spanish Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
swedish Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
tajik Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00
turkish Added CoffeeHouse Mods 2020-12-25 14:24:45 -05:00

README

Stopwords Corpus

This corpus contains lists of stop words for several languages.  These
are high-frequency grammatical words which are usually ignored in text
retrieval applications.

They were obtained from:
http://anoncvs.postgresql.org/cvsweb.cgi/pgsql/src/backend/snowball/stopwords/

The stop words for the Romanian language were obtained from:
http://arlc.ro/resources/

The English list has been augmented
https://github.com/nltk/nltk_data/issues/22

The German list has been corrected
https://github.com/nltk/nltk_data/pull/49

A Kazakh list has been added
https://github.com/nltk/nltk_data/pull/52

A Nepali list has been added
https://github.com/nltk/nltk_data/pull/83

An Azerbaijani list has been added
https://github.com/nltk/nltk_data/pull/100

A Greek list has been added
https://github.com/nltk/nltk_data/pull/103

An Indonesian list has been added
https://github.com/nltk/nltk_data/pull/112