nltk.stem package¶
Submodules¶
- nltk.stem.api module
- nltk.stem.arlstem module
- nltk.stem.arlstem2 module
- nltk.stem.cistem module
- nltk.stem.isri module
ISRIStemmer
ISRIStemmer.__init__()
ISRIStemmer.end_w5()
ISRIStemmer.end_w6()
ISRIStemmer.norm()
ISRIStemmer.pre1()
ISRIStemmer.pre32()
ISRIStemmer.pro_w4()
ISRIStemmer.pro_w53()
ISRIStemmer.pro_w54()
ISRIStemmer.pro_w6()
ISRIStemmer.pro_w64()
ISRIStemmer.stem()
ISRIStemmer.suf1()
ISRIStemmer.suf32()
ISRIStemmer.waw()
- nltk.stem.lancaster module
- nltk.stem.porter module
- nltk.stem.regexp module
- nltk.stem.rslp module
- nltk.stem.snowball module
ArabicStemmer
ArabicStemmer.is_defined
ArabicStemmer.is_noun
ArabicStemmer.is_verb
ArabicStemmer.prefix_step2a_success
ArabicStemmer.prefix_step3a_noun_success
ArabicStemmer.prefix_step3b_noun_success
ArabicStemmer.stem()
ArabicStemmer.suffix_noun_step1a_success
ArabicStemmer.suffix_noun_step2a_success
ArabicStemmer.suffix_noun_step2b_success
ArabicStemmer.suffix_noun_step2c2_success
ArabicStemmer.suffix_verb_step2a_success
ArabicStemmer.suffix_verb_step2b_success
ArabicStemmer.suffixe_noun_step1b_success
ArabicStemmer.suffixes_verb_step1_success
DanishStemmer
DutchStemmer
EnglishStemmer
FinnishStemmer
FrenchStemmer
GermanStemmer
HungarianStemmer
ItalianStemmer
NorwegianStemmer
PorterStemmer
PortugueseStemmer
RomanianStemmer
RussianStemmer
SnowballStemmer
SpanishStemmer
SwedishStemmer
demo()
- nltk.stem.util module
- nltk.stem.wordnet module
Module contents¶
NLTK Stemmers
Interfaces used to remove morphological affixes from words, leaving
only the word stem. Stemming algorithms aim to remove those affixes
required for eg. grammatical role, tense, derivational morphology
leaving only the stem of the word. This is a difficult problem due to
irregular words (eg. common verbs in English), complicated
morphological rules, and part-of-speech and sense ambiguities
(eg. ceil-
is not the stem of ceiling
).
StemmerI defines a standard interface for stemmers.