nltk.tokenize.sent_tokenize¶
- nltk.tokenize.sent_tokenize(text, language='english')[source]¶
Return a sentence-tokenized copy of text, using NLTK’s recommended sentence tokenizer (currently
PunktSentenceTokenizer
for the specified language).- Parameters
text – text to split into sentences
language – the model name in the Punkt corpus