NLTK

Documentation

nltk.tokenize.sent_tokenize¶

nltk.tokenize.sent_tokenize(text, language='english')[source]¶

Return a sentence-tokenized copy of text, using NLTK’s recommended sentence tokenizer (currently PunktSentenceTokenizer for the specified language).

Parameters

text – text to split into sentences
language – the model name in the Punkt corpus