nltk.tokenize.sent_tokenize

nltk.tokenize.sent_tokenize(text, language='english')[source]

Return a sentence-tokenized copy of text, using NLTK’s recommended sentence tokenizer (currently PunktSentenceTokenizer for the specified language).

Parameters
  • text – text to split into sentences

  • language – the model name in the Punkt corpus