nltk.text.ConcordanceIndex¶

class nltk.text.ConcordanceIndex[source]¶

Bases: object

An index that can be used to look up the offset locations at which a given word occurs in a document.

__init__(tokens, key=<function ConcordanceIndex.<lambda>>)[source]¶

Construct a new concordance index.

Parameters

tokens – The document (list of tokens) that this concordance index was created from. This list can be used to access the context of a given word occurrence.
key – A function that maps each token to a normalized version that will be used as a key in the index. E.g., if you use key=lambda s:s.lower(), then the index will be case-insensitive.

offsets(word)[source]¶

Return type: list(int)
Returns: A list of the offset positions at which the given word occurs. If a key function was specified for the index, then given word’s key will be looked up.

find_concordance(word, width=80)[source]¶

Find all concordance lines given the query word.

Provided with a list of words, these will be found as a phrase.

print_concordance(word, width=80, lines=25)[source]¶: Print concordance lines given the query word. :param word: The target word or phrase (a list of strings) :type word: str or list :param lines: The number of lines to display (default=25) :type lines: int :param width: The width of each line, in characters (default=80) :type width: int :param save: The option to save the concordance. :type save: bool

NLTK