nltk package¶

Subpackages¶

Submodules¶

nltk.book module
- sents()
- texts()
nltk.cli module
nltk.collections module
- AbstractLazySequence
- LazyConcatenation
  - LazyConcatenation.__init__()
  - LazyConcatenation.iterate_from()
- LazyEnumerate
  - LazyEnumerate.__init__()
- LazyIteratorList
  - LazyIteratorList.__init__()
  - LazyIteratorList.iterate_from()
- LazyMap
  - LazyMap.__init__()
  - LazyMap.iterate_from()
- LazySubsequence
- LazyZip
  - LazyZip.__init__()
  - LazyZip.iterate_from()
- OrderedDict
- Trie
nltk.collocations module
- BigramCollocationFinder
- QuadgramCollocationFinder
- TrigramCollocationFinder
nltk.compat module
- add_py3_data()
- py3_data()
nltk.data module
- AUTO_FORMATS
- BufferedGzipFile()
- FORMATS
- FileSystemPathPointer
- GzipFileSystemPathPointer
  - GzipFileSystemPathPointer.open()
- LazyLoader
  - LazyLoader.__init__()
- OpenOnDemandZipFile
- PathPointer
- SeekableUnicodeStreamReader
- clear_cache()
- find()
- load()
- path
- retrieve()
- show_cfg()
nltk.decorators module
- decorator()
- getinfo()
- new_wrapper()
nltk.downloader module
- Downloading Packages
- Download Directory
- NLTK Download Server
- Collection
  - Collection.__init__()
  - Collection.children
  - Collection.fromxml()
  - Collection.id
  - Collection.name
  - Collection.packages
- Downloader
  - Downloader.DEFAULT_URL
  - Downloader.INDEX_TIMEOUT
  - Downloader.INSTALLED
  - Downloader.NOT_INSTALLED
  - Downloader.PARTIAL
  - Downloader.STALE
  - Downloader.__init__()
  - Downloader.clear_status_cache()
  - Downloader.collections()
  - Downloader.corpora()
  - Downloader.default_download_dir()
  - Downloader.download()
  - Downloader.download_dir
  - Downloader.incr_download()
  - Downloader.index()
  - Downloader.info()
  - Downloader.is_installed()
  - Downloader.is_stale()
  - Downloader.list()
  - Downloader.models()
  - Downloader.packages()
  - Downloader.status()
  - Downloader.update()
  - Downloader.url
  - Downloader.xmlinfo()
- DownloaderGUI
  - DownloaderGUI.COLUMNS
  - DownloaderGUI.COLUMN_WEIGHTS
  - DownloaderGUI.COLUMN_WIDTHS
  - DownloaderGUI.DEFAULT_COLUMN_WIDTH
  - DownloaderGUI.HELP
  - DownloaderGUI.INITIAL_COLUMNS
  - DownloaderGUI.__init__()
  - DownloaderGUI.about()
  - DownloaderGUI.c
  - DownloaderGUI.destroy()
  - DownloaderGUI.help()
  - DownloaderGUI.mainloop()
- DownloaderMessage
- DownloaderShell
  - DownloaderShell.__init__()
  - DownloaderShell.run()
- ErrorMessage
  - ErrorMessage.__init__()
- FinishCollectionMessage
  - FinishCollectionMessage.__init__()
- FinishDownloadMessage
  - FinishDownloadMessage.__init__()
- FinishPackageMessage
  - FinishPackageMessage.__init__()
- FinishUnzipMessage
  - FinishUnzipMessage.__init__()
- Package
  - Package.__init__()
  - Package.author
  - Package.checksum
  - Package.contact
  - Package.copyright
  - Package.filename
  - Package.fromxml()
  - Package.id
  - Package.license
  - Package.name
  - Package.size
  - Package.subdir
  - Package.svn_revision
  - Package.unzip
  - Package.unzipped_size
  - Package.url
- ProgressMessage
  - ProgressMessage.__init__()
- SelectDownloadDirMessage
  - SelectDownloadDirMessage.__init__()
- StaleMessage
  - StaleMessage.__init__()
- StartCollectionMessage
  - StartCollectionMessage.__init__()
- StartDownloadMessage
  - StartDownloadMessage.__init__()
- StartPackageMessage
  - StartPackageMessage.__init__()
- StartUnzipMessage
  - StartUnzipMessage.__init__()
- UpToDateMessage
  - UpToDateMessage.__init__()
- build_index()
- download()
- download_gui()
- download_shell()
- md5_hexdigest()
- unzip()
- update()
nltk.featstruct module
- Lightweight Feature Structures
- FeatDict
- FeatList
- FeatStruct
- FeatStructReader
- Feature
- RangeFeature
- SlashFeature
  - SlashFeature.read_value()
- conflicts()
- subsumes()
- unify()
nltk.grammar module
- CFG
- DependencyGrammar
- DependencyProduction
- Nonterminal
  - Nonterminal.__init__()
  - Nonterminal.symbol()
- PCFG
- ProbabilisticDependencyGrammar
  - ProbabilisticDependencyGrammar.__init__()
  - ProbabilisticDependencyGrammar.contains()
- ProbabilisticProduction
  - ProbabilisticProduction.__init__()
- Production
- induce_pcfg()
- nonterminals()
- read_grammar()
nltk.help module
- brown_tagset()
- claws5_tagset()
- upenn_tagset()
nltk.internals module
- Counter
  - Counter.__init__()
  - Counter.get()
- Deprecated
  - Deprecated.__new__()
- ElementWrapper
  - ElementWrapper.__init__()
  - ElementWrapper.__new__()
  - ElementWrapper.find()
  - ElementWrapper.findall()
  - ElementWrapper.getchildren()
  - ElementWrapper.getiterator()
  - ElementWrapper.makeelement()
  - ElementWrapper.unwrap()
- ReadError
  - ReadError.__init__()
- config_java()
- deprecated()
- find_binary()
- find_binary_iter()
- find_dir()
- find_file()
- find_file_iter()
- find_jar()
- find_jar_iter()
- find_jars_within_path()
- import_from_stdlib()
- is_writable()
- java()
- overridden()
- raise_unorderable_types()
- read_int()
- read_number()
- read_str()
- slice_bounds()
nltk.jsontags module
- JSONTaggedDecoder
  - JSONTaggedDecoder.decode()
  - JSONTaggedDecoder.decode_obj()
- JSONTaggedEncoder
  - JSONTaggedEncoder.default()
- register_tag()
nltk.langnames module
- inverse_dict()
- lang2q()
- langcode()
- langname()
- q2name()
- q2tag()
- tag2q()
nltk.lazyimport module
- LazyModule
  - LazyModule.__init__()
nltk.probability module
- ConditionalFreqDist
  - ConditionalFreqDist.N()
  - ConditionalFreqDist.__init__()
  - ConditionalFreqDist.conditions()
  - ConditionalFreqDist.copy()
  - ConditionalFreqDist.deepcopy()
  - ConditionalFreqDist.plot()
  - ConditionalFreqDist.tabulate()
- ConditionalProbDist
  - ConditionalProbDist.__init__()
- ConditionalProbDistI
  - ConditionalProbDistI.__init__()
  - ConditionalProbDistI.conditions()
- CrossValidationProbDist
  - CrossValidationProbDist.SUM_TO_ONE
  - CrossValidationProbDist.__init__()
  - CrossValidationProbDist.discount()
  - CrossValidationProbDist.freqdists()
  - CrossValidationProbDist.prob()
  - CrossValidationProbDist.samples()
- DictionaryConditionalProbDist
  - DictionaryConditionalProbDist.__init__()
- DictionaryProbDist
  - DictionaryProbDist.__init__()
  - DictionaryProbDist.logprob()
  - DictionaryProbDist.max()
  - DictionaryProbDist.prob()
  - DictionaryProbDist.samples()
- ELEProbDist
  - ELEProbDist.__init__()
- FreqDist
  - FreqDist.B()
  - FreqDist.N()
  - FreqDist.Nr()
  - FreqDist.__init__()
  - FreqDist.copy()
  - FreqDist.freq()
  - FreqDist.hapaxes()
  - FreqDist.max()
  - FreqDist.pformat()
  - FreqDist.plot()
  - FreqDist.pprint()
  - FreqDist.r_Nr()
  - FreqDist.setdefault()
  - FreqDist.tabulate()
  - FreqDist.update()
- HeldoutProbDist
  - HeldoutProbDist.SUM_TO_ONE
  - HeldoutProbDist.__init__()
  - HeldoutProbDist.base_fdist()
  - HeldoutProbDist.discount()
  - HeldoutProbDist.heldout_fdist()
  - HeldoutProbDist.max()
  - HeldoutProbDist.prob()
  - HeldoutProbDist.samples()
- ImmutableProbabilisticMixIn
  - ImmutableProbabilisticMixIn.set_logprob()
  - ImmutableProbabilisticMixIn.set_prob()
- KneserNeyProbDist
  - KneserNeyProbDist.__init__()
  - KneserNeyProbDist.discount()
  - KneserNeyProbDist.max()
  - KneserNeyProbDist.prob()
  - KneserNeyProbDist.samples()
  - KneserNeyProbDist.set_discount()
- LaplaceProbDist
  - LaplaceProbDist.__init__()
- LidstoneProbDist
  - LidstoneProbDist.SUM_TO_ONE
  - LidstoneProbDist.__init__()
  - LidstoneProbDist.discount()
  - LidstoneProbDist.freqdist()
  - LidstoneProbDist.max()
  - LidstoneProbDist.prob()
  - LidstoneProbDist.samples()
- MLEProbDist
  - MLEProbDist.__init__()
  - MLEProbDist.freqdist()
  - MLEProbDist.max()
  - MLEProbDist.prob()
  - MLEProbDist.samples()
- MutableProbDist
  - MutableProbDist.__init__()
  - MutableProbDist.logprob()
  - MutableProbDist.max()
  - MutableProbDist.prob()
  - MutableProbDist.samples()
  - MutableProbDist.update()
- ProbDistI
  - ProbDistI.SUM_TO_ONE
  - ProbDistI.__init__()
  - ProbDistI.discount()
  - ProbDistI.generate()
  - ProbDistI.logprob()
  - ProbDistI.max()
  - ProbDistI.prob()
  - ProbDistI.samples()
- ProbabilisticMixIn
  - ProbabilisticMixIn.__init__()
  - ProbabilisticMixIn.logprob()
  - ProbabilisticMixIn.prob()
  - ProbabilisticMixIn.set_logprob()
  - ProbabilisticMixIn.set_prob()
- SimpleGoodTuringProbDist
  - SimpleGoodTuringProbDist.SUM_TO_ONE
  - SimpleGoodTuringProbDist.__init__()
  - SimpleGoodTuringProbDist.check()
  - SimpleGoodTuringProbDist.discount()
  - SimpleGoodTuringProbDist.find_best_fit()
  - SimpleGoodTuringProbDist.freqdist()
  - SimpleGoodTuringProbDist.max()
  - SimpleGoodTuringProbDist.prob()
  - SimpleGoodTuringProbDist.samples()
  - SimpleGoodTuringProbDist.smoothedNr()
- UniformProbDist
  - UniformProbDist.__init__()
  - UniformProbDist.max()
  - UniformProbDist.prob()
  - UniformProbDist.samples()
- WittenBellProbDist
  - WittenBellProbDist.__init__()
  - WittenBellProbDist.discount()
  - WittenBellProbDist.freqdist()
  - WittenBellProbDist.max()
  - WittenBellProbDist.prob()
  - WittenBellProbDist.samples()
- add_logs()
- entropy()
- log_likelihood()
- sum_logs()
nltk.tabdata module
- MaxentDecoder
  - MaxentDecoder.tupkey2dict()
- MaxentEncoder
  - MaxentEncoder.tupdict2tab()
- PunktDecoder
  - PunktDecoder.tab2intdict()
- TabDecoder
- TabEncoder
- rm_nl()
nltk.text module
- ConcordanceIndex
- ContextIndex
- Text
- TextCollection
- TokenSearcher
  - TokenSearcher.__init__()
  - TokenSearcher.findall()
nltk.tgrep module
- TGrep search implementation for NLTK trees
- TgrepException
- ancestors()
- tgrep_compile()
- tgrep_nodes()
- tgrep_positions()
- tgrep_tokenize()
- treepositions_no_leaves()
- unique_ancestors()
nltk.toolbox module
- StandardFormat
- ToolboxData
  - ToolboxData.parse()
- ToolboxSettings
  - ToolboxSettings.__init__()
  - ToolboxSettings.parse()
- add_blank_lines()
- add_default_fields()
- demo()
- remove_blanks()
- sort_fields()
- to_settings_string()
- to_sfm_string()
nltk.treeprettyprinter module
- TreePrettyPrinter
nltk.treetransforms module
- chomsky_normal_form()
- collapse_unary()
- un_chomsky_normal_form()
nltk.util module
- Index
  - Index.__init__()
- acyclic_branches_depth_first()
- acyclic_breadth_first()
- acyclic_depth_first()
- acyclic_dic2tree()
- bigrams()
- binary_search_file()
- breadth_first()
- choose()
- clean_html()
- clean_url()
- cut_string()
- edge_closure()
- edges2dot()
- elementtree_indent()
- everygrams()
- filestring()
- flatten()
- guess_encoding()
- in_idle()
- invert_dict()
- invert_graph()
- ngrams()
- pad_sequence()
- pairwise()
- parallelize_preprocess()
- pr()
- print_string()
- re_show()
- set_proxy()
- skipgrams()
- tokenwrap()
- transitive_closure()
- trigrams()
- unique_list()
- unweighted_minimum_spanning_dict()
- unweighted_minimum_spanning_digraph()
- unweighted_minimum_spanning_tree()
nltk.wsd module
- lesk()

Module contents¶

The Natural Language Toolkit (NLTK) is an open source Python library for Natural Language Processing. A free online book is available. (If you use the library for academic research, please cite the book.)

Steven Bird, Ewan Klein, and Edward Loper (2009). Natural Language Processing with Python. O’Reilly Media Inc. https://www.nltk.org/book/

isort:skip_file

@version: 3.9.1

nltk.demo()[source]¶