PyCantonese
3.1.0
Corpus Data
The CHAT Transcription Format
Accessing Built-in Data
Accessing Custom Data
Corpus Reader Methods
The Representation of “Words”
A Note on the Access Methods
Full Reader API
Corpus Search Queries
Searching by a Jyutping Element
Searching by a Chinese Character
Searching by a Part-of-speech Tag
Searching by a Word or Sentence Range
Searching by Multiple Criteria
Output Format of Search Results
Jyutping Romanization
Characters-to-Jyutping Conversion
Parsing Jyutping Strings
Jyutping-to-Yale Conversion
Jyutping-to-TIPA Conversion
Stop Words
Word Segmentation
Customizing Segmentation
Part-of-Speech Tagging
API Reference
Corpus Data
pycantonese.read_chat
pycantonese.hkcancor
pycantonese.corpus.CantoneseCHATReader
pycantonese.corpus.CantoneseCHATReader.search
pycantonese.corpus.CantoneseCHATReader.search
Jyutping Romanization
pycantonese.characters_to_jyutping
pycantonese.parse_jyutping
pycantonese.jyutping_to_yale
pycantonese.jyutping_to_tipa
Natural Language Processing
pycantonese.stop_words
pycantonese.segment
pycantonese.word_segmentation.Segmenter
pycantonese.pos_tag
pycantonese.pos_tagging.hkcancor_to_ud
Changelog
[Unreleased]
Added
Changed
Deprecated
Removed
Fixed
Security
[3.1.0] - 2021-02-21
Added
Fixed
[3.0.0] - 2020-10-25
Added
Changed
API-breaking Changes
Non-API-breaking Changes
Deprecated
Security
[2.4.1] - 2020-10-10
Fixed
[2.4.0] - 2020-10-10
Added
[2.3.0] - 2020-07-24
Added
Removed
[2.2.0] - 2018-06-30
Added
[2.1.0] - 2018-06-11
Added
Fixed
[2.0.0] - 2016-02-06
[1.0] - 2015-09-06
[1.0dev] - 2015-09-02
[0.2.1] - 2015-01-25
[0.2] - 2015-01-22
[0.1] - 2014-12-17
Research Outputs
PyCantonese
»
Search
Please activate JavaScript to enable the search functionality.