PyCantonese
3.1.0
Corpus Data
The CHAT Transcription Format
Accessing Built-in Data
Accessing Custom Data
Corpus Reader Methods
The Representation of “Words”
A Note on the Access Methods
Full Reader API
Corpus Search Queries
Searching by a Jyutping Element
Searching by a Chinese Character
Searching by a Part-of-speech Tag
Searching by a Word or Sentence Range
Searching by Multiple Criteria
Output Format of Search Results
Jyutping Romanization
Characters-to-Jyutping Conversion
Parsing Jyutping Strings
Jyutping-to-Yale Conversion
Jyutping-to-TIPA Conversion
Stop Words
Word Segmentation
Customizing Segmentation
Part-of-Speech Tagging
API Reference
Corpus Data
pycantonese.read_chat
pycantonese.hkcancor
pycantonese.corpus.CantoneseCHATReader
pycantonese.corpus.CantoneseCHATReader.search
pycantonese.corpus.CantoneseCHATReader.search
Jyutping Romanization
pycantonese.characters_to_jyutping
pycantonese.parse_jyutping
pycantonese.jyutping_to_yale
pycantonese.jyutping_to_tipa
Natural Language Processing
pycantonese.stop_words
pycantonese.segment
pycantonese.word_segmentation.Segmenter
pycantonese.pos_tag
pycantonese.pos_tagging.hkcancor_to_ud
Changelog
[Unreleased]
Added
Changed
Deprecated
Removed
Fixed
Security
[3.1.0] - 2021-02-21
Added
Fixed
[3.0.0] - 2020-10-25
Added
Changed
API-breaking Changes
Non-API-breaking Changes
Deprecated
Security
[2.4.1] - 2020-10-10
Fixed
[2.4.0] - 2020-10-10
Added
[2.3.0] - 2020-07-24
Added
Removed
[2.2.0] - 2018-06-30
Added
[2.1.0] - 2018-06-11
Added
Fixed
[2.0.0] - 2016-02-06
[1.0] - 2015-09-06
[1.0dev] - 2015-09-02
[0.2.1] - 2015-01-25
[0.2] - 2015-01-22
[0.1] - 2014-12-17
Research Outputs
PyCantonese
»
Index
Index
_
|
A
|
C
|
D
|
F
|
H
|
I
|
J
|
L
|
M
|
N
|
P
|
R
|
S
|
T
|
U
|
W
_
__init__() (pycantonese.corpus.CantoneseCHATReader method)
(pycantonese.word_segmentation.Segmenter method)
A
abspath() (pycantonese.corpus.CantoneseCHATReader method)
add() (pycantonese.corpus.CantoneseCHATReader method)
age() (pycantonese.corpus.CantoneseCHATReader method)
C
CantoneseCHATReader (class in pycantonese.corpus)
,
[1]
character_sents() (pycantonese.corpus.CantoneseCHATReader method)
characters() (pycantonese.corpus.CantoneseCHATReader method)
characters_to_jyutping() (in module pycantonese)
clear() (pycantonese.corpus.CantoneseCHATReader method)
concordance() (pycantonese.corpus.CantoneseCHATReader method)
D
date_of_birth() (pycantonese.corpus.CantoneseCHATReader method)
dates_of_recording() (pycantonese.corpus.CantoneseCHATReader method)
F
filenames() (pycantonese.corpus.CantoneseCHATReader method)
from_chat_files() (pycantonese.corpus.CantoneseCHATReader class method)
from_chat_str() (pycantonese.corpus.CantoneseCHATReader class method)
H
headers() (pycantonese.corpus.CantoneseCHATReader method)
hkcancor() (in module pycantonese)
hkcancor_to_ud() (in module pycantonese.pos_tagging)
I
index_to_tiers() (pycantonese.corpus.CantoneseCHATReader method)
IPSyn() (pycantonese.corpus.CantoneseCHATReader method)
J
jyutping_sents() (pycantonese.corpus.CantoneseCHATReader method)
jyutping_to_tipa() (in module pycantonese)
jyutping_to_yale() (in module pycantonese)
jyutpings() (pycantonese.corpus.CantoneseCHATReader method)
L
languages() (pycantonese.corpus.CantoneseCHATReader method)
M
MLU() (pycantonese.corpus.CantoneseCHATReader method)
MLUm() (pycantonese.corpus.CantoneseCHATReader method)
MLUw() (pycantonese.corpus.CantoneseCHATReader method)
N
number_of_files() (pycantonese.corpus.CantoneseCHATReader method)
number_of_utterances() (pycantonese.corpus.CantoneseCHATReader method)
P
parse_jyutping() (in module pycantonese)
part_of_speech_tags() (pycantonese.corpus.CantoneseCHATReader method)
participant_codes() (pycantonese.corpus.CantoneseCHATReader method)
participants() (pycantonese.corpus.CantoneseCHATReader method)
pos_tag() (in module pycantonese)
R
read_chat() (in module pycantonese)
remove() (pycantonese.corpus.CantoneseCHATReader method)
S
search() (pycantonese.corpus.CantoneseCHATReader method)
,
[1]
segment() (in module pycantonese)
Segmenter (class in pycantonese.word_segmentation)
sents() (pycantonese.corpus.CantoneseCHATReader method)
stop_words() (in module pycantonese)
T
tagged_sents() (pycantonese.corpus.CantoneseCHATReader method)
tagged_words() (pycantonese.corpus.CantoneseCHATReader method)
TTR() (pycantonese.corpus.CantoneseCHATReader method)
U
update() (pycantonese.corpus.CantoneseCHATReader method)
utterances() (pycantonese.corpus.CantoneseCHATReader method)
W
word_frequency() (pycantonese.corpus.CantoneseCHATReader method)
word_ngrams() (pycantonese.corpus.CantoneseCHATReader method)
words() (pycantonese.corpus.CantoneseCHATReader method)