PyCantonese
3.4.0
Quickstart
Corpus Data
CHAT Format
Built-in Data
CHILDES and TalkBank Data
Custom Data
Corpus Reader Methods
Headers
Transcriptions and Annotations
Jyutping Romanization
Chinese Characters
Word Frequencies and Ngrams
Corpus Search Queries
Searching by a Jyutping Element
Searching by a Chinese Character
Searching by a Part-of-speech Tag
Searching by a Word or Utterance Range
Searching by Multiple Criteria
Output Format of Search Results
Complex Searches
Parsing Cantonese Text
Input 1: A Plain String
Input 2: A List of Strings
Input 3: A List of Tuples of Strings
Customizing Word Segmentation
Customizing Part-of-Speech Tagging
Outputting CHAT Data
More Customization
Jyutping Romanization
Characters-to-Jyutping Conversion
Parsing Jyutping Strings
Jyutping-to-Yale Conversion
Jyutping-to-TIPA Conversion
Stop Words
Word Segmentation
Customizing Segmentation
Part-of-Speech Tagging
API Reference
Corpus Data
pycantonese.read_chat
pycantonese.hkcancor
pycantonese.CHATReader
pycantonese.CHATReader.search
pycantonese.CHATReader.search
Jyutping Romanization
pycantonese.characters_to_jyutping
pycantonese.parse_jyutping
pycantonese.jyutping_to_yale
pycantonese.jyutping_to_tipa
Natural Language Processing
pycantonese.stop_words
pycantonese.parse_text
pycantonese.segment
pycantonese.word_segmentation.Segmenter
pycantonese.pos_tag
pycantonese.pos_tagging.hkcancor_to_ud
CHATReader
Token
Jyutping
Changelog
[Unreleased]
Added
Changed
Deprecated
Removed
Fixed
Security
[3.4.0] - 2021-12-28
Added
Changed
Removed
Security
[3.3.1] - 2021-05-14
Fixed
[3.3.0] - 2021-05-14
Changed
Fixed
[3.2.4] - 2021-05-07
Fixed
[3.2.3] - 2021-04-12
Fixed
[3.2.2] - 2021-03-23
Fixed
[3.2.1] - 2021-03-21
Fixed
[3.2.0] - 2021-03-20
Added
Changed
Deprecated
Fixed
[3.1.1] - 2021-03-18
Fixed
[3.1.0] - 2021-02-21
Added
Fixed
[3.0.0] - 2020-10-25
Added
Changed
API-breaking Changes
Non-API-breaking Changes
Deprecated
Security
[2.4.1] - 2020-10-10
Fixed
[2.4.0] - 2020-10-10
Added
[2.3.0] - 2020-07-24
Added
Removed
[2.2.0] - 2018-06-30
Added
[2.1.0] - 2018-06-11
Added
Fixed
[2.0.0] - 2016-02-06
[1.0] - 2015-09-06
[1.0dev] - 2015-09-02
[0.2.1] - 2015-01-25
[0.2] - 2015-01-22
[0.1] - 2014-12-17
Archives
Tutorials
Research Outputs
PyCantonese
»
Index
Index
_
|
A
|
C
|
D
|
E
|
F
|
H
|
I
|
J
|
L
|
M
|
N
|
P
|
R
|
S
|
T
|
U
|
W
_
__eq__() (pycantonese.jyutping.Jyutping method)
__hash__ (pycantonese.jyutping.Jyutping attribute)
__init__() (pycantonese.CHATReader method)
(pycantonese.jyutping.Jyutping method)
(pycantonese.word_segmentation.Segmenter method)
__repr__() (pycantonese.jyutping.Jyutping method)
__str__() (pycantonese.jyutping.Jyutping method)
A
ages() (pycantonese.CHATReader method)
append() (pycantonese.CHATReader method)
append_left() (pycantonese.CHATReader method)
C
characters() (pycantonese.CHATReader method)
characters_to_jyutping() (in module pycantonese)
CHATReader (class in pycantonese)
,
[1]
clear() (pycantonese.CHATReader method)
D
dates_of_recording() (pycantonese.CHATReader method)
E
extend() (pycantonese.CHATReader method)
extend_left() (pycantonese.CHATReader method)
F
file_paths() (pycantonese.CHATReader method)
filter() (pycantonese.CHATReader method)
final (pycantonese.jyutping.Jyutping property)
from_dir() (pycantonese.CHATReader class method)
from_files() (pycantonese.CHATReader class method)
from_strs() (pycantonese.CHATReader class method)
from_zip() (pycantonese.CHATReader class method)
H
head() (pycantonese.CHATReader method)
headers() (pycantonese.CHATReader method)
hkcancor() (in module pycantonese)
hkcancor_to_ud() (in module pycantonese.pos_tagging)
I
info() (pycantonese.CHATReader method)
ipsyn() (pycantonese.CHATReader method)
J
Jyutping (class in pycantonese.jyutping)
jyutping() (pycantonese.CHATReader method)
jyutping_to_tipa() (in module pycantonese)
jyutping_to_yale() (in module pycantonese)
L
languages() (pycantonese.CHATReader method)
M
mlu() (pycantonese.CHATReader method)
mlum() (pycantonese.CHATReader method)
mluw() (pycantonese.CHATReader method)
N
n_files() (pycantonese.CHATReader method)
P
parse_jyutping() (in module pycantonese)
parse_text() (in module pycantonese)
participants() (pycantonese.CHATReader method)
pop() (pycantonese.CHATReader method)
pop_left() (pycantonese.CHATReader method)
pos_tag() (in module pycantonese)
R
read_chat() (in module pycantonese)
S
search() (pycantonese.CHATReader method)
,
[1]
segment() (in module pycantonese)
Segmenter (class in pycantonese.word_segmentation)
sents() (pycantonese.CHATReader method)
stop_words() (in module pycantonese)
T
tagged_sents() (pycantonese.CHATReader method)
tagged_words() (pycantonese.CHATReader method)
tail() (pycantonese.CHATReader method)
to_chat() (pycantonese.CHATReader method)
to_strs() (pycantonese.CHATReader method)
Token (class in pycantonese.corpus)
tokens() (pycantonese.CHATReader method)
ttr() (pycantonese.CHATReader method)
U
utterances() (pycantonese.CHATReader method)
W
word_frequencies() (pycantonese.CHATReader method)
word_ngrams() (pycantonese.CHATReader method)
words() (pycantonese.CHATReader method)