API Reference¶
Corpus Data¶
|
Read Cantonese CHAT data files into a reader object. |
|
Create a corpus object for the Hong Kong Cantonese Corpus. |
|
A reader for Cantonese CHAT corpus files. |
Search the data for the given criteria. |
Jyutping Romanization¶
|
Convert Cantonese characters into Jyutping romanization. |
|
Parse Jyutping romanization into onset, nucleus, code, and tone. |
|
Convert Jyutping romanization into Yale romanization. |
|
Convert Jyutping romanization into LaTeX TIPA. |
Natural Language Processing¶
|
Return Cantonese stop words. |
|
Segment the unsegmented input. |
|
A customizable word segmentation model. |
|
Tag the words for their parts of speech. |
|
Map a part-of-speech tag from HKCanCor to Universal Dependencies. |