Archives

Tutorials

Working with Cantonese CHILDES Data (Jackson Lee, May 27, 2022)

Analyzing Cantopop with Python (Charles Lam, May 27, 2022)

Extracting Cantonese data from Hong Kong Chinese corpora (Chaak Ming Lau, May 27, 2022)

Accessing and Searching Cantonese Corpora in PyCantonese (Jackson Lee, May 16, 2021)

Sentence-Final Particles (Chaak Ming Lau, May 16, 2021)

Multiword Expressions / Discontinuous Constructions (Charles Lam, May 16, 2021)

Basic Python for Linguists (Jackson Lee, April 2021)

Research Outputs

Official paper:

Jackson L. Lee, Litong Chen, Charles Lam, Chaak Ming Lau, and Tsz-Him Tsui. 2022. PyCantonese: Cantonese Linguistics and NLP in Python. Proceedings of the 13th Language Resources and Evaluation Conference.

Earlier talks introducing PyCantonese:

Jackson L. Lee, Litong Chen, and Tsz-Him Tsui. 2016. PyCantonese: Developing computational tools for Cantonese linguistics. Talk at the 3rd Workshop on Innovations in Cantonese Linguistics, The Ohio State University. March 12. 2016. [Slides] [Handout]

Jackson L. Lee. 2015. PyCantonese: Cantonese linguistic research in the age of big data. Talk at the Childhood Bilingualism Research Centre, Chinese University of Hong Kong. September 15. 2015. [Notes+slides]