New in version 3.1.0.
As of November 2020, PyCantonese v3.1.0 hasn’t been released yet.
All functionality related to part-of-speech tagging
is available only through the GitHub repository for early testers.
Everything (what functions are provided, how they behave) is subject to
change while it is still under active development.
To download and install this (unstable) version of PyCantonese
--pre flag means a pre-release version):
$ pip install --pre --upgrade pycantonese
import pycantonese as pc; print(pc.__version__) will show a
version number similar to
If you notice any issues, please don’t hesitate to report them.
While the documentation below is minimal for now, it is going to be updated and expanded once the part-of-speech tagging functionality is finalized in a new PyCantonese release.
tags words in a segmented sentence or phrase for their parts of speech.
maps a part-of-speech tag from HKCanCor to Universal Dependencies v2.