Structsยง
- Analyzer
Analyzeranalyzes a text into a list of tokens.- Chinese
Tokenizer ChineseTokenizertokenizes a Chinese text.- English
Tokenizer EnglishTokenizertokenizes an English text.- JIEBA ๐
Constantsยง
- VALID_
ASCII_ ๐TOKEN - A-Z, a-z, 0-9, and โ_โ are true
Traitsยง
- Tokenizer
Tokenizertokenizes a text into a list of tokens.