Structsยง
- Analyzer
Analyzer
analyzes a text into a list of tokens.- Chinese
Tokenizer ChineseTokenizer
tokenizes a Chinese text.- English
Tokenizer EnglishTokenizer
tokenizes an English text.- JIEBA ๐
Constantsยง
- VALID_
ASCII_ ๐TOKEN - A-Z, a-z, 0-9, and โ_โ are true
Traitsยง
- Tokenizer
Tokenizer
tokenizes a text into a list of tokens.