Title: | Chinese Text Segmentation Data for jiebaR Package |
---|---|
Description: | jiebaR is a package for Chinese text segmentation, keyword extraction and speech tagging. This package provides the data files required by jiebaR. |
Authors: | Qin Wenfeng |
Maintainer: | Qin Wenfeng <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.1 |
Built: | 2025-02-16 04:45:35 UTC |
Source: | https://github.com/qinwf/jiebard |
jiebaR is a package for Chinese text segmentation, keyword extraction and speech tagging. This package provides the data files required by jiebaR. jiebaR supports four types of segmentation mode: Maximum Probability, Hidden Markov Model, Query Segment and Mix Segment.
You can use custom dictionary to be included in the jiebaR default dictionary. jiebaR can also identify new words, but adding your own new words will ensure a higher accuracy.
Qin Wenfeng <http://qinwenfeng.com>
CppJieba https://github.com/aszxqw/cppjieba;
JiebaR https://github.com/qinwf/jiebaR;
library("jiebaRD")
library("jiebaRD")