Package: jiebaR 0.11

Qin Wenfeng

jiebaR: Chinese Text Segmentation

Chinese text segmentation, keyword extraction and speech tagging For R.

Authors:Qin Wenfeng, Wu Yanyi

jiebaR_0.11.tar.gz
jiebaR_0.11.zip(r-4.5)jiebaR_0.11.zip(r-4.4)jiebaR_0.11.zip(r-4.3)
jiebaR_0.11.tgz(r-4.4-x86_64)jiebaR_0.11.tgz(r-4.4-arm64)jiebaR_0.11.tgz(r-4.3-x86_64)jiebaR_0.11.tgz(r-4.3-arm64)
jiebaR_0.11.tar.gz(r-4.5-noble)jiebaR_0.11.tar.gz(r-4.4-noble)
jiebaR_0.11.tgz(r-4.4-emscripten)jiebaR_0.11.tgz(r-4.3-emscripten)
jiebaR.pdf |jiebaR.html
jiebaR/json (API)
NEWS

# Install 'jiebaR' in R:
install.packages('jiebaR', repos = c('https://qinwf.r-universe.dev', 'https://cloud.r-project.org'))

Peer review:

Bug tracker:https://github.com/qinwf/jiebar/issues

Uses libs:
  • c++– GNU Standard C++ Library v3

On CRAN:

chinesechinese-text-segmentationcppjiebajiebalexical-analysisnlp

10.42 score 342 stars 6 packages 402 scripts 2.7k downloads 2 mentions 32 exports 2 dependencies

Last updated 5 years agofrom:a984fb8813. Checks:OK: 1 NOTE: 8. Indexed: yes.

TargetResultDate
Doc / VignettesOKOct 25 2024
R-4.5-win-x86_64NOTEOct 25 2024
R-4.5-linux-x86_64NOTEOct 25 2024
R-4.4-win-x86_64NOTEOct 25 2024
R-4.4-mac-x86_64NOTEOct 25 2024
R-4.4-mac-aarch64NOTEOct 25 2024
R-4.3-win-x86_64NOTEOct 25 2024
R-4.3-mac-x86_64NOTEOct 25 2024
R-4.3-mac-aarch64NOTEOct 25 2024

Exports:apply_listDICTPATHdistanceedit_dictfile_codingfilecodingfilter_segmentfreqget_idfget_qsegmodelget_tupleHMMPATHIDFPATHkeywordsnew_user_wordqsegreset_qsegmodelsegmentset_qsegmodelshow_dictpathsimhashsimhash_distsimhash_dist_matSTOPPATHtaggingtobinUSERPATHvector_distancevector_keywordsvector_simhashvector_tagworker

Dependencies:jiebaRDRcpp

Quick Start Guide - jiebaR

Rendered fromQuick_Start_Guide.Rmdusingknitr::rmarkdownon Oct 25 2024.

Last update: 2019-12-13
Started: 2014-11-23

Readme and manuals

Help Manual

Help pageTopics
Keywords symbol<=.keywords [.keywords
Quick mode symbol<=.qseg qseg [.qseg
Text segmentation symbol<=.segment [.segment
Simhash symbol<=.simhash [.simhash
Tagger symbol<=.tagger [.tagger
Apply list input to a workerapply_list
The path of dictionaryDICTPATH HMMPATH IDFPATH STOPPATH USERPATH
Hamming distance of wordsdistance vector_distance
Edit default user dictionaryedit_dict
Files encoding detectionfilecoding file_coding
Filter segmentation resultfilter_segment
The frequency of wordsfreq
generate IDF dictget_idf
Set quick mode modelget_qsegmodel reset_qsegmodel set_qsegmodel
get tuple from the segmentation resultget_tuple
A package for Chinese text segmentationjiebaR-package jiebaR
Keyword extractionkeywords vector_keywords
Add user wordnew_user_word
Print worker settingsprint.inv print.jieba print.keywords print.qseg print.simhash
Chinese text segmentation functionsegment
Show default path of dictionariesshow_dictpath
Simhash computationsimhash vector_simhash
Compute Hamming distance of Simhash valuesimhash_dist simhash_dist_mat
Speech Taggingtagging
simhash value to binarytobin
Tag the a character vectorvector_tag
Initialize jiebaR workerworker