Learn R Programming

⚠️There's a newer version (0.11.1) of this package.Take me there.

jiebaR (version 0.11)

Chinese Text Segmentation

Description

Chinese text segmentation, keyword extraction and speech tagging For R.

Copy Link

Version

Install

install.packages('jiebaR')

Monthly Downloads

42

Version

0.11

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Qin Wenfeng

Last Published

December 13th, 2019

Functions in jiebaR (0.11)

DICTPATH

The path of dictionary
apply_list

Apply list input to a worker
get_tuple

get tuple from the segmentation result
<=.keywords

Keywords symbol
keywords

Keyword extraction
new_user_word

Add user word
tagging

Speech Tagging
print.inv

Print worker settings
simhash_dist

Compute Hamming distance of Simhash value
jiebaR

A package for Chinese text segmentation
<=.segment

Text segmentation symbol
<=.qseg

Quick mode symbol
<=.simhash

Simhash symbol
vector_tag

Tag the a character vector
tobin

simhash value to binary
<=.tagger

Tagger symbol
worker

Initialize jiebaR worker
get_qsegmodel

Set quick mode model
show_dictpath

Show default path of dictionaries
segment

Chinese text segmentation function
simhash

Simhash computation
freq

The frequency of words
get_idf

generate IDF dict
distance

Hamming distance of words
edit_dict

Edit default user dictionary
file_coding

Files encoding detection
filter_segment

Filter segmentation result