jiebaR (version 0.10.99)

edit_dict: Edit default user dictionary

Description

Edit the default user dictionary.

Usage

edit_dict(name = "user")

Arguments

name

the name of dictionary including user, system, stop_word.

Details

There are three column in the system dictionary. Each column is seperated by space. The first column is the word, and the second column is the frequency of word. The third column is speech tag using labels compatible with ictclas.

There are two column in the user dictionary. The first column is the word, and the second column is speech tag using labels compatible with ictclas. Frequencies of words in the user dictionary is set by user_weight in worker function. If you want to provide the frequency of a new word, you can put it in the system dictionary.

Only one column in the stop words dictionary, and it contains the stop words.

References

The ictclas speech tag : http://t.cn/RAEj7e1