tm (version 0.5-10)

stemCompletion: Complete Stems

Description

Heuristically complete stemmed words.

Usage

## S3 method for class 'character':
stemCompletion(x, dictionary,
               type = c("prevalent", "first", "longest",
                        "none", "random", "shortest"))
## S3 method for class 'PlainTextDocument':
stemCompletion(x, dictionary,
               type = c("prevalent", "first", "longest",
                        "none", "random", "shortest"))

Arguments

x
A PlainTextDocument or character vector of stems to be completed.
dictionary
A Corpus or character vector to be searched for possible completions.
type
A character naming the heuristics to be used: [object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Value

  • A plain text document or character vector with completed words.

References

Ingo Feinerer (2010). Analysis and Algorithms for Stemming Inversion. Information Retrieval Technology --- 6th Asia Information Retrieval Societies Conference, AIRS 2010, Taipei, Taiwan, December 1--3, 2010. Proceedings, volume 6458 of Lecture Notes in Computer Science, pages 290--299. Springer-Verlag, December 2010.

Examples

Run this code
data("crude")
stemCompletion(c("compan", "entit", "suppl"), crude)
(s <- stemDocument(crude[[1]]))
stemCompletion(s, crude)

Run the code above in your browser using DataLab