Learn R Programming

morphemepiece

The goal of morphemepiece is to allow you to tokenize words into morphemes (the smallest unit of meaning).

Installation

You can install the released version of morphemepiece from CRAN with:

install.packages("morphemepiece")

And the development version from GitHub with:

# install.packages("devtools")
devtools::install_github("macmillancontentscience/morphemepiece")

Code of Conduct

Please note that the morphemepiece project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.

Disclaimer

This is not an officially supported Macmillan Learning product.

Contact information

Questions or comments should be directed to Jonathan Bratt (jonathan.bratt@macmillan.com) and Jon Harmon (jonthegeek@gmail.com).

Copy Link

Version

Install

install.packages('morphemepiece')

Monthly Downloads

260

Version

1.2.3

License

Apache License (>= 2)

Issues

Pull Requests

Stars

Forks

Maintainer

Jonathan Bratt

Last Published

April 16th, 2022

Functions in morphemepiece (1.2.3)

load_lookup

Load a morphemepiece lookup file
load_or_retrieve_lookup

Load a lookup file, or retrieve from cache
morphemepiece_tokenize

Tokenize Sequence with Morpheme Pieces
set_morphemepiece_cache_dir

Set a Cache Directory for Morphemepiece
prepare_vocab

Format a Token List as a Vocabulary
.process_mp_vocab

Process a Morphemepiece Vocabulary for Tokenization
reexports

Objects exported from other packages
.mp_tokenize_word_lookup

Tokenize a Word Including Lookup
.validate_morphemepiece_vocabulary

Validator for Objects of Class morphemepiece_vocabulary
morphemepiece-package

morphemepiece: Morpheme Tokenization
morphemepiece_cache_dir

Retrieve Directory for Morphemepiece Cache
.mp_tokenize_word_bidir

Tokenize a Word Bidirectionally
.mp_tokenize_word

Tokenize a Word
.new_morphemepiece_vocabulary

Constructor for Class morphemepiece_vocabulary
.mp_tokenize_single_string

Tokenize an Input Word-by-word
.infer_case_from_vocab

Determine Vocabulary Casedness
load_vocab

Load a vocabulary file
load_or_retrieve_vocab

Load a vocabulary file, or retrieve from cache