lexicon (version 1.2.1)

hash_grady_pos: Grady Ward's Moby Parts of Speech

Description

A dataset containing a hash lookup of Grady Ward's parts of speech from the Moby project. The words with non-ASCII characters removed.

grady_pos_feature - A function for augmenting hash_grady_pos with 3 additional columns: (1) n_pos - the number of parts of speech a word has, (2) space - logical; indicating if a word contains a space, & (3) primary - logical; indicating if this is the most likely part of speech given the word.

Usage

data(hash_grady_pos)

grady_pos_feature(data)

Arguments

data

This should be lexicon::hash_grady_pos.

Format

A data frame with 246,691 rows and 3 variables

Details

  • word. The word.

  • pos. The part of speech; one of :Adjective, Adverb, Conjunction, Definite Article, Interjection, Noun, Noun Phrase, Plural, Preposition, Pronoun, Verb (intransitive), Verb (transitive), or Verb (usu participle). Note that the first part of speech for a word is its primary use; all other uses are secondary.

Examples

Run this code
# NOT RUN {
library(data.table)

hash_grady_pos <- grady_pos_feature(hash_grady_pos)
hash_grady_pos['dog']
hash_grady_pos[primary == TRUE, ]
hash_grady_pos[primary == TRUE & space == FALSE, ]
# }

Run the code above in your browser using DataLab