Learn R Programming

lexicon (version 0.1.1)

hash_grady_pos: Grady Ward's Moby Parts of Speech

Description

A dataset containing a hash lookup of Grady Ward's parts of speech from the Moby project. The words with non-ASCII characters removed.

Usage

data(hash_grady_pos)

Arguments

Format

A data frame with 250,892 rows and 5 variables

Details

  • word. The word.
  • pos. The part of speech; one of :Adjective, Adverb, Conjunction, Definite Article, Interjection, Noun, Noun Phrase, Plural, Preposition, Pronoun, Verb (intransitive), Verb (transitive), or Verb (usu participle). Note that the first part of speech for a word is its primary use; all other uses are seondary.
  • n_pos. The number of parts of speech associated with a word. Useful for filtering.
  • space. logical. If TRUE the word contains a space. Useful for filtering.
  • primary. logical. If TRUE the word is the primary part of speech used.

References

Moby Thesaurus List by Grady Ward: http://icon.shef.ac.uk/Moby/mpos.html

Examples

Run this code
## Not run: ------------------------------------
# library(data.table)
# 
# hash_grady_pos['dog']
# hash_grady_pos[primary == TRUE, ]
# hash_grady_pos[primary == TRUE & space == FALSE, ]
## ---------------------------------------------

Run the code above in your browser using DataLab