
Get the number of documents or features in an object.
ndoc(x)nfeature(x)
an integer (count) of the number of documents or features
ndoc
returns the number of documents in a corpus,
dfm, or tokens object, or a readtext object from the
readtext package
nfeature
returns the number of features in a dfm
nfeature
returns the number of features from a dfm; it is an
alias for ntype
when applied to dfm objects. This function is only
defined for dfm objects because only these have "features". (To count
tokens, see ntoken
.)
# NOT RUN {
# number of documents
ndoc(data_corpus_inaugural)
ndoc(corpus_subset(data_corpus_inaugural, Year > 1980))
ndoc(tokens(data_corpus_inaugural))
ndoc(dfm(corpus_subset(data_corpus_inaugural, Year > 1980)))
# number of features
nfeature(dfm(corpus_subset(data_corpus_inaugural, Year > 1980), remove_punct = FALSE))
nfeature(dfm(corpus_subset(data_corpus_inaugural, Year > 1980), remove_punct = TRUE))
# }
Run the code above in your browser using DataLab