Get the number of documents or features in an object.
ndoc(x)nfeat(x)
an integer (count) of the number of documents or features
ndoc
returns the number of documents in an object
whose texts are organized as "documents" (a corpus,
dfm, or tokens object, a readtext object from the
readtext package).
nfeat
returns the number of features from a dfm; it is an
alias for ntype
when applied to dfm objects. This function is only
defined for dfm objects because only these have "features". (To count
tokens, see ntoken
.)
# NOT RUN {
# number of documents
ndoc(data_corpus_inaugural)
ndoc(corpus_subset(data_corpus_inaugural, Year > 1980))
ndoc(tokens(data_corpus_inaugural))
ndoc(dfm(corpus_subset(data_corpus_inaugural, Year > 1980)))
# number of features
nfeat(dfm(corpus_subset(data_corpus_inaugural, Year > 1980), remove_punct = FALSE))
nfeat(dfm(corpus_subset(data_corpus_inaugural, Year > 1980), remove_punct = TRUE))
# }
Run the code above in your browser using DataLab