Learn R Programming

multicastR (version 1.0.1)

mc_count_clauses: Count clause units in a multicastR table

Description

Counts number of clause units (bounded by the <##>, <#>, or <%> annotation symbols) in a multicastR table.

Usage

mc_count_clauses(text)

Arguments

text

A data.table in multicastR format, containing minimally a corpus column with the names of the corpora and a graid column with GRAID annotation values.

Value

A data.table with the number of valid clause units in each corpus, the total number of clause units, the number of non-analyzed clause units ("NC"), and the percentage the later make up of the total.

See Also

multicast

Examples

Run this code
# NOT RUN {
# count clause units in the most recent version
# of the Multi-CAST data, by corpus
n <- mc_count_clauses(multicast())

# number of clauses units in the whole collection
sum(n$nClauses)

# }

Run the code above in your browser using DataLab