threshold on the p-value used when selecting the characteristic words (by default 0.05)
maxDocs
maximum number of documents in the working lexical table (by default 20). See details
maxCharDoc
maximum number of characteristic source-documents to extract (by default 10). See details
maxPrnDoc
maximum length to be printed for a characteristic document (by default 100 characters)
Value
Returns a list including:
CharWord
characteristic words of all the documents
CharDoc
characteristic source-documents of all the aggregate-documents
Details
The lexical table provided by TextData can consider either source-documents or aggregate-documents, in accordance with the value of argument "var.agg" in TextData.
Extracting the characteristic words for a too high number of documents is of no interest and time-consuming. So that, this function can be applied only when the number of documents in the lexical table is under or equal to maxDocs (by default 20).
In the case of aggregate documents, extracting the characteristic source-documents is possible but of interest only if the source-documents are not too long.
In any case, only the first maxPrnDoc characters of each characteristic document are printed (by default 100).
References
Lebart, L., Salem, A., & Berry, L. (1998). Exploring textual data. (D. Kluwer, Ed.).