Performs Correspondence Analysis on the working lexical table contained in TextData object. Supplementary documents, words, segments, contextual quantitative and qualitative variables can be considered if previously selected in TextData function.
LexCA(object, ncp=5, context.sup="ALL", doc.sup=NULL, word.sup=NULL,
segment=FALSE, graph=TRUE, axes=c(1, 2), lmd=3, lmw=3)
Returns a list including:
matrix with the eigenvalues, the percentages of inertia and the cumulative percentages of inertia
list of matrices with all the results for the documents (coordinates, square cosines, contributions, inertia)
list of matrices with all the results for the words (coordinates, square cosines, contributions, inertia)
if row.sup is non-NULL, list of matrices with all the results for the supplementary documents (coordinates, square cosines)
if col.sup is non-NULL, list of matrices with all the results for the supplementary words (coordinates, square cosines)
if quanti.sup is non-NULL, list of matrices containing the results for the supplementary quantitative variables (coordinates, square cosines)
if quali.sup is non-NULL, list of matrices with all the results for the supplementary categorical variables; see section details
list of the documents/words whose contribution is over lmd/lmw times the average document/word contribution
Cramer's V coefficient
total inertia
information about the corpus
if segment is TRUE, list of matrices with the results for the repeated segments (coordinates, square cosines)
name of the aggregation variable in the case of an aggregate correspondence analysis
a list with some statistics
object of TextData class
number of dimensions kept in the results (by default 5)
column index(es) or name(s) of the contextual qualitative or quantitative variables among those selected in TextData function (by default "ALL")
vector indicating the index(es) or name(s) of the supplementary documents (rows) (by default NULL)
vector indicating the index(es) or name(s) of the supplementary words (columns) (by default NULL)
if TRUE, the repeated segments identified by TextData function will be considered as supplementary columns (by default FALSE)
if TRUE, basic graphs are displayed; use plot.LexCA to obtain more graphs (by default TRUE)
length-2 vector indicating the axes to plot (by default axes=c(1,2))
only the documents whose contribution is over lmd times the average-document-contribution are plotted (by default lmd=3)
only the words whose contribution is over lmw times the average-word-contribution are plotted (by default lmw=3)
Ramón Alvarez-Esteban ramon.alvarez@unileon.es, Mónica Bécue-Bertaut, Josep-Anton
Sánchez-Espigares
In the case of a direct CA, DocTerm is a non-aggregate table and:
the contextual quantitative variables are considered as supplementary quantitative columns in CA.
the categories of the contextual qualitative variables are considered as supplementary columns in CA.
In the case of an aggregate CA, DocTerm is an aggregate table and:
the contextual quantitative variables are considered as supplementary quantitative columns in CA; the value of an active aggregate-document for a variable is the mean of the values corresponding to the source-documents belonging to this aggregate-document.
the categories of the contextual qualitative variables are threatened as supplementary rows in CA; these rows contain the frequency with which each the set of documents belonging to this category has used the different words.
Benzécri, J, P. (1981). Pratique de l'analyse des donnees. Linguistique & lexicologie (Vol.3). (P. Dunod., Ed).
Husson F., Lê S., Pagès J. (2011). Exploratory Multivariate Analysis by Example Using R. Chapman & Hall/CRC. tools:::Rd_expr_doi("10.1201/b10345").
Lebart, L., Salem, A., & Berry, L. (1998). Exploring textual data. (D. Kluwer, Ed.). tools:::Rd_expr_doi("10.1007/978-94-017-1525-6").
Murtagh F. (2005). Correspondence Analysis and Data Coding with R and Java. Chapman & Hall/CRC.
TextData
, print.LexCA
, plot.LexCA
, summary.LexCA
, ellipseLexCA
data(open.question)
if (FALSE) {
### non-aggregate CA
res.TD<-TextData(open.question, var.text=c(9,10), Fmin=10, Dmin=10,
remov.number=TRUE, stop.word.tm=TRUE)
res.LexCA<-LexCA(res.TD, lmd=0, lmw=1)
}
### aggregate CA
res.TD<-TextData(open.question, var.text=c(9,10), var.agg="Age_Group", Fmin=10, Dmin=10,
remov.number=TRUE, stop.word.tm=TRUE)
res.LexCA<-LexCA(res.TD, lmd=0, lmw=1)
Run the code above in your browser using DataLab