qdap (version 1.2.0)

formality: Formality Score

Description

Transcript apply formality score by grouping variable(s) and optionally plot the breakdown of the model.

Usage

formality(text.var, grouping.var = NULL, order.by.formality = TRUE,
  digits = 2, ...)

Arguments

text.var
The text variable (or an object from pos, pos_by or formality. Passing the later three ob
grouping.var
The grouping variables. Default NULL generates one word list for all text. Also takes a single grouping variable or a list of 1 or more grouping variables.
order.by.formality
logical. If TRUE orders the results by formality score.
digits
The number of digits displayed.
...
Other arguments passed to pos_by.

Value

  • A list containing at the following components:
  • textThe text variable
  • POStaggedRaw part of speech for every word of the text variable
  • POSpropPart of speech proportion for every word of the text variable
  • POSfreqPart of speech count for every word of the text variable
  • pos.by.freqThe part of speech count for every word of the text variable by grouping variable(s)
  • pos.by.propThe part of speech proportion for every word of the text variable by grouping variable(s)
  • form.freq.byThe nine broad part of speech categories count for every word of the text variable by grouping variable(s)
  • form.prop.byThe nine broad part of speech categories proportion for every word of the text variable by grouping variable(s)
  • formalityFormality scores by grouping variable(s)
  • pos.reshapedAn expanded formality scores output (grouping, word.count, pos & form.class) by word

Warning

Heylighen & Dewaele (2002) state, "At present, a sample would probably need to contain a few hundred words for the measure to be minimally reliable. For single sentences, the F-value should only be computed for purposes of illustration" (p. 24).

Details

Heylighen & Dewaele(2002)'s formality score is calculated as: $$F = 50(\frac{n_{f}-n_{c}}{N} + 1)$$ Where: $$f = \left {noun, \;adjective, \;preposition, \;article\right }$$ $$c = \left {pronoun, \;verb, \;adverb, \;interjection\right }$$ $$N = \sum{(f \;+ \;c \;+ \;conjunctions)}$$

References

Heylighen, F., & Dewaele, J.M. (2002). Variation in the contextuality of language: An empirical measure. Context in Context, Special issue of Foundations of Science, 7 (3), 293-340.

Examples

Run this code
with(DATA, formality(state, person))
(x1 <- with(DATA, formality(state, list(sex, adult))))
plot(x1)
plot(x1, short.names = FALSE)

scores(x1)
counts(x1)
proportions(x1)
preprocessed(x1)

plot(scores(x1))
plot(counts(x1))
plot(proportions(x1), high="darkgreen")
plot(preprocessed(x1))

data(rajPOS) #A data set consisting of a pos list object
x2 <- with(raj, formality(rajPOS, act))
plot(x2)
x3 <- with(raj, formality(rajPOS, person))
plot(x3, bar.colors="Dark2")
plot(x3, bar.colors=c("Dark2", "Set1"))
x4 <- with(raj, formality(rajPOS, list(person, act)))
plot(x4, bar.colors=c("Dark2", "Set1"))

rajDEM <- key_merge(raj, raj.demographics) #merge demographics with transcript.
x5 <- with(rajDEM, formality(rajPOS, sex))
plot(x5, bar.colors="RdBu")
x6 <- with(rajDEM, formality(rajPOS, list(fam.aff, sex)))
plot(x6, bar.colors="RdBu")
x7 <- with(rajDEM, formality(rajPOS, list(died, fam.aff)))
plot(x7, bar.colors="RdBu",  point.cex=2, point.pch = 3)
x8 <- with(rajDEM, formality(rajPOS, list(died, sex)))
plot(x8, bar.colors="RdBu",  point.cex=2, point.pch = "|")

names(x8)
colsplit2df(x8$formality)

#pass an object from pos or pos_by
ltruncdf(with(raj, formality(x8 , list(act, person))), 6, 4)

Run the code above in your browser using DataLab