qdap v2.4.3


Monthly downloads



Bridging the Gap Between Qualitative Data and Quantitative Analysis

Automates many of the tasks associated with quantitative discourse analysis of transcripts containing discourse including frequency counts of sentence types, words, sentences, turns of talk, syllables and other assorted analysis tasks. The package provides parsing tools for preparing transcript data. Many functions enable the user to aggregate data by any number of grouping variables, providing analysis and seamless integration with other R packages that undertake higher level analysis and visualization of text. This affords the user a more efficient and targeted analysis. 'qdap' is designed for transcript analysis, however, many functions are applicable to other areas of Text Mining/ Natural Language Processing.



Project Status: Inactive – The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows. Build Status DOI

qdap (Quantitative Discourse Analysis Package) is an R package designed to assist in quantitative discourse analysis. The package stands as a bridge between qualitative transcripts of dialogue and statistical analysis & visualization.


To download the development version of qdap:

Download the zip ball or tar ball, decompress and run R CMD INSTALL on it, or use the pacman package to install the development version (The user may want to install the dev version of reports first):

if (!require("pacman")) install.packages("pacman")



You are welcome to:

Note: If you are reporting a bug make sure you have first read the Cleaning Text & Debugging vignette

Functions in qdap

Name Description
Animate.lexical_classification Animate Formality
automated_readability_index Readability Measures
Animate.polarity Animate Polarity
Search Search Columns of a Data Frame
adjacency_matrix Takes a Matrix and Generates an Adjacency Matrix
Animate Generic Animate Method
Animate.character Animate Character
all_words Searches Text Column for Words
Filter.all_words Filter
NAer Replace Missing Values (NA)
+.Network Add themes to a Network object.
wfm Word Frequency Matrix
check_spelling Check Spelling
check_spelling_interactive.character Check Spelling
Animate.discourse_map Discourse Map
cm_code.exclude Exclude Codes
cm_code.overlap Find Co-occurrence Between Codes
Animate.formality Animate Formality
bracketX Bracket Parsing
build_qdap_vignette Replace Temporary Introduction to qdap Vignette
Network Generic Network Method
DATA Fictitious Classroom Dialogue
Network.formality Network Formality
DATA.SPLIT Fictitious Split Sentence Classroom Dialogue
Trim Remove Leading/Trailing White Space
Title Add Title to Select qdap Plots
cm_range2long Transform Codes to Start-End Durations
cm_time.temp Time Span Code Sheet
Animate.gantt Gantt Durations
Animate.gantt_plot Gantt Plot
Network.lexical_classification Network Lexical Classification
DATA2 Fictitious Repeated Measures Classroom Dialogue
Dissimilarity Dissimilarity Statistics
cm_2long A Generic to Long Function
clean Remove Escaped Characters
add_incomplete Detect Incomplete Sentences; Add | Endmark
add_s Make Plural (or Verb to Singular) Versions of Words
capitalizer Capitalize Select Words
%&% qdap Chaining
comma_spacer Ensure Space After Comma
blank2NA Replace Blanks in a dataframe
beg2char Grab Begin/End of String to Character
common Find Common Words Between Groups
Network.polarity Network Polarity
counts.flesch_kincaid Readability Measures
counts.formality Formality
counts.word_length Word Length Counts
check_text Check Text For Potential Problems
cm_code.transform Transform Codes
as.tdm tm Package Compatibility Tools: Apply to or Convert to/from Term Document Matrix or Document Term Matrix
chunker Break Text Into Ordered Word Chunks
cm_combine.dummy Find Co-occurrence Between Dummy Codes
bag_o_words Bag of Words
check_spelling_interactive.check_spelling Check Spelling
counts.word_position Word Position
check_spelling_interactive.factor Check Spelling
cm_code.blank Blank Code Transformation
gradient_cloud Gradient Word Cloud
cm_code.combine Combine Codes
cm_df.fill Range Coding
cm_df.temp Break Transcript Dialogue into Blank Code Matrix
cm_long2dummy Stretch and Dummy Code cm_xxx2long
cm_range.temp Range Code Sheet
colcomb2class Combine Columns to Class
colsplit2df Wrapper for colSplit that Returns Dataframe(s)
counts.coleman_liau Readability Measures
counts.end_mark_by Question Counts
cm_time2long Transform Codes to Start-End Times
cm_distance Distance Matrix Between Codes
counts.pos_by Parts of Speech
cm_dummy2long Convert cm_combine.dummy Back to Long
dir_map Map Transcript Files from a Directory to a Script
cm_df2long Transform Codes to Start-End Durations
cm_df.transcript Transcript With Word Number
counts Generic Counts Method
counts.SMOG Readability Measures
incomplete_replace Denote Incomplete End Marks With "|"
counts.pos Parts of Speech
common.list list Method for common
condense Condense Dataframe Columns
counts.fry Readability Measures
counts.linsear_write Readability Measures
duplicates Find Duplicated Words in a Text String
counts.automated_readability_index Readability Measures
colSplit Separate a Column Pasted by paste2
counts.character_table Term Counts
htruncdf Dataframe Viewing
counts.question_type Question Counts
counts.pronoun_type Question Counts
plot.gantt Plots a gantt object
counts.subject_pronoun_type Question Counts
paste2 Paste an Unspecified Number Of Text Columns
end_mark Sentence End Marks
counts.termco Term Counts
dispersion_plot Lexical Dispersion Plot
cumulative Cumulative Scores
counts.object_pronoun_type Question Counts
hamlet Hamlet (Complete & Split by Sentence)
mraja1 Romeo and Juliet: Act 1 Dialogue Merged with Demographics
vertex_apply Apply Parameter to List of Igraph Vertices/Edges
plot.lexical_classification_score Plots a lexical_classification_score Object
imperative Intuitively Remark Sentences as Imperative
plot.cumulative_animated_polarity Plots a cumulative_animated_polarity Object
name2sex Names to Gender
plot.end_mark_by_score Plots a end_mark_by_score Object
plot.pronoun_type Plots an pronoun_type Object
discourse_map Discourse Mapping
end_inc Test for Incomplete Sentences
gantt_wrap Gantt Plot
formality Formality Score
gantt_rep Generate Unit Spans for Repeated Measures
pos Parts of Speech Tagging
new_project Project Template
plot.cumulative_animated_lexical_classification Plots a cumulative_animated_lexical_classification Object
env.syl Syllable Lookup Environment
phrase_net Phrase Nets
sentiment_frame Power Score (Sentiment Analysis)
mraja1spl Romeo and Juliet: Act 1 Dialogue Merged with Demographics and Split
left_just Text Justification
pres_debate_raw2012 First 2012 U.S. Presidential Debate
gantt Gantt Durations
freq_terms Find Frequent Terms
plot.diversity Plots a diversity object
plot.discourse_map Plots a discourse_map Object
plot.end_mark_by_preprocessed Plots a end_mark_by_preprocessed Object
plot.flesch_kincaid Plots a flesch_kincaid Object
lexical_classification Lexical Classification Score
gantt_plot Gantt Plot
plot.end_mark Plots an end_mark Object
inspect_text Inspect Text Vectors
plot.linsear_write Plots a linsear_write Object
plot.Network Plots a Network Object
is.global Test If Environment is Global
plot.linsear_write_scores Plots a linsear_write_scores Object
outlier_detect Detect Outliers in Text
plot.combo_syllable_sum Plots a combo_syllable_sum Object
plot.lexical_classification_preprocessed Plots a lexical_classification_preprocessed Object
plot.kullback_leibler Plots a kullback_leibler object
plot.table_proportion Plots a table_proportion Object
preprocessed Generic Preprocessed Method
plot.wfm Plots a wfm object
plot.question_type Plots a question_type Object
mcsv_r Read/Write Multiple csv Files at a Time
plot.lexical_classification Plots a lexical_classification Object
print.animated_formality Prints a animated_formality Object
print.polarity_count Prints a polarity_count Object
plot.table_score Plots a table_score Object
counts.polarity Polarity
plot.cmspans Plots a cmspans object
plot.SMOG Plots a SMOG Object
plot.coleman_liau Plots a coleman_liau Object
counts.word_stats Word Stats
exclude Exclude Elements From a Vector
plot.word_cor Plots a word_cor object
object_pronoun_type Count Object Pronouns Per Grouping Variable
plot.cumulative_lexical_classification Plots a cumulative_lexical_classification Object
print.animated_lexical_classification Prints an animated_lexical_classification Object
rank_freq_mplot Rank Frequency Plot
preprocessed.word_position Word Position
print.diversity Prints a diversity object
plot.syllable_freq Plots a syllable_freq Object
multiscale Nested Standardization
multigsub Multiple gsub
plot.cumulative_formality Plots a cumulative_formality Object
print.cumulative_end_mark Prints a cumulative_end_mark Object
plot.automated_readability_index Plots a automated_readability_index Object
plot.animated_polarity Plots an animated_polarity Object
print.cumulative_combo_syllable_sum Prints a cumulative_combo_syllable_sum Object
plot.lexical Plots a lexical Object
print.termco Prints a termco object.
print.discourse_map Prints a discourse_map Object
preprocessed.pos_by Parts of Speech
print.pronoun_type Prints a pronoun_type object
plot.animated_discourse_map Plots an animated_discourse_map Object
outlier_labeler Locate Outliers in Numeric String
plot.animated_character Plots an animated_character Object
preprocessed.lexical_classification Lexical Classification
plot.character_table Plots a character_table Object
preprocessed.pronoun_type Question Counts
plot.end_mark_by_proportion Plots a end_mark_by_proportion Object
plot.cm_distance Plots a cm_distance object
plot.end_mark_by Plots a end_mark_by Object
preprocessed.subject_pronoun_type Question Counts
plot.end_mark_by_count Plots a end_mark_by_count Object
plot.cumulative_animated_formality Plots a cumulative_animated_formality Object
plot.readability_count Plots a readability_count Object
plot.object_pronoun_type Plots an object_pronoun_type Object
print.formality Prints a formality Object
dist_tab SPSS Style Frequency Tables
plot.formality Plots a formality Object
plot.question_type_preprocessed Plots a question_type_preprocessed Object
print.lexical_classification_by Prints a lexical_classification Object
print.boolean_qdap Prints a boolean_qdap object
pres_debates2012 2012 U.S. Presidential Debates
preprocessed.formality Formality
print.formality_scores Prints a formality_scores object
plot.pos_by Plots a pos_by Object
preprocessed.object_pronoun_type Question Counts
polarity Polarity Score (Sentiment Analysis)
plot.type_token_ratio Plots a type_token_ratio Object
print.coleman_liau Prints an coleman_liau Object
print.all_words Prints an all_words Object
plot.word_stats_counts Plots a word_stats_counts Object
print.adjacency_matrix Prints an adjacency_matrix Object
plot.termco Plots a termco object
print.end_mark Prints an end_mark object
print.lexical_classification_preprocessed Prints a lexical_classification_preprocessed Object
print.trunc Prints a trunc object
plot.word_proximity Plots a word_proximity object
plot.linsear_write_count Plots a linsear_write_count Object
print.wfm Prints a wfm Object
diversity Diversity Statistics
ngrams Generate ngrams
delete Easy File Handling
plot.polarity_score Plots a polarity_score Object
plot.pos Plots a pos Object
print.polarity_score Prints a polarity_score Object
plot.table_count Plots a table_count Object
plot.pos_preprocessed Plots a pos_preprocessed Object
print.subject_pronoun_type Prints a subject_pronoun_type object
print.character_table Prints a character_table object
potential_NA Search for Potential Missing Values
print.syllable_sum Prints an syllable_sum object
print.colsplit2df Prints a colsplit2df Object.
print.end_mark_by Prints an end_mark_by object
print.type_token_ratio Prints a type_token_ratio Object
qdap qdap: Quantitative Discourse Analysis Package
key_merge Merge Demographic Information with Person/Text Transcript
plot.animated_formality Plots a animated_formality Object
replace_contraction Replace Contractions
plot.wfdf Plots a wfdf object
print.linsear_write_count Prints a linsear_write_count Object
proportions Generic Proportions Method
print.qdap_context Prints a qdap_context object
print.linsear_write_scores Prints a linsear_write_scores Object
print.question_type Prints a question_type object
print.cumulative_lexical_classification Prints a cumulative_lexical_classification Object
plot.readability_score Plots a readability_score Object
plot.weighted_wfm Plots a weighted_wfm object
plot.rmgantt Plots a rmgantt object
print.Dissimilarity Prints a Dissimilarity object
print.qdapProj Prints a qdapProj Object
plot.sent_split Plots a sent_split Object
kullback_leibler Kullback Leibler Statistic
plot.cumulative_polarity Plots a cumulative_polarity Object
print.check_text Prints a check_text Object
plot.word_length Plots a word_length Object
print.word_cor Prints a word_cor object
print.word_associate Prints a word_associate object
scores.character_table Term Counts
qcv Quick Character Vector
preprocessed.question_type Question Counts
pronoun_type Count Object/Subject Pronouns Per Grouping Variable
plot.cumulative_combo_syllable_sum Plots a cumulative_combo_syllable_sum Object
plot.word_stats Plots a word_stats object
prop Convert Raw Numeric Matrix or Data Frame to Proportions
plot.animated_lexical_classification Plots an animated_lexical_classification Object
plot.cumulative_end_mark Plots a cumulative_end_mark Object
preprocessed.check_spelling_interactive Check Spelling
plot.word_position Plots a word_position object
print.word_stats Prints a word_stats object
plot.formality_scores Plots a formality_scores Object
print.animated_polarity Prints an animated_polarity Object
print.check_spelling Prints a check_spelling Object
preprocessed.pos Parts of Speech
print.cumulative_animated_formality Prints a cumulative_animated_formality Object
qdap_df Create qdap Specific Data Structure
print.pos_preprocessed Prints a pos_preprocessed object
print.automated_readability_index Prints an automated_readability_index Object
print.cm_distance Prints a cm_distance Object
print.cumulative_formality Prints a cumulative_formality Object
print.phrase_net Prints a phrase_net Object
plot.cumulative_syllable_freq Plots a cumulative_syllable_freq Object
print.table_count Prints a table_count object
proportions.word_position Word Position
replace_number Replace Numbers With Text Representation
plot.polarity Plots a polarity Object
plot.freq_terms Plots a freq_terms Object
scores.SMOG Readability Measures
print.word_length Prints a word_length object
print.Network Prints a Network Object
plot.sum_cmspans Plot Summary Stats for a Summary of a cmspans Object
plot.sums_gantt Plots a sums_gantt object
plot.subject_pronoun_type Plots an subject_pronoun_type Object
plot.polarity_count Plots a polarity_count Object
print.animated_character Prints an animated_character Object
raj.act.5 Romeo and Juliet: Act 5
print.kullback_leibler Prints a kullback_leibler Object.
qheat Quick Heatmap
print.cumulative_polarity Prints a cumulative_polarity Object
replacer Replace Cells in a Matrix or Data Frame
speakerSplit Break and Stretch if Multiple Persons per Cell
scores.automated_readability_index Readability Measures
trans_context Print Context Around Indices
rm_row Remove Rows That Contain Markers
scores.word_stats Word Stats
print.cumulative_animated_lexical_classification Prints a cumulative_animated_lexical_classification Object
raw.time.span Minimal Raw Time Span Data Set
trans_cloud Word Clouds by Grouping Variable
print.which_misspelled Prints a which_misspelled Object
print.fry Prints an fry Object
print.lexical_classification Prints an lexical_classification Object
print.pos_by Prints a pos_by Object.
print.word_position Prints a word_position object.
print.polarity Prints an polarity Object
scrubber Clean Imported Text
proportions.character_table Term Counts
raj.demographics Romeo and Juliet Demographics
spaste Add Leading/Trailing Spaces
preprocessed.end_mark_by Question Counts
raj.act.2 Romeo and Juliet: Act 2
rajPOS Romeo and Juliet Split in Parts of Speech
print.sub_holder Prints a sub_holder object
print.check_spelling_interactive Prints a check_spelling_interactive Object
proportions.object_pronoun_type Question Counts
visual.discourse_map Discourse Map
scores.coleman_liau Readability Measures
qcombine Combine Columns
print.SMOG Prints an SMOG Object
raj.act.3 Romeo and Juliet: Act 3
print.cumulative_animated_polarity Prints a cumulative_animated_polarity Object
print.polysyllable_sum Prints an polysyllable_sum object
qtheme Add themes to a Network object.
print.end_mark_by_preprocessed Prints a end_mark_by_preprocessed object
print.animated_discourse_map Prints an animated_discourse_map Object
print.combo_syllable_sum Prints an combo_syllable_sum object
print.lexical_classification_score Prints a lexical_classification_score Object
type_token_ratio Type-Token Ratio
word_network_plot Word Network Plot
print.flesch_kincaid Prints an flesch_kincaid Object
print.inspect_text Prints an inspect_text Object
raj Romeo and Juliet (Unchanged & Complete)
proportions.termco Term Counts
scores.object_pronoun_type Question Counts
print.word_proximity Prints a word_proximity object
print.readability_score Prints a readability_score Object
print.word_list Prints a word_list Object
weight Weight a qdap Object
word_position Word Position
print.wfm_summary Prints a wfm_summary Object
print.pos Prints a pos Object.
print.linsear_write Prints an linsear_write Object
print.cumulative_syllable_freq Prints a cumulative_syllable_freqObject
print.sum_cmspans Prints a sum_cmspans object
print.ngrams Prints an ngrams object
print.question_type_preprocessed Prints a question_type_preprocessed object
scores Generic Scores Method
scores.end_mark_by Question Counts
print.readability_count Prints a readability_count Object
print.object_pronoun_type Prints a object_pronoun_type object
summary.cmspans Summarize a cmspans object
scores.pronoun_type Question Counts
scores.question_type Question Counts
word_diff_list Differences In Word Use Between Groups
proportions.pos Parts of Speech
print.table_proportion Prints a table_proportion object
print.sent_split Prints a sent_split object
print.sums_gantt Prints a sums_gantt object
read.transcript Read Transcripts Into R
print.word_stats_counts Prints a word_stats_counts object
print.table_score Prints a table_score object
word_count Word Counts
summary.wfdf Summarize a wfdf object
trans_venn Venn Diagram by Grouping Variable
raj.act.1 Romeo and Juliet: Act 1
rm_stopwords Remove Stop Words
termco_c Combine Columns from a termco Object
proportions.end_mark_by Question Counts
termco Search For and Count Terms
proportions.subject_pronoun_type Question Counts
proportions.formality Formality
proportions.question_type Question Counts
qprep Quick Preparation of Text
proportions.word_length Word Length Counts
question_type Count of Question Type
syllable_sum Syllabication
unique_by Find Unique Words by Grouping Variable
word_associate Find Associated Words
scores.subject_pronoun_type Question Counts
scores.fry Readability Measures
raj.act.4 Romeo and Juliet: Act 4
scores.lexical_classification Lexical Classification
scores.termco Term Counts
rajSPLIT Romeo and Juliet (Complete & Split)
proportions.pos_by Parts of Speech
proportions.pronoun_type Question Counts
synonyms Search For Synonyms
replace_ordinal Replace Mixed Ordinal Numbers With Text Representation
space_fill Replace Spaces
random_sent Generate Random Dialogue Data
replace_abbreviation Replace Abbreviations
strip Strip Text
scores.word_position Word Position
sample.time.span Minimal Time Span Data Set
scores.linsear_write Readability Measures
raj.act.1POS Romeo and Juliet: Act 1 Parts of Speech by Person A dataset containing a list from pos_by using the mraja1spl data set (see pos_by for more information).
scores.flesch_kincaid Readability Measures
replace_symbol Replace Symbols With Word Equivalents
scores.polarity Polarity
scores.formality Formality
scores.word_length Word Length Counts
sentSplit Sentence Splitting
word_cor Find Correlated Words
stemmer Stem Text
scores.pos_by Parts of Speech
word_length Count of Word Lengths Type
visual Generic visual Method
subject_pronoun_type Count Subject Pronouns Per Grouping Variable
tot_plot Visualize Word Length by Turn of Talk
strWrap Wrap Character Strings to Format Paragraphs
summary.wfm Summarize a wfm object
word_list Raw Word Lists/Frequency Counts
word_proximity Proximity Matrix Between Words
word_stats Descriptive Word Statistics
No Results!

Last month downloads


Include our badge in your README