qdap v2.4.1


Monthly downloads



Bridging the Gap Between Qualitative Data and Quantitative Analysis

Automates many of the tasks associated with quantitative discourse analysis of transcripts containing discourse including frequency counts of sentence types, words, sentences, turns of talk, syllables and other assorted analysis tasks. The package provides parsing tools for preparing transcript data. Many functions enable the user to aggregate data by any number of grouping variables, providing analysis and seamless integration with other R packages that undertake higher level analysis and visualization of text. This affords the user a more efficient and targeted analysis. 'qdap' is designed for transcript analysis, however, many functions are applicable to other areas of Text Mining/ Natural Language Processing.



Project Status: Inactive – The project has reached a stable, usable state but is no longer being actively developed; support/maintenance will be provided as time allows. Build Status DOI

qdap (Quantitative Discourse Analysis Package) is an R package designed to assist in quantitative discourse analysis. The package stands as a bridge between qualitative transcripts of dialogue and statistical analysis & visualization.


To download the development version of qdap:

Download the zip ball or tar ball, decompress and run R CMD INSTALL on it, or use the pacman package to install the development version (The user may want to install the dev version of reports first):

if (!require("pacman")) install.packages("pacman")



You are welcome to:

Note: If you are reporting a bug make sure you have first read the Cleaning Text & Debugging vignette

Functions in qdap

Name Description
Animate Generic Animate Method
Animate.polarity Animate Polarity
automated_readability_index Readability Measures
Trim Remove Leading/Trailing White Space
Search Search Columns of a Data Frame
Title Add Title to Select qdap Plots
beg2char Grab Begin/End of String to Character
Animate.character Animate Character
blank2NA Replace Blanks in a dataframe
cm_2long A Generic to Long Function
Network.formality Network Formality
as.tdm tm Package Compatibility Tools: Apply to or Convert to/from Term Document Matrix or Document Term Matrix
Network Generic Network Method
Animate.gantt Gantt Durations
Animate.discourse_map Discourse Map
Animate.formality Animate Formality
Network.polarity Network Polarity
Network.lexical_classification Network Lexical Classification
clean Remove Escaped Characters
cm_df2long Transform Codes to Start-End Durations
common.list list Method for common
counts.formality Formality
counts.flesch_kincaid Readability Measures
condense Condense Dataframe Columns
cm_df.transcript Transcript With Word Number
add_incomplete Detect Incomplete Sentences; Add | Endmark
counts.word_length Word Length Counts
check_spelling_interactive.check_spelling Check Spelling
add_s Make Plural (or Verb to Singular) Versions of Words
check_spelling Check Spelling
Animate.gantt_plot Gantt Plot
check_spelling_interactive.character Check Spelling
cm_combine.dummy Find Co-occurrence Between Dummy Codes
counts.word_position Word Position
bag_o_words Bag of Words
cm_range2long Transform Codes to Start-End Durations
dist_tab SPSS Style Frequency Tables
+.Network Add themes to a Network object.
cm_time.temp Time Span Code Sheet
capitalizer Capitalize Select Words
check_spelling_interactive.factor Check Spelling
DATA2 Fictitious Repeated Measures Classroom Dialogue
exclude Exclude Elements From a Vector
cm_code.transform Transform Codes
cm_code.combine Combine Codes
delete Easy File Handling
diversity Diversity Statistics
common Find Common Words Between Groups
counts.object_pronoun_type Question Counts
cm_code.blank Blank Code Transformation
cm_time2long Transform Codes to Start-End Times
counts.polarity Polarity
counts.fry Readability Measures
cm_df.temp Break Transcript Dialogue into Blank Code Matrix
wfm Word Frequency Matrix
cm_df.fill Range Coding
Dissimilarity Dissimilarity Statistics
%&% qdap Chaining
comma_spacer Ensure Space After Comma
new_project Project Template
counts.subject_pronoun_type Question Counts
counts.termco Term Counts
name2sex Names to Gender
counts.linsear_write Readability Measures
cumulative Cumulative Scores
duplicates Find Duplicated Words in a Text String
inspect_text Inspect Text Vectors
counts.word_stats Word Stats
plot.automated_readability_index Plots a automated_readability_index Object
plot.coleman_liau Plots a coleman_liau Object
plot.animated_polarity Plots an animated_polarity Object
plot.cmspans Plots a cmspans object
colSplit Separate a Column Pasted by paste2
check_text Check Text For Potential Problems
lexical_classification Lexical Classification Score
end_inc Test for Incomplete Sentences
hamlet Hamlet (Complete & Split by Sentence)
incomplete_replace Denote Incomplete End Marks With "|"
colcomb2class Combine Columns to Class
cm_long2dummy Stretch and Dummy Code cm_xxx2long
discourse_map Discourse Mapping
plot.end_mark_by_preprocessed Plots a end_mark_by_preprocessed Object
gradient_cloud Gradient Word Cloud
plot.end_mark_by_count Plots a end_mark_by_count Object
counts.coleman_liau Readability Measures
counts.end_mark_by Question Counts
cm_range.temp Range Code Sheet
chunker Break Text Into Ordered Word Chunks
counts.pos_by Parts of Speech
counts.pos Parts of Speech
mraja1 Romeo and Juliet: Act 1 Dialogue Merged with Demographics
colsplit2df Wrapper for colSplit that Returns Dataframe(s)
outlier_detect Detect Outliers in Text
outlier_labeler Locate Outliers in Numeric String
end_mark Sentence End Marks
plot.lexical Plots a lexical Object
plot.lexical_classification Plots a lexical_classification Object
plot.animated_character Plots an animated_character Object
env.syl Syllable Lookup Environment
gantt Gantt Durations
plot.lexical_classification_preprocessed Plots a lexical_classification_preprocessed Object
plot.animated_discourse_map Plots an animated_discourse_map Object
plot.cumulative_animated_formality Plots a cumulative_animated_formality Object
plot.combo_syllable_sum Plots a combo_syllable_sum Object
dispersion_plot Lexical Dispersion Plot
formality Formality Score
plot.lexical_classification_score Plots a lexical_classification_score Object
plot.pos_by Plots a pos_by Object
plot.table_proportion Plots a table_proportion Object
plot.pos_preprocessed Plots a pos_preprocessed Object
plot.cumulative_formality Plots a cumulative_formality Object
gantt_plot Gantt Plot
plot.word_proximity Plots a word_proximity object
plot.table_score Plots a table_score Object
freq_terms Find Frequent Terms
plot.word_stats Plots a word_stats object
plot.gantt Plots a gantt object
plot.cumulative_lexical_classification Plots a cumulative_lexical_classification Object
vertex_apply Apply Parameter to List of Igraph Vertices/Edges
plot.kullback_leibler Plots a kullback_leibler object
is.global Test If Environment is Global
plot.readability_score Plots a readability_score Object
plot.object_pronoun_type Plots an object_pronoun_type Object
plot.weighted_wfm Plots a weighted_wfm object
plot.rmgantt Plots a rmgantt object
plot.linsear_write_scores Plots a linsear_write_scores Object
plot.wfdf Plots a wfdf object
left_just Text Justification
preprocessed.word_position Word Position
preprocessed.check_spelling_interactive Check Spelling
pres_debate_raw2012 First 2012 U.S. Presidential Debate
preprocessed.end_mark_by Question Counts
print.animated_formality Prints a animated_formality Object
imperative Intuitively Remark Sentences as Imperative
plot.SMOG Plots a SMOG Object
plot.Network Plots a Network Object
ngrams Generate ngrams
plot.cm_distance Plots a cm_distance object
plot.character_table Plots a character_table Object
plot.word_length Plots a word_length Object
multigsub Multiple gsub
plot.end_mark_by Plots a end_mark_by Object
object_pronoun_type Count Object Pronouns Per Grouping Variable
plot.end_mark Plots an end_mark Object
plot.formality_scores Plots a formality_scores Object
multiscale Nested Standardization
plot.freq_terms Plots a freq_terms Object
print.animated_lexical_classification Prints an animated_lexical_classification Object
plot.animated_lexical_classification Plots an animated_lexical_classification Object
plot.animated_formality Plots a animated_formality Object
plot.cumulative_combo_syllable_sum Plots a cumulative_combo_syllable_sum Object
plot.cumulative_end_mark Plots a cumulative_end_mark Object
plot.cumulative_syllable_freq Plots a cumulative_syllable_freq Object
plot.end_mark_by_proportion Plots a end_mark_by_proportion Object
plot.polarity_score Plots a polarity_score Object
plot.cumulative_polarity Plots a cumulative_polarity Object
plot.end_mark_by_score Plots a end_mark_by_score Object
plot.pos Plots a pos Object
plot.polarity Plots a polarity Object
plot.pronoun_type Plots an pronoun_type Object
plot.word_position Plots a word_position object
print.cm_distance Prints a cm_distance Object
print.check_text Prints a check_text Object
print.discourse_map Prints a discourse_map Object
sentiment_frame Power Score (Sentiment Analysis)
print.diversity Prints a diversity object
print.kullback_leibler Prints a kullback_leibler Object.
plot.polarity_count Plots a polarity_count Object
print.lexical_classification Prints an lexical_classification Object
plot.question_type_preprocessed Plots a question_type_preprocessed Object
plot.readability_count Plots a readability_count Object
plot.sums_gantt Plots a sums_gantt object
plot.sum_cmspans Plot Summary Stats for a Summary of a cmspans Object
plot.question_type Plots a question_type Object
print.phrase_net Prints a phrase_net Object
preprocessed Generic Preprocessed Method
pres_debates2012 2012 U.S. Presidential Debates
print.polarity Prints an polarity Object
print.question_type_preprocessed Prints a question_type_preprocessed object
print.readability_count Prints a readability_count Object
preprocessed.object_pronoun_type Question Counts
potential_NA Search for Potential Missing Values
preprocessed.pos Parts of Speech
pos Parts of Speech Tagging
print.Network Prints a Network Object
plot.word_stats_counts Plots a word_stats_counts Object
plot.syllable_freq Plots a syllable_freq Object
polarity Polarity Score (Sentiment Analysis)
print.SMOG Prints an SMOG Object
plot.table_count Plots a table_count Object
print.boolean_qdap Prints a boolean_qdap object
print.syllable_sum Prints an syllable_sum object
DATA Fictitious Classroom Dialogue
Filter.all_words Filter
print.animated_polarity Prints an animated_polarity Object
print.automated_readability_index Prints an automated_readability_index Object
print.Dissimilarity Prints a Dissimilarity object
print.character_table Prints a character_table object
adjacency_matrix Takes a Matrix and Generates an Adjacency Matrix
NAer Replace Missing Values (NA)
print.combo_syllable_sum Prints an combo_syllable_sum object
print.cumulative_animated_polarity Prints a cumulative_animated_polarity Object
all_words Searches Text Column for Words
DATA.SPLIT Fictitious Split Sentence Classroom Dialogue
print.cumulative_animated_lexical_classification Prints a cumulative_animated_lexical_classification Object
bracketX Bracket Parsing
print.cumulative_formality Prints a cumulative_formality Object
preprocessed.pos_by Parts of Speech
print.cumulative_animated_formality Prints a cumulative_animated_formality Object
print.cumulative_lexical_classification Prints a cumulative_lexical_classification Object
build_qdap_vignette Replace Temporary Introduction to qdap Vignette
cm_code.exclude Exclude Codes
print.linsear_write Prints an linsear_write Object
print.ngrams Prints an ngrams object
print.end_mark Prints an end_mark object
print.question_type Prints a question_type object
print.lexical_classification_score Prints a lexical_classification_score Object
print.end_mark_by Prints an end_mark_by object
print.object_pronoun_type Prints a object_pronoun_type object
print.qdap_context Prints a qdap_context object
raj.act.1 Romeo and Juliet: Act 1
print.table_count Prints a table_count object
print.formality Prints a formality Object
print.word_position Prints a word_position object.
print.word_proximity Prints a word_proximity object
tot_plot Visualize Word Length by Turn of Talk
qcombine Combine Columns
raj.act.1POS Romeo and Juliet: Act 1 Parts of Speech by Person A dataset containing a list from pos_by using the mraja1spl data set (see pos_by for more information).
proportions Generic Proportions Method
print.sub_holder Prints a sub_holder object
print.type_token_ratio Prints a type_token_ratio Object
print.subject_pronoun_type Prints a subject_pronoun_type object
print.formality_scores Prints a formality_scores object
print.wfm Prints a wfm Object
preprocessed.pronoun_type Question Counts
proportions.object_pronoun_type Question Counts
proportions.character_table Term Counts
syllable_sum Syllabication
cm_code.overlap Find Co-occurrence Between Codes
proportions.question_type Question Counts
proportions.subject_pronoun_type Question Counts
preprocessed.question_type Question Counts
print.linsear_write_count Prints a linsear_write_count Object
print.linsear_write_scores Prints a linsear_write_scores Object
scores.linsear_write Readability Measures
raj Romeo and Juliet (Unchanged & Complete)
cm_distance Distance Matrix Between Codes
cm_dummy2long Convert cm_combine.dummy Back to Long
preprocessed.subject_pronoun_type Question Counts
proportions.pos Parts of Speech
print.check_spelling Prints a check_spelling Object
proportions.word_position Word Position
print.coleman_liau Prints an coleman_liau Object
print.check_spelling_interactive Prints a check_spelling_interactive Object
counts Generic Counts Method
counts.SMOG Readability Measures
print.pronoun_type Prints a pronoun_type object
termco_c Combine Columns from a termco Object
rajPOS Romeo and Juliet Split in Parts of Speech
scores.pronoun_type Question Counts
scores.fry Readability Measures
rm_row Remove Rows That Contain Markers
print.colsplit2df Prints a colsplit2df Object.
scores.lexical_classification Lexical Classification
raj.demographics Romeo and Juliet Demographics
counts.automated_readability_index Readability Measures
question_type Count of Question Type
scores.object_pronoun_type Question Counts
print.cumulative_polarity Prints a cumulative_polarity Object
print.qdapProj Prints a qdapProj Object
scores.question_type Question Counts
print.cumulative_syllable_freq Prints a cumulative_syllable_freqObject
replacer Replace Cells in a Matrix or Data Frame
print.table_proportion Prints a table_proportion object
scores.subject_pronoun_type Question Counts
print.word_list Prints a word_list Object
print.word_length Prints a word_length object
print.table_score Prints a table_score object
counts.question_type Question Counts
counts.character_table Term Counts
random_sent Generate Random Dialogue Data
counts.pronoun_type Question Counts
proportions.termco Term Counts
read.transcript Read Transcripts Into R
print.word_stats Prints a word_stats object
print.inspect_text Prints an inspect_text Object
print.polarity_score Prints a polarity_score Object
print.fry Prints an fry Object
print.pos_preprocessed Prints a pos_preprocessed object
print.polarity_count Prints a polarity_count Object
print.pos_by Prints a pos_by Object.
replace_abbreviation Replace Abbreviations
print.word_stats_counts Prints a word_stats_counts object
rm_stopwords Remove Stop Words
print.trunc Prints a trunc object
print.which_misspelled Prints a which_misspelled Object
scores Generic Scores Method
print.wfm_summary Prints a wfm_summary Object
print.termco Prints a termco object.
summary.cmspans Summarize a cmspans object
dir_map Map Transcript Files from a Directory to a Script
htruncdf Dataframe Viewing
sample.time.span Minimal Time Span Data Set
summary.wfdf Summarize a wfdf object
scores.termco Term Counts
word_proximity Proximity Matrix Between Words
word_count Word Counts
sentSplit Sentence Splitting
visual.discourse_map Discourse Map
summary.wfm Summarize a wfm object
raj.act.3 Romeo and Juliet: Act 3
proportions.word_length Word Length Counts
space_fill Replace Spaces
proportions.end_mark_by Question Counts
proportions.formality Formality
word_diff_list Differences In Word Use Between Groups
key_merge Merge Demographic Information with Person/Text Transcript
proportions.pos_by Parts of Speech
mraja1spl Romeo and Juliet: Act 1 Dialogue Merged with Demographics and Split
replace_contraction Replace Contractions
raj.act.2 Romeo and Juliet: Act 2
weight Weight a qdap Object
scores.coleman_liau Readability Measures
replace_number Replace Numbers With Text Representation
kullback_leibler Kullback Leibler Statistic
proportions.pronoun_type Question Counts
mcsv_r Read/Write Multiple csv Files at a Time
gantt_wrap Gantt Plot
unique_by Find Unique Words by Grouping Variable
rajSPLIT Romeo and Juliet (Complete & Split)
synonyms Search For Synonyms
scores.end_mark_by Question Counts
scores.polarity Polarity
qdap_df Create qdap Specific Data Structure
plot.cumulative_animated_lexical_classification Plots a cumulative_animated_lexical_classification Object
plot.cumulative_animated_polarity Plots a cumulative_animated_polarity Object
phrase_net Phrase Nets
scores.pos_by Parts of Speech
scores.word_length Word Length Counts
plot.discourse_map Plots a discourse_map Object
word_network_plot Word Network Plot
plot.diversity Plots a diversity object
scores.automated_readability_index Readability Measures
stemmer Stem Text
paste2 Paste an Unspecified Number Of Text Columns
plot.flesch_kincaid Plots a flesch_kincaid Object
plot.formality Plots a formality Object
scores.SMOG Readability Measures
gantt_rep Generate Unit Spans for Repeated Measures
scores.character_table Term Counts
scores.word_position Word Position
strip Strip Text
qheat Quick Heatmap
visual Generic visual Method
word_stats Descriptive Word Statistics
plot.linsear_write Plots a linsear_write Object
trans_cloud Word Clouds by Grouping Variable
word_associate Find Associated Words
word_cor Find Correlated Words
plot.sent_split Plots a sent_split Object
plot.linsear_write_count Plots a linsear_write_count Object
subject_pronoun_type Count Subject Pronouns Per Grouping Variable
plot.subject_pronoun_type Plots an subject_pronoun_type Object
plot.termco Plots a termco object
plot.type_token_ratio Plots a type_token_ratio Object
strWrap Wrap Character Strings to Format Paragraphs
plot.wfm Plots a wfm object
plot.word_cor Plots a word_cor object
raj.act.5 Romeo and Juliet: Act 5
trans_context Print Context Around Indices
preprocessed.lexical_classification Lexical Classification
preprocessed.formality Formality
print.adjacency_matrix Prints an adjacency_matrix Object
print.all_words Prints an all_words Object
print.animated_character Prints an animated_character Object
termco Search For and Count Terms
print.animated_discourse_map Prints an animated_discourse_map Object
print.cumulative_end_mark Prints a cumulative_end_mark Object
print.cumulative_combo_syllable_sum Prints a cumulative_combo_syllable_sum Object
print.end_mark_by_preprocessed Prints a end_mark_by_preprocessed object
print.lexical_classification_by Prints a lexical_classification Object
print.flesch_kincaid Prints an flesch_kincaid Object
print.lexical_classification_preprocessed Prints a lexical_classification_preprocessed Object
raj.act.4 Romeo and Juliet: Act 4
print.polysyllable_sum Prints an polysyllable_sum object
word_position Word Position
print.pos Prints a pos Object.
print.readability_score Prints a readability_score Object
print.sums_gantt Prints a sums_gantt object
print.sum_cmspans Prints a sum_cmspans object
print.sent_split Prints a sent_split object
print.word_associate Prints a word_associate object
print.word_cor Prints a word_cor object
pronoun_type Count Object/Subject Pronouns Per Grouping Variable
qcv Quick Character Vector
prop Convert Raw Numeric Matrix or Data Frame to Proportions
qprep Quick Preparation of Text
scores.word_stats Word Stats
qtheme Add themes to a Network object.
rank_freq_mplot Rank Frequency Plot
qdap qdap: Quantitative Discourse Analysis Package
raw.time.span Minimal Raw Time Span Data Set
scores.flesch_kincaid Readability Measures
type_token_ratio Type-Token Ratio
scrubber Clean Imported Text
scores.formality Formality
replace_symbol Replace Symbols With Word Equivalents
trans_venn Venn Diagram by Grouping Variable
speakerSplit Break and Stretch if Multiple Persons per Cell
replace_ordinal Replace Mixed Ordinal Numbers With Text Representation
spaste Add Leading/Trailing Spaces
word_list Raw Word Lists/Frequency Counts
word_length Count of Word Lengths Type
Animate.lexical_classification Animate Formality
No Results!

Last month downloads


Include our badge in your README