Learn R Programming

⚠️There's a newer version (2.4.6) of this package.Take me there.

qdap (version 1.2.0)

Bridging the gap between qualitative data and quantitative analysis

Description

This package automates many of the tasks associated with quantitative discourse analysis of transcripts containing discourse including frequency counts of sentence types, words, sentences, turns of talk, syllables and other assorted analysis tasks. The package provides parsing tools for preparing transcript data. Many functions enable the user to aggregate data by any number of grouping variables providing analysis and seamless integration with other R packages that undertake higher level analysis and visualization of text. This affords the user a more efficient and targeted analysis. qdap is designed for transcript analysis, however, many functions are applicable to other areas of Text Mining/Natural Language Processing.

Copy Link

Version

Install

install.packages('qdap')

Monthly Downloads

1,883

Version

1.2.0

License

GPL-2

Maintainer

Tyler Rinker

Last Published

March 6th, 2014

Functions in qdap (1.2.0)

blank2NA

Replace Blanks in a dataframe
Dissimilarity

Dissimilarity Statistics
cm_code.exclude

Exclude Codes
cm_range2long

Transform Codes to Start-End Durations
cm_combine.dummy

Find Co-occurrence Between Dummy Codes
clean

Remove Escaped Characters
bag_o_words

Bag of Words
cm_code.overlap

Find Co-occurrence Between Codes
cm_distance

Distance Matrix Between Codes
cm_code.transform

Transform Codes
paste2

Paste an Unspecified Number Of Text Columns
Trim

Remove Leading/Trailing White Space
beg2char

Grab Begin/End of String to Character
adjacency_matrix

Takes a Matrix and Generates an Adjacency Matrix
cm_long2dummy

Stretch and Dummy Code cm_xxx2long
dir_map

Map Transcript Files from a Directory to a Script
cm_code.combine

Combine Codes
counts.word_stats

Word Stats
cm_df.fill

Range Coding
cm_dummy2long

Convert cm_combine.dummy Back to Long
hms2sec

Convert h:m:s to Seconds
cm_2long

A Generic to Long Function
cm_time.temp

Time Span Code Sheet
colSplit

Separate a Column Pasted by paste2
plot.flesch_kincaid

Plots a flesch_kincaid Object
dist_tab

SPSS Style Frequency Tables
counts.character_table

Term Counts
cm_range.temp

Range Code Sheet
cm_time2long

Transform Codes to Start-End Times
formality

Formality Score
raj.act.5

Romeo and Juliet: Act 5
dispersion_plot

Lexical Dispersion Plot
gantt_plot

Gantt Plot
id

ID By Row Number or Sequence Along
plot.coleman_liau

Plots a coleman_liau Object
duplicates

Find Duplicated Words in a Text String
cm_df.transcript

Transcript With Word Number
hash

Hash/Dictionary Lookup
colsplit2df

Wrapper for colSplit that Returns Dataframe(s)
counts.fry

Readability Measures
print.automated_readability_index

Prints an automated_readability_index Object
automated_readability_index

Readability Measures
gradient_cloud

Gradient Word Cloud
counts.flesch_kincaid

Readability Measures
colcomb2class

Combine Columns to Class
DATA2

Fictitious Repeated Measures Classroom Dialogue
diversity

Diversity Statistics
print.formality

Prints a formality Object
print.formality_scores

Prints a formality_scores object
plot.pos

Plots a pos Object
counts.question_type

Question Counts
plot.diversity

Plots a diversity object
plot.SMOG

Plots a SMOG Object
cm_df.temp

Break Transcript Dialogue into Blank Code Matrix
gantt

Generates start and end times of supplied text selections (i.e., text selections are determined by any number of grouping variables).
plot.pos_by

Plots a pos_by Object
counts.termco

Term Counts
mraja1spl

Romeo and Juliet: Act 1 Dialogue Merged with Demographics and Split
counts.pos

Parts of Speech
raw.time.span

Minimal Raw Time Span Data Set
Search

Search Columns of a Data Frame
counts.linsear_write

Readability Measures
common

Find Common Words Between Groups
replace_contraction

Replace Contractions
plot.automated_readability_index

Plots a automated_readability_index Object
htruncdf

Dataframe Viewing
plot.kullback_leibler

Plots a kullback_leibler object
counts.formality

Formality
plot.table_score

Plots a table_score Object
print.fry

Prints an fry Object
print.polarity

Prints an polarity Object
ngrams

Generate ngrams
mraja1

Romeo and Juliet: Act 1 Dialogue Merged with Demographics
plot.polarity_count

Plots a polarity_count Object
counts

Generic Counts Method
plot.sent_split

Plots a sent_split Object
plot.linsear_write_count

Plots a linsear_write_count Object
polarity

Polarity Score (Sentiment Analysis)
common.list

list Method for common
counts.SMOG

Readability Measures
tot_plot

Visualize Word Length by Turn of Talk
plot.formality_scores

Plots a formality_scores Object
wfm

Word Frequency Matrix
cm_code.blank

Blank Code Transformation
DATA.SPLIT

Fictitious Split Sentence Classroom Dialogue
counts.polarity

Polarity
left_just

Text Justification
preprocessed.question_type

Question Counts
raj.act.1

Romeo and Juliet: Act 1
qprep

Quick Preparation of Text
sec2hms

Convert Seconds to h:m:s
counts.automated_readability_index

Readability Measures
pres_debate_raw2012

First 2012 U.S. Presidential Debate
print.word_stats_counts

Prints a word_stats_counts object
end_inc

Test for Incomplete Sentences
print.adjacency_matrix

Prints an adjacency_matrix Object
end_mark

Sentence End marks
plot.sum_cmspans

Plot Summary Stats for a Summary of a cmspans Object
plot.word_stats

Plots a word_stats object
print.polarity_count

Prints a polarity_count Object
outlier_detect

Detect Outliers in Text
print.colsplit2df

Prints a colsplit2df Object.
freq_terms

Find Frequent Terms
lookup

Hash Table/Dictionary Lookup
print.readability_score

Prints a readability_score Object
cm_df2long

Transform Codes to Start-End Durations
outlier_labeler

Locate Outliers in Numeric String
print.flesch_kincaid

Prints an flesch_kincaid Object
print.word_associate

Prints a word_associate object
kullback_leibler

Kullback Leibler Statistic
name2sex

Names to Gender Prediction
print.linsear_write_scores

Prints a linsear_write_scores Object
plot.readability_count

Plots a readability_count Object
print.trunc

Prints a trunc object
counts.coleman_liau

Readability Measures
pos

Parts of Speech Tagging
print.table_count

Prints a table_count object
scores

Generic Scores Method
plot.linsear_write_scores

Plots a linsear_write_scores Object
t.TermDocumentMatrix

Transposes a TermDocumentMatrix object
plot.cmspans

Plots a cmspans object
scores.fry

Readability Measures
print.readability_count

Prints a readability_count Object
print.pos_preprocessed

Prints a pos_preprocessed object
plot.linsear_write

Plots a linsear_write Object
print.coleman_liau

Prints an coleman_liau Object
rm_stopwords

Remove Stop Words
qcv

Quick Character Vector
plot.formality

Plots a formality Object
synonyms

Search For Synonyms
incomplete_replace

Denote Incomplete End Marks With "|"
scrubber

Clean Imported Text
multigsub

Multiple gsub
t.DocumentTermMatrix

Transposes a DocumentTermMatrix object
plot.sums_gantt

Plots a sums_gantt object
raj

Romeo and Juliet (Unchanged & Complete)
plot.question_type_preprocessed

Plots a question_type_preprocessed Object
plot.weighted_wfm

Plots a weighted_wfm object
multiscale

Nested Standardization
proportions.formality

Formality
rajPOS

Romeo and Juliet Split in Parts of Speech
print.question_type_preprocessed

Prints a question_type_preprocessed object
proportions.pos_by

Parts of Speech
preprocessed.pos

Parts of Speech
plot.gantt

Plots a gantt object
print.character_table

Prints a character_table object
print.sent_split

Prints a sent_split object
sentSplit

Sentence Splitting
replacer

Replace Cells in a Matrix or Data Frame
speakerSplit

Break and Stretch if Multiple Persons per Cell
plot.pos_preprocessed

Plots a pos_preprocessed Object
plot.wfdf

Plots a wfdf object
text2color

Map Words to Colors
key_merge

Merge Demographic Information with Person/Text Transcript
proportions.pos

Parts of Speech
print.sums_gantt

Prints a sums_gantt object
plot.polarity_score

Plots a polarity_score Object
plot.word_proximity

Plots a word_proximity object
proportions

Generic Proportions Method
print.Dissimilarity

Prints a Dissimilarity object
plot.question_type

Plots a question_type Object
spaste

Add Leading/Trailing Spaces
list2df

List/Matrix/Vector to Dataframe
summary.cmspans

Summarize a cmspans object
preprocessed.formality

Formality
print.sum_cmspans

Prints a sum_cmspans object
word_associate

Find Associated Words
plot.word_stats_counts

Plots a word_stats_counts Object
preprocessed

Generic Preprocessed Method
plot.word_cor

Plots a word_cor object
prop

Convert Raw Numeric Matrix or Data Frame to Proportions
pres_debates2012

2012 U.S. Presidential Debates
print.question_type

Prints a question_type object
replace_symbol

Replace Symbols With Word Equivalents
print.word_stats

Prints a word_stats object
print.boolean_qdap

Prints a boolean_qdap object
exclude

Exclude Elements From a Vector
gantt_rep

Generate Unit Spans for Repeated Measures
qheat

Quick Heatmap
plot.readability_score

Plots a readability_score Object
strWrap

Wrap Character Strings to Format Paragraphs
summary.wfdf

Summarize a wfdf object
new_project

Project Template
qdap

qdap: Quantitative Discourse Analysis Package
trans_cloud

Word Clouds by Grouping Variable
proportions.character_table

Term Counts
scores.character_table

Term Counts
plot.table_count

Plots a table_count Object
word_list

Raw Word Lists/Frequency Counts
proportions.termco

Term Counts
print.cm_distance

Prints a cm_distance Object
condense

Condense Dataframe Columns
trans_venn

Venn Diagram by Grouping Variable
print.table_proportion

Prints a table_proportion object
v_outer

Vectorized Version of outer
scores.question_type

Question Counts
NAer

Replace Missing Values (NA)
word_cor

Find Correlated Words
rm_url

Remove/Replace URLs
word_stats

Descriptive Word Statistics
scores.SMOG

Readability Measures
print.qdap_context

Prints a qdap_context object
scores.automated_readability_index

Readability Measures
strip

Strip Text
summary.wfm

Summarize a wfm object
tdm

tm Package Compatibility Tools: Apply to or Convert to/from Term Document Matrix or Document Term Matrix
print.word_cor

Prints a word_cor object
print.polarity_score

Prints a polarity_score Object
print.word_list

Prints a word_list Object
scores.termco

Term Counts
print.v_outer

Prints a v_outer Object.
print.linsear_write_count

Prints a linsear_write_count Object
scores.formality

Formality
word_proximity

Proximity Matrix Between Words
print.wfm

Prints a wfm Object
raj.act.3

Romeo and Juliet: Act 3
qcombine

Combine Columns
sample.time.span

Minimal Time Span Data Set
replace_number

Replace Numbers With Text Representation
scores.polarity

Polarity
rm_row

Remove Rows That Contain Markers
url_dl

Download Instructional Documents
proportions.question_type

Question Counts
scores.coleman_liau

Readability Measures
rajSPLIT

Romeo and Juliet (Complete & Split)
word_diff_list

Differences In Word Use Between Groups
scores.pos_by

Parts of Speech
scores.flesch_kincaid

Readability Measures
print.linsear_write

Prints an linsear_write Object
raj.act.4

Romeo and Juliet: Act 4
syllable_sum

Syllabication
space_fill

Replace Spaces
print.word_proximity

Prints a word_proximity object
all_words

Searches Text Column for Words
rank_freq_mplot

Rank Frequency Plot
print.diversity

Prints a diversity object
read.transcript

Read Transcripts Into R
imperative

Intuitively Remark Sentences as Imperative
word_count

Word Counts
replace_abbreviation

Replace Abbreviations
plot.wfm

Plots a wfm object
plot.freq_terms

Plots a freq_terms Object
print.ngrams

Prints an ngrams object
word_network_plot

Word Network Plot
termco

Search For and Count Terms
print.pos

Prints a pos Object.
question_type

Count of Question Type
preprocessed.pos_by

Parts of Speech
print.termco

Prints a termco object.
bracketX

Bracket Parsing
capitalizer

Capitalize Select Words
plot.rmgantt

Plots a rmgantt object
potential_NA

Search for Potential Missing Values
print.kullback_leibler

Prints a kullback_leibler Object.
raj.act.2

Romeo and Juliet: Act 2
print.pos_by

Prints a pos_by Object.
raj.demographics

Romeo and Juliet Demographics
scores.word_stats

Word Stats
scores.linsear_write

Readability Measures
stemmer

Stem Text
DATA

Fictitious Classroom Dialogue
counts.pos_by

Parts of Speech
mcsv_r

Read/Write Multiple csv Files at a Time
mtabulate

Tabulate Frequency Counts for Multiple Vectors
plot.character_table

Plots a character_table Object
plot.polarity

Plots a polarity Object
plot.termco

Plots a termco object
print.SMOG

Prints an SMOG Object
print.all_words

Prints an all_words Object
print.qdapProj

Prints a qdapProj Object
print.table_score

Prints a table_score object
termco_c

Combine Columns from a termco Object
trans_context

Print Context Around Indices
plot.table_proportion

Plots a table_proportion Object
gantt_wrap

Gantt Plot