Learn R Programming

⚠️There's a newer version (2.4.6.1) of this package.Take me there.

qdap (version 1.3.2)

Bridging the gap between qualitative data and quantitative analysis

Description

This package automates many of the tasks associated with quantitative discourse analysis of transcripts containing discourse including frequency counts of sentence types, words, sentences, turns of talk, syllables and other assorted analysis tasks. The package provides parsing tools for preparing transcript data. Many functions enable the user to aggregate data by any number of grouping variables providing analysis and seamless integration with other R packages that undertake higher level analysis and visualization of text. This affords the user a more efficient and targeted analysis. qdap is designed for transcript analysis, however, many functions are applicable to other areas of Text Mining/Natural Language Processing.

Copy Link

Version

Install

install.packages('qdap')

Monthly Downloads

3,763

Version

1.3.2

License

GPL-2

Maintainer

Tyler Rinker

Last Published

March 17th, 2014

Functions in qdap (1.3.2)

DATA

Fictitious Classroom Dialogue
cm_code.transform

Transform Codes
cm_2long

A Generic to Long Function
DATA.SPLIT

Fictitious Split Sentence Classroom Dialogue
colcomb2class

Combine Columns to Class
cm_code.exclude

Exclude Codes
cm_df2long

Transform Codes to Start-End Durations
mtabulate

Tabulate Frequency Counts for Multiple Vectors
imperative

Intuitively Remark Sentences as Imperative
common.list

list Method for common
NAer

Replace Missing Values (NA)
beg2char

Grab Begin/End of String to Character
plot.linsear_write_scores

Plots a linsear_write_scores Object
cm_df.transcript

Transcript With Word Number
wfm

Word Frequency Matrix
counts.linsear_write

Readability Measures
print.flesch_kincaid

Prints an flesch_kincaid Object
counts.character_table

Term Counts
colSplit

Separate a Column Pasted by paste2
counts.flesch_kincaid

Readability Measures
clean

Remove Escaped Characters
end_inc

Test for Incomplete Sentences
outlier_labeler

Locate Outliers in Numeric String
automated_readability_index

Readability Measures
htruncdf

Dataframe Viewing
all_words

Searches Text Column for Words
counts.termco

Term Counts
adjacency_matrix

Takes a Matrix and Generates an Adjacency Matrix
id

ID By Row Number or Sequence Along
Trim

Remove Leading/Trailing White Space
cm_distance

Distance Matrix Between Codes
counts.pos

Parts of Speech
cm_time2long

Transform Codes to Start-End Times
kullback_leibler

Kullback Leibler Statistic
Filter.TermDocumentMatrix

Filter
plot.diversity

Plots a diversity object
plot.automated_readability_index

Plots a automated_readability_index Object
plot.pos

Plots a pos Object
cm_df.fill

Range Coding
plot.question_type_preprocessed

Plots a question_type_preprocessed Object
gradient_cloud

Gradient Word Cloud
mcsv_r

Read/Write Multiple csv Files at a Time
mraja1spl

Romeo and Juliet: Act 1 Dialogue Merged with Demographics and Split
counts.fry

Readability Measures
left_just

Text Justification
freq_terms

Find Frequent Terms
condense

Condense Dataframe Columns
blank2NA

Replace Blanks in a dataframe
plot.sum_cmspans

Plot Summary Stats for a Summary of a cmspans Object
Dissimilarity

Dissimilarity Statistics
preprocessed.formality

Formality
cm_range2long

Transform Codes to Start-End Durations
cm_df.temp

Break Transcript Dialogue into Blank Code Matrix
plot.table_proportion

Plots a table_proportion Object
new_project

Project Template
plot.kullback_leibler

Plots a kullback_leibler object
plot.cmspans

Plots a cmspans object
bag_o_words

Bag of Words
pos

Parts of Speech Tagging
exclude

Exclude Elements From a Vector
plot.character_table

Plots a character_table Object
hash

Hash/Dictionary Lookup
plot.readability_count

Plots a readability_count Object
end_mark

Sentence End marks
plot.cm_distance

Plots a cm_distance object
duplicates

Find Duplicated Words in a Text String
plot.freq_terms

Plots a freq_terms Object
counts

Generic Counts Method
counts.formality

Formality
plot.linsear_write

Plots a linsear_write Object
polarity

Polarity Score (Sentiment Analysis)
cm_time.temp

Time Span Code Sheet
gantt_plot

Gantt Plot
counts.SMOG

Readability Measures
counts.pos_by

Parts of Speech
DATA2

Fictitious Repeated Measures Classroom Dialogue
counts.coleman_liau

Readability Measures
plot.SMOG

Plots a SMOG Object
cm_combine.dummy

Find Co-occurrence Between Dummy Codes
cm_range.temp

Range Code Sheet
capitalizer

Capitalize Select Words
formality

Formality Score
cm_long2dummy

Stretch and Dummy Code cm_xxx2long
cm_code.overlap

Find Co-occurrence Between Codes
pres_debate_raw2012

First 2012 U.S. Presidential Debate
bracketX

Bracket Parsing
cm_dummy2long

Convert cm_combine.dummy Back to Long
plot.formality_scores

Plots a formality_scores Object
incomplete_replace

Denote Incomplete End Marks With "|"
gantt

Generates start and end times of supplied text selections (i.e., text selections are determined by any number of grouping variables).
mraja1

Romeo and Juliet: Act 1 Dialogue Merged with Demographics
plot.wfdf

Plots a wfdf object
counts.polarity

Polarity
print.coleman_liau

Prints an coleman_liau Object
qcombine

Combine Columns
key_merge

Merge Demographic Information with Person/Text Transcript
plot.word_stats_counts

Plots a word_stats_counts Object
print.boolean_qdap

Prints a boolean_qdap object
print.cm_distance

Prints a cm_distance Object
plot.formality

Plots a formality Object
preprocessed.pos_by

Parts of Speech
replacer

Replace Cells in a Matrix or Data Frame
print.formality

Prints a formality Object
proportions

Generic Proportions Method
summary.wfdf

Summarize a wfdf object
print.trunc

Prints a trunc object
dir_map

Map Transcript Files from a Directory to a Script
sample.time.span

Minimal Time Span Data Set
strWrap

Wrap Character Strings to Format Paragraphs
plot.polarity_count

Plots a polarity_count Object
common

Find Common Words Between Groups
scores.polarity

Polarity
plot.word_proximity

Plots a word_proximity object
print.formality_scores

Prints a formality_scores object
plot.word_stats

Plots a word_stats object
print.linsear_write_count

Prints a linsear_write_count Object
plot.sent_split

Plots a sent_split Object
print.sums_gantt

Prints a sums_gantt object
print.termco

Prints a termco object.
qcv

Quick Character Vector
synonyms

Search For Synonyms
print.polarity

Prints an polarity Object
termco

Search For and Count Terms
plot.polarity_score

Plots a polarity_score Object
print.word_stats

Prints a word_stats object
tot_plot

Visualize Word Length by Turn of Talk
scores.word_stats

Word Stats
print.fry

Prints an fry Object
print.diversity

Prints a diversity object
print.word_associate

Prints a word_associate object
plot.pos_by

Plots a pos_by Object
multiscale

Nested Standardization
print.qdap_context

Prints a qdap_context object
plot.question_type

Plots a question_type Object
print.automated_readability_index

Prints an automated_readability_index Object
plot.rmgantt

Plots a rmgantt object
plot.readability_score

Plots a readability_score Object
print.colsplit2df

Prints a colsplit2df Object.
dispersion_plot

Lexical Dispersion Plot
outlier_detect

Detect Outliers in Text
proportions.pos

Parts of Speech
raj.act.1

Romeo and Juliet: Act 1
gantt_wrap

Gantt Plot
word_list

Raw Word Lists/Frequency Counts
name2sex

Names to Gender Prediction
ngrams

Generate ngrams
print.qdapProj

Prints a qdapProj Object
hms2sec

Convert h:m:s to Seconds
print.pos

Prints a pos Object.
gantt_rep

Generate Unit Spans for Repeated Measures
plot.table_count

Plots a table_count Object
plot.table_score

Plots a table_score Object
plot.termco

Plots a termco object
scores.fry

Readability Measures
plot.gantt

Plots a gantt object
rajPOS

Romeo and Juliet Split in Parts of Speech
counts.word_stats

Word Stats
lookup

Hash Table/Dictionary Lookup
plot.flesch_kincaid

Plots a flesch_kincaid Object
plot.weighted_wfm

Plots a weighted_wfm object
print.readability_score

Prints a readability_score Object
print.Dissimilarity

Prints a Dissimilarity object
prop

Convert Raw Numeric Matrix or Data Frame to Proportions
multigsub

Multiple gsub
print.sum_cmspans

Prints a sum_cmspans object
paste2

Paste an Unspecified Number Of Text Columns
preprocessed.question_type

Question Counts
speakerSplit

Break and Stretch if Multiple Persons per Cell
strip

Strip Text
plot.word_cor

Plots a word_cor object
print.ngrams

Prints an ngrams object
tdm

tm Package Compatibility Tools: Apply to or Convert to/from Term Document Matrix or Document Term Matrix
question_type

Count of Question Type
potential_NA

Search for Potential Missing Values
scores.flesch_kincaid

Readability Measures
replace_abbreviation

Replace Abbreviations
proportions.formality

Formality
t.TermDocumentMatrix

Transposes a TermDocumentMatrix object
print.linsear_write_scores

Prints a linsear_write_scores Object
raw.time.span

Minimal Raw Time Span Data Set
word_proximity

Proximity Matrix Between Words
proportions.question_type

Question Counts
print.character_table

Prints a character_table object
scores.coleman_liau

Readability Measures
summary.wfm

Summarize a wfm object
sec2hms

Convert Seconds to h:m:s
print.word_list

Prints a word_list Object
print.adjacency_matrix

Prints an adjacency_matrix Object
print.pos_preprocessed

Prints a pos_preprocessed object
raj.act.4

Romeo and Juliet: Act 4
t.DocumentTermMatrix

Transposes a DocumentTermMatrix object
print.word_cor

Prints a word_cor object
proportions.termco

Term Counts
scores.pos_by

Parts of Speech
syllable_sum

Syllabication
trans_cloud

Word Clouds by Grouping Variable
scrubber

Clean Imported Text
trans_venn

Venn Diagram by Grouping Variable
rm_stopwords

Remove Stop Words
v_outer

Vectorized Version of outer
raj.demographics

Romeo and Juliet Demographics
word_cor

Find Correlated Words
read.transcript

Read Transcripts Into R
scores.question_type

Question Counts
print.word_proximity

Prints a word_proximity object
preprocessed.pos

Parts of Speech
print.wfm

Prints a wfm Object
word_diff_list

Differences In Word Use Between Groups
replace_symbol

Replace Symbols With Word Equivalents
rm_row

Remove Rows That Contain Markers
scores.character_table

Term Counts
raj.act.3

Romeo and Juliet: Act 3
print.question_type

Prints a question_type object
plot.pos_preprocessed

Plots a pos_preprocessed Object
print.kullback_leibler

Prints a kullback_leibler Object.
print.polarity_count

Prints a polarity_count Object
word_count

Word Counts
word_associate

Find Associated Words
rank_freq_mplot

Rank Frequency Plot
print.linsear_write

Prints an linsear_write Object
scores.formality

Formality
print.pos_by

Prints a pos_by Object.
print.readability_count

Prints a readability_count Object
print.polarity_score

Prints a polarity_score Object
rm_url

Remove/Replace URLs
scores.termco

Term Counts
print.table_count

Prints a table_count object
print.table_proportion

Prints a table_proportion object
scores.linsear_write

Readability Measures
qheat

Quick Heatmap
replace_number

Replace Numbers With Text Representation
trans_context

Print Context Around Indices
print.all_words

Prints an all_words Object
print.question_type_preprocessed

Prints a question_type_preprocessed object
print.SMOG

Prints an SMOG Object
print.table_score

Prints a table_score object
rajSPLIT

Romeo and Juliet (Complete & Split)
raj

Romeo and Juliet (Unchanged & Complete)
scores

Generic Scores Method
proportions.character_table

Term Counts
spaste

Add Leading/Trailing Spaces
termco_c

Combine Columns from a termco Object
proportions.pos_by

Parts of Speech
raj.act.2

Romeo and Juliet: Act 2
raj.act.5

Romeo and Juliet: Act 5
space_fill

Replace Spaces
url_dl

Download Instructional Documents
scores.automated_readability_index

Readability Measures
Search

Search Columns of a Data Frame
counts.question_type

Question Counts
dist_tab

SPSS Style Frequency Tables
diversity

Diversity Statistics
plot.coleman_liau

Plots a coleman_liau Object
plot.linsear_write_count

Plots a linsear_write_count Object
plot.sums_gantt

Plots a sums_gantt object
preprocessed

Generic Preprocessed Method
pres_debates2012

2012 U.S. Presidential Debates
print.wfm_summary

Prints a wfm_summary Object
print.v_outer

Prints a v_outer Object.
qprep

Quick Preparation of Text
sentSplit

Sentence Splitting
text2color

Map Words to Colors
scores.SMOG

Readability Measures
word_network_plot

Word Network Plot
cm_code.blank

Blank Code Transformation
cm_code.combine

Combine Codes
counts.automated_readability_index

Readability Measures
colsplit2df

Wrapper for colSplit that Returns Dataframe(s)
list2df

List/Matrix/Vector to Dataframe
plot.polarity

Plots a polarity Object
plot.wfm

Plots a wfm object
print.sent_split

Prints a sent_split object
print.word_stats_counts

Prints a word_stats_counts object
qdap

qdap: Quantitative Discourse Analysis Package
replace_contraction

Replace Contractions
stemmer

Stem Text
summary.cmspans

Summarize a cmspans object
word_stats

Descriptive Word Statistics