Learn R Programming

stylo (version 0.7.5)

Stylometric Multivariate Analyses

Description

Supervised and unsupervised multivariate methods, supplemented by GUI and some visualizations, to perform various analyses in the field of computational stylistics, authorship attribution, etc. For further reference, see Eder et al. (2016), . You are also encouraged to visit the Computational Stylistics Group's website , where a reasonable amount of information about the package and related projects are provided.

Copy Link

Version

Install

install.packages('stylo')

Monthly Downloads

1,058

Version

0.7.5

License

GPL (>= 3)

Issues

Pull Requests

Stars

Forks

Maintainer

Maciej Eder

Last Published

April 3rd, 2024

Functions in stylo (0.7.5)

dist.minmax

Min-Max Distance (aka Ruzicka Distance)
gui.classify

GUI for the function classify
imposters.optimize

Tuning Parameters for the Imposters Method
load.corpus

Load text files
load.corpus.and.parse

Load text files and perform pre-processing
make.ngrams

Make text n-grams
make.table.of.frequencies

Prepare a table of (relative) word frequencies
oppose

Contrastive analysis of texts
lee

Table of word frequencies (Lee, Capote, Faulkner, Styron, etc.)
make.frequency.list

Make List of the Most Frequent Elements (e.g. Words)
parse.corpus

Perform pre-processing (tokenization, n-gram extracting, etc.)
make.samples

Split text to samples
novels

A selection of 19th-century English novels
perform.nsc

Nearest Shrunken Centroids classifier
perform.culling

Exclude variables (e.g. words, n-grams) from a frequency table that are too characteristic for some samples
perform.knn

k-Nearest Neighbor classifier
perform.delta

Distance-based classifier
perform.svm

Support Vector Machines classifier
parse.pos.tags

Extract POS-tags or Words from Annotated Corpora
perform.impostors

An Authorship Verification Classifier Known as the Impostors Method. ATTENTION: this function is obsolete; refer to a new implementation, aka the imposters() function!
plot.sample.size

Plot Classification Accuracy for Short Text Samples
perform.naivebayes

Naive Bayes classifier
performance.measures

Accuracy, Precision, Recall, and the F Measure
stylo.pronouns

List of pronouns
stylo.default.settings

Setting variables for the package stylo
stylo.network

Bootstrap consensus networks, with D3 visualization
samplesize.penalize

Determining Minimal Sample Size for Text Classification
zeta.craig

Compare two subcorpora using Craig's Zeta
stylo

Stylometric multidimensional analyses
zeta.chisquare

Compare two subcorpora using a home-brew variant of Craig's Zeta
rolling.delta

Sequential stylometric analysis
txt.to.words.ext

Split text into words: extended version
txt.to.words

Split text into words
rolling.classify

Sequential machine-learning classification
zeta.eder

Compare two subcorpora using Eder's Zeta
txt.to.features

Split string of words or other countable features
assign.plot.colors

Assign colors to samples
change.encoding

Change character encoding
classify

Machine-learning supervised classification
crossv

Function to Perform Cross-Validation
dist.cosine

Cosine Distance
delete.stop.words

Exclude stop words (e.g. pronouns, particles, etc.) from a dataset
check.encoding

Check character encoding in corpus folder
dist.delta

Delta Distance
define.plot.area

Define area for scatterplots
delete.markup

Delete HTML or XML tags
galbraith

Table of word frequencies (Galbraith, Rowling, Coben, Tolkien, Lewis)
gui.stylo

GUI for stylo
dist.wurzburg

Cosine Delta Distance (aka Wurzburg Distance)
dist.entropy

Entropy Distance
gui.oppose

GUI for the function oppose
dist.simple

Cosine Distance
imposters

Authorship Verification Classifier Known as the Imposters Method