Learn R Programming

⚠️There's a newer version (0.7.5) of this package.Take me there.

stylo (version 0.6.9)

Stylometric Multivariate Analyses

Description

Supervised and unsupervised multivariate methods, supplemented by GUI and some visualizations, to perform various analyses in the field of computational stylistics, authorship attribution, etc. For further reference, see Eder et al. (2016), You are also encouraged to visit the Computational Stylistics Group's website , where a reasonable amount of information about the package and related projects are provided.

Copy Link

Version

Install

install.packages('stylo')

Monthly Downloads

819

Version

0.6.9

License

GPL (>= 3)

Issues

Pull Requests

Stars

Forks

Maintainer

Maciej Eder

Last Published

January 21st, 2019

Functions in stylo (0.6.9)

imposters.optimize

Tuning Parameters for the Imposters Method
define.plot.area

Define area for scatterplots
gui.stylo

GUI for stylo
gui.oppose

GUI for the function oppose
perform.naivebayes

Naive Bayes classifier
lee

Table of word frequencies (Lee, Capote, Faulkner, Styron, etc.)
load.corpus

Load text files
perform.knn

k-Nearest Neighbor classifier
txt.to.features

Split string of words or other countable features
txt.to.words

Split text into words
load.corpus.and.parse

Load text files and perform pre-processing
perform.culling

Exclude variables (e.g. words, n-grams) from a frequency table that are too characteristic for some samples
dist.wurzburg

Cosine Delta Distance (aka Wurzburg Distance)
dist.simple

Cosine Distance
make.ngrams

Make text n-grams
dist.entropy

Entropy Distance
stylo.network

Bootstrap consensus networks, with D3 visualization
make.samples

Split text to samples
dist.minmax

Min-Max Distance (aka Ruzicka Distance)
make.frequency.list

Make List of the Most Frequent Elements (e.g. Words)
change.encoding

Change character encoding
parse.pos.tags

Extract POS-tags or Words from Annotated Corpora
rolling.classify

Sequential machine-learning classification
rolling.delta

Sequential stylometric analysis
galbraith

Table of word frequencies (Galbraith, Rowling, Cobenm, Tolkien, Lewis)
stylo.pronouns

List of pronouns
stylo

Stylometric multidimensional analyses
oppose

Contrastive analysis of texts
perform.nsc

Nearest Shrunken Centroids classifier
perform.impostors

An Authorship Verification Classifier Known as the Impostors Method. ATTENTION: this function is obsolete; refer to a new implementation, aka the imposters() function!
gui.classify

GUI for the function classify
perform.svm

Support Vector Machines classifier
zeta.craig

Compare two subcorpora using Craig's Zeta
stylo.default.settings

Setting variables for the package stylo
novels

A selection of 19th-century English novels
zeta.eder

Compare two subcorpora using Eder's Zeta
make.table.of.frequencies

Prepare a table of (relative) word frequencies
perform.delta

Distance-based classifier
txt.to.words.ext

Split text into words: extended version
parse.corpus

Perform pre-processing (tokenization, n-gram extracting, etc.)
zeta.chisquare

Compare two subcorpora using a home-brew variant of Craig's Zeta
dist.cosine

Cosine Distance
delete.markup

Delete HTML or XML tags
assign.plot.colors

Assign colors to samples
delete.stop.words

Exclude stop words (e.g. pronouns, particles, etc.) from a dataset
check.encoding

Check character encoding in corpus folder
imposters

Authorship Verification Classifier Known as the Imposters Method
crossv

Function to Perform Cross-Validation
dist.delta

Delta Distance
classify

Machine-learning supervised classification