Learn R Programming

⚠️There's a newer version (0.7.5) of this package.Take me there.

stylo (version 0.6.7)

Stylometric Analyses

Description

Supervised and unsupervised multivariate methods, supplemented by GUI and some visualizations, to perform various analyses in the field of computational stylistics, authorship attribution, etc. For further reference, see Eder et al. (2016), .

Copy Link

Version

Install

install.packages('stylo')

Monthly Downloads

648

Version

0.6.7

License

GPL (>= 3)

Issues

Pull Requests

Stars

Forks

Maintainer

Maciej Eder

Last Published

May 26th, 2018

Functions in stylo (0.6.7)

crossv

Function to Perform Cross-Validation
dist.delta

Delta Distance
dist.simple

Cosine Distance
lee

Table of word frequencies (Lee, Capote, Faulkner, Styron, etc.)
oppose

Contrastive analysis of texts
stylo

Stylometric multidimensional analyses
perform.nsc

Nearest Shrunken Centroids classifier
dist.wurzburg

Cosine Delta Distance (aka Wurzburg Distance)
parse.corpus

Perform pre-processing (tokenization, n-gram extracting, etc.)
load.corpus

Load text files
stylo.default.settings

Setting variables for the package stylo
rolling.classify

Sequential machine-learning classification
txt.to.words.ext

Split text into words: extended version
rolling.delta

Sequential stylometric analysis
zeta.craig

Compare two subcorpora using Craig's Zeta
perform.svm

Support Vector Machines classifier
zeta.chisquare

Compare two subcorpora using a home-brew variant of Craig's Zeta
zeta.eder

Compare two subcorpora using Eder's Zeta
assign.plot.colors

Assign colors to samples
stylo.network

Bootstrap consensus networks, with D3 visualization
stylo.pronouns

List of pronouns
classify

Machine-learning supervised classification
dist.entropy

Entropy Distance
galbraith

Table of word frequencies (Galbraith, Rowling, Cobenm, Tolkien, Lewis)
gui.classify

GUI for the function classify
dist.minmax

Min-Max Distance (aka Ruzicka Distance)
load.corpus.and.parse

Load text files and perform pre-processing
make.frequency.list

Make List of the Most Frequent Elements (e.g. Words)
imposters

Authorship Verification Classifier Known as the Imposters Method
imposters.optimize

Tuning Parameters for the Imposters Method
parse.pos.tags

Extract POS-tags or Words from Annotated Corpora
perform.impostors

An Authorship Verification Classifier Known as the Impostors Method. ATTENTION: this function is obsolete; refer to a new implementation, aka the imposters() function!
perform.delta

Distance-based classifier
delete.markup

Delete HTML or XML tags
perform.culling

Exclude variables (e.g. words, n-grams) from a frequency table that are too characteristic for some samples
delete.stop.words

Exclude stop words (e.g. pronouns, particles, etc.) from a dataset
txt.to.words

Split text into words
txt.to.features

Split string of words or other countable features
make.ngrams

Make text n-grams
perform.knn

k-Nearest Neighbor classifier
make.samples

Split text to samples
perform.naivebayes

Naive Bayes classifier
dist.cosine

Cosine Distance
gui.stylo

GUI for stylo
define.plot.area

Define area for scatterplots
gui.oppose

GUI for the function oppose
make.table.of.frequencies

Prepare a table of (relative) word frequencies
novels

A selection of 19th-century English novels