Learn R Programming

⚠️There's a newer version (0.8.9) of this package.Take me there.

polmineR (version 0.8.0)

Toolkit for Corpus Analysis

Description

Library for corpus analysis using the Corpus Workbench as an efficient back end for indexing and querying large corpora. The package offers functionality to flexibly create partitions and to carry out basic statistical operations (count, co-occurrences etc.). The original full text of documents can be reconstructed and inspected at any time. Beyond that, the package is intended to serve as an interface to packages implementing advanced statistical procedures. Respective data structures (document term matrices, term co- occurrence matrices etc.) can be created based on the indexed corpora.

Copy Link

Version

Install

install.packages('polmineR')

Monthly Downloads

342

Version

0.8.0

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Andreas Blaette

Last Published

December 17th, 2019

Functions in polmineR (0.8.0)

Cooccurrences-class

Cooccurrences class for corpus/partition.
blapply

apply a function over a list or bundle
chisquare

Perform chisquare-text.
bundle-class

Bundle Class
as.VCorpus

Get VCorpus.
context-class

Context class.
as.markdown

Get markdown-formatted full text of a partition.
annotations

Annotation functionality
cooccurrences-class

Cooccurrences class.
Cooccurrences,character-method

Get all cooccurrences in corpus/partition.
cooccurrences

Get cooccurrence statistics.
dispersion

Dispersion of a query or multiple queries.
decode

Decode corpus or subcorpus.
encodings

Conversion between corpus and native encoding.
kwic-class

S4 kwic class
as.TermDocumentMatrix

Generate TermDocumentMatrix / DocumentTermMatrix.
count

Get counts.
count_class

Count class.
encoding

Get and set encoding.
mail

Send the result of an analysis by Email.
dotplot

dotplot
ll

Compute Log-likelihood Statistics.
enrich

Enrich an object.
hits_class

Hits class.
html

Generate html from object.
context

Analyze context of a node word.
partition_bundle

Generate bundle of partitions.
kwic

Perform keyword-in-context (KWIC) analysis.
noise

detect noise
ngrams_class

Ngrams class.
polmineR-generics

Generic methods defined in the polmineR package
polmineR-defunct

Defunct methods and functions.
partition_class

Partition class and methods.
registry_reset

Reset registry directory.
renamed

Renamed Functions
reexports

Objects exported from other packages
context_bundle-class

S4 context_bundle class
cpos

Get corpus positions for a query or queries.
trim

trim an object
cqp

Tools for CQP queries.
use

Add corpora in R data packages to session registry.
features-class

Feature selection by comparison.
as.sparseMatrix

Type conversion - get sparseMatrix.
as.speeches

Split corpus or partition into speeches.
corpus-methods

Corpus class methods
corpus-class

Corpus class initialization
features

Get features by comparison.
get_token_stream

Get Token Stream.
regions

Regions of a CWB corpus.
means

calculate means
get_type

Get corpus/partition type.
s_attributes

Get s-attributes.
size

Get Number of Tokens.
slice

Virtual class slice.
registry_move

Get registry and data directories.
partition

Initialize a partition.
weigh

Apply Weight to Matrix
store

Store objects as Excel-file.
partition_bundle-class

Bundle of partitions (partition_bundle class).
registry_get_name

Evaluate registry file.
view

Inspect object using View().
highlight

Highlight tokens in text output.
ngrams

Get N-Grams
partition_to_string

Decode as String.
hits

Get hits for query
pmi

Calculate Pointwise Mutual Information (PMI).
ocpu_exec

Execute code on OpenCPU server
polmineR-package

polmineR-package
subset

Subsetting corpora and subcorpora
textstat-class

S4 textstat superclass.
p_attributes

Get p-attributes.
t_test

Perform t-test.
read

Display full text.
tooltips

Add tooltips to text output.
subcorpus

The S4 subcorpus class.
get_template

Get and set templates.
subcorpus_bundle-class

Bundled subcorpora
terms

Get terms in partition or corpus.