Learn R Programming

⚠️There's a newer version (0.8.9) of this package.Take me there.

polmineR (version 0.8.5)

Verbs and Nouns for Corpus Analysis

Description

Package for corpus analysis using the Corpus Workbench ('CWB', ) as an efficient back end for indexing and querying large corpora. The package offers functionality to flexibly create subcorpora and to carry out basic statistical operations (count, co-occurrences etc.). The original full text of documents can be reconstructed and inspected at any time. Beyond that, the package is intended to serve as an interface to packages implementing advanced statistical procedures. Respective data structures (document-term matrices, term-co-occurrence matrices etc.) can be created based on the indexed corpora.

Copy Link

Version

Install

install.packages('polmineR')

Monthly Downloads

425

Version

0.8.5

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Andreas Blaette

Last Published

September 22nd, 2020

Functions in polmineR (0.8.5)

as.markdown

Get markdown-formatted full text of a partition.
as.sparseMatrix

Type conversion - get sparseMatrix.
annotations

Annotation functionality
bundle-class

Bundle Class
blapply

apply a function over a list or bundle
as.speeches

Split corpus or partition into speeches.
Cooccurrences-class

Cooccurrences class for corpus/partition.
as.TermDocumentMatrix

Generate TermDocumentMatrix / DocumentTermMatrix.
as.VCorpus

Get VCorpus.
Cooccurrences,corpus-method

Get all cooccurrences in corpus/partition.
chisquare

Perform chisquare-text.
context_bundle-class

S4 context_bundle class
context

Analyze context of a node word.
context-class

Context class.
corpus-class

Corpus class initialization
corpus-methods

Corpus class methods
decode

Decode corpus or subcorpus.
dispersion

Dispersion of a query or multiple queries.
cooccurrences

Get cooccurrence statistics.
cqp

Tools for CQP queries.
cooccurrences-class

Cooccurrences class.
encodings

Conversion between corpus and native encoding.
cpos

Get corpus positions for a query or queries.
count

Get counts.
enrich

Enrich an object.
count_class

Count class.
dotplot

dotplot
hits_class

Hits class.
features-class

Feature selection by comparison.
html

Generate html from object.
kwic-class

S4 kwic class
get_token_stream

Get Token Stream.
noise

detect noise
ocpu_exec

Execute code on OpenCPU server
features

Get features by comparison.
kwic

Perform keyword-in-context (KWIC) analysis.
polmineR-defunct

Defunct functionality
encoding

Get and set encoding.
highlight

Highlight tokens in text output.
partition_class

Partition class and methods.
polmineR-package

polmineR-package
read

Display full text.
partition_to_string

Decode as String.
t_test

Perform t-test.
subset

Subsetting corpora and subcorpora
polmineR-generics

Generic methods defined in the polmineR package
slice

Virtual class slice.
size

Get Number of Tokens.
ll

Compute Log-likelihood Statistics.
hits

Get hits for query
means

calculate means
p_attributes

Get p-attributes.
registry_move

Get registry and data directories.
get_template

Get template for reconstructing full text.
partition

Initialize a partition.
registry_get_name

Evaluate registry file.
subcorpus_bundle-class

Bundled subcorpora
subcorpus

The S4 subcorpus class.
get_type

Get corpus/partition type.
ngrams

Get N-Grams
textstat-class

S4 textstat superclass.
tooltips

Add tooltips to text output.
reexports

Objects exported from other packages
partition_bundle

Generate bundle of partitions.
partition_bundle-class

Bundle of partitions (partition_bundle class).
ngrams_class

Ngrams class.
regions

Regions of a CWB corpus.
trim

trim an object
pmi

Calculate Pointwise Mutual Information (PMI).
phrases

Manage and use phrases
restore

Restore S4 object with data.table slots
s_attributes

Get s-attributes.
terms

Get terms in partition or corpus.
weigh

Apply Weight to Matrix
view

Inspect object using View().
registry_reset

Reset registry directory.
use

Add corpora in R data packages to session registry.
renamed

Renamed Functions