Learn R Programming

⚠️There's a newer version (0.8.9) of this package.Take me there.

polmineR (version 0.8.7)

Verbs and Nouns for Corpus Analysis

Description

Package for corpus analysis using the Corpus Workbench ('CWB', ) as an efficient back end for indexing and querying large corpora. The package offers functionality to flexibly create subcorpora and to carry out basic statistical operations (count, co-occurrences etc.). The original full text of documents can be reconstructed and inspected at any time. Beyond that, the package is intended to serve as an interface to packages implementing advanced statistical procedures. Respective data structures (document-term matrices, term-co-occurrence matrices etc.) can be created based on the indexed corpora.

Copy Link

Version

Install

install.packages('polmineR')

Monthly Downloads

342

Version

0.8.7

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Andreas Blaette

Last Published

August 26th, 2022

Functions in polmineR (0.8.7)

as.VCorpus

Get VCorpus.
as.speeches

Split corpus or partition into speeches.
as.markdown

Get markdown-formatted full text of a partition.
annotations

Annotation functionality
as.TermDocumentMatrix

Generate TermDocumentMatrix / DocumentTermMatrix.
as.sparseMatrix

Type conversion - get sparseMatrix.
bundle-class

Bundle Class
blapply

apply a function over a list or bundle
Cooccurrences-class

Cooccurrences class for corpus/partition.
Cooccurrences,corpus-method

Get all cooccurrences in corpus/partition.
context

Analyze context of a node word.
corpus-methods

Corpus class methods
context-class

Context class.
count_class

Count class.
context_bundle-class

S4 context_bundle class
corpus-class

Corpus class initialization
cooccurrences-class

Cooccurrences class.
chisquare

Perform chisquare-text.
count

Get counts.
cooccurrences

Get cooccurrence statistics.
encodings

Conversion between corpus and native encoding.
cpos

Get corpus positions for a query or queries.
features-class

Feature selection by comparison.
cqp

Tools for CQP queries.
dotplot

dotplot
decode

Decode corpus or subcorpus.
enrich

Enrich an object.
features

Get features by comparison.
encoding

Get and set encoding.
dispersion

Dispersion of a query or multiple queries.
kwic

Perform keyword-in-context (KWIC) analysis.
means

calculate means
html

Generate html from object.
ll

Compute Log-likelihood Statistics.
kwic-class

S4 kwic class
highlight

Highlight tokens in text output.
hits_class

S4 class to represent hits for queries.
get_token_stream

Get Token Stream.
hits

Get hits for query
get_type

Get corpus/partition type.
noise

detect noise
ngrams_class

Ngrams class.
partition_bundle

Generate bundle of partitions.
partition_class

Partition class and methods.
ngrams

Get N-Grams
partition_bundle-class

Bundle of partitions (partition_bundle class).
partition

Initialize a partition.
p_attributes

Get p-attributes.
ocpu_exec

Execute code on OpenCPU server
partition_to_string

Decode as String.
ranges

Get ranges for query.
polmineR-generics

Generic methods defined in the polmineR package
reexports

Objects exported from other packages
regions

Regions of a CWB corpus.
phrases

Manage and use phrases
polmineR-defunct

Defunct functionality
polmineR-package

polmineR-package
pmi

Calculate Pointwise Mutual Information (PMI).
s_attributes

Get s-attributes.
ranges-class

Ranges of query matches.
read

Display full text.
size

Get Number of Tokens.
subcorpus_bundle-class

Bundled subcorpora
registry_reset

Reset registry directory.
renamed

Renamed Functions
subset

Subsetting corpora and subcorpora
registry_move

Get registry and data directories.
registry_get_name

Evaluate registry file.
t_test

Perform t-test.
weigh

Apply Weight to Matrix
view

Inspect object using View().
tooltips

Add tooltips to text output.
use

Add corpora in R data packages to session registry.
get_template

Get template for reconstructing full text.
terms

Get terms in partition or corpus.
slice

Virtual class slice.
subcorpus

The S4 subcorpus class.
trim

Trim an object.
textstat-class

S4 textstat superclass.