Learn R Programming

⚠️There's a newer version (0.8.9) of this package.Take me there.

polmineR (version 0.8.0)

Toolkit for Corpus Analysis

Description

Library for corpus analysis using the Corpus Workbench as an efficient back end for indexing and querying large corpora. The package offers functionality to flexibly create partitions and to carry out basic statistical operations (count, co-occurrences etc.). The original full text of documents can be reconstructed and inspected at any time. Beyond that, the package is intended to serve as an interface to packages implementing advanced statistical procedures. Respective data structures (document term matrices, term co- occurrence matrices etc.) can be created based on the indexed corpora.

Copy Link

Version

Install

install.packages('polmineR')

Monthly Downloads

432

Version

0.8.0

License

GPL-3

Issues

Pull Requests

Stars

Forks

Repository

https://www.github.com/PolMine/polmineR

Maintainer

Andreas Blaette

Last Published

December 17th, 2019

Functions in polmineR (0.8.0)

Cooccurrences-class

Cooccurrences class for corpus/partition.

apply a function over a list or bundle

Perform chisquare-text.

Get markdown-formatted full text of a partition.

Annotation functionality

cooccurrences-class

Cooccurrences class.

Cooccurrences,character-method

Get all cooccurrences in corpus/partition.

Get cooccurrence statistics.

Dispersion of a query or multiple queries.

Decode corpus or subcorpus.

Conversion between corpus and native encoding.

as.TermDocumentMatrix

Generate TermDocumentMatrix / DocumentTermMatrix.

Get and set encoding.

Send the result of an analysis by Email.

Compute Log-likelihood Statistics.

Enrich an object.

Generate html from object.

Analyze context of a node word.

partition_bundle

Generate bundle of partitions.

Perform keyword-in-context (KWIC) analysis.

polmineR-generics

Generic methods defined in the polmineR package

polmineR-defunct

Defunct methods and functions.

partition_class

Partition class and methods.

Reset registry directory.

Renamed Functions

Objects exported from other packages

context_bundle-class

S4 context_bundle class

Get corpus positions for a query or queries.

Tools for CQP queries.

Add corpora in R data packages to session registry.

Feature selection by comparison.

as.sparseMatrix

Type conversion - get sparseMatrix.

Split corpus or partition into speeches.

Corpus class methods

Corpus class initialization

Get features by comparison.

get_token_stream

Get Token Stream.

Regions of a CWB corpus.

calculate means

Get corpus/partition type.

Get s-attributes.

Get Number of Tokens.

Virtual class slice.

Get registry and data directories.

Initialize a partition.

Apply Weight to Matrix

Store objects as Excel-file.

partition_bundle-class

Bundle of partitions (partition_bundle class).

registry_get_name

Evaluate registry file.

Inspect object using View().

Highlight tokens in text output.

partition_to_string

Decode as String.

Get hits for query

Calculate Pointwise Mutual Information (PMI).

Execute code on OpenCPU server

polmineR-package

polmineR-package

Subsetting corpora and subcorpora

S4 textstat superclass.

Get p-attributes.

Perform t-test.

Display full text.

Add tooltips to text output.

The S4 subcorpus class.

Get and set templates.

subcorpus_bundle-class

Bundled subcorpora

Get terms in partition or corpus.