Learn R Programming

polmineR (version 0.6.1)

polmineR-package: polmineR-package

Description

Machinery for mining CWB corpora

Usage

polmineR()

Arguments

Details

The package provides functions for basic text statistics for corpora that are managed by the Corpus Workbench (CWB). A core feature is to generate subcorpora/partitions based on metadata. The package is also meant to serve as an interface between the CWB and R-packages implementing more sophisticated statistical procedures (e.g. lsa, lda, topicmodels) or providing further functionality for text mining (e.g. tm). Any analysis using this package will usually start with setting up a subcorpus/partition (with partition). A set of partitions can be generated with partitionBundle. Once a partition or a set of partitions has been set up, core functions are context and compare. Based on a partition bundle, a term-document matrix (class 'TermDocumentMatrix' from the tm package) can be generated (with as.TermDocumentMatrix). This opens the door to the wealth of statistical methods implemented in R.

References

http://polmine.sowi.uni-due.de