Learn R Programming

⚠️There's a newer version (0.4.1) of this package.Take me there.

cwbtools (version 0.1.0)

Tools to create, modify and manage 'CWB' Corpora

Description

The 'Corpus Workbench' ('CWB', ) offers a classic and mature approach for working with large, linguistically and structurally annotated corpora. The 'CWB' is memory efficient and its design makes running queries fast (Evert and Hardie 2011, ). The 'cwbtools' package offers pure R tools to create indexed corpus files as well as high-level wrappers for the original C implementation of CWB as exposed by the 'RcppCWB' package . Additional functionality to add and modify annotations of corpora from within R makes working with CWB indexed corpora much more flexible and convenient. The 'cwbtools' package in combination with the R packages 'RcppCWB' () and 'polmineR' () offers a leightweight infrastructure to support the combination of quantitative and qualitative approaches for working with textual data.

Copy Link

Version

Install

install.packages('cwbtools')

Monthly Downloads

104

Version

0.1.0

License

GPL-3

Maintainer

Andreas Blaette

Last Published

October 21st, 2019

Functions in cwbtools (0.1.0)

cwbtools-package

cwbtools-package
s_attribute_encode

Read, process and write data on structural attributes.
registry_file_parse

Parse and create registry files.
p_attribute_encode

Encode Positional Attribute(s).
pkg_utils

Create and manage packages with corpus data.
corpus_install

Install and manage corpora.
cwb_install

Utilities to install Corpus Workbench.
CorpusData

Manage Corpus Data and Encode CWB Corpus.
conll_get_regions

Extract regions from NER annotations (CoNNL format).
get_encoding

Get Encoding of Character Vector.