cooccurrence

cooccurrence.character

cooccurrence.cooccurrence

cooccurrence.data.frame

either<ul>
<li>a data.frame where the data.frame contains 1 row per document/term,
 in which case you need to provide <code>group</code> and <code>term</code>. This uses cooccurrence.data.frame.</li>
<li>a character vector with terms. This uses cooccurrence.character.</li>
<li>an object of class <code>cooccurrence</code>.This uses cooccurrence.cooccurrence.</li>
</ul>

logical indicating if we need to sort the output from high cooccurrences to low coccurrences. Defaults to TRUE.

order

other arguments passed on to the methods

character string with a column in the data frame <code>x</code>. To be used if <code>x</code> is a data.frame.

group

character string with a column in the data frame <code>x</code>, containing 1 term per row. To be used if <code>x</code> is a data.frame.

term

A cooccurence data.frame indicates how many times each term co-occurs with another term.
This type of dataset is a data.frame with fields term1, term2 and cooc where cooc indicates how many times
term1 and term2 co-occurred.
The dataset can be constructed based upon a data frame where you look within a group if 2 terms occurred.
It also can be constructed based upon a vector of words in which case we look how many times each word is 
followed by another word.

This natural language processing toolkit provides language-agnostic
'tokenization', 'parts of speech tagging', 'lemmatization' and 'dependency
parsing' of raw text. Next to text parsing, the package also allows you to train
annotation models based on data of 'treebanks' in 'CoNLL-U' format as provided
at <http://universaldependencies.org/format.html>. The techniques are explained
in detail in the paper: 'Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0
with UDPipe', available at <doi:10.18653/v1/K17-3009>.

Jan Wijffels

udpipe

Tokenization, Parts of Speech Tagging, Lemmatization and
Dependency Parsing with the 'UDPipe' 'NLP' Toolkit

BNOSAC 

Institute of Formal and Applied Linguistics, Faculty of Mathematics and Physics, Charles University in Prague, Czech Republic 

Milan Straka 

Jana Strakov<c3><a1>

cooccurrence function

<ul>
<li><code>character</code>: Create a cooccurence data.frame based on a vector of terms</li>
<li><code>cooccurrence</code>: Aggregate co-occurrence statistics by summing the cooc by term/term2</li>
<li><code>data.frame</code>: Create a cooccurence data.frame based on a data.frame where you look within a document / sentence / paragraph / group 
if terms co-occur</li>
</ul>

cooccurrence: Create a cooccurence data.frame

Description

Usage

Arguments

Value

Methods (by class)

Examples