rmSharedWords

This function allows removing shared words, ie triming to non-redundant words.

The efficient treatment and convenient analysis of experimental high-throughput (omics) data gets facilitated through this collection of diverse functions.
Several functions address advanced object-conversions, like manipulating lists of lists or lists of arrays, reorganizing lists to arrays or into separate vectors, merging of multiple entries, etc.
Another set of functions provides speed-optimized calculation of standard deviation (sd), coefficient of variance (CV) or standard error of the mean (SEM)
for data in matrixes or means per line with respect to additional grouping (eg n groups of replicates).
A group of functions facilitate dealing with non-redundant information, by indexing unique, adding counters to redundant or eliminating lines with respect redundancy in a given reference-column, etc.
Help is provided to identify very closely matching numeric values to generate (partial) distance matrixes for very big data in a memory efficient manner or to reduce the complexity of large data-sets by combining very close values.
Other functions help aligning a matrix or data.frame to a reference using partial matching or to mine an experimental setup to extract patterns of replicate samples.
Many times large experimental datasets need some additional filtering, adequate functions are provided.
Convenient data normalization is supported in various different modes, parameter estimation via permutations or boot-strap as well as flexible testing of multiple pair-wise combinations using the framework of 'limma' is provided, too.
Batch reading (or writing) of sets of files and combining data to arrays is supported, too.

Wolfgang Raffelsberger

wrMisc

Analyze Experimental High-Throughput (Omics) Data

rmSharedWords function

<dl><dt>x</dt>
<dd>(character) main input for making non-redundant</dd>
<dt>sep</dt>
<dd>(character) separator(s) to be used</dd>
<dt>anySep</dt>
<dd>(logical) if <code>TRUE</code>, will consider all separators at one time (), thus combinations with different separators won't be distinguished</dd>
<dt>newSep</dt>
<dd>(character) new (uniform) separator between words, if <code>NULL</code> the first value/separator of if <code>sep</code> will be used</dd>
<dt>minLe</dt>
<dd>(integer) minimum length for allowing being recognised as 'word'</dd>
<dt>na.omit</dt>
<dd>(logical) if <code>TRUE NA</code>s will be removed from output</dd>
<dt>fixed</dt>
<dd>(logical) will be transmitted to argument <code>fixed</code> of <code><a href="/link/strsplit()?package=wrMisc&version=1.15.2" data-mini-rdoc="wrMisc::strsplit()">strsplit()</a></code>; if <code>TRUE</code> regular expressions are allowed/used</dd>
<dt>silent</dt>
<dd>(logical) suppress messages</dd>
<dt>debug</dt>
<dd>(logical) additional messages for debugging</dd>
<dt>callFrom</dt>
<dd>(character) allows easier tracking of messages produced</dd></dl>

Arguments

Trim/Remove Redundant Words — rmSharedWords

<dl>

<dt>x</dt>
<dd>(character) main input for making non-redundant</dd>


<dt>sep</dt>
<dd>(character) separator(s) to be used</dd>


<dt>anySep</dt>
<dd>(logical) if <code>TRUE</code>, will consider all separators at one time (), thus combinations with different separators won't be distinguished</dd>


<dt>newSep</dt>
<dd>(character) new (uniform) separator between words, if <code>NULL</code> the first value/separator of if <code>sep</code> will be used</dd>


<dt>minLe</dt>
<dd>(integer) minimum length for allowing being recognised as 'word'</dd>


<dt>na.omit</dt>
<dd>(logical) if <code>TRUE NA</code>s will be removed from output</dd>


<dt>fixed</dt>
<dd>(logical) will be transmitted to argument <code>fixed</code> of <code><a href='https://rdrr.io/r/base/strsplit.html'>strsplit()</a></code>; if <code>TRUE</code> regular expressions are allowed/used</dd>


<dt>silent</dt>
<dd>(logical) suppress messages</dd>


<dt>debug</dt>
<dd>(logical) additional messages for debugging</dd>


<dt>callFrom</dt>
<dd>(character) allows easier tracking of messages produced</dd>

</dl>

rmSharedWords: Trim/Remove Redundant Words

Description

Usage

Value

Arguments

Details

See Also

Examples