rainette_uc_index docvar
merge_segments(dtm, min_segment_size = 10, doc_id = NULL)the original dtm with a new rainette_uc_id docvar.
dtm of segments
minimum number of forms by segment
character name of a dtm docvar which identifies source documents.
If min_segment_size == 0, no segments are merged together.
If min_segment_size > 0 then doc_id must be provided
unless the corpus comes from split_segments, in this case
segment_source is used by default.