relist and split are 2 common ways of grouping the elements
of a vector-like object into a list-like object. The IRanges and
S4Vectors packages define relist and split methods
that operate on a Vector object and return a List object.
Note that the split methods defined in
S4Vectors delegate to the splitAsList function defined in
IRanges and documented below. Because relist and split both impose restrictions on
the kind of grouping that they support (e.g. every element in the input
object needs to go in a group and can only go in one group), the
IRanges package introduces the extractList generic function
for performing arbitrary groupings.
## relist()
## --------
"relist"(flesh, skeleton)
"relist"(flesh, skeleton)
## splitAsList()
## -------------
splitAsList(x, f, drop=FALSE, ...)
## extractList()
## -------------
extractList(x, i)
## regroup()
## ---------
regroup(x, g)skeleton matters. Its exact content is ignored.
f is a factor).
skeleton, the content here matters
(see Details section below).
Note that i can be a Ranges object (a particular type of
list-like object), and, in that case, extractList is particularly
fast (this is a common use case).
regroup, g groups the elements of x.
relist methods behave like utils::relist except that they
return a List object. If skeleton has names, then they are
propagated to the returned value.splitAsList behaves like base::split except that the
former returns a List object instead of an ordinary list.extractList returns a list-like object parallel to i and with
the same "shape" as i (i.e. same element lengths).
If i has names, then they are propagated to the returned value.All these functions return a list-like object where the list elements have
the same class as x. relistToClass gives
the exact class of the returned object.
relist, split, and extractList have in common that
they return a list-like object where each list element has the same class
as the original vector-like object. Thus they need to be able to select
the appropriate List concrete subclass to use for this returned
value. This selection is performed by relistToClass
and is based only on the class of the original object. By default, extractList(x, i) is equivalent to:
relist(x[unlist(i)], i)An exception is made when
x is a data-frame-like object. In that
case x is subsetted along the rows, that is, extractList(x, i)
is equivalent to:
relist(x[unlist(i), ], i)This is more or less how the default method is implemented, except for some optimizations when
i is a Ranges object. relist and split (or splitAsList) can be seen as
special cases of extractList:
relist(flesh, skeleton) is equivalent to
extractList(flesh, PartitioningByEnd(skeleton)) split(x, f) is equivalent to
extractList(x, split(seq_along(f), f))
It is good practise to use extractList only for cases not covered
by relist or split. Whenever possible, using relist
or split is preferred as they will always perform more efficiently.
In addition their names carry meaning and are familiar to most R
users/developers so they'll make your code easier to read/understand.
Note that the transformation performed by relist or split
is always reversible (via unlist and unsplit, respectively),
but not the transformation performed by extractList (in general).
The regroup function splits the elements of unlist(x)
into a list according to the grouping g. Each element of
unlist(x) inherits its group from its parent element of
x. regroup is different from relist and
split, because x is already grouped, and the goal is to
combine groups.
unlist and relist
functions in the base and utils packages, respectively.
split and unsplit
functions in the base package.
split methods defined in the
S4Vectors package.
relistToClass is documented in the man
page for List objects.
## On an Rle object:
x <- Rle(101:105, 6:2)
i <- IRanges(6:10, 16:12, names=letters[1:5])
extractList(x, i)
## On a DataFrame object:
df <- DataFrame(X=x, Y=LETTERS[1:20])
extractList(df, i)
Run the code above in your browser using DataLab