InputFeatures_FE
An InputFeatures object is a single set of (input) features of data used for (output) feature extraction. Note that this class is similiar to the InputFeatures class used for simple sequence classification, with annoying differences. Will eventually standardize; till then, check parameter names. (RBERT issue #28.)
.InputFeatures_EF(unique_id, tokens, input_ids, input_mask, input_type_ids)
Integer or character; a unique id for this example.
Character vector; the actual tokens in this example.
Integer vector; the sequence of token ids in this example.
Integer vector; sequence of 1s (for "real" tokens) and 0s (for padding tokens).
Integer vector; aka token_type_ids. Indicators for which sentence (or sequence) each token belongs to. Classical BERT supports only 0s and 1s (for first and second sentence, respectively).
An object of class InputFeatures_FE
.