One of "numerical", "categorical", "boolean",
"datetime", "id".
For categorical columns the level order used by the synthesizer follows
the input: a factor keeps its levels() order (including ordered
factors), while a plain character column gets a sorted unique-value order
for determinism. The sort is lexicographic, so numeric-like character
columns (c("2", "10")) come back ordered "10", "2". Coerce these to
factor with the desired level order before fitting if order matters.