- id
[integer(1)]
Data set ID, autogenerated by the server. Ignored when set manually.
- name
[character(1)]
The name of the data set.
- version
[character(1)]
Version of the data set, autogenerated by the server. Ignored when set manually.
- description
[character(1)]
Description of the data set, given by the uploader.
- format
[character(1)]
Format of the data set. At the moment this is always "ARFF".
- creator
[character]
The person(s), that created this data set. Optional.
- contributor
[character]
People, that contibuted to this version of the data set (e.g., by reformatting). Optional.
- collection.date
[character(1)]
The date the data was originally collected. Given by the uploader. Optional.
- upload.date
[POSIXt]
The date the data was uploaded. Added by the server. Ignored when set manually.
- language
[character(1)]
Language in which the data is represented. Starts with 1 upper case letter, rest lower case,
e.g. 'English'
- licence
[character(1)]
Licence of the data. NA means: Public Domain or "don't know/care".
- url
[character(1)]
Valid URL that points to the data file.
- default.target.attribute
[character]
The default target attribute, if it exists. Of course, tasks can be defined that use
another attribute as target.
- row.id.attribute
[character(1)]
The attribute that represents the row-id column, if present in the data set. Else NA.
- ignore.attribute
[character]
Attributes that should be excluded in modelling, such as identifiers and indexes. Optional.
- version.label
[character(1)]
Version label provided by user, something relevant to the user. Can also be a date,
hash, or some other type of id.
- citation
[character(1)]
Reference(s) that should be cited when building on this data.
- visibility
[character(1)]
Who can see the data set. Typical values: 'Everyone', 'All my friends', 'Only me'.
Can also be any of the user's circles.
- original.data.url
[character(1)]
For derived data, the url to the original data set.
This can be an OpenML data set, e.g. 'http://openml.org/d/1'.
- paper.url
[character(1)]
Link to a paper describing the data set.
- update.comment
[character(1)]
When the data set is updated, add an explanation here.
- md5.checksum
[character(1)]
MD5 checksum to check if the data set is downloaded without corruption.
Can be ignored by user.
- status
[character(1)]
The status of the data set, autogenerated by the server. Ignored when set manually.
- tags
[character]
Optional tags for the data set.