Learn R Programming

⚠️There's a newer version (0.4.2) of this package.Take me there.

mldr.datasets (version 0.3.1)

R Ultimate Multilabel Dataset Repository

Description

Large collection of multilabel datasets along with the functions needed to export them to several formats, to make partitions, and to obtain bibliographic information.

Copy Link

Version

Install

install.packages('mldr.datasets')

Monthly Downloads

2,173

Version

0.3.1

License

LGPL (>= 3) | file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

David Charte

Last Published

November 24th, 2015

Functions in mldr.datasets (0.3.1)

cal500

Dataset with music data along with labels for emotions, instruments, genres, etc.
corel16k009

Datasets with data from the Corel image collection. There are 10 subsets in corel16k
mediamill

Dataset with features extracted from video sequences and semantic concepts assigned as labels
nuswide_BoW

Dataset obtained from the NUS-WIDE database with BoW representation
rcv1sub4

Dataset from the Reuters corpus (subset 4)
reutersk500

Dataset from the Reuters Corpus with the 500 most relevant features selected
stackex_cooking

Dataset from the Stack Exchange's cooking forum
yahoo_business

Dataset generated from the Yahoo! web site index (business category)
yahoo_society

Dataset generated from the Yahoo! web site index (society category)
corel16k002

Datasets with data from the Corel image collection. There are 10 subsets in corel16k
corel16k003

Datasets with data from the Corel image collection. There are 10 subsets in corel16k
eurlexdc_test

List with 10 folds of the test data from the EUR-Lex directory codes dataset
rcv1sub3

Dataset from the Reuters corpus (subset 3)
stackex_coffee

Dataset from the Stack Exchange's coffee forum
stackex_cs

Dataset from the Stack Exchange's computer science forum
yahoo_education

Dataset generated from the Yahoo! web site index (arts education)
bibtex

Dataset with BibTeX entries
birds

Dataset with sounds produced by birds and the species they belong to
corel16k010

Datasets with data from the Corel image collection. There are 10 subsets in corel16k
langlog

Dataset with data from the Language forum discussion
ng20

Dataset with news messages and the news groups they belong to
tmc2007_500

Dataset from airplanes failures reports (500 most relevant features extracted)
tmc2007

Dataset from airplanes failures reports
eurlexsm_tra

List with 10 folds of the train data from the EUR-Lex subject matters dataset
stackex_chemistry

Dataset from the Stack Exchange's chemistry forum
flags

Dataset with features correspoinding to world flags
yeast

Dataset with protein profiles and their categories
yahoo_health

Dataset generated from the Yahoo! web site index (health category)
corel16k001

Datasets with data from the Corel image collection. There are 10 subsets in corel16k
toBibtex.mldr

BibTeX entry associated to an mldr object
corel16k006

Datasets with data from the Corel image collection. There are 10 subsets in corel16k
corel16k007

Datasets with data from the Corel image collection. There are 10 subsets in corel16k
delicious

Dataset generated from the del.icio.us site bookmarks
bookmarks

Dataset with data from web bookmarks and their categories
eurlexev_test

List with 10 folds of the test data from the EUR-Lex EUROVOC descriptors dataset
rcv1sub5

Dataset from the Reuters corpus (subset 5)
slashdot

Dataset generated from slashdot.org site entries
stackex_philosophy

Dataset from the Stack Exchange's philosophy forum
imdb

Dataset generated from the IMDB film database
rcv1sub1

Dataset from the Reuters corpus (subset 1)
yahoo_computers

Dataset generated from the Yahoo! web site index (computers category)
yahoo_social

Dataset generated from the Yahoo! web site index (social category)
yahoo_recreation

Dataset generated from the Yahoo! web site index (recreation category)
yahoo_arts

Dataset generated from the Yahoo! web site index (arts category)
yahoo_entertainment

Dataset generated from the Yahoo! web site index (arts entertainment)
emotions

Dataset with features extracted from music tracks and the emotions they produce
eurlexsm_test

List with 10 folds of the test data from the EUR-Lex subject matters dataset
eurlexdc_tra

List with 10 folds of the train data from the EUR-Lex directory codes dataset
random.kfolds

Partition an mldr object into k folds
nuswide_VLAD

Dataset obtained from the NUS-WIDE database with cVLAD+ representation
scene

Dataset from images with different natural scenes
enron

Dataset with email messages and the folders where the users stored them
corel5k

Dataset with data from the Corel image collection
genbase

Dataset with genes data and their functional expression
medical

Dataset generated from medical reports
mldrs

Obtain and show a list of additional datasets available to download
corel16k005

Datasets with data from the Corel image collection. There are 10 subsets in corel16k
corel16k004

Datasets with data from the Corel image collection. There are 10 subsets in corel16k
write.mldr

Export an mldr object or set of mldr objects to different file formats
stackex_chess

Dataset from the Stack Exchange's chess forum
corel16k008

Datasets with data from the Corel image collection. There are 10 subsets in corel16k
yahoo_reference

Dataset generated from the Yahoo! web site index (reference category)
yahoo_science

Dataset generated from the Yahoo! web site index (science category)
stratified.kfolds

Partition an mldr object into k folds
ohsumed

Dataset generated from a subset of the Medline database
eurlexev_tra

List with 10 folds of the train data from the EUR-Lex EUROVOC descriptors dataset
rcv1sub2

Dataset from the Reuters corpus (subset 2)
check_n_load.mldr

Check if an mldr object is locally available and download it if needed