Learn R Programming

⚠️There's a newer version (0.4.2) of this package.Take me there.

mldr.datasets (version 0.3.1)

R Ultimate Multilabel Dataset Repository

Description

Large collection of multilabel datasets along with the functions needed to export them to several formats, to make partitions, and to obtain bibliographic information.

Copy Link

Version

Install

install.packages('mldr.datasets')

Monthly Downloads

2,173

Version

0.3.1

License

LGPL (>= 3) | file LICENSE

Issues

Pull Requests

Stars

Forks

Repository

https://github.com/fcharte/mldr.datasets

Maintainer

David Charte

Last Published

November 24th, 2015

Functions in mldr.datasets (0.3.1)

Dataset with music data along with labels for emotions, instruments, genres, etc.

Datasets with data from the Corel image collection. There are 10 subsets in corel16k

Dataset with features extracted from video sequences and semantic concepts assigned as labels

Dataset obtained from the NUS-WIDE database with BoW representation

Dataset from the Reuters corpus (subset 4)

Dataset from the Reuters Corpus with the 500 most relevant features selected

stackex_cooking

Dataset from the Stack Exchange's cooking forum

Dataset generated from the Yahoo! web site index (business category)

Dataset generated from the Yahoo! web site index (society category)

Datasets with data from the Corel image collection. There are 10 subsets in corel16k

Datasets with data from the Corel image collection. There are 10 subsets in corel16k

List with 10 folds of the test data from the EUR-Lex directory codes dataset

Dataset from the Reuters corpus (subset 3)

Dataset from the Stack Exchange's coffee forum

Dataset from the Stack Exchange's computer science forum

yahoo_education

Dataset generated from the Yahoo! web site index (arts education)

Dataset with BibTeX entries

Dataset with sounds produced by birds and the species they belong to

Datasets with data from the Corel image collection. There are 10 subsets in corel16k

Dataset with data from the Language forum discussion

Dataset with news messages and the news groups they belong to

Dataset from airplanes failures reports (500 most relevant features extracted)

Dataset from airplanes failures reports

List with 10 folds of the train data from the EUR-Lex subject matters dataset

stackex_chemistry

Dataset from the Stack Exchange's chemistry forum

Dataset with features correspoinding to world flags

Dataset with protein profiles and their categories

Dataset generated from the Yahoo! web site index (health category)

Datasets with data from the Corel image collection. There are 10 subsets in corel16k

BibTeX entry associated to an mldr object

Datasets with data from the Corel image collection. There are 10 subsets in corel16k

Datasets with data from the Corel image collection. There are 10 subsets in corel16k

Dataset generated from the del.icio.us site bookmarks

Dataset with data from web bookmarks and their categories

List with 10 folds of the test data from the EUR-Lex EUROVOC descriptors dataset

Dataset from the Reuters corpus (subset 5)

Dataset generated from slashdot.org site entries

stackex_philosophy

Dataset from the Stack Exchange's philosophy forum

Dataset generated from the IMDB film database

Dataset from the Reuters corpus (subset 1)

yahoo_computers

Dataset generated from the Yahoo! web site index (computers category)

Dataset generated from the Yahoo! web site index (social category)

yahoo_recreation

Dataset generated from the Yahoo! web site index (recreation category)

Dataset generated from the Yahoo! web site index (arts category)

yahoo_entertainment

Dataset generated from the Yahoo! web site index (arts entertainment)

Dataset with features extracted from music tracks and the emotions they produce

List with 10 folds of the test data from the EUR-Lex subject matters dataset

List with 10 folds of the train data from the EUR-Lex directory codes dataset

Partition an mldr object into k folds

Dataset obtained from the NUS-WIDE database with cVLAD+ representation

Dataset from images with different natural scenes

Dataset with email messages and the folders where the users stored them

Dataset with data from the Corel image collection

Dataset with genes data and their functional expression

Dataset generated from medical reports

Obtain and show a list of additional datasets available to download

Datasets with data from the Corel image collection. There are 10 subsets in corel16k

Datasets with data from the Corel image collection. There are 10 subsets in corel16k

Export an mldr object or set of mldr objects to different file formats

Dataset from the Stack Exchange's chess forum

Datasets with data from the Corel image collection. There are 10 subsets in corel16k

yahoo_reference

Dataset generated from the Yahoo! web site index (reference category)

Dataset generated from the Yahoo! web site index (science category)

stratified.kfolds

Partition an mldr object into k folds

Dataset generated from a subset of the Medline database

List with 10 folds of the train data from the EUR-Lex EUROVOC descriptors dataset

Dataset from the Reuters corpus (subset 2)

check_n_load.mldr

Check if an mldr object is locally available and download it if needed