softmaxreg (version 1.2)

word2vec: Pre-trained Word2vec Dataset from Corpus

Description

This dataset is a small pre-trained word2vec dataset with 20 dimensions and 5296 words.

Usage

data("word2vec")

Arguments

Format

A data frame with 12853 observations on the following 21 variables.

Source

Reuter_50_50 dataset,UCI Machine Learning Repository[https://archive.ics.uci.edu/ml/datasets/Reuter_50_50]

References

word2vec model are trained based on below text corpus:

Reuter_50_50 dataset,UCI Machine Learning Repository[https://archive.ics.uci.edu/ml/datasets/Reuter_50_50], Author: Zhi Liu, National Engineering Research Center For E-Learning Technology, Hubei Wuhan, China

Examples

data(word2vec)