classify: Classification Function

Description

The classify() function allows the user to combine the task of random projection based dimension reduction and classification within a single function. The dimension of the training data and test data was reduced by the value returned from the dimension() method. Then the projection matrix was generated using form_matrix() function based on the input paramater "projection".Then the training data and test data was projected into the low dimensional space by multiplying with the projection matrix. At last the reduced matrix was given to the classifier. The confusion matrix is the output of the classifier where we can calculate the performance of the classifier.

Usage

classify(train_data, test_data, train_label, test_label, eps = 0.1,
  projection = "gaussian", classifier = "knn")

Arguments

train_data

- Training data of either matrix or data frame

test_data

- Test data of either matrix or data frame

train_label

- Training label of either vector or data frame

test_label

- Test label of either vector or data frame

eps

- Epsilon with default 0.1

projection

- projection function with default "gaussian"

classifier

- classifier with default "knn"

Value

Confusion Matrix

Details

The parameters train_data,test_data,train_label and test_label are mandatory arguments. The eps is the error tolerance paramater. The value of eps must be \(0.0<eps<1.0\). The default value of eps is 0.1 that means 10 percentage of error is acceptable during projection. The supported projection functions are gaussian, probability, li, and achlioptas.The default projection method is "gaussian". The complete detail of the projection function is given in form_matrix() function. The final argument "classifier" in the function defines the classifier to train the model. The supported classifier for classification task are

"knn" - k-nearest neighbor classification

"svmlinear" - Support Vector Machine

"nb" - Naive Bayes Classifier

References

[1] Cannings, T. I. and Samworth, R. J. "Random projection ensemble classification(2015)".

[2] Ella Bingham and Heikki Mannila, "Random projection in dimensionality reduction: Applications to image and text data(2001)".

Examples

Run this code

# NOT RUN {
# Load Library
library(RandPro)

#Load Iris Data
data("iris")

#Split the data into training set and test set of 75:25 ratio.
set.seed(101)
sample <- sample.int(n = nrow(iris), size = floor(.75*nrow(iris)), replace = FALSE)
trainn <- iris[sample, ]
testt  <- iris[-sample,]

#Extract the train label and test label
trainl <- trainn$Species
testl <- testt$Species
typeof(trainl)

#Remove the label from training set and test set
trainn <- trainn[,1:4]
testt <- testt[,1:4]

#classify the Iris data with default K-NN Classifier.
res <- classify(trainn,testt,trainl,testl)
res

# }

Run the code above in your browser using DataLab