Learn R Programming

word.alignment (version 1.0.9)

Evaluation1: Evaluation of Word Alignment Quality

Description

It measures Precision, Recall, AER, and F_measurs metrics to evaluate the quality of word alignment.

Usage

Evaluation1(file_gold = 'gold.RData', 
            file_align = 'alignment.-1.3.RData',
	    agn = c('my.agn', 'an.agn'), alpha = 0.3)

Arguments

file_gold

the gold standarad file name.

file_align

the alignment file name.

agn

character string including two values. If "my.agn", the user wants to evaluate one-to-many word alignment using the word_alignIBM1 function in this package. If "an.agn", the user wants to evaluate word alignment results which are obtained by another software.

alpha

is a parameter that sets the trade-off between Precision and Recall.

Value

A list.

Recall

A decimal number.

Precision

A decimal number.

AER

A decimal number.

F_measure.PS

A decimal number.

F_measure.S

A decimal number.

Details

To evaluate word alignment quality, we need to a "reference alignment" (a gold standard for the word alignment) of a test set. In order to read the gold into R format and to compare it with the word alignment results, the gold standard file name must be set in file_gold.

References

Fraser F., Marcu D. (2007), "MeasuringWord Alignment Quality for Statistical Machine Translation.", Computational Linguistics, 33(3), 293-303.

Koehn P. (2010), "Statistical Machine Translation.", Cambridge University, New York.

Och F., Ney H.(2003)."A Systematic Comparison Of Various Statistical Alignment Models.", 2003 Association for Computational Linguistics, J03-1002, 29(1).

Wang X. "Evaluation of Two Word Alignment Systems.", Final Thesis, Department of Computer and Information Science.

See Also

cons.agn, align_test.set, word_alignIBM1