evaluation: Evaluation of Word Alignment Quality

Description

It measures Precision, Recall, AER, and F_measurs metrics to evaluate the quality of word alignment.

Usage

evaluation(file.gold = 'gold.RData', 
           file.align = 'alignment.-1.3.RData', 
           agn = c('my.agn','an.agn'), alpha = 0.3)

Arguments

file.gold

the gold standarad file name.

file.align

the alignment file name.

agn

character string including two values. If "my.agn", the user wants to evaluate one-to-many word alignment using the align.ibm1 function in this package. If "an.agn", the user wants to evaluate word alignment results which are obtained by another software.

alpha

is a parameter that sets the trade-off between Precision and Recall.

Value

A list.

Recall

A decimal number.

Precision

A decimal number.

AER

A decimal number.

F_measure.PS

A decimal number.

F_measure.S

A decimal number.

Details

To evaluate word alignment quality, we need to a "reference alignment" (a gold standard for the word alignment) of a test set. In order to read the gold into RData format and to compare it with the word alignment results, the gold standard file name must be set in file.gold.

References

Fraser F., Marcu D. (2007), "MeasuringWord Alignment Quality for Statistical Machine Translation.", Computational Linguistics, 33(3), 293-303.

Koehn P. (2010), "Statistical Machine Translation.", Cambridge University, New York.

Och F., Ney H.(2003)."A Systematic Comparison Of Various Statistical Alignment Models.", 2003 Association for Computational Linguistics, J03-1002, 29(1).

Wang X. "Evaluation of Two Word Alignment Systems.", Final Thesis, Department of Computer and Information Science.

Description

Usage

Arguments

Value

Details

References

See Also