ISCA Archive Interspeech 2004
ISCA Archive Interspeech 2004

From WER and RIL to MER and WIL: improved evaluation measures for connected speech recognition

Andrew Cameron Morris, Viktoria Maier, Phil Green

The word error rate (WER), commonly used in ASR assessment, measures the cost of restoring the output word sequence to the original input sequence. However, for most CSR applications apart from dictation machines a more meaningful performance measure would be given by the proportion of information communicated. In this article we introduce two new absolute CSR performance measures: MER (match error rate) and WIL (word information lost). MER is the proportion of I/O word matches which are errors. WIL is a simple approximation to the proportion of word information lost which overcomes the problems associated with the RIL (relative information lost) measure that was proposed half a century ago. Issues relating to ideal performance measurement are discussed and the commonly used Viterbi input/output alignment procedure, with zero weight for hits and equal weight for substitutions, deletions and insertions, is shown to be optimal.


doi: 10.21437/Interspeech.2004-668

Cite as: Morris, A.C., Maier, V., Green, P. (2004) From WER and RIL to MER and WIL: improved evaluation measures for connected speech recognition. Proc. Interspeech 2004, 2765-2768, doi: 10.21437/Interspeech.2004-668

@inproceedings{morris04_interspeech,
  author={Andrew Cameron Morris and Viktoria Maier and Phil Green},
  title={{From WER and RIL to MER and WIL: improved evaluation measures for connected speech recognition}},
  year=2004,
  booktitle={Proc. Interspeech 2004},
  pages={2765--2768},
  doi={10.21437/Interspeech.2004-668}
}