A benchmark dataset and evaluation methodology for Chinese zero pronoun translation

Mingzhou Xu, Longyue Wang, Siyou Liu, Derek F. Wong, Shuming Shi, Zhaopeng Tu

研究成果: Article同行評審

2 引文 斯高帕斯(Scopus)

摘要

The phenomenon of zero pronoun (ZP) has attracted increasing interest in the machine translation community due to its importance and difficulty. However, previous studies generally evaluate the quality of translating ZPs with BLEU score on MT testsets, which is not expressive or sensitive enough for accurate assessment. To bridge the data and evaluation gaps, we propose a benchmark testset and evaluation metric for target evaluation on Chinese ZP translation. The human-annotated testset covers five challenging genres, which reveal different characteristics of ZPs for comprehensive evaluation. We systematically revisit advanced models on ZP translation and identify current challenges for future exploration. We release data, code, and trained models, which we hope can significantly promote research in this field.

原文English
頁(從 - 到)1263-1293
頁數31
期刊Language Resources and Evaluation
57
發行號3
DOIs
出版狀態Published - 9月 2023

指紋

深入研究「A benchmark dataset and evaluation methodology for Chinese zero pronoun translation」主題。共同形成了獨特的指紋。

引用此