University of Macau Library | UML Digital Resources Hub

UM Dissertations & Theses Collection (澳門大學電子學位論文庫)

Full Text

Title

Exploiting rich feature representation for SMT N-best reranking

English Abstract

EXPLOITING RICH FEATURE REPRESENTATION FOR SMT -BEST RERANKING by Yu Tong Thesis Supervisor: Derek F. Wong and Lidia S. Chao Master of Science in Computer Science In the field of Statistical Machine Translation, there is one issue that the poor quality of translation result which generated by machine translation system. One effective way to improve the translation quality is reranking the translation candidates which are from -best list. The sentences in -best lists represent the translation results of one source sentence. The translation result which is considered as the best during the process of decoder may be not optimal. The traditional method is to rerank -best list through training a reranking model by using the standard model features generated from translation system. For a rerankig model, standard model features can not well represent and distinguish the similar translation candidates from -best list. In this thesis, to solve this problem, instead of using standard model features, we focus on extracting semantic and syntactic features to represent the similar translation hypotheses for training a better reranking model. Our work focuses on investigating the performance on different kinds of features working on reranking tasks. We describe our work and evaluate it on different language pairs with different reranking algorithms. Applying our proposed method to do the experiment, the improvements up to three percent in the Bi-Lingual Evaluation Understudy (BLEU) score are obtained.

Issue date

2016.

Author

Tong, Yu

Faculty

Faculty of Science and Technology

Department

Department of Computer and Information Science

Degree

M.Sc.

Subject

Machine translating

Supervisor

Wong, Fai

Chao, Sam

Files In This Item

Full-text (Internet)

Location

1/F Zone C

Library URL

991001950389706306