Đang chuẩn bị nút TẢI XUỐNG, xin hãy chờ
Tải xuống
Inspired by the incremental TER alignment, we re-designed the Indirect HMM (IHMM) alignment, which is one of the best hypothesis alignment methods for conventional MT system combination, in an incremental manner. One crucial problem of incremental alignment is to align a hypothesis to a confusion network (CN). Our incremental IHMM alignment is implemented in three different ways: 1) treat CN spans as HMM states and define state transition as distortion over covered ngrams between two spans; 2) treat CN spans as HMM states and define state transition as distortion over words in component translations in the CN; and 3). | Incremental HMM Alignment for MT System Combination Chi-Ho Li Microsoft Research Asia 49 Zhichun Road Beijing China chl@microsoft.com Yupeng Liu Harbin Institute of Technology 92 Xidazhi Street Harbin China ypliu@mtlab.hit.edu.cn Abstract Inspired by the incremental TER alignment we re-designed the Indirect HMM IHMM alignment which is one of the best hypothesis alignment methods for conventional MT system combination in an incremental manner. One crucial problem of incremental alignment is to align a hypothesis to a confusion network CN . Our incremental IHMM alignment is implemented in three different ways 1 treat CN spans as HMM states and define state transition as distortion over covered ngrams between two spans 2 treat CN spans as HMM states and define state transition as distortion over words in component translations in the CN and 3 use a consensus decoding algorithm over one hypothesis and multiple IHMMs each of which corresponds to a component translation in the CN. All these three approaches of incremental alignment based on IHMM are shown to be superior to both incremental TER alignment and conventional IHMM alignment in the setting of the Chinese-to-English track of the 2008 NIST Open MT evaluation. 1 Introduction Word-level combination using confusion network Matusov et al. 2006 and Rosti et al. 2007 is a widely adopted approach for combining Machine Translation MT systems output. Word alignment between a backbone or skeleton translation and a hypothesis translation is a key problem in this approach. Translation Edit Rate TER Snover et al. 2006 based alignment proposed in Sim Xiaodong He Microsoft Research One Microsoft Way Redmond USA xiaohe@microsoft.com Ning Xi Nanjing University 8 Hankou Road Nanjing China xin@nlp.nju.edu.cn et al. 2007 is often taken as the baseline and a couple of other approaches such as the Indirect Hidden Markov Model IHMM He et al. 2008 and the ITG-based alignment Karakos et al. 2008 were recently proposed with better results