Integration of an Arabic Transliteration Module into a Statistical Machine Translation System

  1. (PDF, 485 KB)
AuthorSearch for: ; Search for: ; Search for: ; Search for: ; Search for:
ConferenceProceedings of Association for Computational Linguistics (ACL) Second Workshop on Statistical Machine Translation (WMT07), June 23, 2007., Prague, Czech Republic
AbstractWe provide an in-depth analysis of the integration of an Arabic-to-English transliteration system into a general-purpose phrase-based statistical machine translation system. We study the integration from different aspects and evaluate the improvement that can be attributed to the integration using the BLEU metric. Our experiments show that a transliteration module can help significantly in the situation where the test data is rich with previously unseen named entities. We obtain 70% and 53% of the theoretical maximum improvement we could achieve, as measured by an oracle on development and test sets respectively for OOV words (out of vocabulary source words not appearing in the phrase table).
Publication date
AffiliationNRC Institute for Information Technology; National Research Council Canada
Peer reviewedNo
NRC number50332
NPARC number8913876
Export citationExport as RIS
Report a correctionReport a correction
Record identifier764bce6d-2c60-429b-9216-390c0ab3a2cd
Record created2009-04-22
Record modified2016-05-09
Bookmark and share
  • Share this page with Facebook (Opens in a new window)
  • Share this page with Twitter (Opens in a new window)
  • Share this page with Google+ (Opens in a new window)
  • Share this page with Delicious (Opens in a new window)
Date modified: