节点文献

基于模板的英蒙机器翻译系统的研究

Study on Template-Based English-Mongolian Machine Tranlation

【作者】 吉日木图

【导师】 敖其尔;

【作者基本信息】 内蒙古大学 , 计算机应用, 2005, 硕士

【摘要】 机器翻译(Machine Translation)是利用计算机把一种自然语言转换成另一种自然语言的过程。它是同时属于自然语言理解与处理、计算语言学以及人工智能领域的边缘学科。目前情况下,相对于英语和汉语而言蒙古语作为目标语言或者源语言的机器翻译理论还在初级阶段,没有一个比较成熟的理论和商品化的翻译系统。本课题是由国家自然科学基金项目——英蒙机器翻译研究课题资助。英蒙机器翻译研究是一项基础性研究工作。它对蒙古文的信息化和民族地区与其他地区之间的文化交流具有很大的推动作用。 本文首先介绍了机器翻译的发展史和背景知识,详细阐述了英-蒙机器翻译的模板匹配算法、蒙古文生成模型。 对于一些语法结构完整的简单句的翻译,本文提出了基于规则的翻译方法。通过英语语法分析得到英语的语法树,然后按照英语和蒙古语之间的转换规则把英语语法树转换成蒙古语语法树,再从蒙古语语法树生成符合蒙古语语法的蒙古文句子。 对于常用的复合句或一些固定短语的翻译,本文提出了基于模板的翻译方法。利用现有的模板进行模板匹配,得到比较准确的译文。在初级阶段,人工生成模板,而系统下一步目标为,通过基于词典的机器学习之后从现成语料库中自动提取模板。 对于模板和规则都不能正确处理的句子的翻译,本文提出了基于HMM模型的蒙古文生成方法。基于HMM模型的蒙古文生成方法采用词典驱动模型和HMM模型从单语料生成蒙古文译文。

【Abstract】 Machine translation is the procedure which uses the computer to transform a kind of natural language into another kind. It is an edge subject of natural language understanding and dealing, computational linguistics and artificial intelligence field. Under the situation at present, compared with English and Chinese, Mongolian is also in primary stage of the machine translation theory as a goal language or a source language. There isn’t a mature theory and commercialized translation system about Mongolian. This subject is first subject for research of English-Mongolian machine translation sponsored by the China’s natural science foundation project. The English-Mongolian machine translation study is a basic research work that has very great impetus to the cultural exchanges between minority area and other areas and to the informationization of Mongolian.This paper introduces development history and background knowledge of machine translation at first, and then explain template match algorithm of English-Mongolian machine translation and Mongolian generating model in detail.To the translations of some simple sentences, this paper has been put forward a rule-based translation method. Through English analyze, get grammar tree of English, change English grammar tree into Mongolian grammar tree according to English-Mongolian conversion rule, and then produce the Mongolian sentence which accords with Mongolian grammar from Mongolian grammar tree.To the translations of commonly used compound sentences or some regular phrases, this paper has been put forward the translation method based on template. Use existing template to match template and get very accurate translation. In primary stage, template was produced artificially; the further perfection goal of system is to draw the template automatically from the available language material through dictionary-based machine study.To the translations of sentences that neither template nor rule can translate correctly, this paper has been put forward Mongolian generating algorithm based on HMM. It produces Mongolian translation from the single language material through use of dictionary-based model and Hidden Markov Model.

【关键词】 机器翻译模板HMM模型蒙古文英文
【Key words】 EMMTEnglish-Mongolian Machine TranslationTemplateHMM Model
  • 【网络出版投稿人】 内蒙古大学
  • 【网络出版年期】2006年 02期
  • 【分类号】TP391.2
  • 【被引频次】7
  • 【下载频次】169
节点文献中: 

本文链接的文献网络图示:

本文的引文网络