节点文献

中国数字书法检索与作品真伪鉴别的研究

Retrieval and Verification of Chinese Calligraphy

【作者】 章夏芬

【导师】 庄越挺;

【作者基本信息】 浙江大学 , 计算机应用, 2006, 博士

【摘要】 数字化技术和数字存储技术的发展推动了中华民族五千年文化遗留下来的书法作品的数字化进程。原始的历史书法作品每件都是唯一的,存在于纸张、石头、绢丝或者竹简上,因容易破损而珍藏在博物馆里不允许随便翻阅。数字化使历史书法作品拥有数字形式的备份,得以在数字图书馆中长久保存,使用户能随时、随地、随意地浏览和欣赏民族文化瑰宝,更好地传承了民族文化。 然而,数字化书法作品同时也带来了新挑战:在当前OCR技术无法将书法字识别成文本从而进行检索的情况下,如何为这些作品提供有效的检索服务?在当前依靠主观经验难以人工鉴别大数据量数字书法作品真伪的情况下,如何在计算机辅助下为书法作品真伪鉴别提供客观、量化的评判? 本论文的目的是提出数字书法处理的理论、方法和技术,研究如何在不识别书法字的情况下检索书法字,研究如何提高数字图书馆中大数据量书法字检索速度,探索如何结合图像处理技术和人工智能技术以底层图像特征量化表达高层语义(书法风格)特征,从而辅助进行书法作品的真伪鉴别。主要完成了如下工作: 1.基于形状的书法字检索。提取书法字轮廓特征,在不对书法字图像进行识别的情况下而直接根据样例检索书法字。其思路在于构造书法字形状矩阵,检索在形状上与样例相似的书法字。对检索目标不在书法数据库中的情况,提出并展示了合成用户所需风格的新书法字的方法。 2.书法字快速检索。系统响应时间对拥有大数据量书法作品的数字图书馆至关重要,本文提出了三种加快书法字检索速度的途径:通过先粗略剪枝后精确匹配的方案缩短总体计算时间;通过动态投影时序校准算法、支持不等长序列的2D动态校准算法,缩短匹配算法所需的计算时间;采用高维索引的PK树索引书法字的高维特征,加快存取和查找的时间。 3.书法风格学习。书法风格是描述书法字、辨别不同书法家作品的关键,因此本文采用机器学习的方法,描述了如何用底层图像特征量化表征高层书法风格特征;其思路在于,从书法字的笔划形态特征和结体形态出发,统计和分析不同书法家书法字的稳定性书写特征,提出了个性风格特征的选取及其权重计算方法,构造了个性风格特征向量。 4.书法作品真伪鉴别。与数字图书馆书法收藏联系在一起的是书法真伪鉴别,因此本文提出了计算机辅助书法作品真伪鉴别的系统框架、工作原理;提出针对可疑作品所宣称的作者构造真迹模型的方法,对比分析真迹模型与可疑作品的每一个特征,检测可疑点并给出作品是真迹的可能性概率,同时提供量化的“可怀疑证据”及置信度。

【Abstract】 With rapid development of digitizing technology, the digitization of Chinese historical calligraphy works is marching into a new stage. The original historical calligraphy works are exists in papers, stones, silks or bamboo slips. They are unique, fragile, and can be easily destroyed. The digitizing gives them a secondary medium to keep and to enable universal accessible. But digitizing the historical calligraphy works brings new challenges: As the existing Optical Character Recognition technologies can’t convert these calligraphy images into texts, how to retrieve the calligraphy characters and enable the searching services in digital library? As calligraphy works’ verification has long been investigated by calligraphists subjectively from the art’s field. How to distinguish the genuine calligraphy works from those of forged with objective measures and evidences? This thesis discusses theories, methodologies and technologies of how to retrieve calligraphy works and how to bring objective measures to verify the historical calligraphy works. The main contributions of this thesis are as follows:1. Retrieval of shape-based calligraphy character. Chapter 3 introduces how to extract meaningful features from individual calligraphy character images segmented from the calligraphy image page. Chapter 4 proposes approaches of how to retrieve calligraphy characters without recognition. First shape matrix is construcedt for each calligraphy character based on the contour point’s property. Then retrieve the candidate similar to the query in terms of shape. Meanwhile, if the database contains no calligraphy character that meets the user’s demands, then a new style of calligraphy character will be generated by learning from two similar styles.2. Fast search engine of calligraphy character. The response time is important for the service of digital library, and so Chapter 5 proposes 3 strateges to speed up the retrieval time. The first is to use a coarse-to-fine process to shorten the total computation time. The second is to employ good algorithms of the Dyanamic projecting histogram warping and the extended 2D Dyanamic Programing Warpping to speed up the shape matching process. The third is to introduce high-dimensional indexer to minimize the access and index time.3. Learning of calligraphy writing style. Calligraphy style is the key to understand different calligraphy and to distingsuish calligraphy works created by differenct calligraphist in different dynasty. Thus Chapter 6 discusses about how to employ machine learning methodologies to represent calligraphy style by suitable features. The features, mainly in the stroke level and in the character level, are selected and the corresponding weights are computed to construct a personal calligraphy style vector.4. Verification of calligraphy works. Chapter 7 presents the system architecture and the basic working principle of calligraphy verification. First, the calligraphyist’s genuine mode is proposed according to the name that the suspicious works claimed. Then based on this genuine mode, the features of the suspicious works and the guniue’s works are analysized and the accepting probabilities are computed. The final decision is based on the accumulated probabilities and the

  • 【网络出版投稿人】 浙江大学
  • 【网络出版年期】2007年 02期
节点文献中: 

本文链接的文献网络图示:

本文的引文网络