节点文献

昆虫基因数据库平台的构建及其关键技术的研究

Construction of Insect Genetic Database Platform and the Research on Its Key Technology

【作者】 张艺群

【导师】 徐焕良; 李飞;

【作者基本信息】 南京农业大学 , 计算机应用技术, 2012, 硕士

【摘要】 随着分子生物学领域研究的不断深入,多种生物基因组测序工作己经或者即将完成,随之产生了大量的生物数据。这些数据的出现导致了生物信息数据库数量的快速增长。目前昆虫的基因序列数据有两种数据库来源,一是针对个别物种的专门数据库,二是存储诸多物种基因数据的大型综合数据库。针对专门物种的数据库数量不多,而综合数据库中数据的物种和类型都比较多样,对于只针对某类物种的研究来说不够方便和细化。因此,构建一个专门物种的基因数据库具有非常重要的意义。本论文首先介绍了生物信息数据库的概念和分类,总结了国内外生物信息数据库的发展现状。接着针对昆虫基因数据库平台实现过程中的一些关键技术进行了深入的研究和重点的介绍。结合不同来源的昆虫基因数据存在的结构差异,分别对各种结构的数据进行了详细的概念设计和存储设计,并在此基础上以MySQL作为后台数据库构建了一个昆虫基因数据库。该数据库包括了实验室测序得到的八种农业害虫转录组数据、昆虫EST数据、豌豆蚜、冈比亚按蚊、蜜蜂、家蚕、黑腹果蝇、赤拟谷盗等六种昆虫的基因组数据。为了进一步完善用户对昆虫基因数据库的使用需求,本文设计并实现了数据库序列检索与下载、文献检索、BLAST序列比对、GBrowse基因组图形化展示等功能,提供基于关键字的序列检索,数据的整体和分类下载以及检索结果的即时下载,并支持相关文献资料的检索,同时整合了基于BLAST算法的序列比对的分析工具,提供序列比对的在线服务,并针对豌豆蚜等六个已测序的物种提供了基于GBrowse的基因组的图形化展示服务。最终构建了针对昆虫的生物信息数据库,为相关的研究人员提供了更加便利的数据使用平台。

【Abstract】 With the development of sequencing technique, the genomes of many important organisms were sequenced. This has accumulated a huge numbers of biological data. There are currently two types of database sources, one is specialized database for individual species, another is large-scale comprehensive database that contain the data of many species. Tough the comprehensive databases contain much more data, it is not convenient to be used for special purpose. Therefore, building a specialized insect gene database is of great significance.This paper introduces the classification of biological informatic database, and then summarizes the current development of bioinformatic databases. The insect gene database has been constructed with MySQL as its background database on the base of detailed conceptual design and detailed design for different data’s structure. The database collect a data of eight agricultural pests’transcriptome data from our laboratory, insect EST data and the genome data of Acyrthosiphon pisum, Anopheles gambia, Apis mellifera, Bombyx mori, Drosophila melanogaster and Tribolium castaneum.We constructed an insect bioinformatics platform which can be used as data retrieval and downloads, literature retrieval, BLAST sequence homology alignment service and GBrowse viewer for genome data. This platform provides ways to keyword-based sequence retrieval, data downloads. The retrieval of relevant literature is also supported. It also integrates an online analysis tools based on BLAST algorithm, providing sequence alignment, and graphical display of genome data by using GBrowse software. And finally we build a specialized insect gene database which provides a more convenient data platform for the researchers.

  • 【分类号】Q811.4;TP311.13
  • 【下载频次】177
节点文献中: