节点文献
滑皮金柑全长转录组测序分析
Full-length Transcriptome Sequencing and Comparative Genomics Analysis of Fortunella crassifolia Swingle
【摘要】 滑皮金柑(Fortunella crassifolia Swingle)为我国重要的柑橘类水果之一,为获得其完整的遗传信息,我们采用PacBio单细胞长链测序技术对滑皮金柑进行全长转录组测序,总获得13.81 G原始数据.经处理得到12 040个全长转录本,其中已知转录本为5 033个,已知基因的新转录本为5 589个,新基因转录本为1 418个,确定的新转录本占比为58%.全长转录本对应于6 984个基因,其中878个为新基因.经结构分析发现3 031个基因发生了可变剪切事件,占比为43.4%;6 106个基因至少含有一个poly(A)位点,且其中含有2个和2个以上的poly(A)位点的基因占比为42.3%;596个转录因子分布于82个转录因子家族;360个LncRNA序列和75个融合基因.通过对PacBio数据的进一步挖掘,该研究还发现了6个抗病基因,为滑皮金柑的抗病品种培育打下基础.
【Abstract】 Fortunella crassifolia Swingle is one of the most important citrus fruits crops in China. In order to obtain complete genetic information, single-molecule long read isoform sequencing(SMRT-seq) technology of PacBio was performed to sequence the full-length transcriptome of Huapi kumquat, and a total of 13.81 G raw data was produced. After analysis, we get 12,040 full-length transcripts including 5,033 known transcripts, 5,589 new transcripts of known genes, 1,418 new gene transcripts. 58% of transcripts were newly excavated transcripts that had not been previously annotated. 6,984 genes containing 878 new genes were annotated, and 3,031 genes(43.4%) were found to have alternative splicing events. 6,106 genes contained at least one poly(A) sites, and the proportion of genes containing two or more poly(A) sites was 42.3%. 596 transcription factors were identified and distributed in 82 transcription factor families. 360 LncRNAs and 75 fusion genes were identified. Beside, we also identified 6 resistance genes which will lay foundation for the breeding of disease-resistant varieties of Fortunella crassifolia Swingle in the future.
【Key words】 Fortunella crassifolia swingle; full-length transcriptome; alternative polyadenylation; fused genes; resistance genes;
- 【文献出处】 赣南师范大学学报 ,Journal of Gannan Normal University , 编辑部邮箱 ,2022年06期
- 【分类号】S666
- 【网络出版时间】2022-11-25 09:15:00
- 【下载频次】113