节点文献

基于PU-learning的磷酸激酶预测算法

Prediction algorithm of phosphokinase based on PU-learning

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 王艺琪王明举张进彭智才魏森谢多双

【Author】 WANG Yiqi;WANG Mingju;ZHANG Jin;PENG Zhicai;WEI Sen;XIE Duoshuang;Department of Information Resource,Taihe Hospital;

【通讯作者】 谢多双;

【机构】 太和医院

【摘要】 目的蛋白质磷酸化是通过激酶催化特定位点把磷酸基转移到底物蛋白质氨基酸残基的过程,是研究蛋白质活力及功能的重要机制。目前已鉴定的数千个磷酸化位点大多缺失激酶信息,为此本研究提出基于PU-learning的磷酸激酶预测算法,通过迭代标记磷酸位点,可以准确预测催化磷酸肽的磷酸激酶。方法首先该算法以PU-learning为框架,利用最大熵方差对不同种类的磷酸激酶自动筛选最佳阈值,从而提取每条磷酸肽上潜在的磷酸化位点,然后根据统计分析确定磷酸化位点对应的激酶,最后通过五折交叉验证该算法在Phospho. ELM数据库上的预测性能,并与现有算法对比。结果该算法的交叉验证特异性和灵敏度比现有最好算法在单个数据集上最高提高4%及10%,其预测Phospho. ELM中数据准确度达到79. 52%。结论基于PU-learning的磷酸激酶预测算法显著优于现有算法,且可以准确预测Phospho. ELM数据库中未知激酶信息的磷酸肽,在磷酸化实验中具有较强的指导意义。

【Abstract】 Objective Protein phosphorylation is a process by which a kinase catalyzes the transfer of a phosphate group to a protein residue at a specific site,as an important mechanism of protein activity and function. Most of identified phosphorylation sites are lack of kinase information. To this end,a prediction algorithm of phosphokinase based on PU-learning is proposed. By iterative phosphate site labeling,the phosphokinase that catalyzes the phosphopeptide can be accurately predicted. Methods The algorithm uses PUlearning as the framework to automatically screen the optimal thresholds for different kinds of phosphokinases by using the maximum entropy variance,so as to extract the potential phosphorylation sites on each phosphopeptide,and then determines the corresponding phosphorylation sites according to statistical analysis.Finally,the prediction performance is verified by a five-fold cross validation on the Phospho. ELM database and compared with existing algorithms. Results The cross-validation specificity and sensitivity of this algorithm are4% and 10% higher than those of the best existing approach on single data set,and the prediction accuracy on Phospho. ELM is as high as 79. 52%. Conclusions The prediction algorithm of phosphokinase based on PUlearning is significantly better than the existing algorithms,and can accurately predict the phosphopeptides of unknown kinase information in the Phospho. ELM database,which has a strong guiding significance in phosphorylation experiments.

【基金】 国家自然科学基金青年基金(31501070);湖北省自然科学基金(2017CFB137)资助
  • 【文献出处】 北京生物医学工程 ,Beijing Biomedical Engineering , 编辑部邮箱 ,2019年04期
  • 【分类号】Q55;Q811.4
  • 【网络出版时间】2019-08-15 16:53
  • 【下载频次】61
节点文献中: 

本文链接的文献网络图示:

本文的引文网络