基于计算机算法的生物序列分析研究

Journal: Advances in Computer and Autonomous Intelligence Research DOI: 10.32629/acair.v4i1.19357

刘珂伶

西华大学

Abstract

生物序列承载着生命体遗传信息,其分析是生物信息学的核心课题。本文以计算机算法为支撑,系统探讨生物序列分析的基础理论、核心算法及应用拓展。首先阐述生物序列类型、特征与算法分类,明确性能评价指标;随后聚焦序列比对、特征提取与预测三大核心方向,剖析各类算法的原理、优势及优化路径;最后针对海量数据处理、智能算法融合等实际需求,提出算法改进策略。研究旨在为生物序列分析提供算法层面的理论参考,助力基因与蛋白质功能研究、疾病诊断等领域的技术突破,推动生物信息学向精准化、高效化发展。

Keywords

生物序列;计算机算法;序列比对;特征提取;智能融合

References

[1] 穆学琛.高维非结构化生物序列数据的深度学习表征及降维算法[D].吉林大学,2025.
[2] 王艳,冀松,刘静.基于KMP算法的生物序列模式自动识别应用研究[J].电脑知识与技术,2024,20(36):42-44.
[3] 李飞.面向生物功能序列识别的多模型整合算法研究[D].吉林大学,2024.
[4] 郭育洲,周小安,林洋.基于神经网络的生物序列分类探析[J].数字技术与应用,2024,42(11):152-156.
[5] 卫泽刚,陈旭,张小丹,等.基于Edlib的启发式生物序列聚类算法[J].宝鸡文理学院学报(自然科学版),2024,44(03):50-55.
[6] Jumper J,Evans R,Pritzel A,et al.Highly accurate protein structure prediction with AlphaFold[J].Nature,2021,596(7873):583-589.
[7] Baek M,DiMaio F,Anishchenko I,et al.Accurate prediction of protein structures and interactions using a three-track neural network[J].Science,2021,373(6557):871-876.
[8] Lin Z,Akin H, Rao R,et al.Evolutionary-scale prediction of atomic-level protein structure with a language model[J]. Science,2023,379(6637):1123-1130.
[9] Rives A, Meier J, Sercu T, et al. Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences[J]. Proceedings of the National Academy of Sciences,2021,118(15):e2016239118.
[10] Elnaggar A,Heinzinger M,Dallago C,et al.ProtTrans: Toward Cracking the Language of Life’s Code Through SelfSupervised Deep Learning and High Performance Computing[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2022,44(10):7112-7127 .
[11] Ji Y,Zhou Z,Liu H,Davuluri R V.DNABERT:pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome[J].Bioinformatics,2021,37(15):2112-2120.
[12] Dalla-Torre H,Gonzalez L,et al.The Nucleotide Transformer:Building and Evaluating Foundation Models for DNA[J].Nature Methods,2025,22:287-297 .
[13] GligorijevićV,Renfrew P D,Kosciolek T,et al. Structure-based protein function prediction using graph convolutional networks[J].Nature Communications,2021,12:3168.
[14] Mirdita M,Schütze K,Moriwaki Y,et al.ColabFold:making protein folding accessible to all[J].Nature Methods,2022,19(6):679-682.

Copyright © 2026 刘珂伶

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License