RMSAGen: Integrating Multiple Sequence Alignment for Function RNA Design
DOI:
https://doi.org/10.1609/aaai.v40i1.37012Abstract
Biological sequences, including RNAs and proteins, share similarities with natural languages, enabling the application of advanced language models to various biological tasks. However, due to its flexibility and lack of experimental data, RNA is a particularly challenging biological ``language'' compared to other biological sequences like proteins. RNA multiple sequence alignments (MSAs), which align evolutionarily related RNA sequences, can greatly enhance RNA biology modeling, as evidenced by their significant roles in structure prediction and function annotation. This raises the question of whether RNA MSAs can also benefit RNA design, which remains unexplored. This paper introduces RMSAGen, a model comprising RMSA-Encoder and RMSA-Decoder, that leverages MSAs to design functional RNA sequences. RMSA-Encoder effectively extracts MSA features, enhancing performance in functional prediction and solvent accessibility prediction tasks and supporting RMSA-Decoder in accurate RNA generation. RMSAGen can design RNA sequences that effectively bind to target RNA-binding proteins, and the design performance improves with an increasing number of sequences. In addition, the ribozymes designed with structural features by RMSAGen show strong computational metrics and exhibit biological activity during gel electrophoresis. These results highlight the effectiveness of RMSAGen, establishing it as a powerful tool and a new direction for RNA design.Downloads
Published
2026-03-14
How to Cite
Jiang, J., Chen, Y., Zhang, Q., Li, J., Shi, X., Zhou, C., Lin, Z., Wang, J., He, D., Hong, L., Li, Q., Chen, P., Chen, J., Zhang, X., Yuan, J., Zhang, T., & Li, Y. (2026). RMSAGen: Integrating Multiple Sequence Alignment for Function RNA Design. Proceedings of the AAAI Conference on Artificial Intelligence, 40(1), 489-497. https://doi.org/10.1609/aaai.v40i1.37012
Issue
Section
AAAI Technical Track on Application Domains I