RMSAGen: Integrating Multiple Sequence Alignment for Function RNA Design

Authors

  • Jiyue Jiang The Chinese University of Hong Kong
  • Yanyu Chen The Chinese University of Hong Kong
  • Qingchuan Zhang The Chinese University of Hong Kong
  • Jiayi Li The Chinese University of Hong Kong
  • Xiangyu Shi The Chinese University of Hong Kong
  • Chang Zhou The Chinese University of Hong Kong
  • Ziqian Lin The Chinese University of Hong Kong
  • Jiuming Wang The Chinese University of Hong Kong
  • Dongchen He The Chinese University of Hong Kong
  • Liang Hong The Chinese University of Hong Kong
  • Qintong Li The University of Hong Kong
  • Pengan Chen The Chinese University of Hong Kong
  • Jiayang Chen The Chinese University of Hong Kong
  • Xinrui Zhang The Chinese University of Hong Kong
  • Jiao Yuan Guangzhou National Laboratory Guangzhou Medical University
  • Tianqing Zhang Hangzhou Institute of Medicine, Chinese Academy of Sciences
  • Yu Li The Chinese University of Hong Kong

DOI:

https://doi.org/10.1609/aaai.v40i1.37012

Abstract

Biological sequences, including RNAs and proteins, share similarities with natural languages, enabling the application of advanced language models to various biological tasks. However, due to its flexibility and lack of experimental data, RNA is a particularly challenging biological ``language'' compared to other biological sequences like proteins. RNA multiple sequence alignments (MSAs), which align evolutionarily related RNA sequences, can greatly enhance RNA biology modeling, as evidenced by their significant roles in structure prediction and function annotation. This raises the question of whether RNA MSAs can also benefit RNA design, which remains unexplored. This paper introduces RMSAGen, a model comprising RMSA-Encoder and RMSA-Decoder, that leverages MSAs to design functional RNA sequences. RMSA-Encoder effectively extracts MSA features, enhancing performance in functional prediction and solvent accessibility prediction tasks and supporting RMSA-Decoder in accurate RNA generation. RMSAGen can design RNA sequences that effectively bind to target RNA-binding proteins, and the design performance improves with an increasing number of sequences. In addition, the ribozymes designed with structural features by RMSAGen show strong computational metrics and exhibit biological activity during gel electrophoresis. These results highlight the effectiveness of RMSAGen, establishing it as a powerful tool and a new direction for RNA design.

Downloads

Published

2026-03-14

How to Cite

Jiang, J., Chen, Y., Zhang, Q., Li, J., Shi, X., Zhou, C., Lin, Z., Wang, J., He, D., Hong, L., Li, Q., Chen, P., Chen, J., Zhang, X., Yuan, J., Zhang, T., & Li, Y. (2026). RMSAGen: Integrating Multiple Sequence Alignment for Function RNA Design. Proceedings of the AAAI Conference on Artificial Intelligence, 40(1), 489-497. https://doi.org/10.1609/aaai.v40i1.37012

Issue

Section

AAAI Technical Track on Application Domains I