Disturbance-based Discretization, Differentiable IDS Channel, and an IDS-Correcting Code for DNA-based Storage

Authors

  • Alan J.X. Guo Tianjin University
  • Mengyi Wei Tianjin University
  • Yufan Dai Tianjin University
  • Yali Wei Tianjin University
  • Pengchen Zhang Tianjin University

DOI:

https://doi.org/10.1609/aaai.v40i26.39289

Abstract

With recent advancements in next-generation data storage, especially in biological molecule-based storage, insertion, deletion, and substitution (IDS) error-correcting codes have garnered increased attention. However, a universal method for designing tailored IDS-correcting codes across varying channel settings remains underexplored. We present an autoencoder-based approach, THEA-code, aimed at efficiently generating IDS-correcting codes for complex IDS channels. In the work, a disturbance-based discretization is proposed to discretize the features of the autoencoder, and a simulated differentiable IDS channel is developed as a differentiable alternative for IDS operations. These innovations facilitate the successful convergence of the autoencoder, producing channel-customized IDS-correcting codes that demonstrate commendable performance across complex IDS channels, particularly in realistic DNA-based storage channels.

Downloads

Published

2026-03-14

How to Cite

Guo, A. J., Wei, M., Dai, Y., Wei, Y., & Zhang, P. (2026). Disturbance-based Discretization, Differentiable IDS Channel, and an IDS-Correcting Code for DNA-based Storage. Proceedings of the AAAI Conference on Artificial Intelligence, 40(26), 21423–21431. https://doi.org/10.1609/aaai.v40i26.39289

Issue

Section

AAAI Technical Track on Machine Learning III