Generative Adversarial Networks for DNA Storage Channel Simulator
- Abstract
- DNA data storage systems have rapidly developed with novel error-correcting techniques, random access algorithms, and query systems. However, designing an algorithm for DNA storage systems is challenging, mainly due to the unpredictable nature of errors and the extremely high price of experiments. Thus, a simulator is of interest that can imitate the error statistics of a DNA storage system and replace the experiments in developing processes. We introduce novel generative adversarial networks that learn DNA storage channel statistics. Our simulator takes oligos (DNA sequences to write) as an input and generates a FASTQ file that includes output DNA reads and quality scores as if the oligos are synthesized and sequenced. We trained the proposed simulator with data from a single experiment consisting of 14,400 input oligo strands and 12,108,573 output reads. The error statistics between the input and the output of the trained generator match the actual error statistics, including the error rate at each position, the number of errors for each nucleotide, and high-order statistics. The code is available at https://github.com/gyfbianhuanyun/DNA_storage_simulator_GAN .
- Issued Date
- 2023
Sanghoon Kang
Yunfei Gao
Jaeho Jeong
Seong-Joon Park
Jae-Won Kim
Jong-Seon No
- Type
- Article
- Keyword
- Channel simulator; DNA storage; generative adversarial networks; recurrent neural networks; transformer
- DOI
- 10.1109/ACCESS.2023.3235201
- URI
- https://oak.ulsan.ac.kr/handle/2021.oak/17877
- Publisher
- IEEE ACCESS
- Language
- 영어
- ISSN
- 2169-3536
- Citation Volume
- 11
- Citation Start Page
- 3781
- Citation End Page
- 3793
-
Appears in Collections:
- Engineering > IT Convergence
- 공개 및 라이선스
-
- 파일 목록
-
Items in Repository are protected by copyright, with all rights reserved, unless otherwise indicated.