Iterative coding scheme satisfying GC balance and run-length constraints for DNA storage with robustness to error propagation

In this paper, we propose a novel iterative encoding algorithm for DNA storage to satisfy both the GC balance and run-length constraints using a greedy algorithm. DNA strands with run-length more than three and the GC balance ratio far from 50% are known to be prone to errors. The proposed encoding...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:Journal of communications and networks Ročník 24; číslo 3; s. 283 - 291
Hlavní autoři: Park, Seong-Joon, Lee, Yongwoo, No, Jong-Seon
Médium: Journal Article
Jazyk:angličtina
Vydáno: Seoul The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 01.06.2022
한국통신학회
Témata:
ISSN:1229-2370, 1976-5541
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:In this paper, we propose a novel iterative encoding algorithm for DNA storage to satisfy both the GC balance and run-length constraints using a greedy algorithm. DNA strands with run-length more than three and the GC balance ratio far from 50% are known to be prone to errors. The proposed encoding algorithm stores data with high flexibility of run-length at most m and GC balance between 0.5 ± α for arbitrary m and α. More importantly, we propose a novel mapping method to reduce the average bit error compared to the randomly generated mapping method. By using the proposed method, the average bit error caused by the one base error is 2.3455 bits, which is reduced by 20.5%, compared to the randomized mapping. Also, it is robust to error propagation since the input sequence is partitioned into small blocks during the mapping step. The proposed algorithm is implemented through iterative encoding, consisting of three main steps: randomization, M-ary mapping, and verification. It has an information density of 1.833 bits/nt in the case of m = 3 and α = 0.05.
Bibliografie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1229-2370
1976-5541
DOI:10.23919/JCN.2022.000008