RUS  ENG
Full version
JOURNALS // Problemy Peredachi Informatsii // Archive

Probl. Peredachi Inf., 2022 Volume 58, Issue 2, Pages 12–23 (Mi ppi2365)

This article is cited in 4 papers

Coding Theory

On the maximum number of non-confusable strings evolving under short tandem duplications

M. Kovačević

Faculty of Technical Sciences, University of Novi Sad, Novi Sad, Serbia

Abstract: The set of all q-ary strings that do not contain repeated substrings of length $\leqslant3$ (i.e., that do not contain substrings of the form aa, abab,andabcabc) constitutes a code cor recting an arbitrary number of tandem-duplication mutations of length $\leqslant3$. In other words, any two such strings are non-confusable in the sense that they cannot produce the same string while evolving under tandem duplications of length $\leqslant3$. We demonstrate that this code is asymptotically optimal in terms of rate, meaning that it represents the largest set of non-con fusable strings up to subexponential factors. This result settles the zero-error capacity problem for the last remaining case of tandem-duplication channels satisfying the “root-uniqueness” property.

Keywords: tandem duplication, tandem repeat, duplication error, repetition error, sticky in sertion, DNA storage, error correction, zero-error capacity, constrained code, square-free string.

UDC: 621.391 : 519.724.2

Received: 04.10.2021
Revised: 20.04.2022
Accepted: 21.04.2022

DOI: 10.31857/S0555292322020028


 English version:
Problems of Information Transmission, 2022, 58:2, 111–121


© Steklov Math. Inst. of RAS, 2026