DNA Codes with Constant GC-Content Constructed from Hadamard Matrices

Young-Sik KIM
Hosung PARK
Sang-Hyo KIM

IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences   Vol.E100-A    No.11    pp.2408-2415
Publication Date: 2017/11/01
Online ISSN: 1745-1337
DOI: 10.1587/transfun.E100.A.2408
Type of Manuscript: PAPER
Category: Coding Theory
binary pseudo random sequences,  Hadamard matrix,  DNA codes,  inverse Gray mapping,  minimum Hamming distance,  

Full Text: PDF>>
Buy this Article

To construct good DNA codes based on biologically motivated constraints, it is important that they have a large minimum Hamming distance and the number of GC-content is kept constant. Also, maximizing the number of codewords in a DNA code is required for given code length, minimum Hamming distance, and number of GC-content. In most previous works on the construction of DNA codes, quaternary constant weight codes were directly used because the alphabet of DNA strands is quaternary. In this paper, we propose new coding theoretic constructions of DNA codes based on the binary Hadamard matrix from a binary sequence with ideal autocorrelation. The proposed DNA codes have a greater number of codewords than or the equal number to existing DNA codes constructed from quaternary constant weight codes. In addition, it is numerically shown that for the case of codes with length 8 or 16, the number of codewords in the proposed DNA code sets is the largest with respect to the minimum reverse complementary Hamming distances, compared to all previously known results.