DNA codes for additive stem similarity
We study two new concepts of combinatorial coding theory: additive stem similarity and additive stem distance between q -ary sequences. For q = 4, the additive stem similarity is applied to describe a mathematical model of thermodynamic similarity, which reflects the “hybridization potential” of two...
Saved in:
| Published in: | Problems of information transmission Vol. 45; no. 2; pp. 124 - 144 |
|---|---|
| Main Authors: | , |
| Format: | Journal Article |
| Language: | English |
| Published: |
Dordrecht
SP MAIK Nauka/Interperiodica
01.06.2009
|
| Subjects: | |
| ISSN: | 0032-9460, 1608-3253 |
| Online Access: | Get full text |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Abstract | We study two new concepts of combinatorial coding theory: additive stem similarity and additive stem distance between
q
-ary sequences. For
q
= 4, the additive stem similarity is applied to describe a mathematical model of thermodynamic similarity, which reflects the “hybridization potential” of two DNA sequences. Codes based on the additive stem distance are called DNA codes. We develop methods to prove upper and lower bounds on the rate of DNA codes analogous to the well-known Plotkin upper bound and random coding lower bound (the Gilbert-Varshamov bound). These methods take into account both the “Markovian” character of the additive stem distance and the structure of a DNA code specified by its invariance under the Watson-Crick transformation. In particular, our lower bound is established with the help of an ensemble of random codes where distribution of independent codewords is defined by a stationary Markov chain. |
|---|---|
| AbstractList | We study two new concepts of combinatorial coding theory: additive stem similarity and additive stem distance between
q
-ary sequences. For
q
= 4, the additive stem similarity is applied to describe a mathematical model of thermodynamic similarity, which reflects the “hybridization potential” of two DNA sequences. Codes based on the additive stem distance are called DNA codes. We develop methods to prove upper and lower bounds on the rate of DNA codes analogous to the well-known Plotkin upper bound and random coding lower bound (the Gilbert-Varshamov bound). These methods take into account both the “Markovian” character of the additive stem distance and the structure of a DNA code specified by its invariance under the Watson-Crick transformation. In particular, our lower bound is established with the help of an ensemble of random codes where distribution of independent codewords is defined by a stationary Markov chain. |
| Author | D’yachkov, A. G. Voronina, A. N. |
| Author_xml | – sequence: 1 givenname: A. G. surname: D’yachkov fullname: D’yachkov, A. G. email: agd-msu@yandex.ru organization: Probability Theory Chair, Faculty of Mechanics and Mathematics, Lomonosov Moscow State University – sequence: 2 givenname: A. N. surname: Voronina fullname: Voronina, A. N. organization: Probability Theory Chair, Faculty of Mechanics and Mathematics, Lomonosov Moscow State University |
| BookMark | eNp9jz1PwzAQhi1UJNLCD2DLxBY423HijFX5lCoYgDm65GzkKh_INkj99yQqE0hd7ob3fe70LNliGAfD2CWHa85lfvMKIEWVFwAVCIBcnbCEF6AzKZRcsGSOszk_Y8sQdgB8KsmEXd0-r9N2JBNSO_oUiVx03yYN0fRpcL3r0Lu4P2enFrtgLn73ir3f371tHrPty8PTZr3NWqF1zFBxS9qCBhIom7wBKAVSaxsi0pqagmSlisZWViqstLGlyA2BQjVNNHLF-OFu68cQvLH1p3c9-n3NoZ5F63-iE1P-YVoXMbpxiB5dd5QUBzJMX4YP4-vd-OWHSfAI9ANiA2da |
| CitedBy_id | crossref_primary_10_1016_j_ffa_2015_10_008 crossref_primary_10_1002_wics_1364 |
| Cites_doi | 10.1109/18.850695 10.1089/cmb.2007.0083 10.1146/annurev.biophys.32.110601.141800 10.1109/ISIT.2000.866628 10.1109/ISIT.2008.4595399 |
| ContentType | Journal Article |
| Copyright | Pleiades Publishing, Ltd. 2009 |
| Copyright_xml | – notice: Pleiades Publishing, Ltd. 2009 |
| DBID | AAYXX CITATION |
| DOI | 10.1134/S0032946009020045 |
| DatabaseName | CrossRef |
| DatabaseTitle | CrossRef |
| DatabaseTitleList | |
| DeliveryMethod | fulltext_linktorsrc |
| Discipline | Engineering Computer Science |
| EISSN | 1608-3253 |
| EndPage | 144 |
| ExternalDocumentID | 10_1134_S0032946009020045 |
| GroupedDBID | -5B -5G -BR -EM -Y2 -~C -~X .86 .DC .VR 06D 0R~ 0VY 123 1N0 29O 29~ 2J2 2JN 2JY 2KG 2KM 2LR 2P1 2VQ 2~H 30V 4.4 408 409 40D 40E 5VS 67Z 6NX 8TC 95- 95. 95~ 96X AAAVM AABHQ AACDK AAHNG AAIAL AAJBT AAJKR AANZL AARHV AARTL AASML AATNV AATVU AAUYE AAWCG AAYIU AAYQN AAYTO AAYZH ABAKF ABBBX ABBXA ABDZT ABECU ABFTD ABFTV ABHQN ABJNI ABJOX ABKCH ABKTR ABMNI ABMQK ABNWP ABQBU ABQSL ABSXP ABTEG ABTHY ABTKH ABTMW ABULA ABWNU ABXPI ACAOD ACBXY ACDTI ACGFS ACHSB ACHXU ACKNC ACMDZ ACMLO ACOKC ACOMO ACPIV ACSNA ACZOJ ADHHG ADHIR ADINQ ADKNI ADKPE ADRFC ADTPH ADURQ ADYFF ADZKW AEBTG AEFQL AEGAL AEGNC AEJHL AEJRE AEMSY AENEX AEOHA AEPYU AETLH AEVLU AEXYK AFBBN AFFNX AFGCZ AFLOW AFQWF AFWTZ AFZKB AGAYW AGDGC AGJBK AGMZJ AGQMX AGRTI AGWIL AGWZB AGYKE AHAVH AHBYD AHKAY AHSBF AHYZX AIAKS AIGIU AIIXL AILAN AITGF AJBLW AJRNO ALMA_UNASSIGNED_HOLDINGS ALWAN AMKLP AMXSW AMYLF AMYQR AOCGG ARMRJ ASPBG AVWKF AXYYD AZFZN B-. BA0 BDATZ BGNMA BSONS CAG COF CS3 CSCUP DDRTE DL5 DNIVK DPUIP DU5 EBLON EBS EIOEI EJD ESBYG FEDTE FERAY FFXSO FIGPU FINBP FNLPD FRRFC FSGXE FWDCC GGCAI GGRSB GJIRD GNWQR GQ6 GQ7 GQ8 GXS H13 HF~ HG6 HLICF HMJXF HQYDN HRMNR HVGLF HZ~ H~9 IHE IJ- IKXTQ IWAJR IXC IXD IXE IZIGR IZQ I~X I~Z J-C JBSCW JCJTX JZLTJ KDC KOV LAK LLZTM M4Y MA- N2Q NB0 NPVJJ NQJWS NU0 O9- O93 O9J OAM OVD P2P P9O PF0 PT4 QOS R89 R9I RIG RNI RNS ROL RPX RSV RZC RZE S16 S1Z S27 S3B SAP SDH SEG SHX SISQX SJYHP SNE SNPRN SNX SOHCF SOJ SPISZ SRMVM SSLCW STPWE SZN T13 TEORI TN5 TSG TSK TSV TUC U2A UG4 UOJIU UTJUX UZXMN VC2 VFIZW W23 W48 WH7 WK8 XU3 YLTOR Z88 ZMTXR ~A9 AAPKM AAYXX ABDBE ABFSG ABJCF ABRTQ ACSTC ADHKG AEZWR AFDZB AFFHD AFHIU AFKRA AFOHR AGQPQ AHPBZ AHWEU AIXLP ARAPS ATHPR BENPR BGLVJ CCPQU CITATION HCIFZ K7- M7S PHGZM PHGZT PQGLB PTHSS |
| ID | FETCH-LOGICAL-c288t-a51fd8f080d2a3b4b0072adcfbddd88db6d3956bf9f35a98ef724ed05a5ed0ae3 |
| IEDL.DBID | RSV |
| ISICitedReferencesCount | 7 |
| ISICitedReferencesURI | http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000268246600004&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| ISSN | 0032-9460 |
| IngestDate | Tue Nov 18 22:23:45 EST 2025 Sat Nov 29 06:21:57 EST 2025 Fri Feb 21 02:35:36 EST 2025 |
| IsPeerReviewed | true |
| IsScholarly | true |
| Issue | 2 |
| Keywords | Information Transmission Large Deviation Principle Cross Hybridization Moment Generate Function Random Code |
| Language | English |
| License | http://www.springer.com/tdm |
| LinkModel | DirectLink |
| MergedId | FETCHMERGED-LOGICAL-c288t-a51fd8f080d2a3b4b0072adcfbddd88db6d3956bf9f35a98ef724ed05a5ed0ae3 |
| PageCount | 21 |
| ParticipantIDs | crossref_primary_10_1134_S0032946009020045 crossref_citationtrail_10_1134_S0032946009020045 springer_journals_10_1134_S0032946009020045 |
| PublicationCentury | 2000 |
| PublicationDate | 2009-06-01 |
| PublicationDateYYYYMMDD | 2009-06-01 |
| PublicationDate_xml | – month: 06 year: 2009 text: 2009-06-01 day: 01 |
| PublicationDecade | 2000 |
| PublicationPlace | Dordrecht |
| PublicationPlace_xml | – name: Dordrecht |
| PublicationTitle | Problems of information transmission |
| PublicationTitleAbbrev | Probl Inf Transm |
| PublicationYear | 2009 |
| Publisher | SP MAIK Nauka/Interperiodica |
| Publisher_xml | – name: SP MAIK Nauka/Interperiodica |
| References | Dembo, Zeitouni (CR10) 1993 CR4 Blahut (CR13) 1983 CR3 CR6 Bishop, D’yachkov, Macula, Renz, Rykov (CR2) 2007; 14 SantaLucia, Hicks (CR8) 2004; 33 MacWilliams, Sloane (CR5) 1977 Tutubalin (CR9) 1992 D’yachkov, Torney (CR7) 2000; 46 Gallager (CR11) 1968 D’yachkov, Vilenkin, Ismagilov, Sarbaev, Macula, Torney, White (CR1) 2005; 41. Berlekamp (CR12) 1984 J. SantaLucia Jr. (2004_CR8) 2004; 33 A. Dembo (2004_CR10) 1993 M.A. Bishop (2004_CR2) 2007; 14 F.J. MacWilliams (2004_CR5) 1977 A.G. D’yachkov (2004_CR7) 2000; 46 A.G. D’yachkov (2004_CR1) 2005; 41. 2004_CR3 R.G. Gallager (2004_CR11) 1968 R.E. Blahut (2004_CR13) 1983 V.N. Tutubalin (2004_CR9) 1992 2004_CR6 2004_CR4 E.R. Berlekamp (2004_CR12) 1984 |
| References_xml | – volume: 41. start-page: 57 issue: 4 year: 2005 end-page: 77 ident: CR1 article-title: On DNA Codes publication-title: Probl. Peredachi Inf. – volume: 46 start-page: 1558 issue: 4 year: 2000 end-page: 1564 ident: CR7 article-title: On Similarity Codes publication-title: IEEE Trans. Inform. Theory doi: 10.1109/18.850695 – year: 1984 ident: CR12 publication-title: Algebraic Coding Theory – year: 1983 ident: CR13 publication-title: Theory and Practice of Error Control Codes – year: 1977 ident: CR5 publication-title: The Theory of Error-Correcting Codes – year: 1968 ident: CR11 publication-title: Information Theory and Reliable Communication – ident: CR3 – ident: CR4 – volume: 14 start-page: 1088 issue: 8 year: 2007 end-page: 1104 ident: CR2 article-title: Free Energy Gap and Statistical Thermodynamic Fidelity of DNA Codes publication-title: J. Comput. Biol. doi: 10.1089/cmb.2007.0083 – ident: CR6 – year: 1993 ident: CR10 publication-title: Large Deviations Techniques and Applications – volume: 33 start-page: 415 year: 2004 end-page: 440 ident: CR8 article-title: The Thermodynamics of DNA Structural Motifs publication-title: Annu. Rev. Biophys. Biomol. Struct. doi: 10.1146/annurev.biophys.32.110601.141800 – year: 1992 ident: CR9 publication-title: Teoriya veroyatnostei i sluchainykh protsessov – volume: 14 start-page: 1088 issue: 8 year: 2007 ident: 2004_CR2 publication-title: J. Comput. Biol. doi: 10.1089/cmb.2007.0083 – ident: 2004_CR6 doi: 10.1109/ISIT.2000.866628 – volume-title: Theory and Practice of Error Control Codes year: 1983 ident: 2004_CR13 – volume-title: Teoriya veroyatnostei i sluchainykh protsessov year: 1992 ident: 2004_CR9 – volume: 41. start-page: 57 issue: 4 year: 2005 ident: 2004_CR1 publication-title: Probl. Peredachi Inf. – volume-title: Algebraic Coding Theory year: 1984 ident: 2004_CR12 – ident: 2004_CR3 – ident: 2004_CR4 doi: 10.1109/ISIT.2008.4595399 – volume-title: Information Theory and Reliable Communication year: 1968 ident: 2004_CR11 – volume: 46 start-page: 1558 issue: 4 year: 2000 ident: 2004_CR7 publication-title: IEEE Trans. Inform. Theory doi: 10.1109/18.850695 – volume-title: The Theory of Error-Correcting Codes year: 1977 ident: 2004_CR5 – volume: 33 start-page: 415 year: 2004 ident: 2004_CR8 publication-title: Annu. Rev. Biophys. Biomol. Struct. doi: 10.1146/annurev.biophys.32.110601.141800 – volume-title: Large Deviations Techniques and Applications year: 1993 ident: 2004_CR10 |
| SSID | ssj0010043 |
| Score | 1.785288 |
| Snippet | We study two new concepts of combinatorial coding theory: additive stem similarity and additive stem distance between
q
-ary sequences. For
q
= 4, the additive... |
| SourceID | crossref springer |
| SourceType | Enrichment Source Index Database Publisher |
| StartPage | 124 |
| SubjectTerms | Coding Theory Communications Engineering Control Electrical Engineering Engineering Information Storage and Retrieval Networks Systems Theory |
| Title | DNA codes for additive stem similarity |
| URI | https://link.springer.com/article/10.1134/S0032946009020045 |
| Volume | 45 |
| WOSCitedRecordID | wos000268246600004&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D |
| hasFullText | 1 |
| inHoldings | 1 |
| isFullTextHit | |
| isPrint | |
| journalDatabaseRights | – providerCode: PRVAVX databaseName: SpringerLINK Contemporary 1997-Present customDbUrl: eissn: 1608-3253 dateEnd: 99991231 omitProxy: false ssIdentifier: ssj0010043 issn: 0032-9460 databaseCode: RSV dateStart: 20010101 isFulltext: true titleUrlDefault: https://link.springer.com/search?facet-content-type=%22Journal%22 providerName: Springer Nature |
| link | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8QwEB509aAHV1fF9UUO4kEptEnaJsdFXTzIIr7YW0maCSy4q2xXf79JH4uLD9BLT9NSppnMTL_M9wGcpBgrNC6QeMKTwOVbGUiGURDllmue0pipclD4Jh0MxHAob-s57qI57d5AkuVOXemOcD_Ty6jkLkHL0H_aeBlWXLYTXq_h7v5pDh14bKviYqReP76BMr99xGIyWkRCywTTb__r1TZho64nSa9aAFuwhJMOtButBlKHbgfWPxEPbsPp5aBH_DR7QVzRSvyhIr_tEc_qTIrReOT6XVee78Bj_-rh4jqoFROCnAoxC1QcWSOsqwINVUxz7YnBlcmtNsYIYXRimGuItJWWxUoKtCnlaMJYxe6qkO1Ca_IywT0gmGhmkSlJteaJsw2FSV05hhhRhZR1IWxcl-U1nbhXtXjOyraC8eyLV7pwNr_lteLS-M34vPF1VodV8bP1_p-sD2CtAoX8z5RDaM2mb3gEq_n7bFRMj8vl9AHvvb_q |
| linkProvider | Springer Nature |
| linkToHtml | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1LS8QwEB50FdSDq6vi-sxBPCiFNknb5Lj4YMW1iK6yt5I0CSzoKtvV32_Sx-LiA_TS07SUaSbfTL_MNwBHsQ6FVjaQaEQjz-It9zjRgRdkhkoa45CIolG4FycJGwz4bdXHnden3WtKstipy7kj1PX0EsypBWjuu08bzsMCtYDlBPPv7h-n1IHjtkotRuzmx9dU5rePmAWjWSa0AJjL5r9ebQ1Wq3wSdcoFsA5zetSCZj2rAVWh24KVT8KDG3B8nnSQ62bPkU1akTtU5LY95FSdUT58Htp616bnm_BwedE_63rVxAQvw4xNPBEGRjFjs0CFBZFUOmFwoTIjlVKMKRkpYgsiabghoeBMmxhTrfxQhPYqNNmCxuhlpLcB6UgSo4ngWEoaWVufqdimY1oHWGhM2uDXrkuzSk7cTbV4SouygtD0i1facDK95bXU0vjN-LT2dVqFVf6z9c6frA9hqdu_6aW9q-R6F5ZLgsj9WNmDxmT8pvdhMXufDPPxQbG0PgD4_8LO |
| linkToPdf | http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwpV1LS8QwEB50FdGDj1VxfeYgHpRim6SPHBfXRXEpCz7YW0maBBa0Ltvq7zfpY3HxAeKlp2koaaYz02_m-wBOQ-VzJY0j0YAGjom3zGFEeY6XaipoiH3Cy0HhQRjH0WjEhrXOad50uzeQZDXTYFmasuJyInWtQULtfC_BjJpgzVz7mv1FWKK2j96W6_dPMxjB4lwVLyO2WvINrPntEvOBaR4VLYNNf-Pfj7kJ63WeibrVwdiCBZW1YaPRcEC1S7dh7RMh4Tac9eIuslPuOTLJLLLNRvZziCzbM8rHL2NTB5u0fQce-9cPVzdOraTgpDiKCof7npaRNtmhxJwIKixhOJepFlLKKJIikMQUSkIzTXzOIqVDTJV0fe6bK1dkF1rZa6b2AKlAEK0IZ1gIGhhbN5KhSdOU8jBXmHTAbbYxSWuacat28ZyU5QahyZdd6cD57JZJxbHxm_FFs-9J7W75z9b7f7I-gZVhr58MbuO7A1itcCP7v-UQWsX0TR3BcvpejPPpcXnKPgC5Lcuy |
| openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=DNA+codes+for+additive+stem+similarity&rft.jtitle=Problems+of+information+transmission&rft.au=D%E2%80%99yachkov%2C+A.+G.&rft.au=Voronina%2C+A.+N.&rft.date=2009-06-01&rft.issn=0032-9460&rft.eissn=1608-3253&rft.volume=45&rft.issue=2&rft.spage=124&rft.epage=144&rft_id=info:doi/10.1134%2FS0032946009020045&rft.externalDBID=n%2Fa&rft.externalDocID=10_1134_S0032946009020045 |
| thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0032-9460&client=summon |
| thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0032-9460&client=summon |
| thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0032-9460&client=summon |