✦ LIBER ✦

The exploitation of assembly language instructions in biological text manipulation: I. Nucleotide sequences

✍ Scribed by D.A. Mac Dónaill; N.H. Buttimore

Publisher: Elsevier Science
Year: 1996
Tongue: English
Weight: 581 KB
Volume: 32
Category: Article
ISSN: 0898-1221
DOI: 10.1016/s0898-1221(96)00194-0

No coin nor oath required. For personal study only.

✦ Synopsis

We explore the numerical interpretation of biological texts with a view to exploiting the efficiency with which digital computers manipulate binary strings. The central feature of our proposition is that not all numerical interpretations of biological text are digitally equivalent. Certain specific biological to numerical text mappings permit the exploitation of assembly instructions for processing entire strings in a single instruction cycle, thereby avoiding expensive digit by digit manipulation for nucleotide or amino acid sequences.

It is shown that the choice of mapping from biological to numerical text can critically effect efficiency. For nucleotide texts, we find that, of the 24 possible bijective mappings from the nucleotide alphabet to the quaternary number system, one subset of eight mappings corresponds to the interpretation of polymerase as a base-4 3's complement arithmetic operator, allowing polymerase to be particularly efficiently modelled using the assembly language NOT instruction.

📜 SIMILAR VOLUMES

The exploitation of assembly language in

The exploitation of assembly language instructions in biological text manipulation: II. Amino acid sequences

✍ N.H. Buttimore; D.A. Mac Dónaill 📂 Article 📅 1996 🏛 Elsevier Science 🌐 English ⚖ 485 KB

Amino acid residues may be divided into groups according to similarity of function, or evolutionary history, or other useful criteria. A grouping of amino acids into the eight sets based upon functionality allows a representation involving a three-bit code that can be of value in string matching sea