✦ LIBER ✦

The exploitation of assembly language instructions in biological text manipulation: II. Amino acid sequences

✍ Scribed by N.H. Buttimore; D.A. Mac Dónaill

Publisher: Elsevier Science
Year: 1996
Tongue: English
Weight: 485 KB
Volume: 32
Category: Article
ISSN: 0898-1221
DOI: 10.1016/s0898-1221(96)00195-2

No coin nor oath required. For personal study only.

✦ Synopsis

Amino acid residues may be divided into groups according to similarity of function, or evolutionary history, or other useful criteria. A grouping of amino acids into the eight sets based upon functionality allows a representation involving a three-bit code that can be of value in string matching searches. An amino acid residue may be identified uniquely by employing a further two bits. We propose that amino acid sequence data and search strings be preprocessed to form strings of highest bits, strings of the next highest bits, and so on. Machine assembly language instructions on the separate bit-strings provide a hierarchical measure of homology. We study a number of preprocessing strategies arranged to accord with the kind of search contemplated.

📜 SIMILAR VOLUMES

The exploitation of assembly language in

The exploitation of assembly language instructions in biological text manipulation: I. Nucleotide sequences

✍ D.A. Mac Dónaill; N.H. Buttimore 📂 Article 📅 1996 🏛 Elsevier Science 🌐 English ⚖ 581 KB

We explore the numerical interpretation of biological texts with a view to exploiting the efficiency with which digital computers manipulate binary strings. The central feature of our proposition is that not all numerical interpretations of biological text are digitally equivalent. Certain specific