We consider the problem of aligning of k sequences of length n. The cost function is sum of pairs, and satisfies triangle inequality. Earlier results on finding approximation algorithms for this problem are due to Gusfield (1991) who achieved an approximation ratio of 2 -2/k, and Pevzner (1992) who
Non-approximability of weighted multiple sequence alignment
β Scribed by Bodo Manthey
- Publisher
- Elsevier Science
- Year
- 2003
- Tongue
- English
- Weight
- 167 KB
- Volume
- 296
- Category
- Article
- ISSN
- 0304-3975
No coin nor oath required. For personal study only.
β¦ Synopsis
We consider a weighted generalization of multiple sequence alignment (MSA) with sum-of-pair score. MSA without weights is known to be NP-complete and can be approximated within a constant factor, but it is unknown whether it has a polynomial time approximation scheme. Weighted multiple sequence alignment (WMSA) can be approximated within a factor of O(log 2 n) where n is the number of sequences.
We prove that WMSA alignment is MAX SNP-hard and establish a numerical lower bound on its approximability, namely 324 323 -. This lower bound is obtained already for the simple binary weighted case where the weights are restricted to 0 and 1. Furthermore, we show that WMSA and its restriction to binary weights can be approximated to the same degree.
π SIMILAR VOLUMES
Let S be a set of weighted axis-parallel rectangles such that for each axis no projection of one rectangle properly contains that of another. Two rectangles are in con ict if two projections of the rectangles on an axis intersect. The problem we consider in this paper is to ΓΏnd a maximum weighted su
Automatic identi"cation of sub-structures in multi-aligned sequences is of great importance for e!ective and objective structural/functional domain annotation, phylogenetic treeing and other molecular analyses. We present a segmentation algorithm that optimally partitions a given multi-alignment int