Computational Methods for the Analysis of Differential Conservation in Groups of Similar DNA Sequences

Nikola Stojanovic (nick@cse.uta.edu)

Department of Computer Science and Engineering, University of Texas at Arlington, Arlington, TX 76019, USA


Abstract

Multiple sequence alignments are a powerful tool for identifying the regions of DNA which have been constrained in evolutionary divergence, presumably due to their functional role. However, such constraints rarely manifest themselves as perfect conservation of a site clearly standing out in its broader environment, as they reflect the species-specific differences in proteins, as well as the ability of some proteins to interact with multiple variants of their binding sequence. In this paper we explore the use of alignment column uncertainty as an aid in locating differential phylogenetic footprints, which refer to the sites in DNA where groups of related species exhibit sequence conservation, but where the pattern may vary between the groups. We use efficient, linear-time algorithms to locate such sites. We have performed a study of the mammalian CAV2-CAV1 gene region using our software, and we conclude with several observations concerning the differential conservation and the use of computational methods for its detection. The software developed for this project is available, free of charge, by contacting the author.

[ Full-text PDF | Table of Contents ]


Japanese Society for Bioinformatics