Codon test : modeling amino acid substitution preferences in coding sequences

dc.contributor.authorDelport, Wayne
dc.contributor.authorScheffler, Konrad
dc.contributor.authorBotha, Gordon
dc.contributor.authorGravenor, Mike B.
dc.contributor.authorMuse, Spencer V.
dc.contributor.authorPond, Sergei L. Kosakovsky
dc.date.accessioned2013-02-25T14:23:03Z
dc.date.available2013-02-25T14:23:03Z
dc.date.issued2010-08
dc.descriptionThe original publication is available at www.ploscompbiol.orgen_ZA
dc.description.abstractCodon models of evolution have facilitated the interpretation of selective forces operating on genomes. These models, however, assume a single rate of non-synonymous substitution irrespective of the nature of amino acids being exchanged. Recent developments have shown that models which allow for amino acid pairs to have independent rates of substitution offer improved fit over single rate models. However, these approaches have been limited by the necessity for large alignments in their estimation. An alternative approach is to assume that substitution rates between amino acid pairs can be subdivided into K rate classes, dependent on the information content of the alignment. However, given the combinatorially large number of such models, an efficient model search strategy is needed. Here we develop a Genetic Algorithm (GA) method for the estimation of such models. A GA is used to assign amino acid substitution pairs to a series of K rate classes, where K is estimated from the alignment. Other parameters of the phylogenetic Markov model, including substitution rates, character frequencies and branch lengths are estimated using standard maximum likelihood optimization procedures. We apply the GA to empirical alignments and show improved model fit over existing models of codon evolution. Our results suggest that current models are poor approximations of protein evolution and thus gene and organism specific multi-rate models that incorporate amino acid substitution biases are preferred. We further anticipate that the clustering of amino acid substitution rates into classes will be biologically informative, such that genes with similar functions exhibit similar clustering, and hence this clustering will be useful for the evolutionary fingerprinting of genes.en_ZA
dc.description.sponsorshipThis research was supported by the Joint DMS/NIGMS Mathematical Biology Initiative through Grant NSF-0714991, the National Institutes of Health (AI47745), and by a University of California, San Diego Center for AIDS Research/NIAID Developmental Award to WD and SLKP (AI36214). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.en_ZA
dc.description.versionPublishers versionen_ZA
dc.format.extent17 p. : ill.
dc.identifier.citationDelport, W.et al. 2010. Codon test: modeling amino acid substitution preferences in coding sequences. PLoS Computational Biology, 6(8):1-17. doi:10.1371/journal.pcbi.1000885.en_ZA
dc.identifier.issn1553-7358 (online)
dc.identifier.issn1553-734X (print)
dc.identifier.otherdoi:10.1371/journal.pcbi.1000885
dc.identifier.urihttp://hdl.handle.net/10019.1/79608
dc.language.isoen_ZAen_ZA
dc.publisherPLOS Computational Biologyen_ZA
dc.rights.holderThe author holds the copyrighten_ZA
dc.subjectCodon modelsen_ZA
dc.subjectAmino acidsen_ZA
dc.titleCodon test : modeling amino acid substitution preferences in coding sequencesen_ZA
dc.typeArticleen_ZA
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
delport_codon_2010.pdf
Size:
1.17 MB
Format:
Adobe Portable Document Format
Description:
Publishers' Version
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.95 KB
Format:
Item-specific license agreed upon to submission
Description: