TY - JOUR
T1 - Quantifying the mechanisms for segmental duplications in mammalian genomes by statistical analysis and modeling
AU - Zhou, Yi
AU - Mishra, Bud
PY - 2005/3/15
Y1 - 2005/3/15
N2 - A large number of the segmental duplications in mammalian genomes have been cataloged by genome-wide sequence analyses. The molecular mechanisms involved in these duplications mostly remain a matter of speculation. To uncover, test, and further quantify the hypotheses on the mechanisms for the recent duplications in the mammalian genomes, we have performed a series of statistical analyses on the sequences flanking the duplicated segments and proposed a dynamic model for the duplication process. The model, when applied to the human duplication data, indicates that ≈30% of the recent human segmental duplications were caused by a recombination-like mechanism, among which 12% were mediated by the most recently active repeat, Alu. But a significant proportion of the duplications are caused by some mechanism independent of the repeat distribution. A less sure but similar picture is found in the rodent genomes. A further analysis on the physical features of the flanking sequences suggests that one of the uncharacterized duplication mechanisms shared by the mammalian genomes is surprisingly well correlated with the physical instability in the DNA sequences.
AB - A large number of the segmental duplications in mammalian genomes have been cataloged by genome-wide sequence analyses. The molecular mechanisms involved in these duplications mostly remain a matter of speculation. To uncover, test, and further quantify the hypotheses on the mechanisms for the recent duplications in the mammalian genomes, we have performed a series of statistical analyses on the sequences flanking the duplicated segments and proposed a dynamic model for the duplication process. The model, when applied to the human duplication data, indicates that ≈30% of the recent human segmental duplications were caused by a recombination-like mechanism, among which 12% were mediated by the most recently active repeat, Alu. But a significant proportion of the duplications are caused by some mechanism independent of the repeat distribution. A less sure but similar picture is found in the rodent genomes. A further analysis on the physical features of the flanking sequences suggests that one of the uncharacterized duplication mechanisms shared by the mammalian genomes is surprisingly well correlated with the physical instability in the DNA sequences.
KW - Copy number fluctuation
KW - Genomic instability
KW - Interspersed transposable elements
KW - Markov models
KW - Segmental duplication
UR - http://www.scopus.com/inward/record.url?scp=15244351968&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=15244351968&partnerID=8YFLogxK
U2 - 10.1073/pnas.0407957102
DO - 10.1073/pnas.0407957102
M3 - Article
C2 - 15741274
AN - SCOPUS:15244351968
SN - 0027-8424
VL - 102
SP - 4051
EP - 4056
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 11
ER -