Is there a unifying principle in biology that can be deciphered from the structures of all the present day genomes? Can one hope to determine the general structure and organization of cellular information as well as elucidate the underlying evolutionary dynamics, by systematically exploring various statistical characteristics of the genomic and proteomic data at many levels? A large-scale software and hardware system, Valis, developed by NYU bioinformatics group, aims to do just that. We analyze word-frequency and domain family size distributions across various genomes, and the distribution of the potential "hot-spots" for segmental duplications in the human genome. We hypothesize and test, by computational analysis, that such a pattern is the result of a generic dominating evolution mechanism, "evolution by duplication", originally suggested by Susumu Ohno. We examine what implications these duplications may have in determining the translocations of male-biased genes from sex chromosomes, genome structure of sex chromosomes, copy-number fluctuations (amplifications of oncogenes and homozygous or homozygous deletions of tumor suppressor genes) in cancer genomes, etc. We examine, through our explorations with Valis, how important a role information technology is likely to play in elucidating biology at all levels.
|Original language||English (US)|
|Number of pages||1|
|Journal||Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)|
|State||Published - 2003|
ASJC Scopus subject areas
- Theoretical Computer Science
- Computer Science(all)