TY - JOUR
T1 - Adaptive Evolution Signatures in Prochlorococcus
T2 - Open Reading Frame (ORF)eome Resources and Insights from Comparative Genomics
AU - Daakour, Sarah
AU - Nelson, David R.
AU - Fu, Weiqi
AU - Jaiswal, Ashish
AU - Dohai, Bushra
AU - Alzahmi, Amnah Salem
AU - Koussa, Joseph
AU - Huang, Xiaoluo
AU - Shen, Yue
AU - Twizere, Jean Claude
AU - Salehi-Ashtiani, Kourosh
N1 - Publisher Copyright:
© 2024 by the authors.
PY - 2024/8
Y1 - 2024/8
N2 - Prochlorococcus, a cyanobacteria genus of the smallest and most abundant oceanic phototrophs, encompasses ecotype strains adapted to high-light (HL) and low-light (LL) niches. To elucidate the adaptive evolution of this genus, we analyzed 40 Prochlorococcus marinus ORFeomes, including two cornerstone strains, MED4 and NATL1A. Employing deep learning with robust statistical methods, we detected new protein family distributions in the strains and identified key genes differentiating the HL and LL strains. The HL strains harbor genes (ABC-2 transporters) related to stress resistance, such as DNA repair and RNA processing, while the LL strains exhibit unique chlorophyll adaptations (ion transport proteins, HEAT repeats). Additionally, we report the finding of variable, depth-dependent endogenous viral elements in the 40 strains. To generate biological resources to experimentally study the HL and LL adaptations, we constructed the ORFeomes of two representative strains, MED4 and NATL1A synthetically, covering 99% of the annotated protein-coding sequences of the two species, totaling 3976 cloned, sequence-verified open reading frames (ORFs). These comparative genomic analyses, paired with MED4 and NATL1A ORFeomes, will facilitate future genotype-to-phenotype mappings and the systems biology exploration of Prochlorococcus ecology.
AB - Prochlorococcus, a cyanobacteria genus of the smallest and most abundant oceanic phototrophs, encompasses ecotype strains adapted to high-light (HL) and low-light (LL) niches. To elucidate the adaptive evolution of this genus, we analyzed 40 Prochlorococcus marinus ORFeomes, including two cornerstone strains, MED4 and NATL1A. Employing deep learning with robust statistical methods, we detected new protein family distributions in the strains and identified key genes differentiating the HL and LL strains. The HL strains harbor genes (ABC-2 transporters) related to stress resistance, such as DNA repair and RNA processing, while the LL strains exhibit unique chlorophyll adaptations (ion transport proteins, HEAT repeats). Additionally, we report the finding of variable, depth-dependent endogenous viral elements in the 40 strains. To generate biological resources to experimentally study the HL and LL adaptations, we constructed the ORFeomes of two representative strains, MED4 and NATL1A synthetically, covering 99% of the annotated protein-coding sequences of the two species, totaling 3976 cloned, sequence-verified open reading frames (ORFs). These comparative genomic analyses, paired with MED4 and NATL1A ORFeomes, will facilitate future genotype-to-phenotype mappings and the systems biology exploration of Prochlorococcus ecology.
KW - MED4
KW - NALT1A
KW - Prochlorococcus
KW - comparative genomics
KW - deep learning
KW - endogenous viral elements
KW - light adaptations
UR - http://www.scopus.com/inward/record.url?scp=85202620130&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85202620130&partnerID=8YFLogxK
U2 - 10.3390/microorganisms12081720
DO - 10.3390/microorganisms12081720
M3 - Article
AN - SCOPUS:85202620130
SN - 2076-2607
VL - 12
JO - Microorganisms
JF - Microorganisms
IS - 8
M1 - 1720
ER -