Predicting malaria interactome classifications from time-course transcriptomic data along the intraerythrocytic developmental cycle

Antonina Mitrofanova, Samantha Kleinberg, Jane Carlton, Simon Kasif, Bud Mishra

Research output: Contribution to journalArticlepeer-review


Objective: Even though a vaccine for malaria infections has been under intense study for many years, it has resisted several different lines of attack attempted by biologists. More than half of Plasmodium proteins still remain uncharacterized and therefore cannot be used in clinical trials. The task is further complicated by the metamorphic life-cycle of the parasite, which allows for rapid evolutionary changes and diversity among related strains, thus making precise targeting of the appropriate proteins for vaccination a technical challenge. We propose an automated method for predicting functions for the malaria parasite, which capitalizes on the importance of the intraerythrocytic developmental cycle data and expression changes during its five phases, as determined computationally by our segmentation algorithm. Materials and methods: Our method combines temporal gene expression profiles with protein-protein interaction data, sequence similarity scores, and metabolic pathway information to produce a set of predicted protein functions that can be used as targets for vaccine development. We use a Bayesian approach, which assigns a probability of having (or not having) a particular function to each protein, given the various sources of evidence. In our method, each data source is represented by either a functional linkage graph or a categorical feature vector. Results and conclusions: The methods are tested on Plasmodium falciparum, the species responsible for the deadliest malaria infections. The algorithm was able to assign meaningful functions to 628 out of 1439 previously unannotated proteins, which are first-choice candidates for experimental vaccine research. We conclude that analyzing time-course gene expression profiles in separate phases leads to much higher prediction accuracy when compared with Pearson correlation coefficients computed across the time course as a whole. Additionally, we demonstrate that temporal expression profiles alone are able to improve the predictive power of the integrated data.

Original languageEnglish (US)
Pages (from-to)167-176
Number of pages10
JournalArtificial Intelligence in Medicine
Issue number3
StatePublished - Jul 2010


  • Bayesian probabilistic approach
  • Intraerythrocytic developmental cycle
  • N-terminal host targeting motif
  • Pexel
  • Plasmodium falciparum
  • Protein function prediction
  • Red blood cell membrane proteins
  • Time-course gene expression data

ASJC Scopus subject areas

  • Medicine (miscellaneous)
  • Artificial Intelligence


Dive into the research topics of 'Predicting malaria interactome classifications from time-course transcriptomic data along the intraerythrocytic developmental cycle'. Together they form a unique fingerprint.

Cite this