TY - JOUR
T1 - GRAPES
T2 - A Software for Parallel Searching on Biological Graphs Targeting Multi-Core Architectures
AU - Giugno, Rosalba
AU - Bonnici, Vincenzo
AU - Bombieri, Nicola
AU - Pulvirenti, Alfredo
AU - Ferro, Alfredo
AU - Shasha, Dennis
PY - 2013/10/22
Y1 - 2013/10/22
N2 - Biological applications, from genomics to ecology, deal with graphs that represents the structure of interactions. Analyzing such data requires searching for subgraphs in collections of graphs. This task is computationally expensive. Even though multicore architectures, from commodity computers to more advanced symmetric multiprocessing (SMP), offer scalable computing power, currently published software implementations for indexing and graph matching are fundamentally sequential. As a consequence, such software implementations (i) do not fully exploit available parallel computing power and (ii) they do not scale with respect to the size of graphs in the database. We present GRAPES, software for parallel searching on databases of large biological graphs. GRAPES implements a parallel version of well-established graph searching algorithms, and introduces new strategies which naturally lead to a faster parallel searching system especially for large graphs. GRAPES decomposes graphs into subcomponents that can be efficiently searched in parallel. We show the performance of GRAPES on representative biological datasets containing antiviral chemical compounds, DNA, RNA, proteins, protein contact maps and protein interactions networks.
AB - Biological applications, from genomics to ecology, deal with graphs that represents the structure of interactions. Analyzing such data requires searching for subgraphs in collections of graphs. This task is computationally expensive. Even though multicore architectures, from commodity computers to more advanced symmetric multiprocessing (SMP), offer scalable computing power, currently published software implementations for indexing and graph matching are fundamentally sequential. As a consequence, such software implementations (i) do not fully exploit available parallel computing power and (ii) they do not scale with respect to the size of graphs in the database. We present GRAPES, software for parallel searching on databases of large biological graphs. GRAPES implements a parallel version of well-established graph searching algorithms, and introduces new strategies which naturally lead to a faster parallel searching system especially for large graphs. GRAPES decomposes graphs into subcomponents that can be efficiently searched in parallel. We show the performance of GRAPES on representative biological datasets containing antiviral chemical compounds, DNA, RNA, proteins, protein contact maps and protein interactions networks.
UR - http://www.scopus.com/inward/record.url?scp=84886012840&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84886012840&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0076911
DO - 10.1371/journal.pone.0076911
M3 - Article
C2 - 24167551
AN - SCOPUS:84886012840
SN - 1932-6203
VL - 8
JO - PloS one
JF - PloS one
IS - 10
M1 - e76911
ER -