TY - JOUR
T1 - ArcMatch
T2 - high-performance subgraph matching for labeled graphs by exploiting edge domains
AU - Bonnici, Vincenzo
AU - Grasso, Roberto
AU - Micale, Giovanni
AU - Maria, Antonio di
AU - Shasha, Dennis
AU - Pulvirenti, Alfredo
AU - Giugno, Rosalba
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024
Y1 - 2024
N2 - Consider a large labeled graph (network), denoted the target. Subgraph matching is the problem of finding all instances of a small subgraph, denoted the query, in the target graph. Unlike the majority of existing methods that are restricted to graphs with labels solely on vertices, our proposed approach, named can effectively handle graphs with labels on both vertices and edges. ntroduces an efficient new vertex/edge domain data structure filtering procedure to speed up subgraph queries. The procedure, called path-based reduction, filters initial domains by scanning them for paths up to a specified length that appear in the query graph. Additionally, ncorporates existing techniques like variable ordering and parent selection, as well as adapting the core search process, to take advantage of the information within edge domains. Experiments in real scenarios such as protein–protein interaction graphs, co-authorship networks, and email networks, show that s faster than state-of-the-art systems varying the number of distinct vertex labels over the whole target graph and query sizes.
AB - Consider a large labeled graph (network), denoted the target. Subgraph matching is the problem of finding all instances of a small subgraph, denoted the query, in the target graph. Unlike the majority of existing methods that are restricted to graphs with labels solely on vertices, our proposed approach, named can effectively handle graphs with labels on both vertices and edges. ntroduces an efficient new vertex/edge domain data structure filtering procedure to speed up subgraph queries. The procedure, called path-based reduction, filters initial domains by scanning them for paths up to a specified length that appear in the query graph. Additionally, ncorporates existing techniques like variable ordering and parent selection, as well as adapting the core search process, to take advantage of the information within edge domains. Experiments in real scenarios such as protein–protein interaction graphs, co-authorship networks, and email networks, show that s faster than state-of-the-art systems varying the number of distinct vertex labels over the whole target graph and query sizes.
KW - Domain reduction
KW - Labelled graphs
KW - Path-based reduction
KW - Subgraph isomorphism
UR - http://www.scopus.com/inward/record.url?scp=85200692427&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85200692427&partnerID=8YFLogxK
U2 - 10.1007/s10618-024-01061-8
DO - 10.1007/s10618-024-01061-8
M3 - Article
AN - SCOPUS:85200692427
SN - 1384-5810
JO - Data Mining and Knowledge Discovery
JF - Data Mining and Knowledge Discovery
ER -