Abstract

Graphs have long been used to model relationships between entities. For some applications, a single graph is sufficient; for other problems, a collection of graphs may be more appropriate to represent the underlying data. Many contemporary problem domains, for which graphs are an ideal data model, contain an enormous amount of data (e.g., social networks). Hence, researchers frequently employ parallelized or distributed processing. The graph data must first be partitioned and assigned to the multiple processors in a way that the workload is balanced and inter-processor communication is minimized. The latter problem may be complicated by the existence of edges between vertices in a graph that have been assigned to different processors. Herein we introduce a strategy that combines vocabulary-based summarization of graphs (VoG) and detection of hotspots (i.e., vertices of high degree) to determine how a single undirected graph should be partitioned to optimize multi-processor load balancing and minimize the number of edges that exist between the partitioned subgraphs. We benchmark our method against another well-known partitioning algorithm (METIS) to demonstrate the benefits of our approach.

Department(s)

Computer Science

Keywords and Phrases

Graph data mining; Graph partitioning; Hotspot; Structures

International Standard Serial Number (ISSN)

2561-8156; 2561-8148

Document Type

Article - Journal

Document Version

Citation

File Type

text

Language(s)

English

Rights

© 2025 The Authors, All rights reserved.

Publication Date

01 Jun 2025

Share

 
COinS