PRISM: Practical In-Memory Acceleration for Subgraph Matching at Scale

PRISM is a scalable subgraph counting framework designed for the UPMEM Processing-In-Memory (PIM) architecture. It currently supports efficient triangle counting, and is extensible to other subgraph patterns.

The goal of PRISM is to accelerate subgraph counting tasks on large-scale graphs using UPMEM DPUs. It integrates asynchronous pipelining, bitmap-based set intersection, and other optimizations tailored for near-data processing architecture.

🚀 Key Features

Fast triangle counting for large-scale sparse graphs.
Skew-aware workload distribution for balanced execution across thousands of DPUs.
Asynchronous loader–worker pipeline using WRAM FIFO.
Bitmap-based set intersection acceleration on DPUs.
Lightweight performance profiling and cycle analysis tools.
CSR (Compressed Sparse Row) binary input format for fast loading.

📁 Directory Structure

PRISM/
├── host/         # Host-side logic (C)
├── dpu/          # DPU-side programs (C for UPMEM)
├── python_tool/  # Python scripts for preprocessing and profiling
├── include/      # Shared headers
├── makefile      # Compilation rules
└── README.md     # Project description

🛠 Requirements

Linux environment
UPMEM SDK v2025.1.0.
GNU Make, C compiler (e.g., gcc)
Python ≥ 3.8 (for analysis scripts)

⚙️ Build and Run Instructions

To match a pattern within a graph, run:

make clean
GRAPH=<graph_name> PATTERN=<pattern_name> make test

Example:

GRAPH=AM0312 PATTERN=CLIQUE3 make test

💡 The available values for GRAPH and PATTERN are defined in include/common.h.
To add new graphs or patterns, modify common.h and recompile.

📥 Input Format

PRISM accepts input graphs in binary CSR (Compressed Sparse Row) format.

To convert an edge list into CSR binary format, use python_tool/adjtsv2csrbin.py:

python3 python_tool/adjtsv2csrbin.py input.tsv --output graph.bin --header 1

Skips header line (if specified).
Ignores edge weights (only node pairs are used).
Outputs the following binary layout:
- node_num (4 bytes)
- edge_num (4 bytes)
- row_ptr[] (node_num × 4 bytes)
- col_idx[] (edge_num × 4 bytes)

Example:

mkdir -p ./data 
wget https://graphchallenge.s3.amazonaws.com/snap/amazon0312/amazon0312_adj.tsv 
python3 python_tool/adjtsv2csrbin.py amazon0312_adj.tsv
mv amazon0312_adj.bin ./data
GRAPH=AM0312 PATTERN=CLIQUE3 make test

🧩 Customized Graphs and Matching Patterns

PRISM supports flexible definitions of graph inputs and matching patterns.

All configuration entries are defined in include/common.h.

➕ Adding Custom Graphs

Place your input graph (in CSR binary format) into the ./data/ directory.
Add a macro definition in include/common.h:

#if defined(AM0312)
#define DATA_NAME "amazon0312_adj"
#define N (1<<20)
#define M (1<<23)
#endif

Build and test:

GRAPH=AM0312 PATTERN=CLIQUE3 make test

➕ Adding Custom Patterns

Define a new macro for your pattern kernel in include/common.h

#elif defined(TELE5)
#define KERNEL_FUNC tele5
#define PATTERN_NAME "tele5"
#endif

Implement the kernel function in dpu/ directory (e.g., in TELE5.c or new source file).
Build and run:

GRAPH=AM0312 PATTERN=TELE5 make test

📈 Scalability Testing

PRISM is designed to scale from hundreds to tens of thousands of DPUs.

🔧 Custom DPU Count

To run PRISM on a specific number of DPUs:

GRAPH=AM0312 PATTERN=CLIQUE3 EXTRA_FLAGS="-DV_NR_DPUS=5120" make test

📊 Full Scalability Sweep

To automatically benchmark PRISM from 640 to 40,960 DPUs:

GRAPH=AM0312 PATTERN=CLIQUE3 make test_sc

This script:

Compiles PRISM with various DPU counts.
Runs the benchmark for each configuration.

📊 Profiling & Visualization Tools

`analyze_csr_graph.py`

Analyzes CSR binary and outputs graph statistics:

python3 python_tool/analyze_csr_graph.py input/graph.bin

Outputs include:

Number of nodes and edges
Degree distribution (min/avg/max)

`show_cycle.py`

Visualizes DPU-level workload distribution:

python3 python_tool/show_cycle.py result.txt

Left plot: Max cycle per DPU
Right plot: Task count per DPU

🙏 Acknowledgments

We gratefully acknowledge the foundational contributions of PimPam [SIGMOD'24], which inspired and informed much of this work.
We thank the authors for advancing the state of graph pattern mining on real Processing-in-Memory hardware and for generously releasing their implementation, which has been invaluable to our research.

Reference:
Shuangyu Cai, Boyu Tian, Huanchen Zhang, and Mingyu Gao. PimPam: Efficient Graph Pattern Matching on Real Processing-in-Memory Hardware. In Proceedings of the ACM on Management of Data (SIGMOD '24), Volume 2, Issue 3, Article 161, Pages 1–25.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PRISM: Practical In-Memory Acceleration for Subgraph Matching at Scale

🚀 Key Features

📁 Directory Structure

🛠 Requirements

⚙️ Build and Run Instructions

📥 Input Format

🧩 Customized Graphs and Matching Patterns

➕ Adding Custom Graphs

➕ Adding Custom Patterns

📈 Scalability Testing

🔧 Custom DPU Count

📊 Full Scalability Sweep

📊 Profiling & Visualization Tools

`analyze_csr_graph.py`

`show_cycle.py`

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
dpu		dpu
host		host
include		include
python_tool		python_tool
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
makefile		makefile

License

CGCL-codes/PRISM

Folders and files

Latest commit

History

Repository files navigation

PRISM: Practical In-Memory Acceleration for Subgraph Matching at Scale

🚀 Key Features

📁 Directory Structure

🛠 Requirements

⚙️ Build and Run Instructions

📥 Input Format

🧩 Customized Graphs and Matching Patterns

➕ Adding Custom Graphs

➕ Adding Custom Patterns

📈 Scalability Testing

🔧 Custom DPU Count

📊 Full Scalability Sweep

📊 Profiling & Visualization Tools

analyze_csr_graph.py

show_cycle.py

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`analyze_csr_graph.py`

`show_cycle.py`

Packages