Historical Context Integration Module (HCIM)

A temporally-weighted extension of the static signed graph neural networks for signed link prediction in dynamic networks.

Overview

This implementation enhances the SE-SGformer (Signed Graph Transformer) with historical context awareness, enabling improved link prediction performance on temporal signed networks. The model integrates LSTM-based sequence modeling and temporal attention to adaptively aggregate node embeddings across time while preserving interpretability.

Key Features

Historical Context Integration: Uses LSTM and temporal attention to leverage previous timesteps
Recency Bias: Applies learnable decay factors to weight recent information more heavily
Confidence Gating: Adaptive mechanism to control the influence of historical context
Memory Efficient: Batch processing for large-scale networks
Comprehensive Evaluation: Multiple metrics including AUC, F1, and Precision@100

Architecture

Components

Original SE-SGformer: Base model with centrality encoding, spatial features, and multi-head attention
Historical Context Extractor: LSTM-based processor with temporal attention and recency weighting
Temporal SE-SGformer: Enhanced model combining current and historical embeddings with confidence gating

Key Improvements

Temporal Modeling: Processes sequences of historical embeddings
Adaptive Weights: Either fixed or MLP-based combination strategies
Confidence Assessment: Gates historical information based on reliability

Installation

Requirements

pip install torch torch-geometric numpy scipy pandas matplotlib scikit-learn

Dependencies

Python 3.7+
PyTorch 1.8+
PyTorch Geometric
NumPy, SciPy, Pandas
Matplotlib (for visualization)
scikit-learn (for metrics)

Usage

Basic Usage

from temporal_sgformer import compare_approaches

# Run comparison on Bitcoin OTC dataset
results = compare_approaches(
    file_path="bitcoin_otc.csv.gz",
    target_timestep=-1,  # Use last timestep as target
    num_time_bins=6,     # Split data into 6 temporal bins
    epochs=50,           # Training epochs
    title="Bitcoin OTC Analysis"
)

Configuration Options

# Model configuration
args = Args(
    num_layers=2,              # Number of transformer layers
    num_heads=4,               # Multi-head attention heads
    node_dim=128,              # Node embedding dimension
    max_degree=20,             # Maximum node degree for encoding
    use_adaptive_weights=True,  # Use MLP-based combination weights
    base_weights=0.3           # Fixed combination weight (if not adaptive)
)

# Create models
baseline_model = SE_SGformer(args)
temporal_model = Temporal_SE_SGformer(args)

Data Format

The code expects CSV data with columns:

source: Source node ID
target: Target node ID
rating: Edge weight/rating (positive/negative)
time: Timestamp

Bitcoin OTC Dataset Format: The Bitcoin OTC dataset follows this format where each line represents one rating:

SOURCE,TARGET,RATING,TIME

SOURCE: ID of the source node (rater)
TARGET: ID of the target node (ratee)
RATING: Rating score from -10 (total distrust) to +10 (total trust)
TIME: Unix timestamp of when the rating was given

Example:

1,2,8,1237462018
2,3,-5,1237465108
1,3,10,1237467209

The model converts ratings to binary signs: positive ratings (>0) become +1, negative ratings (<0) become -1.

Evaluation Metrics

The framework evaluates models using:

AUC (Area Under Curve): Binary classification performance
F1 Score: Harmonic mean of precision and recall
Precision@100: Precision of top-100 predicted links

Key Functions

Data Loading

timesteps, num_nodes = load_bitcoin_dataset_timesteps(
    file_path="data.csv.gz", 
    num_time_bins=10
)

Model Training

# Train baseline model
baseline_model = SE_SGformer(args)
optimizer = torch.optim.Adam(baseline_model.parameters(), lr=0.001)

for epoch in range(epochs):
    z = baseline_model(x, pos_edge_index, neg_edge_index)
    loss = baseline_model.loss(z, pos_edge_index, neg_edge_index)
    loss.backward()
    optimizer.step()

Historical Context Extraction

# Extract embeddings from previous timesteps
historical_embeddings = []
for hist_data in historical_timesteps:
    z_hist = model(x_hist, pos_edges_hist, neg_edges_hist)
    historical_embeddings.append(z_hist)

# Use in temporal model
z_temporal = temporal_model(x, pos_edges, neg_edges, historical_embeddings)

Visualization

The framework generates comprehensive visualizations:

Training Loss Curves: Comparison of baseline vs temporal training
Loss Difference: Training improvement over epochs
Performance Metrics: AUC, F1, and Precision@100 comparison
Improvement Analysis: Absolute and percentage gains

Advanced Configuration

Temporal Parameters

# Configure historical context extractor
context_extractor = HistoricalContextExtractor(node_dim=128)
context_extractor.decay_factor = 0.7      # Decay for older timesteps
context_extractor.recency_strength = 1.5  # Recency bias strength

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests and documentation
Submit a pull request

Acknowledgments

Original SE-SGformer: This work builds upon the SE-SGformer (Self-Explainable Signed Graph Transformer) framework proposed by Liu et al. in "Self-Explainable Graph Transformer for Link Sign Prediction" (arXiv:2408.08754, 2024). We extend their original model with temporal context awareness while preserving the core architectural innovations.
Bitcoin OTC Dataset: We use the Bitcoin OTC trust weighted signed network from the Stanford Network Analysis Project (SNAP). This dataset was introduced by Kumar et al. in "Edge weight prediction in weighted signed networks" (ICDM 2016) and represents a who-trusts-whom network of Bitcoin traders with ratings from -10 to +10.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
examples		examples
LICENSE		LICENSE
README.md		README.md
config.py		config.py
data_loader.py		data_loader.py
heuristics.py		heuristics.py
history_extractor.py		history_extractor.py
layer.py		layer.py
main.py		main.py
model.py		model.py
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Historical Context Integration Module (HCIM)

Overview

Key Features

Architecture

Components

Key Improvements

Installation

Requirements

Dependencies

Usage

Basic Usage

Configuration Options

Data Format

Evaluation Metrics

Key Functions

Data Loading

Model Training

Historical Context Extraction

Visualization

Advanced Configuration

Temporal Parameters

License

Contributing

Acknowledgments

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

incoder-mru/Historical-Context-Integration-Module

Folders and files

Latest commit

History

Repository files navigation

Historical Context Integration Module (HCIM)

Overview

Key Features

Architecture

Components

Key Improvements

Installation

Requirements

Dependencies

Usage

Basic Usage

Configuration Options

Data Format

Evaluation Metrics

Key Functions

Data Loading

Model Training

Historical Context Extraction

Visualization

Advanced Configuration

Temporal Parameters

License

Contributing

Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages