How do I convert Data to CSR format while still containing all nodes #7305

ProfDoof · 2023-05-05T20:05:57Z

ProfDoof
May 5, 2023

pytorch_geometric/torch_geometric/data/graph_store.py

Lines 277 to 281 in fb1d855

    
           if attr.layout != EdgeLayout.CSR:  # COO->CSR 
        
               num_rows = attr.size[0] if attr.size else int(row.max()) + 1 
        
               row, perm = index_sort(row, max_value=num_rows) 
        
               col = col[perm] 
        
               row = index2ptr(row, num_rows)

TL;DR. The CSR format that this is converted to does not have a long enough rowptr tensor to represent graphs that have isolated nodes where such nodes have a higher index than any of the source nodes for the edge indices that you define. This means that the random_walk implementation in both pyg_lib and torch_cluster fails. With that being said, is there a way I can get the CSR representation without losing these nodes?

More explanation:

So, I have been getting this fun runtime error in my implementation of node2vec that I have been writing (I have my reasons, but they aren't relevant to this story). It was a CUDA Runtime Error saying I was going out of bounds and accessing memory I should not be. However, as far as I could tell, I was not going out of bounds. Then, I looked deeper, and it turns out that given a graph like the following

the CSR representation is as follows

rowptr=tensor([0, 1, 1, 2, 3], device='cuda:0')

col=tensor([3, 1, 2], device='cuda:0')

This means that we are losing information about the graph because the CSR representation, which includes nodes 4 and 5, should look like

rowptr=tensor([0, 1, 1, 2, 3, 3, 3], device='cuda:0')

col=tensor([3, 1, 2], device='cuda:0')

This feels like it should be addressed in an issue if it's a problem, but maybe I'm missing something, so I want to ask first.

Answered by rusty1s

May 8, 2023

Thanks for the example, helped a lot. I fixed it in #7316. Sorry for any inconvenience.

View full answer

grantnedwards · 2023-05-05T20:12:43Z

grantnedwards
May 5, 2023

you have isolated nodes in your graph that are not being included in the CSR representation, causing issues in your node2vec implementation. To preserve these isolated nodes, you can modify the code that converts the COO format to CSR.

row = torch.tensor([0, 2, 3], dtype=torch.long)
col = torch.tensor([3, 1, 2], dtype=torch.long)
num_nodes = 6  # Optional, include the isolated nodes

rowptr, col = coo_to_csr(row, col, num_nodes)

you could try this function here to handle COO to CSR - above is how you can call and assign it.

def coo_to_csr(row, col, num_nodes=None):
    if num_nodes is None:
        num_nodes = int(row.max()) + 1

    row, perm = index_sort(row, max_value=num_nodes)
    col = col[perm]
    
    rowptr = torch.zeros(num_nodes + 1, dtype=row.dtype, device=row.device)
    rowptr[1:] = (row[:-1] != row[1:]).to(rowptr.dtype)
    rowptr = torch.cumsum(rowptr, dim=0)

    return rowptr, col

15 replies

ProfDoof May 7, 2023
Author

@rusty1s Here's the actual and expected behavior from me:

Actual behavior: data.csr()=({None: tensor([0, 2])}, {None: tensor([1, 0])}, {None: tensor([0, 1])})
Expected behavior: data.csr()=({None: tensor([0, 2, 2, 2, 2, 2, 2])}, {None: tensor([1, 0])}, {None: tensor([0, 1])})

Here's the code I used to generate the previous outputs:

from torch_geometric.data import Data
import torch
from local import my_edge_to_layout # This is a local folder with an __init__.py that contains the function I posted above.

data = Data(torch.tensor([0, 1, 2, 3, 4, 5]), torch.tensor([[0, 0], [1, 0]]))
data.num_nodes = 6
print(f'Actual behavior: {data.csr()=}')

data.coo()
Data._edge_to_layout = my_edge_to_layout
print(f'Expected behavior: {data.csr()=}')

ProfDoof May 7, 2023
Author

The function I wrote isn't a complete solution but it works for my use case. I assume y'all would have to do more.

ProfDoof May 7, 2023
Author

Oh, and my version of pytorch_geometric is 2.3.0, I'm using torch version 2.0.0

rusty1s May 8, 2023
Maintainer

Thanks for the example, helped a lot. I fixed it in #7316. Sorry for any inconvenience.

Answer selected by ProfDoof

ProfDoof May 8, 2023
Author

Np, thanks for the fix.

ProfDoof May 11, 2023
Author

@rusty1s dumb question, when will that fix be integrated into the conda package?

rusty1s May 11, 2023
Maintainer

This will be integrated once we release a new version (we target June right now). Not sure if this issue warrants a hotfix to make it available faster. After all, you can also install PyG from master to have this fix integrated on your end.

ProfDoof May 11, 2023
Author

That's fine, I just wanted to check. I am trying to install it from main using Conda but having lots of fun issues. 😂 We'll see if I can get it working correctly.

How do I convert Data to CSR format while still containing all nodes #7305

Uh oh!

ProfDoof May 5, 2023

Replies: 1 comment · 15 replies

Uh oh!

Uh oh!

grantnedwards May 5, 2023

Uh oh!

Uh oh!

ProfDoof May 7, 2023 Author

Uh oh!

ProfDoof May 7, 2023 Author

Uh oh!

ProfDoof May 7, 2023 Author

Uh oh!

rusty1s May 8, 2023 Maintainer

Uh oh!

ProfDoof May 8, 2023 Author

Uh oh!

ProfDoof May 11, 2023 Author

Uh oh!

rusty1s May 11, 2023 Maintainer

Uh oh!

ProfDoof May 11, 2023 Author

ProfDoof
May 5, 2023

Replies: 1 comment 15 replies

grantnedwards
May 5, 2023

ProfDoof May 7, 2023
Author

ProfDoof May 7, 2023
Author

ProfDoof May 7, 2023
Author

rusty1s May 8, 2023
Maintainer

ProfDoof May 8, 2023
Author

ProfDoof May 11, 2023
Author

rusty1s May 11, 2023
Maintainer

ProfDoof May 11, 2023
Author