Replies: 1 comment 7 replies
-
The poor performance might be due to the imbalance of labels, it looks like you have a ratio of 1:15. To understand your code better, how is |
Beta Was this translation helpful? Give feedback.
7 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I am implementing a Hetero GNN by following the following link:
https://github.com/pyg-team/pytorch_geometric/blob/master/examples/hetero/hetero_conv_dblp.py
But in my data there are some nodes that don't have any label. Could you please help me as to how to random split the data to form train and test masks and train and test sets respectively.
Currently I am splitting it this way:
n = merc_x.shape[0]
idx0=list(y0[:,0]) #label 0 samples
idx1=list(y1[:,0]) #label 1 samples
random.shuffle(idx0)
random.shuffle(idx1)
train_mask_idx1 = idx1[:4000]
train_mask_idx0 = idx0[:600000]
But the model is giving very poor performance and I think it is because of this splitting strategy.
Thanks very much!
Beta Was this translation helpful? Give feedback.
All reactions