-
Dear all, I had some issues with some dimension mismatches and thereby saw that after determining the random split indices in the RandomLinkSplit finally the data is split not as I had expected. In the RandomLinkSplit class the data splits actually do not use the sampled validation and test edges but reuse the train edges for the validation data and a combination of the training and validation edges for the test data. I have copied the respective part of the code below which can be found here :
Could somebody explain why that is? Thanks a lot in advance! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
For evaluation, the So:
|
Beta Was this translation helpful? Give feedback.
data.edge_index
refers to the edges that are used for message passing. As such, during training and validation, you are allowed to propagate information based on the training edges, while during testing, you can propagate information based on the union of training and validation edges.For evaluation, the
data.edge_label_index
holds a batch of positive and negative samples that should be used to evaluate your model on.So:
data.edge_index
should be solely used for message passingdata.edge_label_index
should be used for evaluation