After running the code above, navigate to the respective dataset folder and look for `ogbl_citation2`; all the data you need can be found there. Below is the `metadata.yaml` file we're currently using:
```yaml
dataset_name:ogbl_citation2
graph:
nodes:
-num:2927963
edges:
-format:csv
path:edges/cite.csv
feature_data:
feature_data:
-domain:node
type:null
name:feat
format:numpy
in_memory:true
path:data/node-feat.npy
-domain:node
type:null
name:year
format:numpy
in_memory:true
path:data/node-year.npy
tasks:
-name:"link_prediction"
num_classes:2
train_set:
-type_name:null
data:
# (n, 2)
-name:node_pairs
format:numpy
path:set/train_node_pairs.npy
in_memory:true
validation_set:
-type_name:null
data:
-name:node_pairs
format:numpy
path:set/valid_node_pairs.npy
in_memory:true
-name:negative_dsts
format:numpy
path:set/valid_negative_dsts.npy
in_memory:true
test_set:
-type_name:null
data:
-name:node_pairs
format:numpy
path:set/test_node_pairs.npy
in_memory:true
-name:negative_dsts
format:numpy
path:set/test_negative_dsts.npy
in_memory:true
```
You'll need to convert the **raw dataset** into the corresponding structure and organize it into any folder of your choice. The final file structure should look like this: