torch_geometric.datasets.GEDDataset
- class GEDDataset(root: str, name: str, train: bool = True, transform: Optional[Callable] = None, pre_transform: Optional[Callable] = None, pre_filter: Optional[Callable] = None, force_reload: bool = False)[source]
Bases:
InMemoryDataset来自“通过图神经网络计算图编辑距离”论文的GED数据集。
GEDs 可以通过全局属性
ged和norm_ged访问,适用于所有训练/训练图对和所有训练/测试图对:dataset = GEDDataset(root, name="LINUX") data1, data2 = dataset[0], dataset[1] ged = dataset.ged[data1.i, data2.i] # GED between `data1` and `data2`.
请注意,如果两个图都来自测试集,则无法使用GED。 为了进行评估,建议将测试集中的每个图与训练集中的每个图配对。
注意
ALKANE缺少训练/测试图对的GEDs,因为它们未在官方数据集中提供。- Parameters:
root (str) – Root directory where the dataset should be saved.
name (str) – 数据集的名称(可以是
"AIDS700nef","LINUX","ALKANE","IMDBMulti"中的一个)。transform (callable, optional) – A function/transform that takes in an
torch_geometric.data.Dataobject and returns a transformed version. The data object will be transformed before every access. (default:None)pre_transform (callable, optional) – A function/transform that takes in an
torch_geometric.data.Dataobject and returns a transformed version. The data object will be transformed before being saved to disk. (default:None)pre_filter (callable, optional) – A function that takes in an
torch_geometric.data.Dataobject and returns a boolean value, indicating whether the data object should be included in the final dataset. (default:None)force_reload (bool, optional) – Whether to re-process the dataset. (default:
False)
统计:
名称
#图表
#节点
#edges
#特性
#classes
艾滋病700nef
700
~8.9
~17.6
29
0
LINUX
1,000
~7.6
~13.9
0
0
烷烃
150
~8.9
~15.8
0
0
IMDB多
1,500
~13.0
~131.9
0
0