torch_geometric.datasets.GEDDataset

class GEDDataset(root: str, name: str, train: bool = True, transform: Optional[Callable] = None, pre_transform: Optional[Callable] = None, pre_filter: Optional[Callable] = None, force_reload: bool = False)[source]

Bases: InMemoryDataset

来自“通过图神经网络计算图编辑距离”论文的GED数据集。

GEDs 可以通过全局属性 gednorm_ged 访问,适用于所有训练/训练图对和所有训练/测试图对:

dataset = GEDDataset(root, name="LINUX")
data1, data2 = dataset[0], dataset[1]
ged = dataset.ged[data1.i, data2.i]  # GED between `data1` and `data2`.

请注意,如果两个图都来自测试集,则无法使用GED。 为了进行评估,建议将测试集中的每个图与训练集中的每个图配对。

注意

ALKANE 缺少训练/测试图对的GEDs,因为它们未在官方数据集中提供。

Parameters:
  • root (str) – Root directory where the dataset should be saved.

  • name (str) – 数据集的名称(可以是 "AIDS700nef", "LINUX", "ALKANE", "IMDBMulti" 中的一个)。

  • train (bool, 可选) – 如果 True,加载训练数据集, 否则加载测试数据集。(默认值:True

  • transform (callable, optional) – A function/transform that takes in an torch_geometric.data.Data object and returns a transformed version. The data object will be transformed before every access. (default: None)

  • pre_transform (callable, optional) – A function/transform that takes in an torch_geometric.data.Data object and returns a transformed version. The data object will be transformed before being saved to disk. (default: None)

  • pre_filter (callable, optional) – A function that takes in an torch_geometric.data.Data object and returns a boolean value, indicating whether the data object should be included in the final dataset. (default: None)

  • force_reload (bool, optional) – Whether to re-process the dataset. (default: False)

统计:

名称

#图表

#节点

#edges

#特性

#classes

艾滋病700nef

700

~8.9

~17.6

29

0

LINUX

1,000

~7.6

~13.9

0

0

烷烃

150

~8.9

~15.8

0

0

IMDB多

1,500

~13.0

~131.9

0

0