torch_geometric.datasets.FB15k_237

class FB15k_237(root: str, split: str = 'train', transform: Optional[Callable] = None, pre_transform: Optional[Callable] = None, force_reload: bool = False)[source]

Bases: InMemoryDataset

来自“Translating Embeddings for Modeling Multi-Relational Data”论文的FB15K237数据集,包含14,541个实体,237个关系和310,116个事实三元组。

注意

原始的 FB15k 数据集由于逆关系存在严重的测试泄漏问题,其中大量测试三元组可以通过反转训练集中的三元组获得。为了创建一个没有这种特性的数据集,FB15k_237 描述了 FB15k 的一个子集,其中移除了逆关系。

Parameters:
  • root (str) – Root directory where the dataset should be saved.

  • split (str, optional) – If "train", loads the training dataset. If "val", loads the validation dataset. If "test", loads the test dataset. (default: "train")

  • transform (callable, optional) – A function/transform that takes in an torch_geometric.data.Data object and returns a transformed version. The data object will be transformed before every access. (default: None)

  • pre_transform (callable, optional) – A function/transform that takes in an torch_geometric.data.Data object and returns a transformed version. The data object will be transformed before being saved to disk. (default: None)

  • force_reload (bool, optional) – Whether to re-process the dataset. (default: False)