torch_geometric.datasets.FB15k_237
- class FB15k_237(root: str, split: str = 'train', transform: Optional[Callable] = None, pre_transform: Optional[Callable] = None, force_reload: bool = False)[source]
Bases:
InMemoryDataset来自“Translating Embeddings for Modeling Multi-Relational Data”论文的FB15K237数据集,包含14,541个实体,237个关系和310,116个事实三元组。
注意
原始的
FB15k数据集由于逆关系存在严重的测试泄漏问题,其中大量测试三元组可以通过反转训练集中的三元组获得。为了创建一个没有这种特性的数据集,FB15k_237描述了FB15k的一个子集,其中移除了逆关系。- Parameters:
root (str) – Root directory where the dataset should be saved.
split (str, optional) – If
"train", loads the training dataset. If"val", loads the validation dataset. If"test", loads the test dataset. (default:"train")transform (callable, optional) – A function/transform that takes in an
torch_geometric.data.Dataobject and returns a transformed version. The data object will be transformed before every access. (default:None)pre_transform (callable, optional) – A function/transform that takes in an
torch_geometric.data.Dataobject and returns a transformed version. The data object will be transformed before being saved to disk. (default:None)force_reload (bool, optional) – Whether to re-process the dataset. (default:
False)