torch_geometric.datasets.Wikidata5M

class Wikidata5M(root: str, setting: str = 'transductive', transform: Optional[Callable] = None, pre_transform: Optional[Callable] = None, force_reload: bool = False)[source]

Bases: InMemoryDataset

来自“KEPLER: 知识嵌入和预训练语言表示的统一模型”论文的Wikidata-5M数据集,包含4,594,485个实体,822个关系,20,614,279个训练三元组,5,163个验证三元组和5,133个测试三元组。

Wikidata-5M 是一个从Wikidata提取的大规模知识图谱数据集,包含对齐的语料库。

Parameters:
  • root (str) – Root directory where the dataset should be saved.

  • 设置 (str, 可选) – 如果为 "transductive",则加载传导数据集。 如果为 "inductive",则加载归纳数据集。 (默认: "transductive")

  • transform (callable, optional) – A function/transform that takes in an torch_geometric.data.Data object and returns a transformed version. The data object will be transformed before every access. (default: None)

  • pre_transform (callable, optional) – A function/transform that takes in an torch_geometric.data.Data object and returns a transformed version. The data object will be transformed before being saved to disk. (default: None)

  • force_reload (bool, optional) – Whether to re-process the dataset. (default: False)