torch_geometric.datasets.OGB_MAG
- class OGB_MAG(root: str, preprocess: Optional[str] = None, transform: Optional[Callable] = None, pre_transform: Optional[Callable] = None, force_reload: bool = False)[source]
Bases:
InMemoryDataset来自“开放图基准:用于图机器学习的数据集”论文的
ogbn-mag数据集。ogbn-mag是由微软学术图(MAG)的一个子集组成的异构图。 它包含四种类型的实体——论文(736,389个节点)、作者(1,134,649个节点)、机构(8,740个节点)和研究领域(59,965个节点)——以及连接两种类型实体的四种有向关系。 每篇论文都与一个128维的word2vec特征向量相关联,而所有其他节点类型则没有与任何输入特征相关联。 任务是预测每篇论文的发表场所(会议或期刊)。总共有349个不同的场所。- Parameters:
root (str) – Root directory where the dataset should be saved.
preprocess (str, optional) – 通过添加结构特征(
"metapath2vec","TransE")来预处理原始数据集,以处理无特征的节点。(默认值:None)transform (callable, optional) – A function/transform that takes in an
torch_geometric.data.HeteroDataobject and returns a transformed version. The data object will be transformed before every access. (default:None)pre_transform (callable, optional) – A function/transform that takes in an
torch_geometric.data.HeteroDataobject and returns a transformed version. The data object will be transformed before being saved to disk. (default:None)force_reload (bool, optional) – Whether to re-process the dataset. (default:
False)