torch_geometric.datasets.IMDB

class IMDB(root: str, transform: Optional[Callable] = None, pre_transform: Optional[Callable] = None, force_reload: bool = False)[source]

Bases: InMemoryDataset

互联网电影数据库(IMDB)的一个子集,收集自 “MAGNN: Metapath Aggregated Graph Neural Network for Heterogeneous Graph Embedding” 论文。 IMDB 是一个包含三种类型实体的异构图 - 电影 (4,278 个节点)、演员(5,257 个节点)和导演(2,081 个节点)。 电影根据其类型分为三类(动作、喜剧、戏剧)。 电影特征对应于其情节关键词的词袋表示的元素。

Parameters:
  • root (str) – Root directory where the dataset should be saved.

  • transform (callable, optional) – A function/transform that takes in an torch_geometric.data.HeteroData object and returns a transformed version. The data object will be transformed before every access. (default: None)

  • pre_transform (callable, optional) – A function/transform that takes in an torch_geometric.data.HeteroData object and returns a transformed version. The data object will be transformed before being saved to disk. (default: None)

  • force_reload (bool, optional) – Whether to re-process the dataset. (default: False)