AIFBDataset
- class dgl.data.AIFBDataset(print_every=10000, insert_reverse=True, raw_dir=None, force_reload=False, verbose=True, transform=None)[source]
基础类:
RDFGraphDataset
用于节点分类任务的AIFB数据集
AIFB数据集是一个语义网(RDF)数据集,用作数据挖掘的基准。它记录了卡尔斯鲁厄大学AIFB的组织结构。
AIFB数据集统计:
节点数:7262
边数:48810(包括反向边)
目标类别:人员
班级数量:4
标签分割:
训练:140
测试: 36
- Parameters:
print_every (int) – 每X个元组预处理日志。默认值:10000。
insert_reverse (bool) – 如果为真,将反向边和反向关系添加到最终图中。默认值:True。
raw_dir (str) – Raw file directory to download/contains the input data directory. Default: ~/.dgl/
force_reload (bool) – Whether to reload the dataset. Default: False
verbose (bool) – Whether to print out progress information. Default: True.
transform (callable, optional) – A transform that takes in a
DGLGraph
object and returns a transformed version. TheDGLGraph
object will be transformed before every access.
示例
>>> dataset = dgl.data.rdf.AIFBDataset() >>> graph = dataset[0] >>> category = dataset.predict_category >>> num_classes = dataset.num_classes >>> >>> train_mask = g.nodes[category].data['train_mask'] >>> test_mask = g.nodes[category].data['test_mask'] >>> label = g.nodes[category].data['label']