AMDataset
- class dgl.data.AMDataset(print_every=10000, insert_reverse=True, raw_dir=None, force_reload=False, verbose=True, transform=None)[source]
Bases:
RDFGraphDataset
AM数据集。用于节点分类任务
命名空间约定:
实例:
http://purl.org/collections/nl/am/
- 关系:
http://purl.org/collections/nl/am/
我们在输出图中忽略了所有字面节点以及连接它们的关系。
AM 数据集统计:
节点数:881680
边数:5668682(包括反向边)
目标类别:代理
班级数量:11
标签分割:
列车:802
测试: 198
- Parameters:
print_every (int) – Preprocessing log for every X tuples. Default: 10000.
insert_reverse (bool) – If true, add reverse edge and reverse relations to the final graph. Default: True.
raw_dir (str) – Raw file directory to download/contains the input data directory. Default: ~/.dgl/
force_reload (bool) – Whether to reload the dataset. Default: False
verbose (bool) – Whether to print out progress information. Default: True.
transform (callable, optional) – A transform that takes in a
DGLGraph
object and returns a transformed version. TheDGLGraph
object will be transformed before every access.
示例
>>> dataset = dgl.data.rdf.AMDataset() >>> graph = dataset[0] >>> category = dataset.predict_category >>> num_classes = dataset.num_classes >>> >>> train_mask = g.nodes[category].data['train_mask'] >>> test_mask = g.nodes[category].data['test_mask'] >>> label = g.nodes[category].data['label']