字典编码#

group dictionary_encode

函数

通过字典编码现有列来构建字典列。

输出列是一个DICTIONARY类型，其键列包含非空、唯一的值，这些值按照严格的、全序排列。这意味着，对于所有i in [0,n-1)，keys[i] 在 keys[i+1] 之前有序，其中 n 是键的数量。

输出列有一个子索引列，该列是整数类型，并且与输入列的大小相同。

空掩码和空计数从输入列复制到输出列。

c = [429, 111, 213, 111, 213, 429, 213]
d = encode(c)
d now has keys [111, 213, 429] and indices [2, 0, 1, 0, 1, 2, 1]

Throws:

Parameters:

Returns:

返回一个字典列

通过从提供的 dictionary_column 中收集键，并使用该列中的索引，创建一个新列。

d1 = {["a", "c", "d"], [2, 0, 1, 0]}
s = decode(d1)
s is now ["d", "a", "c", "a"]

Parameters:

Returns:

新列的类型与字典列的键匹配