cudf.core.column.string.StringMethods.character_tokenize#

StringMethods.character_tokenize() → SeriesOrIndex[source]#

每个字符串被分割成单个字符。返回的序列包含每个字符作为单独的字符串。

Returns:

Series or Index of object.

示例

>>> import cudf
>>> data = ["hello world", None, "goodbye, thank you."]
>>> ser = cudf.Series(data)
>>> ser.str.character_tokenize()
  h
  e
  l
  l
  o
0
  w
  o
  r
  l
  d
  g
  o
  o
  d
  b
  y
  e
  ,
2
  t
  h
  a
  n
  k
2
  y
  o
  u
  .
dtype: object