pyspark.sql.Column.over

Column. over ( window : WindowSpec ) → Column [source]

定义一个窗口列。

新增于版本 1.4.0。

在版本 3.4.0 中更改: 支持 Spark Connect。

Parameters
window WindowSpec
Returns
Column

示例

>>> from pyspark.sql import Window
>>> window = (
...     Window.partitionBy("name")
...     .orderBy("age")
...     .rowsBetween(Window.unboundedPreceding, Window.currentRow)
... )
>>> from pyspark.sql.functions import rank, min, desc
>>> df = spark.createDataFrame(
...      [(2, "Alice"), (5, "Bob")], ["age", "name"])
>>> df.withColumn(
...      "rank", rank().over(window)
... ).withColumn(
...      "min", min('age').over(window)
... ).sort(desc("age")).show()
+---+-----+----+---+
|age| name|rank|min|
+---+-----+----+---+
|  5|  Bob|   1|  5|
|  2|Alice|   1|  2|
+---+-----+----+---+