2015-04-06 87 views

回答

11

假設你想添加一個名爲new通過調用numpy.random.normal反覆構建了您的數據幀df欄,你可以這樣做:

import numpy 
from pyspark.sql.functions import UserDefinedFunction 
from pyspark.sql.types import DoubleType 

udf = UserDefinedFunction(numpy.random.normal, DoubleType()) 

df_with_new_column = df.withColumn('new', udf())