3
我想運行在pyspark代碼(火花2.1.1):趴趴的類型必須爲org.apache.spark.ml.linalg.VectorUDT
from pyspark.ml.feature import PCA
bankPCA = PCA(k=3, inputCol="features", outputCol="pcaFeatures")
pcaModel = bankPCA.fit(bankDf)
pcaResult = pcaModel.transform(bankDF).select("label", "pcaFeatures")
pcaResult.show(truncate= false)
但我得到這個錯誤:
requirement failed: Column features must be of type
org.apache.spark.ml.linalg.Vect [email protected]
but was actually[email protected]
.