2016-10-27 70 views
2

這個代碼是從給定的名單建立數據框:應該是什麼sqlContext.createDataFrame參數()?

sample_one = [(0, 'mouse'), (1, 'black')] 
sample_two = [(0, 'cat'), (1, 'tabby'), (2, 'mouse')] 
sample_three = [(0, 'bear'), (1, 'black'), (2, 'salmon')] 
sample_data_df = sqlContext.createDataFrame([(sample_one,), (sample_two,),(sample_three,)], ['features']) 

在createDataFrame(),爲什麼多餘的逗號sample_one後給予(sample_one)?

回答

1

這個語法是創建一個元組。您可以嘗試以下操作:

>>> sample_one = [(0, 'mouse'), (1, 'black')] 
>>> type((sample_one)) 
<type 'list'> 
>>> type((sample_one,)) 
<type 'tuple'> 
+0

@ Anthony..Thanks ...這是因爲我們需要給出一個元組列表來創建數據幀()..對嗎? – Neo

相關問題