我上車計算結果爲數據集中工作機器學習和數據集是這樣編碼列標籤在熊貓的機器學習
buying,maint,doors,persons,lug_boot,safety,class
vhigh,vhigh,2,2,small,low,unacc
vhigh,vhigh,2,2,small,med,unacc
vhigh,vhigh,2,2,small,high,unacc
vhigh,vhigh,2,2,med,low,unacc
vhigh,vhigh,2,2,med,med,unacc
vhigh,vhigh,2,2,med,high,unacc
我想這些字符串轉換爲唯一枚舉整數縱列。我看到pandas.factorize()是要走的路,但它只能在一列上工作。如何用一個命令一次性分解數據幀。
我試過lambda函數,它不工作。
df.apply(拉姆達C:pd.factorize(c)中,軸= 1)
輸出:
0 ([0, 0, 1, 1, 2, 3, 4], [vhigh, 2, small, low,...
1 ([0, 0, 1, 1, 2, 3, 4], [vhigh, 2, small, med,...
2 ([0, 0, 1, 1, 2, 3, 4], [vhigh, 2, small, high...
3 ([0, 0, 1, 1, 2, 3, 4], [vhigh, 2, med, low, u...
4 ([0, 0, 1, 1, 2, 2, 3], [vhigh, 2, med, unacc])
5 ([0, 0, 1, 1, 2, 3, 4], [vhigh, 2, med, high, ...
我看到編碼值,但不能拔出了這一點從上面的陣列
難道你不想做'df.apply(pd.factorize)'嗎? – EdChum 2014-08-27 15:17:20