我想在應用groupby函數後使用列變量的標準偏差從熊貓數據框中刪除異常值。應用.groupby()爭論後用熊貓數據框中的NaN代替異常值
這是我的數據幀:
ARI Flesch Kincaid Speaker Score
0 -2.090000 121.220000 -3.400000 NaN NaN
1 8.276460 64.478573 9.034156 William Dudley 1.670275
2 19.570911 27.362067 17.253580 Janet Yellen -0.604757
3 -2.090000 121.220000 -3.400000 NaN NaN
4 -2.090000 121.220000 -3.400000 NaN NaN
5 20.643483 17.069411 18.394178 Lael Brainard 0.215396
6 -2.090000 121.220000 -3.400000 NaN NaN
7 -2.090000 121.220000 -3.400000 NaN NaN
8 12.624198 52.220468 11.403157 Jerome H. Powell -1.350798
9 18.466305 35.186261 16.205693 Stanley Fischer 0.522121
10 -2.090000 121.220000 -3.400000 NaN NaN
11 16.953460 36.246573 15.323457 Lael Brainard -0.217779
12 -2.090000 121.220000 -3.400000 NaN NaN
13 -2.090000 121.220000 -3.400000 NaN NaN
14 17.066088 32.592551 16.108486 Stanley Fischer 0.642245
15 -2.090000 121.220000 -3.400000 NaN NaN
我想第一組數據幀由「揚聲器」,然後除去「ARI」,「弗萊士」和「金凱德」值異常值所界定與特定特徵的平均值相比超過3個標準偏差。
請讓我知道這是否可能。謝謝!
你可以把你的數據的片段,而不是附加圖像?人們更容易複製它。 – titipata
更好嗎?謝謝! –
完美,謝謝格雷厄姆。有人會很快解決它:) – titipata