Q

WEKA軟件分析混淆矩陣

2014-04-16 306 views -1 likes

-1

*嗨，再次我在比較混淆矩陣的問題。貝婁我提供了兩個混淆矩陣。WEKA軟件分析混淆矩陣

a b  classified as 
    349 58  a tested_negative 
    93 124  b tested_positive 

    a b classified as 
    346 61 a tested_negative 
    90 127 b tested_positive 
i know that the diagonal of top-left to right but here both that value is same so how can i make decision which one best?*

2014-04-16 Mohammad Hasan

A

回答

1

它實際上取決於您的具體應用。假設你想最大限度地減少誤報數量（因爲它會花費你很多的錢來處理任何虛驚的後果）

在這種情況下，選擇第一個分類器，因爲它的誤判率低於第二分類器：

58 /（58 + 124）< 61 /（61 + 127） 0.3186813 < 0.3244681

看一看這裏 http://en.wikipedia.org/wiki/Accuracy_and_precision

和這裏： http://en.wikipedia.org/wiki/Sensitivity_and_specificity

如果你只是想「最好的分類」 - 你有問題，因爲這兩個分類器具有相同的精度：

A1 =（349 + 124）/（349 + 124 + 58 + 93） = 0.7580128 A2 =（346 + 127）/（346 + 127 + 61 + 90）= 0.7580128

所以，你需要分析你的領域或行業，並決定是否要：

1）得到的儘可能少的誤報 - 然後選擇誤分率最小的分類器;

2）獲得儘可能少的錯過的病例 - 然後選擇分類器，以最小的假陰性率;

3）儘可能獲得更多匹配 - 然後選擇具有最大正確率的分類器;

4）獲得更多正確的拒絕 - 然後選擇具有最大真實負面率的分類器。

2014-04-16 18:48:13

相關問題