2016-08-25 72 views
3

最大的不同在R I具有下列R-代碼:使用命令

library(ggplot2) 
data(diamonds) 
by(diamonds$price, diamonds$cut, summary) 
by(diamonds$price, diamonds$cut, max) 

這給了我:

diamonds$cut: Fair 
Min. 1st Qu. Median Mean 3rd Qu. Max. 
337 2050 3282 4359 5206 18570 
---------------------------------------------- 
    diamonds$cut: Good 
Min. 1st Qu. Median Mean 3rd Qu. Max. 
327 1145 3050 3929 5028 18790 
---------------------------------------------- 
    diamonds$cut: Very Good 
Min. 1st Qu. Median Mean 3rd Qu. Max. 
336  912 2648 3982 5373 18820 
---------------------------------------------- 
    diamonds$cut: Premium 
Min. 1st Qu. Median Mean 3rd Qu. Max. 
326 1046 3185 4584 6296 18820 
---------------------------------------------- 
    diamonds$cut: Ideal 
Min. 1st Qu. Median Mean 3rd Qu. Max. 
326  878 1810 3458 4678 18810 

diamonds$cut: Fair 
[1] 18574 
---------------------------------------------- 
    diamonds$cut: Good 
[1] 18788 
---------------------------------------------- 
    diamonds$cut: Very Good 
[1] 18818 
---------------------------------------------- 
    diamonds$cut: Premium 
[1] 18823 
---------------------------------------------- 
    diamonds$cut: Ideal 
[1] 18806 

我不明白爲什麼這兩個表中的最大值不同。最大值不應該一樣嗎?

回答

5

它的數字參數?summary

by(diamonds$price, diamonds$cut, function(x) summary(x, digits = 10 ))` 

就給一樣最大

+0

'位數= 5'將在這裏就夠了。 – mtoto

+0

我發現第一行「by(diamonds $ price,diamonds $ cut,summary)」的行爲非常奇怪。這條線如何計算每種情況下的最大值? – Ohumeronen