我有這個數據幀調用mydf
。 Sample
列中有重複的樣本。我想提取具有最大值total_reads
的唯一樣品行並獲取result
。在R數據幀的另一列中提取具有最大值的唯一行
mydf<-structure(list(Sample = c("AOGC-02-0188", "AOGC-02-0191", "AOGC-02-0191",
"AOGC-02-0191", "AOGC-02-0194", "AOGC-02-0194", "AOGC-02-0194"
), total_reads = c(27392583, 19206920, 34462563, 53669483, 24731988,
43419826, 68151814), Lane = c("4", "5", "4", "4;5", "5", "4",
"4;5")), .Names = c("Sample", "total_reads", "Lane"), row.names = c("166",
"169", "170", "171", "173", "174", "175"), class = "data.frame")
結果
Sample total_reads Lane
AOGC-02-0188 27392583 4
AOGC-02-0191 53669483 4;5
AOGC-02-0194 68151814 4;5
的可能的複製[集合在給定的列中的數據幀,並顯示另一列(http://stackoverflow.com/questions/6289538/aggregate-a-dataframe-on-a-given-column-and -display-另一個列) – Bulat