2017-04-16 46 views
1

我有一個數據幀p1。我想轉移列a。查找每行的最小值並返回具有最小值的列名稱。按列轉置數據幀,找到列最小值並返回索引

a=c(0,1,2,3,4,0,1,2,3,4) 
b=c(10,20,30,40,50,9,8,7,6,5) 
p1=data.frame(a,b) 
p1 


> p1 
    a b 
1 0 10 
2 1 20 
3 2 30 
4 3 40 
5 4 50 
6 0 9 
7 1 8 
8 2 7 
9 3 6 
10 4 5 

最終所需的答案

0 1 2 3 4 row_minimum column_index_of_minimum 
10 20 30 40 50 10    0 
9 8 7 6 5 5    4 

回答

2

我用過很多事情,但主要是ave(p1$a, p1$a, FUN = seq_along)這讓我的b成組基礎上,他們與a

共伴生的次數分開
myans = setNames(data.frame(do.call(rbind, lapply(split(p1, ave(p1$a, p1$a, FUN = seq_along)), 
      function(x) x[,2]))), nm = rbind(p1$a[ave(p1$a, p1$a, FUN = seq_along) == 1])) 
minimum = apply(myans, 1, min) 
index = colnames(myans)[apply(myans, 1, which.min)] 
myans$min = minimum 
myans$index = index 
myans 
# 0 1 2 3 4 min index 
#1 10 20 30 40 50 10  0 
#2 9 8 7 6 5 5  4 
1

考慮使用運行組計數,然後進行聚合和重塑:

# RUNNING GROUP COUNT 
p1$grpcnt <- sapply(seq(nrow(p1)), function(i) sum(p1[1:i, c("a")]==p1$a[[i]])) 

# MINIMUM OF B BY GROUP COUNT MERGING TO RETRIEVE A VALUE 
aggdf <- setNames(merge(aggregate(b~grpcnt, p1, FUN=min),p1,by="b")[c("grpcnt.x","b","a")], 
        c("grpcnt", "row_minimum", "column_index_of_minimum")) 

# RESHAPE/TRANSPOSE LONG TO WIDE 
reshapedf <- setNames(reshape(p1, timevar=c("a"), idvar=c("grpcnt"), direction="wide"), 
         c("grpcnt", paste(unique(p1$a)))) 
# FINAL MERGE 
finaldf <- merge(reshapedf, aggdf, by="grpcnt")[-1] 
finaldf 

# 0 1 2 3 4 row_minimum column_index_of_minimum 
# 1 10 20 30 40 50   10      0 
# 2 9 8 7 6 5   5      4 
相關問題