2011-12-28 162 views
0

我有一個由時間組成的因子變量。當我試圖把它變成一個因素時,R按從低到高的順序排列它們。 Unforunately,這意味着我最終像下面這樣:按時間排序矢量

1:48:55 1:53:11 1:58:18 10:04:01 10:12:45 10:14:16 
10:20:24 10:21:59 10:23:11 10:23:40 10:30:40 10:34:13 
10:39:47 10:54:33 10:58:43 3:21:59 3:27:38 3:37:58 
3:49:00 4:20:42 4:30:52 4:37:39 4:40:08 4:40:55 
4:42:25 5:03:25 5:10:30 5:10:49 5:12:39 5:14:20 

因此,而不是打印爲了與時俱進,與10點00分時間7點之前來了。

有沒有一種快速的方法來按實際時間排序?

time = c("1:48:55", "1:53:11", "1:58:18", "10:04:01", "10:12:45", "10:14:16", "10:20:24", "10:21:59", "10:23:11", "10:23:40", "10:30:40", "10:34:13", "10:39:47", "10:54:33", "10:58:43", "3:21:59", "3:27:38", "3:37:58", "3:49:00", "4:20:42", "4:30:52", "4:37:39", "4:40:08", "4:40:55", "4:42:25", "5:03:25", "5:10:30", "5:10:49", "5:12:39", "5:14:20") 

time = factor(time) 

回答

3

試試這個:

library(chron) 
sort(times(time)) 

或者這個,如果你想排序的載體仍然是一個因素:

time[order(times(time))] 
2

的項目都在他們的順序,所以你用factor,他們將留在該訂單,但你可以設置的方式水平,將讓迴歸報告這些「正常」。如果你想要的時間值,你需要選擇一個時間類的工作他們得到整理

require(gtools) 
mixedsort(time) 
[1] "1:48:55" "1:53:11" "1:58:18" "3:21:59" "3:27:38" "3:37:58" "3:49:00" "4:20:42" "4:30:52" 
[10] "4:37:39" "4:40:08" "4:40:55" "4:42:25" "5:03:25" "5:10:30" "5:10:49" "5:12:39" "5:14:20" 
[19] "10:04:01" "10:12:45" "10:14:16" "10:20:24" "10:21:59" "10:23:11" "10:23:40" "10:30:40" "10:34:13" 
[28] "10:39:47" "10:54:33" "10:58:43" 

> time=factor(time, levels= mixedsort(time)) 
> levels(time) 
[1] "1:48:55" "1:53:11" "1:58:18" "3:21:59" "3:27:38" "3:37:58" "3:49:00" "4:20:42" "4:30:52" 
[10] "4:37:39" "4:40:08" "4:40:55" "4:42:25" "5:03:25" "5:10:30" "5:10:49" "5:12:39" "5:14:20" 
[19] "10:04:01" "10:12:45" "10:14:16" "10:20:24" "10:21:59" "10:23:11" "10:23:40" "10:30:40" "10:34:13" 
[28] "10:39:47" "10:54:33" "10:58:43" 
> time 
[1] 1:48:55 1:53:11 1:58:18 10:04:01 10:12:45 10:14:16 10:20:24 10:21:59 10:23:11 10:23:40 10:30:40 
[12] 10:34:13 10:39:47 10:54:33 10:58:43 3:21:59 3:27:38 3:37:58 3:49:00 4:20:42 4:30:52 4:37:39 
[23] 4:40:08 4:40:55 4:42:25 5:03:25 5:10:30 5:10:49 5:12:39 5:14:20 
30 Levels: 1:48:55 1:53:11 1:58:18 3:21:59 3:27:38 3:37:58 3:49:00 4:20:42 4:30:52 4:37:39 ... 10:58:43 

:如預期gtools的mixedsort功能將這樣的特徵向量排序。該代下包是一個可能的選擇:

time = c("1:48:55", "1:53:11", "1:58:18", "10:04:01", "10:12:45", "10:14:16", "10:20:24", "10:21:59", "10:23:11", "10:23:40", "10:30:40", "10:34:13", "10:39:47", "10:54:33", "10:58:43", "3:21:59", "3:27:38", "3:37:58", "3:49:00", "4:20:42", "4:30:52", "4:37:39", "4:40:08", "4:40:55", "4:42:25", "5:03:25", "5:10:30", "5:10:49", "5:12:39", "5:14:20") 

> chron(times=time) 
[1] 01:48:55 01:53:11 01:58:18 10:04:01 10:12:45 10:14:16 10:20:24 10:21:59 10:23:11 10:23:40 10:30:40 
[12] 10:34:13 10:39:47 10:54:33 10:58:43 03:21:59 03:27:38 03:37:58 03:49:00 04:20:42 04:30:52 04:37:39 
[23] 04:40:08 04:40:55 04:42:25 05:03:25 05:10:30 05:10:49 05:12:39 05:14:20 
> sort(chron(times=time)) 
[1] 01:48:55 01:53:11 01:58:18 03:21:59 03:27:38 03:37:58 03:49:00 04:20:42 04:30:52 04:37:39 04:40:08 
[12] 04:40:55 04:42:25 05:03:25 05:10:30 05:10:49 05:12:39 05:14:20 10:04:01 10:12:45 10:14:16 10:20:24 
[23] 10:21:59 10:23:11 10:23:40 10:30:40 10:34:13 10:39:47 10:54:33 10:58:43 
+0

短另外:注意因素無定義!但是,由於R具有您需要的所有內容,因此數據類型爲「ordered」,這只是定義順序的一個因素。如果你在DWINS的答案中用'ordered'替換'factor',你的數據類型將會更有意義(但是,如果你只顯示這個因子,那麼它們之間沒有明顯的差別,但是在某些統計方法中它會。 – Thilo 2011-12-28 19:44:37

+1

我對有序因素的經驗是,他們在任何地方播下了混亂,所以我不推薦他們去新手。 – 2011-12-28 19:47:35