2016-07-30 48 views
0

我想創建一個累積圖as shown here,另一個警告。這些步驟應該基於2分鐘的時間間隔,因此間隔可能有多個條目,甚至沒有條目。如何通過時間「bin」在R中的累積和 - 與ggplot

我用rowSums創建列在cumsum使用的值, 例如,

df_so $intraverbal <- rowSums(df_so[-1] == "intraverbal") 
df_so$tact <- rowSums(df_so[-1] == "tact") 
df_so$mand <- rowSums(df_so[-1] == "mand") 
df_so$echoic <- rowSums(df_so[-1] == "echoic") 

圖表制定好足夠使用情節:

plot(cumsum(df_so$intraverbal), type="s") 

A cumulative graph - not by 2 minute intervals.

但是,有一些方法不足。理想情況下,數據將根據「時間倉」進行統計和標註。至少,時間框應在x標籤上,但增量不是連續的。假設,我應該使用dplyr或lapply來融合併合並它們 - 但我不知道如何。也許,如所描述的here

用ggplot完成此操作會很好,因此變化的cumsum可以在同一個圖表上,例如here,或者也可以用stat_bin作爲here

這裏的一小部分數據工作示例:

df_so <- structure(list(time.bin = structure(c(1L, 1L, 1L, 1L, 1L, 1L,1L, 124L, 124L, 124L), .Label = c("0:00:00", "0:02:00", "0:04:00","0:06:00", "0:08:00", "0:10:00", "0:12:00", "0:14:00", "0:16:00","0:18:00", 
     "0:20:00", "0:22:00", "0:24:00", "0:26:00", "0:28:00","0:30:00", "0:32:00", "0:34:00", "0:36:00", "0:38:00", "0:40:00","0:42:00", "0:44:00", "0:46:00", "0:48:00", "0:50:00", "0:52:00","0:54:00", "0:56:00", "0:58:00", 
     "1:00:00", "1:02:00", "1:04:00","1:06:00", "1:08:00", "1:10:00", "1:12:00", "1:14:00", "1:16:00","1:18:00", "1:20:00", "1:22:00", "1:24:00", "1:26:00", "1:28:00","1:30:00", "1:32:00", "1:34:00", "1:36:00", "1:38:00", 
     "1:40:00","1:42:00", "1:44:00", "1:46:00", "1:48:00", "1:50:00", "1:52:00","1:54:00", "1:56:00", "1:58:00", "2:00:00", "2:02:00", "2:04:00","2:06:00", "2:08:00", "2:10:00", "2:12:00", "2:14:00", "2:16:00","2:18:00", 
     "2:20:00", "2:22:00", "2:24:00", "2:26:00", "2:28:00","2:30:00", "2:32:00", "2:34:00", "2:36:00", "2:38:00", "2:40:00","2:42:00", "2:44:00", "2:46:00", "2:48:00", "2:50:00", "2:52:00","2:54:00", "2:56:00", "2:58:00", 
     "3:00:00", "3:02:00", "3:04:00","3:06:00", "3:08:00", "3:10:00", "3:12:00", "3:14:00", "3:16:00","3:18:00", "3:20:00", "3:22:00", "3:24:00", "3:26:00", "3:28:00","3:30:00", "3:32:00", "3:34:00", "3:36:00", "3:38:00", "3:40:00","3:42:00", "3:44:00", "3:48:00", "3:50:00", "3:52:00", "3:54:00","3:56:00", "3:58:00", "4:00:00", "4:02:00", "4:04:00", "4:06:00","4:08:00"), class = "factor"), 
     Primary.VB = structure(c(1L,3L, 1L, 3L, 1L, 3L, 1L, 1L, 1L, 1L), .Label = c("", "echoic","intraverbal", "mand", "tact"), class = "factor"), 
     Secondary.VB = structure(c(1L,1L, 1L, 5L, 1L, 1L, 1L, 1L, 1L, 1L), .Label = c("", "echoic","intraverbal", "mand", "tact"), class = "factor"), 
     Tertiary.VB = structure(c(1L,1L, 1L, 3L, 1L, 1L, 1L, 1L, 1L, 1L), .Label = c("", "intraverbal","mand", "tact"), class = "factor"), intraverbal = c(0, 1, 0,1, 0, 1, 0, 0, 0, 0), 
     tact = c(0, 0, 0, 1, 0, 0, 0, 0, 0, 0),mand = c(0, 0, 0, 1, 0, 0, 0, 0, 0, 0), 
     echoic = c(0, 0,0, 0, 0, 0, 0, 0, 0, 0)), .Names = c("time.bin", "Primary.VB","Secondary.VB","Tertiary.VB","intraverbal", 
     "tact", "mand", "echoic"), row.names = c(1L, 2L,3L, 4L, 5L, 6L, 7L, 1648L, 1649L, 1650L), class = "data.frame") 

回答

1

不是一個答案,只是和擴展的意見,我會刪除。如果我們暫時忽略x軸代表因子數......它看起來好嗎?

​​
+0

我添加了一張圖片。這是一個累積計數,所以價值應該繼續上漲。 – Donnied

+0

@Donnied編輯 – Dambo

+0

看起來很棒。我不熟悉這些函數(tbl_df,group_by)。我認爲它正在按照時間倉的原則進行。 由於它是ggplot,x標籤應該很簡單(?)。獎勵積分爲intraverbal和echoic在一起... – Donnied