我期望通過app_name版本獲得計數的差異。我的數據集是這樣的:APP_NAME,VERSION_ID,計數,[差異]按列分組的R中的行之間的差異
這裏是集
data = structure(list(app_name = structure(c(1L, 1L, 1L, 2L, 2L, 2L,
2L, 3L, 3L), .Label = c("a", "b", "c"), class = "factor"), version_id = c(1,
1.1, 2.3, 2, 3.1, 3.3, 4, 1.1, 2.4), count = c(600L, 620L, 620L,
200L, 200L, 250L, 250L, 15L, 36L)), .Names = c("app_name", "version_id",
"count"), class = "data.frame", row.names = c(NA, -9L))
鑑於這種data.frame,我怎麼能雙方APP_NAME & VERSION_ID得到計數的滯後差?每個應用程序的初始(第一)版本差異將爲零,因爲沒有區別。
這裏是最後的結果會是什麼樣子與最後的「差異」列
structure(list(app_name = structure(c(1L, 1L, 1L, 2L, 2L, 2L,
2L, 3L, 3L), .Label = c("a", "b", "c"), class = "factor"), version_id = c(1,
1.1, 2.3, 2, 3.1, 3.3, 4, 1.1, 2.4), count = c(600L, 620L, 620L,
200L, 200L, 250L, 250L, 15L, 36L), diff = c(0, 20, 0, 0, 0, 1.25,
0, 0, 2.4)), .Names = c("app_name", "version_id", "count", "diff"
), class = "data.frame", row.names = c(NA, -9L))
到目前爲止您嘗試了什麼? –
@Pascal我一直在嘗試使用mutate()無濟於事。以下線程:http://stackoverflow.com/questions/31362397/calculating-the-difference-between-rows-in-a-dataframe-using-dplyr –