0
我有一個包含40列,每行100.000行的數據集,我需要過濾/減少/精簡:因此,我想刪除2014年10月1日之前以及20.8之後所做的所有訂單0.2016(時間跨度我要保持在表1.10.2104-20.8.2016)我怎樣才能做到這一點(和剛刪除不需要的舊數據出表)Here's一個例子:刪除沒有特定時間跨度的行
DB <- data.frame(orderID = c(1,2,3,4,5,6,7,8,9,10),
orderDate = c("01.07.2014 05:11","12.08.2014 12:39","09.09.2015 09:14","04.10.2014 16:15","02.11.2015 07:04", "10.11.2015 16:52","20.02.2016 08:08","12.04.2016 14:07","24.07.2016 17:04","09.09.2016 06:04"),
itemID = c(2,3,2,5,12,4,2,3,1,5),
size = c("m", "l", 42, "xxl", "m", 42, 39, "m", "m", 44),
color = c("green", "red", "blue", "yellow", "red", "yellow", "blue", "red", "green", "black"),
manufacturer = c("11", "12", "13", "12", "13", "13", "12", "11", "11", "13")
customerID = c(1, 2, 3, 1, 1, 3, 2, 2, 1, 1)
預期結果:
DB <- data.frame(orderID = c(3,4,5,6,7,8,9),
orderDate = c("09.09.2015 09:14","04.10.2014 16:15","02.11.2015 07:04", "10.11.2015 16:52","20.02.2016 08:08","12.04.2016 14:07","24.07.2016 17:04"),
itemID = c(2,5,12,4,2,3,1),
size = c(42, "xxl", "m", 42, 39, "m", "m"),
color = c("blue", "yellow", "red", "yellow", "blue", "red", "green"),
manufacturer = c("13", "12", "13", "13", "12", "11", "11")
customerID = c(3, 1, 1, 3, 2, 2, 1)
看看[此篇](HTTP:/ /stackoverflow.com/questions/23622338/subset-a-dataframe-between-2-dates-in-r-better-way)。您也可以使用lubridate和'dmy_hm'來格式化日期 – etienne