2013-07-26 38 views
2

我有以下數據框;使用python熊貓查找'Time Delayed'

Group  Deadline Time Deadline Date Task Completed Date Task Completed Time 
Group 1 20:00:00   17-07-2012  17-07-2012    20:34:00 
Group 2 20:15:00   17-07-2012  17-07-2012    20:39:00 
Group 3 22:00:00   17-07-2012  17-07-2012    22:21:00 
Group 4 23:50:00   17-07-2012  18-07-2012    00:09:00 
Group 5 20:00:00   18-07-2012  18-07-2012    20:37:00 
Group 6 20:15:00   18-07-2012  18-07-2012    21:13:00 
Group 7 22:00:00   18-07-2012  18-07-2012    22:56:00 
Group 8 23:50:00   18-07-2012  19-07-2012    00:01:00 
Group 9 20:15:00   19-07-2012  19-07-2012    20:34:00 
Group 10 20:00:00   19-07-2012  19-07-2012    20:24:00 

如何計算時間延遲爲;

Time Delay (mins) 
00:34:00 
00:24:00 
00:21:00 
00:19:00 
00:37:00 
00:58:00 
00:56:00 
00:11:00 
00:19:00 
00:24:00 

我試過沒有成功;

  1. 結合的「截止日期」「日期」 &「時間」欄和「任務完成」「日期」 &「時間」列和

  2. 尋找「完成任務」的區別是 - 「截止日期' 時間。

回答

3

它們合併爲字符串(「加法」的作品),並將其轉換爲datetime類型,然後減去,這給timedelta類型的系列。

In [14]: deadline = pd.to_datetime(df['Deadline Date'] + ' ' + df['Deadline Time']) 

In [15]: completed = pd.to_datetime(df['Task Completed Date'] + ' ' + df['Task Completed Time']) 

In [16]: completed - deadline 
Out[16]: 
0 00:34:00 
1 00:24:00 
2 00:21:00 
3 00:19:00 
4 00:37:00 
5 00:58:00 
6 00:56:00 
7 00:11:00 
8 00:19:00 
9 00:24:00 
dtype: timedelta64[ns] 
+0

很酷!謝謝丹 – richie