我有一個事件記錄,當一個設備開始或停止失敗代碼,我試圖計算失敗和開始之間的平均和平均時間。下面是數據的一個很簡單的例子表:MySQL group with with a lookahead?
+----+-----------+---------------------+
| id | eventName | eventTime |
+----+-----------+---------------------+
| 1 | start | 2012-11-01 14:25:20 |
| 2 | fail A | 2012-11-01 14:27:45 |
| 3 | start | 2012-11-01 14:30:49 |
| 4 | fail B | 2012-11-01 14:32:54 |
| 5 | start | 2012-11-01 14:35:59 |
| 6 | fail A | 2012-11-01 14:37:02 |
| 7 | start | 2012-11-01 14:38:05 |
| 8 | fail A | 2012-11-01 14:40:09 |
| 9 | start | 2012-11-01 14:41:11 |
| 10 | fail C | 2012-11-01 14:43:14 |
+----+-----------+---------------------+
創建代碼:
CREATE TABLE `test` (
`id` int(10) unsigned NOT NULL AUTO_INCREMENT,
`eventName` varchar(50) NOT NULL,
`eventTime` datetime NOT NULL,
PRIMARY KEY (`id`)
);
INSERT INTO `test` (`id`, `eventName`, `eventTime`) VALUES (1,'start','2012-11-01 14:25:20'),(2,'fail A','2012-11-01 14:27:45'),(3,'start','2012-11-01 14:30:49'),(4,'fail B','2012-11-01 14:32:54'),(5,'start','2012-11-01 14:35:59'),(6,'fail A','2012-11-01 14:37:02'),(7,'start','2012-11-01 14:38:05'),(8,'fail A','2012-11-01 14:40:09'),(9,'start','2012-11-01 14:41:11'),(10,'fail C','2012-11-01 14:43:14');
我可以得到啓動和使用這樣的一個失敗的次數:
SET @time_prev := -1;
SELECT
*
FROM
(
SELECT
eventName
, eventTime
, @ts := UNIX_TIMESTAMP(eventTime) AS ts
, @started := IF(eventName = 'start', 1, 0) AS started
, @failed := IF(eventName <> 'start', 1, 0) AS failed
, @time_diff := IF(@time_prev > -1, @ts - @time_prev, 0) AS time_diff
, @time_prev := @ts AS time_prev
, @time_to_fail := IF(@failed, @time_diff, 0) AS time_to_fail
, @time_to_start := IF(@started, @time_diff, 0) AS time_to_start
FROM
test
) AS t1;
+-----------+---------------------+------------+---------+--------+-----------+------------+--------------+---------------+
| eventName | eventTime | ts | started | failed | time_diff | time_prev | time_to_fail | time_to_start |
+-----------+---------------------+------------+---------+--------+-----------+------------+--------------+---------------+
| start | 2012-11-01 14:25:20 | 1351805120 | 1 | 0 | 0 | 1351805120 | 0 | 0 |
| fail A | 2012-11-01 14:27:45 | 1351805265 | 0 | 1 | 145 | 1351805265 | 0 | 145 |
| start | 2012-11-01 14:30:49 | 1351805449 | 1 | 0 | 184 | 1351805449 | 184 | 0 |
| fail B | 2012-11-01 14:32:54 | 1351805574 | 0 | 1 | 125 | 1351805574 | 0 | 125 |
| start | 2012-11-01 14:35:59 | 1351805759 | 1 | 0 | 185 | 1351805759 | 185 | 0 |
| fail A | 2012-11-01 14:37:02 | 1351805822 | 0 | 1 | 63 | 1351805822 | 0 | 63 |
| start | 2012-11-01 14:38:05 | 1351805885 | 1 | 0 | 63 | 1351805885 | 63 | 0 |
| fail A | 2012-11-01 14:40:09 | 1351806009 | 0 | 1 | 124 | 1351806009 | 0 | 124 |
| start | 2012-11-01 14:41:11 | 1351806071 | 1 | 0 | 62 | 1351806071 | 62 | 0 |
| fail C | 2012-11-01 14:43:14 | 1351806194 | 0 | 1 | 123 | 1351806194 | 0 | 123 |
+-----------+---------------------+------------+---------+--------+-----------+------------+--------------+---------------+
但爲了在失敗和開始之間獲得時間,我必須前進到下一個記錄並丟失該失敗代碼的分組。我怎樣才能將其移動到下一個級別,並讓未來的時間開始合併到失敗的記錄中,以便將其分組?
最終,計算平均值和中位數後,我最終會設置這樣的結果:
+-----------+-------------+----------------+--------------+-----------------+
| eventName | avg_to_fail | median_to_fail | avg_to_start | median_to_start |
+-----------+-------------+----------------+--------------+-----------------+
| fail A | 110.66 | 124.00 | 103.00 | 63.00 |
| fail B | 125.00 | 125.00 | 185.00 | 185.00 |
+-----------+-------------+----------------+--------------+-----------------+
我刪除位數從標題,這不是問題。問題是根據下一行數據計算第二個avg/median。 – fwrawx
@fwrawx - 我已經根據您的規範更新它,以提供avg_to_fail。適應avg_to_start很容易。然後,您可以完全外連接EventName上的兩個結果集。 – Laurence
* _to_fail是很容易的部分,獲取* _to_start和合並是困難的部分,因爲1)eventName對於所有記錄是相同的,並且2)時間是從前一記錄計算的 – fwrawx