2016-06-21 57 views
0

我有一個ID,交易,這些交易的日期和這些交易的類別的列表。我想每個ID中創建的每個不同種類的計統計不同和窗口函數

出發表我是看起來像這樣:

id trxn_dt  trxn_amt trxn_category 
1 10/31/2014 58  apple 
1 11/9/2014 34  banana 
1 12/10/2014 12  apple 
2 7/8/2014 78  banana 
2 11/20/2014 99  banana 
3 1/5/2014 120  orange 
4 2/17/2014 588  apple 
4 2/18/2014 8  banana 
4 3/9/2014 65  orange 
4 4/25/2014 74  apple 

而且我想最終的結果看起來是這樣的:

id trxn_dt  trxn_amt trxn_category number_category 
1 10/31/2014 58  apple   2 
1 11/9/2014 34  banana   2 
1 12/10/2014 12  apple   2 
2 7/8/2014 78  banana   1 
2 11/20/2014 99  banana   1 
3 1/5/2014 120  orange   1 
4 2/17/2014 588  apple   3 
4 2/18/2014 8  banana   3 
4 3/9/2014 65  orange   3 
4 4/25/2014 74  apple   3 

我一直在使用count(distinct(trxn_category)) over(partition by id,trxn_category order by id) as number_category嘗試,但我得到一個錯誤有關使用「獨立的」

+1

是什麼***確切的***錯誤信息?你正在使用哪些DBMS? (Postgres?Oracle?SQL Server?)。另外:'distinct'是**不是**的函數。 –

回答

0

您可以使用相關子查詢此:

SELECT id, trxn_dt, trxn_amt, trxn_category, 
     (SELECT COUNT(DISTINCT trxn_category) 
     FROM mytable AS t2 
     WHERE t2.id = t1.id) AS cnt 
FROM mytable AS t1 

Demo here

0
;WITH cteCounts AS (
    SELECT 
     id 
     ,COUNT(DISTINCT trxn_category) as CategoryCount 
    FROM 
     table 
    GROUP BY 
     id 
) 


SELECT 
    t.id 
    ,trxn_dt 
    ,trxn_amt 
    ,trxn_category 
    ,c.CategoryCount 
FROM 
    table t 
    INNER JOIN cteCounts c 
    ON t.id = c.id 

不能在分區聚集使用DISTINCT。

0

大多數的DBMS不支持窗口化功能DISTINCT,但你可以模擬使用兩個DENSE_RANKs COUNT(DISTINCT):

DENSE_RANK() over (partition by id,trxn_category order by id ASC)- 
DENSE_RANK() over (partition by id,trxn_category order by id DESC) 

或嵌套MAX(DENSE_RANK):

select 
    MAX(dr) over (partition by id,trxn_category) 
from 
(
    select 
     DENSE_RANK() over (partition by id,trxn_category order by id DESC) as dr 
)