假設我有一個別名transactions
與此數據:計數不同的元素
person store spent
A S 3.3
A S 4.7
B S 1.2
B T 3.4
我想找出許多不同的人如何去每家店和多少,他們在那裏度過的:
store visitors revenue
S 2 9.2
T 1 3.4
我希望我能做到一步到位:
stores = foreach (group transactions by store) generate
group as store, SUM(transactions.spent) as revenue,
COUNT(UNIQUE(transactions.person)) as visitors;
,但它並不像有這樣事情爲UNIQUE
。
我堅持兩個步驟?
tr1 = foreach (group transactions by (store,person)) generate
group.store as store, SUM(spent) as revenue;
stores = foreach (group tr1 by store) generate
group as store, COUNT(tr1) as visitors, SUM(revenue) as revenue;
你是說「DISTINCT」可以放在包裏嗎? – sds
你會想要在嵌套的foreach中執行DISTINCT。 – Metropolis