我想提取一組交易的關聯規則與下面的代碼火花斯卡拉:協會規則,頻繁模式挖掘
val fpg = new FPGrowth().setMinSupport(minSupport).setNumPartitions(10)
val model = fpg.run(transactions)
model.generateAssociationRules(minConfidence).collect()
但產品數量都超過10K所以提取的規則對所有組合具有計算表現力,我也不需要它們。所以我想只提取成對:
Product 1 ==> Product 2
Product 1 ==> Product 3
Product 3 ==> Product 1
,我不關心其他組合,如:
[Product 1] ==> [Product 2, Product 3]
[Product 3,Product 1] ==> Product 2
有沒有辦法做到這一點?
感謝, 阿米爾
順便說一句,我正在做Spark-Scala – Amir