2014-05-02 47 views
3

我有3 Cassandra nodes可以說c1,c2 and c3。我想將Hadoop與Cassandra集成,以便我可以在Hadoop上運行我的豬腳本以從Cassandra讀取數據並分析它。所以我有hadoop像這樣設置h1 as name-node , h2 as data-node, c1 as data-node and c3 as data-node. Here h2 node is a only hadoop data-node and not with the any Cassandra node。我的問題是while reading and processing data through pig/mapredude does it uses h2 data-node?Hadoop Cassandra集成設計

回答

0

糾正我,如果我錯了,但是你不需要在所有cassandra節點上安裝hadoop datanode? 我的理解是map-reduce在減少數據之前使用HDFS datanodes存儲中間結果。所以我認爲H2很有可能被使用。這是我的猜測,我期待着更正