2013-12-18 129 views
1

在我的設置中,由於電源中斷我的虛擬機在重啓所有虛擬機後出現故障 我無法啓動Hbase,所有其他服務都可以重新啓動/停止/啓動,但在UI CM 在HBase的操作選項卡的所有選項都隱藏..無法啓動Cloudera Manager中的HBase

以下是我的HBase的主日誌,現在它的退出模式

6:29:05.880 AM ERROR org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper 
ZooKeeper exists failed after 3 retries 
6:29:05.880 AM WARN org.apache.hadoop.hbase.zookeeper.ZKUtil  
hconnection Unable to set watcher on znode /hbase/master 
org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /hbase/master 
    at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) 
    at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) 
    at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1041) 
    at org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.exists(RecoverableZooKeeper.java:172) 
    at org.apache.hadoop.hbase.zookeeper.ZKUtil.watchAndCheckExists(ZKUtil.java:420) 
    at org.apache.hadoop.hbase.zookeeper.ZooKeeperNodeTracker.start(ZooKeeperNodeTracker.java:76) 
    at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.ensureZookeeperTrackers(HConnectionManager.java:621) 
    at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getMaster(HConnectionManager.java:684) 
    at org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:126) 
    at org.apache.hadoop.hbase.master.HMasterCommandLine.stopMaster(HMasterCommandLine.java:175) 
    at org.apache.hadoop.hbase.master.HMasterCommandLine.run(HMasterCommandLine.java:106) 
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) 
    at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:76) 
    at org.apache.hadoop.hbase.master.HMaster.main(HMaster.java:1962) 
6:29:05.880 AM ERROR org.apache.hadoop.hbase.zookeeper.ZooKeeperWatcher 
hconnection Received unexpected KeeperException, re-throwing exception 
org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /hbase/master 
    at org.apache.zookeeper.KeeperException.create(KeeperException.java:99) 
    at org.apache.zookeeper.KeeperException.create(KeeperException.java:51) 
    at org.apache.zookeeper.ZooKeeper.exists(ZooKeeper.java:1041) 
    at org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper.exists(RecoverableZooKeeper.java:172) 
    at org.apache.hadoop.hbase.zookeeper.ZKUtil.watchAndCheckExists(ZKUtil.java:420) 
    at org.apache.hadoop.hbase.zookeeper.ZooKeeperNodeTracker.start(ZooKeeperNodeTracker.java:76) 
    at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.ensureZookeeperTrackers(HConnectionManager.java:621) 
    at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation.getMaster(HConnectionManager.java:684) 
    at org.apache.hadoop.hbase.client.HBaseAdmin.<init>(HBaseAdmin.java:126) 
    at org.apache.hadoop.hbase.master.HMasterCommandLine.stopMaster(HMasterCommandLine.java:175) 
    at org.apache.hadoop.hbase.master.HMasterCommandLine.run(HMasterCommandLine.java:106) 
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70) 
    at org.apache.hadoop.hbase.util.ServerCommandLine.doMain(ServerCommandLine.java:76) 
    at org.apache.hadoop.hbase.master.HMaster.main(HMaster.java:1962) 
6:29:05.881 AM INFO org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation 
This client just lost it's session with ZooKeeper, will automatically reconnect when needed. 
6:29:06.880 AM INFO org.apache.zookeeper.ClientCnxn 
Opening socket connection to server cderaalpha.alp.com/10.0.0.13:2181. Will not attempt to authenticate using SASL (unknown error) 
6:29:06.882 AM WARN org.apache.zookeeper.ClientCnxn 
Session 0x0 for server null, unexpected error, closing socket connection and attempting reconnect 
java.net.ConnectException: Connection refused 
    at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) 
    at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:735) 
    at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) 
    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) 
6:29:06.983 AM WARN org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper 
Possibly transient ZooKeeper exception: org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /hbase/root-region-server 
6:29:06.983 AM INFO org.apache.hadoop.hbase.util.RetryCounter 
Sleeping 2000ms before retry #1... 
6:29:06.983 AM INFO org.apache.zookeeper.ClientCnxn 
Opening socket connection to server cderadn1.alp.com/10.0.0.14:2181. Will not attempt to authenticate using SASL (unknown error) 
6:29:06.984 AM WARN org.apache.zookeeper.ClientCnxn 
Session 0x0 for server null, unexpected error, closing socket connection and attempting reconnect 
java.net.ConnectException: Connection refused 
    at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) 
    at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:735) 
    at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) 
    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) 
6:29:07.084 AM INFO org.apache.zookeeper.ClientCnxn 
Opening socket connection to server cderadn3.alp.com/10.0.0.16:2181. Will not attempt to authenticate using SASL (unknown error) 
6:29:27.103 AM INFO org.apache.zookeeper.ClientCnxn 
Client session timed out, have not heard from server in 20019ms for sessionid 0x0, closing socket connection and attempting reconnect 
6:29:27.204 AM WARN org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper 
Possibly transient ZooKeeper exception: org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /hbase/root-region-server 
6:29:27.204 AM INFO org.apache.hadoop.hbase.util.RetryCounter 
Sleeping 4000ms before retry #2... 
6:29:28.204 AM INFO org.apache.zookeeper.ClientCnxn 
Opening socket connection to server cderaalpha.alp.com/10.0.0.13:2181. Will not attempt to authenticate using SASL (unknown error) 
6:29:28.206 AM WARN org.apache.zookeeper.ClientCnxn 
Session 0x0 for server null, unexpected error, closing socket connection and attempting reconnect 
java.net.ConnectException: Connection refused 
    at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) 
    at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:735) 
    at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) 
    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) 
6:29:28.306 AM INFO org.apache.zookeeper.ClientCnxn 
Opening socket connection to server cderadn1.alp.com/10.0.0.14:2181. Will not attempt to authenticate using SASL (unknown error) 
6:29:28.307 AM WARN org.apache.zookeeper.ClientCnxn 
Session 0x0 for server null, unexpected error, closing socket connection and attempting reconnect 
java.net.ConnectException: Connection refused 
    at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) 
    at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:735) 
    at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) 
    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) 
6:29:28.408 AM INFO org.apache.zookeeper.ClientCnxn 
Opening socket connection to server cderadn3.alp.com/10.0.0.16:2181. Will not attempt to authenticate using SASL (unknown error) 
6:29:48.425 AM INFO org.apache.zookeeper.ClientCnxn 
Client session timed out, have not heard from server in 20017ms for sessionid 0x0, closing socket connection and attempting reconnect 
6:29:48.526 AM WARN org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper 
Possibly transient ZooKeeper exception: org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /hbase/root-region-server 
6:29:48.526 AM INFO org.apache.hadoop.hbase.util.RetryCounter 
Sleeping 8000ms before retry #3... 
6:29:49.526 AM INFO org.apache.zookeeper.ClientCnxn 
Opening socket connection to server cderaalpha.alp.com/10.0.0.13:2181. Will not attempt to authenticate using SASL (unknown error) 
6:29:49.529 AM WARN org.apache.zookeeper.ClientCnxn 
Session 0x0 for server null, unexpected error, closing socket connection and attempting reconnect 
java.net.ConnectException: Connection refused 
    at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) 
    at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:735) 
    at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) 
    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) 
6:29:49.630 AM INFO org.apache.zookeeper.ClientCnxn 
Opening socket connection to server cderadn1.alp.com/10.0.0.14:2181. Will not attempt to authenticate using SASL (unknown error) 
6:29:49.630 AM WARN org.apache.zookeeper.ClientCnxn 
Session 0x0 for server null, unexpected error, closing socket connection and attempting reconnect 
java.net.ConnectException: Connection refused 
    at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method) 
    at sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:735) 
    at org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:350) 
    at org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1068) 
6:29:49.731 AM INFO org.apache.zookeeper.ClientCnxn 
Opening socket connection to server cderadn3.alp.com/10.0.0.16:2181. Will not attempt to authenticate using SASL (unknown error) 
6:30:09.744 AM INFO org.apache.zookeeper.ClientCnxn 
Client session timed out, have not heard from server in 20013ms for sessionid 0x0, closing socket connection and attempting reconnect 
6:30:09.845 AM WARN org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper 
Possibly transient ZooKeeper exception: org.apache.zookeeper.KeeperException$ConnectionLossException: KeeperErrorCode = ConnectionLoss for /hbase/root-region-server 
6:30:09.845 AM ERROR org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper 
ZooKeeper exists failed after 3 retries 

感謝您的關注...

回答

0

從日誌中我們可以看到它失敗了連接ZooKeeper服務器。您可能需要啓動ZooKeeper的服務,並嘗試作爲zhutoulala之前所說的重新啓動HBase的

0

,我建議過,那麼,如果它仍然沒有工作,你或許應該刪除的HBase的Z序節點:

使用來自運行HBase服務的服務器之一的'hbase zkcli'命令。一旦進入ZooKeeper CLI,您可以運行'rmr/hbase'來刪除znode。