2013-08-05 47 views
0

我已經按照此(http://sujitpal.blogspot.in/2012/09/learning-mahout-clustering.html)鏈接創建了初始簇和k-均值簇,如下面鏈接中所述,但是當我試圖將簇轉儲到本地系統時,以下錯誤如何使用mahout kmeans算法將簇轉儲到本地文件系統

hadoop binary is not in PATH,HADOOP_HOME/bin,HADOOP_PREFIX/bin, running locally 
SLF4J: Class path contains multiple SLF4J bindings. 
SLF4J: Found binding in [jar:file:/opt/mahout/mahout-examples-0.8-job.jar!/org/slf4j/impl/StaticLoggerBinder.class] 
SLF4J: Found binding in [jar:file:/opt/mahout/lib/slf4j-jcl-1.7.5.jar!/org/slf4j/impl/StaticLoggerBinder.class] 
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation. 
SLF4J: Actual binding is of type [org.slf4j.impl.JCLLoggerFactory] 
Aug 5, 2013 1:21:51 PM org.slf4j.impl.JCLLoggerAdapter info 
INFO: Command line arguments: {--dictionary=[vectorfiles/dictionary.file-0], --dictionaryType=[seqfiles], --distanceMeasure=[org.apache.mahout.common.distance.SquaredEuclideanDistanceMeasure], --endPhase=[2147483647], --input=[kmeans-clusters], --numWords=[10], --output=[cluster.txt], --outputFormat=[TEXT], --startPhase=[0], --tempDir=[temp]} 
Exception in thread "main" java.lang.IllegalArgumentException: Invalid dictionary format 
     at org.apache.mahout.utils.clustering.ClusterDumper.printClusters(ClusterDumper.java:169) 
     at org.apache.mahout.utils.clustering.ClusterDumper.run(ClusterDumper.java:156) 
     at org.apache.mahout.utils.clustering.ClusterDumper.main(ClusterDumper.java:100) 
     at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) 
     at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) 
     at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) 
     at java.lang.reflect.Method.invoke(Method.java:597) 
     at org.apache.hadoop.util.ProgramDriver$ProgramDescription.invoke(ProgramDriver.java:68) 
     at org.apache.hadoop.util.ProgramDriver.driver(ProgramDriver.java:139) 
     at org.apache.mahout.driver.MahoutDriver.main(MahoutDriver.java:194) 

命令轉儲clusers

mahout clusterdump -i kmeans-clusters -d vectorfiles/dictionary.file-* -dt seqfiles -n 10 -o cluster.txt 

請建議我如何得到這個

感謝

回答

0

我用以下命令和它的工作

亨利馬烏clusterdump -dt sequencefile -d vectorfiles/dictionary.file-0 -i k均值集羣/集羣-1-最終-o的Result.txt -b 10 -n 10

相關問題