2013-05-16 17 views
1

我按照這個站點配置我的nutch代碼在eclips中http://wiki.apache.org/nutch/RunNutchInEclipse 但是出現錯誤「java.io.IOException:無法設置路徑權限:\ TMP \ Hadoop的「 讓我感動到Hadoop的0.20.2但變化的hadoop罐子後,我被困在此錯誤: -在Windows環境下運行nutch 1.6和hadoop 0.20.2在eclips中運行時發生錯誤

2013-05-17 00:44:17,742 WARN crawl.Crawl (Crawl.java:run(97)) - solrUrl is not set, indexing will be skipped... 
2013-05-17 00:44:17,876 INFO crawl.Crawl (Crawl.java:run(108)) - crawl started in: crawl 
2013-05-17 00:44:17,876 INFO crawl.Crawl (Crawl.java:run(109)) - rootUrlDir = urls 
2013-05-17 00:44:17,876 INFO crawl.Crawl (Crawl.java:run(110)) - threads = 10 
2013-05-17 00:44:17,876 INFO crawl.Crawl (Crawl.java:run(111)) - depth = 3 
2013-05-17 00:44:17,877 INFO crawl.Crawl (Crawl.java:run(112)) - solrUrl=null 
2013-05-17 00:44:17,877 INFO crawl.Crawl (Crawl.java:run(114)) - topN = 50 
2013-05-17 00:44:17,888 INFO crawl.Injector (Injector.java:inject(257)) - Injector: starting at 2013-05-17 00:44:17 
2013-05-17 00:44:17,888 INFO crawl.Injector (Injector.java:inject(258)) - Injector: crawlDb: crawl/crawldb 
2013-05-17 00:44:17,888 INFO crawl.Injector (Injector.java:inject(259)) - Injector: urlDir: urls 
2013-05-17 00:44:17,936 INFO crawl.Injector (Injector.java:inject(269)) - Injector: Converting injected urls to crawl db entries. 
2013-05-17 00:44:17,961 INFO jvm.JvmMetrics (JvmMetrics.java:init(71)) - Initializing JVM Metrics with processName=JobTracker, sessionId= 
2013-05-17 00:44:18,144 WARN mapred.JobClient (JobClient.java:configureCommandLineOptions(661)) - No job jar file set. User classes may not be found. See JobConf(Class) or JobConf#setJar(String). 
2013-05-17 00:44:18,176 INFO mapred.FileInputFormat (FileInputFormat.java:listStatus(192)) - Total input paths to process : 1 
2013-05-17 00:44:18,519 INFO mapred.JobClient (JobClient.java:monitorAndPrintJob(1275)) - Running job: job_local_0001 
2013-05-17 00:44:18,521 INFO mapred.FileInputFormat (FileInputFormat.java:listStatus(192)) - Total input paths to process : 1 
2013-05-17 00:44:18,573 INFO mapred.MapTask (MapTask.java:runOldMapper(347)) - numReduceTasks: 1 
2013-05-17 00:44:18,578 INFO mapred.MapTask (MapTask.java:<init>(776)) - io.sort.mb = 100 
2013-05-17 00:44:18,601 INFO mapred.MapTask (MapTask.java:<init>(788)) - data buffer = 79691776/99614720 
2013-05-17 00:44:18,601 INFO mapred.MapTask (MapTask.java:<init>(789)) - record buffer = 262144/327680 
2013-05-17 00:44:18,611 WARN plugin.PluginRepository (PluginManifestParser.java:getPluginFolder(123)) - Plugins: directory not found: plugins 
2013-05-17 00:44:18,612 INFO plugin.PluginRepository (PluginRepository.java:displayStatus(313)) - Plugin Auto-activation mode: [true] 
2013-05-17 00:44:18,612 INFO plugin.PluginRepository (PluginRepository.java:displayStatus(314)) - Registered Plugins: 
2013-05-17 00:44:18,612 INFO plugin.PluginRepository (PluginRepository.java:displayStatus(317)) - NONE 
2013-05-17 00:44:18,613 INFO plugin.PluginRepository (PluginRepository.java:displayStatus(324)) - Registered Extension-Points: 
2013-05-17 00:44:18,613 INFO plugin.PluginRepository (PluginRepository.java:displayStatus(326)) - NONE 
2013-05-17 00:44:18,615 WARN mapred.LocalJobRunner (LocalJobRunner.java:run(256)) - job_local_0001 
java.lang.RuntimeException: Error in configuring object 
    at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:93) 
    at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64) 
    at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117) 
    at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:354) 
    at org.apache.hadoop.mapred.MapTask.run(MapTask.java:307) 
    at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:177) 
Caused by: java.lang.reflect.InvocationTargetException 
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) 
    at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) 
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) 
    at java.lang.reflect.Method.invoke(Unknown Source) 
    at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88) 
    ... 5 more 
Caused by: java.lang.RuntimeException: Error in configuring object 
    at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:93) 
    at org.apache.hadoop.util.ReflectionUtils.setConf(ReflectionUtils.java:64) 
    at org.apache.hadoop.util.ReflectionUtils.newInstance(ReflectionUtils.java:117) 
    at org.apache.hadoop.mapred.MapRunner.configure(MapRunner.java:34) 
    ... 10 more 
Caused by: java.lang.reflect.InvocationTargetException 
    at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) 
    at sun.reflect.NativeMethodAccessorImpl.invoke(Unknown Source) 
    at sun.reflect.DelegatingMethodAccessorImpl.invoke(Unknown Source) 
    at java.lang.reflect.Method.invoke(Unknown Source) 
    at org.apache.hadoop.util.ReflectionUtils.setJobConf(ReflectionUtils.java:88) 
    ... 13 more 
Caused by: java.lang.RuntimeException: x point org.apache.nutch.net.URLNormalizer not found. 
    at org.apache.nutch.net.URLNormalizers.<init>(URLNormalizers.java:123) 
    at org.apache.nutch.crawl.Injector$InjectMapper.configure(Injector.java:74) 
    ... 18 more 
2013-05-17 00:44:19,520 INFO mapred.JobClient (JobClient.java:monitorAndPrintJob(1288)) - map 0% reduce 0% 
2013-05-17 00:44:19,523 INFO mapred.JobClient (JobClient.java:monitorAndPrintJob(1343)) - Job complete: job_local_0001 
2013-05-17 00:44:19,524 INFO mapred.JobClient (Counters.java:log(514)) - Counters: 0 
Exception in thread "main" java.io.IOException: Job failed! 
    at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:1252) 
    at org.apache.nutch.crawl.Injector.inject(Injector.java:281) 
    at org.apache.nutch.crawl.Crawl.run(Crawl.java:132) 
    at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65) 
    at org.apache.nutch.crawl.Crawl.main(Crawl.java:55) 

我搜索了很多,但沒有發現任何可行的解決方案。 請建議。

+0

發生了什麼呢? –

+0

你是否試圖在Windows中運行這個? Nutch for Windows的最新版本是1.2。試試這個教程:http://wiki.apache.org/nutch/FabioGiavazzi/HowtoGettingNutchRunningonWindows – nimeshjm

+0

你檢查nutch插件。你的構建是否正確? – cguzel

回答

4

2013-05-17 00:44:18,611 WARN plugin.PluginRepository (PluginManifestParser.java:getPluginFolder(123)) -Plugins: directory not found: plugins

您檢查插件目錄。並在nutch-site.xml中添加插件文件夾路徑。 必須在<value></value>標記,如下所示添加插件路徑:

<property> 
    <name>plugin.folders</name> 
    <value>/home/YOUR-USER/nutch/build/plugins</value> 
</property> 
+1

我有一個爆炸的戰爭目錄,其中nutch.2.1.1.jar駐留在WEB-INF/lib目錄中。我已經將我的插件目錄包含爲/ home/USER/apache-tomcat/webapps/myapp-war/WEB-INF/lib 它仍在提取插件目錄,如下所示: ** WARN org.apache.nutch。 plugin.PluginRepository - 插件:找不到目錄:插件** INFO org.apache.nutch.plugin.PluginRepository - 插件自動激活模式:[true] INFO org.apache.nutch.plugin.PluginRepository - 已註冊的插件: INFO org.apache.nutch.plugin.PluginRepository - NONE 任何意見將不勝感激。 – Kartik

+0

問題仍然存在。需要諮詢。 – Kumar