2010-09-11 39 views
0

我配置了solrindex-mapping.xml(nutch)並配置了我的solr schema.xmlsolrconfig.xml。單運行兩個運作良好,但如果我用bin/nutch solrindex ...我得到一個異常:apache nutch(版本1.2)集成在apach solr(trunk)中的問題 - 獲得solr異常

org.apache.solr.common.SolrException: Document [null] missing required field: id 

我已經配置在所有的配置,文件id。在solrindex-mapping.xml它映射從urlid和在 solr我也配置了id。我不知道什麼是錯的。我將一些日誌輸出添加到org.apache.nutch.indexer.solr.SolrWriter.java。我在這些行添加一個loginfo,當讀取的字段被添加到SolrInputDocument。構建和運行後的結果是:

2010-09-11 21:31:06,326 INFO solr.SolrWriter - write() 
2010-09-11 21:31:06,327 INFO solr.SolrWriter - Key: segment, value: 20100911212934 
2010-09-11 21:31:06,327 INFO solr.SolrWriter - Key: boost, value: 1.0 
2010-09-11 21:31:06,327 INFO solr.SolrWriter - Key: digest, value: bc315927b7c01c7a2905d5b6872bc35b 
2010-09-11 21:31:06,327 INFO solr.SolrWriter - close() 

您將只能看到3個讀取字段O_o。有誰知道我的配置是否有問題? 我需要運行Nutch的真快,因爲我目前正在寫關於我的學士論文:/(在地方網絡的異構數據源的信息集成)

問候
燙髮=)

的日誌的其餘部分:

2010-09-11 21:31:06,079 INFO solr.SolrWriter - open() 
2010-09-11 21:31:06,280 INFO solr.SolrMappingReader - source: content dest: content 
2010-09-11 21:31:06,280 INFO solr.SolrMappingReader - source: site dest: site 
2010-09-11 21:31:06,280 INFO solr.SolrMappingReader - source: title dest: metadata_title 
2010-09-11 21:31:06,281 INFO solr.SolrMappingReader - source: host dest: host 
2010-09-11 21:31:06,281 INFO solr.SolrMappingReader - source: segment dest: segment 
2010-09-11 21:31:06,281 INFO solr.SolrMappingReader - source: boost dest: boost 
2010-09-11 21:31:06,281 INFO solr.SolrMappingReader - source: digest dest: digest 
2010-09-11 21:31:06,281 INFO solr.SolrMappingReader - source: tstamp dest: metadata_last_modified 
2010-09-11 21:31:06,281 INFO solr.SolrMappingReader - source: lastModified dest: metadata_last_modified 
2010-09-11 21:31:06,281 INFO solr.SolrMappingReader - source: url dest: url 
2010-09-11 21:31:06,281 INFO solr.SolrMappingReader - source: url dest: id 
2010-09-11 21:31:06,281 INFO solr.SolrMappingReader - source: url dest: id 
2010-09-11 21:31:06,281 INFO solr.SolrMappingReader - source: url dest: url 
2010-09-11 21:31:06,281 INFO solr.SolrMappingReader - uniqueKey = id 
2010-09-11 21:31:06,291 INFO solr.SolrWriter - write() 
2010-09-11 21:31:06,294 INFO solr.SolrWriter - Key: segment, value: 20100911212934 
2010-09-11 21:31:06,294 INFO solr.SolrWriter - Key: boost, value: 1.0 
2010-09-11 21:31:06,294 INFO solr.SolrWriter - Key: digest, value: 18abadd34a2bd71a8336fa5e8c6dbedb 
2010-09-11 21:31:06,306 INFO solr.SolrWriter - write() 
2010-09-11 21:31:06,306 INFO solr.SolrWriter - Key: segment, value: 20100911212934 
2010-09-11 21:31:06,306 INFO solr.SolrWriter - Key: boost, value: 1.0 
2010-09-11 21:31:06,306 INFO solr.SolrWriter - Key: digest, value: 3267fd5ea03852cdc83383635d133fad 
2010-09-11 21:31:06,310 INFO solr.SolrWriter - write() 
2010-09-11 21:31:06,310 INFO solr.SolrWriter - Key: segment, value: 20100911212934 
2010-09-11 21:31:06,310 INFO solr.SolrWriter - Key: boost, value: 1.0 
2010-09-11 21:31:06,311 INFO solr.SolrWriter - Key: digest, value: b61607602ab99eda5684adc9966349d6 
2010-09-11 21:31:06,314 INFO solr.SolrWriter - write() 
2010-09-11 21:31:06,314 INFO solr.SolrWriter - Key: segment, value: 20100911212851 
2010-09-11 21:31:06,314 INFO solr.SolrWriter - Key: boost, value: 1.0 
2010-09-11 21:31:06,314 INFO solr.SolrWriter - Key: digest, value: 9bdb8df3d1addf254203542dd22096d3 
2010-09-11 21:31:06,316 INFO solr.SolrWriter - write() 
2010-09-11 21:31:06,316 INFO solr.SolrWriter - Key: segment, value: 20100911212934 
2010-09-11 21:31:06,316 INFO solr.SolrWriter - Key: boost, value: 1.0 
2010-09-11 21:31:06,317 INFO solr.SolrWriter - Key: digest, value: 66eb3639ae15655bf91dc53208f95167 
2010-09-11 21:31:06,319 INFO solr.SolrWriter - write() 
2010-09-11 21:31:06,319 INFO solr.SolrWriter - Key: segment, value: 20100911212934 
2010-09-11 21:31:06,319 INFO solr.SolrWriter - Key: boost, value: 1.0 
2010-09-11 21:31:06,319 INFO solr.SolrWriter - Key: digest, value: 6e0501b52e204c2a68d9caa70dd0dfa9 
2010-09-11 21:31:06,326 INFO solr.SolrWriter - write() 
2010-09-11 21:31:06,327 INFO solr.SolrWriter - Key: segment, value: 20100911212934 
2010-09-11 21:31:06,327 INFO solr.SolrWriter - Key: boost, value: 1.0 
2010-09-11 21:31:06,327 INFO solr.SolrWriter - Key: digest, value: bc315927b7c01c7a2905d5b6872bc35b 
2010-09-11 21:31:06,327 INFO solr.SolrWriter - close() 
2010-09-11 21:31:06,687 WARN mapred.LocalJobRunner - job_local_0001 
org.apache.solr.common.SolrException: Document [null] missing required field: id 
Document [null] missing required field: id 
request: http://127.0.0.1:8983/solr/update?wt=javabin&version=1 
     at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:424) 
     at org.apache.solr.client.solrj.impl.CommonsHttpSolrServer.request(CommonsHttpSolrServer.java:243) 
     at org.apache.solr.client.solrj.request.AbstractUpdateRequest.process(AbstractUpdateRequest.java:105) 
     at org.apache.solr.client.solrj.SolrServer.add(SolrServer.java:49) 
     at org.apache.nutch.indexer.solr.SolrWriter.close(SolrWriter.java:98) 
     at org.apache.nutch.indexer.IndexerOutputFormat$1.close(IndexerOutputFormat.java:48) 
     at org.apache.hadoop.mapred.ReduceTask.runOldReducer(ReduceTask.java:474) 
     at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:411) 
     at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216) 
2010-09-11 21:31:07,556 ERROR solr.SolrIndexer - java.io.IOException: Job failed! 

回答

0

Nutch的1.2不與Solr的主幹工作...

從Nutch的郵件列表(原帖here)...

大家都知道1.2是否適用於當前的Solr主幹?

它沒有,它使用Solr 1.4.x. Solr中繼使用不兼容的API。