2013-12-16 53 views
0

我想用TikaEntityProcessor安排Delta導入。完全導入工作正常,但Delta導入沒有更新任何東西。也沒有錯誤。 這多少服務器日誌顯示得,我無法找出什麼地方出了錯:solr delta導入不與TikaEntityProcessor配合使用

 
121151 [qtp966396367-15] INFO org.apache.solr.handler.dataimport.DocBuilder û Starting delta collection. 
121155 [qtp966396367-15] INFO org.apache.solr.handler.dataimport.DocBuilder û Running ModifiedRowKey() for Entity: message 
121156 [qtp966396367-15] INFO org.apache.solr.handler.dataimport.DocBuilder û Completed ModifiedRowKey for Entity: message rows obtained : 0 
121156 [qtp966396367-15] INFO org.apache.solr.handler.dataimport.DocBuilder û Completed DeletedRowKey for Entity: message rows obtained : 0 
121156 [qtp966396367-15] INFO org.apache.solr.handler.dataimport.DocBuilder û Completed parentDeltaQuery for Entity: message 
121156 [qtp966396367-15] INFO org.apache.solr.handler.dataimport.DocBuilder û Running ModifiedRowKey() for Entity: messages 
121157 [qtp966396367-15] INFO org.apache.solr.handler.dataimport.JdbcDataSource û Creating a connection for entity messages with URL: jdbc:oracle:thin:@//172.16.29.92:1521/d11gr21 
121176 [qtp966396367-15] INFO org.apache.solr.handler.dataimport.JdbcDataSource û Time taken for getConnection(): 19 
121182 [qtp966396367-15] INFO org.apache.solr.handler.dataimport.DocBuilder û Completed ModifiedRowKey for Entity: messages rows obtained : 1 
121182 [qtp966396367-15] INFO org.apache.solr.handler.dataimport.DocBuilder û Completed DeletedRowKey for Entity: messages rows obtained : 0 

我dataconfig.xml如下:

<document> 

    <entity name="messages" pk="BLOB_PK" transformer='DateFormatTransformer' 
    query="select * from BLOB_TEST" 
    deltaImportQuery="select * from BLOB_TEST where BLOB_PK='${dataimporter.delta.id}'" 
    deltaQuery="select BLOB_PK from BLOB_TEST where to_char(last_modified,'YYYY-MM-DD HH24:MI:SS') &gt; '${dataimporter.last_index_time}' " 
    dataSource="db"> 
    <field column ="BLOB_PK" name ="id" /> 
    <field column="last_modified" dateTimeFormat="YYYY-MM-DD HH24:MI:SS" locale="en" /> 
    <entity 
     name="message" 
     dataSource="dastream" 
      processor="TikaEntityProcessor" 
     url="message" 
     dataField="messages.MESSAGE" 
     format="text"> 

     <field column="text" name="mxMsg" blob="true" /> 
     </entity> 
    </entity> 

</document> 

當我從手動運行增量導入Web的客戶端狀態顯示是這樣的:

"statusMessages": { "Total Requests made to DataSource": "4", "Total Rows Fetched": "3", "Total Documents Skipped": "0", "Delta Dump started": "2013-12-16 14:48:28", "Identifying Delta": "2013-12-16 14:48:28", "Deltas Obtained": "2013-12-16 14:48:28", "Building documents": "2013-12-16 14:48:28", "Total Changed Documents": "3", "Total Documents Processed": "0", "Time taken": "0:0:0.50" }

回答

0

我能夠使它發揮作用。 我不得不刪除從數據-config.xml中如下:其中BLOB_PK ='$ {} dataimporter.delta.id

我有$ {dataimporter沒有配置從BLOB_TEST

deltaImportQuery =「SELECT *。 delta.id}因此可能是因爲即使在檢測到添加的行的正確數量之後,也沒有任何內容被索引。

相關問題