2011-02-24 158 views
9

爲了獲得更多上下文相關的搜索結果,我決定與lucene.net玩一玩,雖然我對它很陌生,但我發現它並不是我遇到的最直觀的圖書館。這並不是因爲缺乏相關的例子來幫助我解決問題。Lucene.net多字段搜索

我使用simple lucene建立我的指數,這似乎是完美的工作:

Field f = null; 
Document document = new Document(); 

document.Add(new Field("id", dl.Id.ToString(), Field.Store.YES, Field.Index.NOT_ANALYZED)); 

f = new Field("category", dl.CategoryName.ToLowerInvariant(), Field.Store.YES, Field.Index.ANALYZED, Field.TermVector.WITH_POSITIONS_OFFSETS); 
f.SetBoost(5); 
document.Add(f); 

f = new Field("company_name", dl.CompanyName.ToLowerInvariant(), Field.Store.YES, Field.Index.ANALYZED, Field.TermVector.WITH_POSITIONS_OFFSETS); 
f.SetBoost(2); 
document.Add(f); 

document.Add(new Field("description", dl.Description.ToLowerInvariant(), Field.Store.YES, Field.Index.ANALYZED, Field.TermVector.WITH_POSITIONS_OFFSETS)); 
document.Add(new Field("meta_keywords", dl.Meta_Keywords.ToLowerInvariant(), Field.Store.YES, Field.Index.ANALYZED, Field.TermVector.WITH_POSITIONS_OFFSETS)); 
document.Add(new Field("meta_description", dl.Meta_Description.ToLowerInvariant(), Field.Store.YES, Field.Index.ANALYZED, Field.TermVector.WITH_POSITIONS_OFFSETS)); 

//And a few more fields 

基於這個指數我第一次嘗試沿着這些線路查詢:

var whatParser = new MultiFieldQueryParser(
    global::Lucene.Net.Util.Version.LUCENE_29, 
    new string[] { "company_name", "description", "meta_keywords", "meta_description", "category" }, 
    analyzer); 

whatQuery = whatParser.Parse("search".ToLowerInvariant()); 

這個工作直到搜索詞變得超過1個單詞爲止。接下來是一個詞組查詢。

whatQuery = new PhraseQuery(); 
whatQuery.Add(new Term("company_name", what)); 
whatQuery.Add(new Term("description", what)); 
whatQuery.Add(new Term("meta_keywords", what)); 
whatQuery.Add(new Term("meta_description", what)); 
whatQuery.Add(new Term("category", what)); 

然後我發現扔了錯誤:All phrase terms must be in the same field

所以,我要去哪裏錯了?你有什麼建議如何解決它?如果有更好的建議,我可以完全改變搜索技術。

這可能是有用的一些額外的信息

  • 所有的結果都在一端通過排序new Sort(new SortField[] {new SortField("is_featured", SortField.STRING, true),SortField.FIELD_SCORE})
  • 有一些額外的搜索條件,以便每個查詢被添加到與發生設定爲SHOULD
  • 一個布爾查詢

感謝您的幫助。

+0

Hawxby,請你寫在這裏微調的代碼以供參考 – 2011-06-17 18:36:37

回答

3

我認爲這個問題是BooleanClause.Occur.SHOULD。我們用這樣的:

string[] fieldList = { "field1", "field2", "field3"; 

//for us the field list varies .. there are other ways to create this array of course 
List<BooleanClause.Occur> occurs = new List<BooleanClause.Occur>(); 
foreach (string field in fieldList) 
    occurs.Add(BooleanClause.Occur.SHOULD); 

if(!string.IsNullOrEmpty(multiWordPhrase)) 
{ 
    Query q = MultiFieldQueryParser.Parse(multiWordPhrase, fieldList, occurs.ToArray(), new StandardAnalyzer()); 
    return q; 
} 
+0

了一些調整這個工作完全之後。謝謝! – Hawxby