在Java中比較兩個列表的有效方法是什麼？

-1

我有一個程序，它從db獲取java對象的列表，並將它與已經檢索的舊列表進行比較，並找到它中的delta（差異）元素並返回。我想知道是否有最好的方法來做到這一點，而不是僅僅使用Set方法Union（），Intersection（）等，並避免內存不足的錯誤？列表的大小可以是200k。我在我的項目中使用Spring 3.2.8.RELEASE版本。在Java中比較兩個列表的有效方法是什麼？

public class Tester { 

    private List<AddressInfo> oldListOfAddresses; 

    @Scheduled(cron="0 1 6 * * ?") // 6 AM everyday 
    public Map<String, AddressInfo> getCompany() { 
     try { 
      Map<String, AddressInfo> companyMap = new HashMap<>(); 
      String sql = "Some sql query which return Address Info."; 
      List<AddressInfo> newListOfAddresses = jdbcTemplate.query(sql, new Object[0], 
        new FacilityNewMapper()); 
      if (newListOfAddresses == null || newListOfAddresses.size() = 0) { 
       throw new FacilityLookUpException("List of clinic Info from facilities is empty..."); 
      } else { 

       // I have to find the delta of new list and old list here. 
       // I need an efficient (Space and Time) way of finding delta. 
       List<AddressInfo> deltaList = newListOfAddresses - oldListOfAddresses; //Something like this 

       for (AddressInfo comp : deltaList) { 
        if (comp != null) { 
         companyMap.put(comp.getLocationId(), comp); 
        } 
       } 
       oldListOfAddresses = newListOfAddresses; 
      } 
      return companyMap; 
     } catch (Exception e) { 
      throw new CompanyLookUpException(
        "List of company addresses is empty..." + e.getMessage()); 
     } 
    } 
}

AddressInfo bean。

public class AddressInfo{ 

    private String locationId; 
    private String streetName; 
    private String city; 
    private String state; 
    private String country; 

    public String getLocationId() { 
     return locationId; 
    } 
    public void setLocationId(String locationId) { 
     this.locationId = locationId; 
    } 
    public String getStreetName() { 
     return streetName; 
    } 
    public void setStreetName(String streetName) { 
     this.streetName = streetName; 
    } 
    public String getCity() { 
     return city; 
    } 
    public void setCity(String city) { 
     this.city = city; 
    } 
    public String getState() { 
     return state; 
    } 
    public void setState(String state) { 
     this.state = state; 
    } 
    public String getCountry() { 
     return country; 
    } 
    public void setCountry(String country) { 
     this.country = country; 
    } 
    @Override 
    public int hashCode() { 
     final int prime = 31; 
     int result = 1; 
     result = prime * result + ((city == null) ? 0 : city.hashCode()); 
     result = prime * result + ((country == null) ? 0 : country.hashCode()); 
     result = prime * result + ((locationId == null) ? 0 : locationId.hashCode()); 
     result = prime * result + ((state == null) ? 0 : state.hashCode()); 
     result = prime * result + ((streetName == null) ? 0 : streetName.hashCode()); 
     return result; 
    } 
    @Override 
    public boolean equals(Object obj) { 
     if (this == obj) 
      return true; 
     if (obj == null) 
      return false; 
     if (getClass() != obj.getClass()) 
      return false; 
     AddressInfo other = (AddressInfo) obj; 
     if (city == null) { 
      if (other.city != null) 
       return false; 
     } else if (!city.equals(other.city)) 
      return false; 
     if (country == null) { 
      if (other.country != null) 
       return false; 
     } else if (!country.equals(other.country)) 
      return false; 
     if (locationId == null) { 
      if (other.locationId != null) 
       return false; 
     } else if (!locationId.equals(other.locationId)) 
      return false; 
     if (state == null) { 
      if (other.state != null) 
       return false; 
     } else if (!state.equals(other.state)) 
      return false; 
     if (streetName == null) { 
      if (other.streetName != null) 
       return false; 
     } else if (!streetName.equals(other.streetName)) 
      return false; 
     return true; 
    } 

}

來源

2017-09-03 Gautham.R

請解釋*我想知道是否有做到這一點最好的辦法，而不是僅僅使用Set方法聯盟（），交集（）等，並避免內存不足錯誤？* – nullpointer

沒有「最好「辦法。根據許多因素（列表的大小，檢索列表所需的時間，執行比較的次數等等），對於不同的情況有很好的方法。 – biziclop

您的問題不完整。您尚未指定「比較」兩個列表的含義，以及「delta」的含義。 FIrst和最重要的，注意你的'AddressInfo'類沒有定義'equals（）'方法。這意味着你不能有意義地比較這個類的兩個對象，所以即使原則上也不可能做你正在問的東西。假設你提供了一個'equals（）'，那麼問題是列表是否可以包含重複項（基於'equals（）'）。那麼，你必須告訴我們，比較中元素的順序是否重要。 –

-1

最好的方法確實是使用set操作。將舊列表添加到集合中，將允許您迭代新列表，並且對於每個項目，檢查構造的集合是否包含它，如果沒有，則將其添加到結果中。這會給你一個O(n*log(n))的運行時間，而不是暴力破解方法的O(n^2)。

來源

2017-09-03 13:45:59 NiVeR

使用'Collection'的'removeAll'方法怎麼樣？我覺得這非常有效。或者是你正在談論的方法之一？ –

那麼，你會在一個集合上應用該方法，並且可能獲得相同的複雜性。低於這種複雜性是不可能的。 – NiVeR

-1

我不這麼認爲（注：我假設列表的順序沒有重要性）。例如，不使用該集合的最快方式是對兩個將花費你O（nlogn）的列表進行排序，然後對它們進行迭代比較每個元素並保存那些沒有一對的元素。在Set的情況下，基本上遍歷每個元素並在第二個集合中查找它，以便迭代爲O（n），搜索爲O（1）。最後，我們有O（nlogn）> O（n）的一組獲勝

來源

2017-09-03 13:46:56 urag

-1

假設AddressInfo實現equals和hashCode得當，並在每個列表中的項目是獨一無二的，下面的函數可以找到線性時間三角洲：

Set<AddressInfo> findDiff(final List<AddressInfo> newListOfAddresses, final List<AddressInfo> oldListOfAddresses) { 
    Map< AddressInfo, Boolean > map = new HashMap<>(newListOfAddresses.size()); 

    for (AddressInfo addressInfo : newListOfAddresses) { 
     map.put(addressInfo, TRUE); 
    } 

    for (AddressInfo addressInfo : oldListOfAddresses) { 
     map.remove(addressInfo); 
    } 

    return map.keySet(); 
}

來源

2017-09-03 13:56:25 alirabiee

我同意，我認爲使用Set with equals是解決問題的好方法。 –

您正在創建一個Map 地圖。爲大量對象創建映射可能是多餘的。設置本身應該是好的。 – nagendra547

@ nagendra547真的沒有區別， HashSet的內部實現完全一樣。 – alirabiee

-1

這應該適用於創建兩個列表之間的區別。

這裏我創建一個集合並添加newList的所有元素。然後，無論哪個元素是oldList的一部分，我將它們刪除。

Set<AddressInfo> findDiffOfTwoList(List<AddressInfo> newList, List<AddressInfo> oldList) { 
    Set<AddressInfo> set = new HashSet<>(); 
    set.addAll(newList); 
    for(AddressInfo address:oldList){ 
     set.remove(address); 
    } 
    return set; 
}

來源

2017-09-03 14:47:16 nagendra547

爲什麼downvoting？ – nagendra547

在Java中比較兩個列表的有效方法是什麼？

回答

相關問題