0
在sbt控制檯的工程中使用Crawler4j。當使用SBT-組件來創建一個fatjar提卡(?)似乎不再與Crawler4J在FatJar中沒有檢測到Tika與sbt-assembly的編碼
java -jar crawler.jar
什麼提卡缺少檢測編碼啓動時能夠檢測到的頁面編碼?
ERROR edu.uci.ics.crawler4j.parser.Parser - Failed to detect the character
encoding of a document, while parsing
合併策略是
assemblyMergeStrategy in assembly := {
case PathList("META-INF", xs @ _*) => MergeStrategy.discard
case _ => MergeStrategy.first
}