在Haskell中解壓縮GZip

haskell

2012-04-09 200 views 5 likes

我很難搞清楚這一點。這是我想要的：在Haskell中解壓縮GZip

ghci> :m +System.FileArchive.GZip -- From the "MissingH" package 
ghci> fmap decompress $ readFile "test.html.gz" 
*** Exception: test.html.gz: hGetContents: invalid argument (invalid byte sequence)

爲什麼我會得到那個異常？

我也試過Codec.Compression.GZip.decompress從zlib package，但我不能得到類型String而不是ByteString。

來源

2012-04-09 Snowball

這不是一個完整的答案，但可能'readFile'試圖解碼'test.html.gz'，就好像它是在你的系統編碼中的文本編碼一樣。改用二進制讀取。 – 2012-04-10 00:55:45

回答

從ByteString到String的轉換取決於壓縮文件的字符編碼，但假設它是ASCII或Latin-1的，這應該工作：

import Codec.Compression.GZip (decompress) 
import qualified Data.ByteString.Lazy as LBS 
import Data.ByteString.Lazy.Char8 (unpack) 

readGZipFile :: FilePath -> IO String 
readGZipFile path = fmap (unpack . decompress) $ LBS.readFile path

如果你需要一些其他編碼類似的工作UTF-8，用適當的解碼功能代替unpack，例如Data.ByteString.Lazy.UTF8.toString。

當然，如果您正在解壓縮的文件不是文本文件，最好將其保存爲ByteString。

來源

2012-04-10 01:15:42 hammar

如果是，解壓縮然後解碼爲文本 – alternative 2012-04-10 01:23:31

相關問題

11. 在Dynamics AX X ++中解壓縮GZip流
12. 在解析中Gzip壓縮PDF文件
13. 在Racket中解壓縮gzip html
14. 本地gzip解壓縮鉻在javascript中
15. Vala解壓縮gzip數據
16. 爲silverlight解壓縮gzip流
17. 解壓縮gzip http請求
18. 解壓縮gzip http響應
19. node.js gzip解壓縮xmlhttprequesr.responseText
20. 如何在內存中解壓縮GZip壓縮文件？
21. GZIP串壓縮不解壓「£」字符
22. AppEngine gzip壓縮
23. TYPO3 gzip壓縮
24. javascript gzip壓縮
25. gzip壓縮
26. WP8 Gzip壓縮
27. 解壓縮GZIP字符串中的Java
28. GZip解壓縮停止在任意點
29. Java-在gzip解壓縮上的說明
30. 在Hadoop/PIG中壓縮/解壓gzip數據是否透明？