垃圾的價值，同時使用C＃

2014-10-29 50 views 0 likes

閱讀HTML身體下面我有HTML文件，它包含的內容象下面這樣：垃圾的價值，同時使用C＃

<HTML> 
<BODY> 
... 
........ company's Chief Financial Officer. Now the....... 
... 
</BODY> 
</HTML>

我使用閱讀本文件的內容：

StringBuilder stringBuilder = new StringBuilder(); 
using (StreamReader sr = new StreamReader(filePath)) 
{ 
    String line = sr.ReadToEnd(); 
    stringBuilder.Append(line); 
} 
strFileContent = stringBuilder.ToString();

但是它返回字符串爲：

........company sChief FinancialOfficer. 現在..... ..

HTML文件在我的本地系統中。

來源

2014-10-29 Aquarius24

什麼是文件的編碼？嘗試明確指定編碼，否則'StreamReader'將默認爲'UTF8'。 – 2014-10-29 06:43:02

@ Sriram，目前的編碼是charset = windows-1252。我認爲這是造成問題 – Aquarius24 2014-10-29 06:50:32

回答

您需要使用它來創建文件相同的編碼。 StreamReader默認情況下您的編碼是UTF8，並嘗試使用該編碼對文件進行解碼，但原始編碼爲windows-1252（如您在註釋中所述）。嘗試使用錯誤的編碼讀取會產生垃圾數據，原因很明顯。

你應該明確地說出文件的編碼方式。下面是你如何做。

var encoding = Encoding.GetEncoding(1252);//windows-1252 
using (StreamReader sr = new StreamReader(filePath, encoding)) 
...

Bonus reading

來源

2014-10-29 06:58:56

謝謝！值得學習（Y） – Aquarius24 2014-10-29 07:04:43

必須設置編碼中的StreamReader這樣的：

using (StreamReader sr = new StreamReader(filePath, Encoding.UTF8))

來源

2014-10-29 06:55:48

相關問題

11. 從字符串中使用垃圾值C使用C
12. 垃圾值
13. 給垃圾值
14. 垃圾值
15. malloc的垃圾值
16. C：在scanf這份聲明中人物返回垃圾價值
17. c中的不同垃圾收集器＃
18. C＃垃圾回收
19. C＃垃圾收集
20. 使用Zend_Session_Handler_DbTable時的垃圾收集
21. 獲得垃圾值
22. 保持垃圾值
23. 價值觀結構變成垃圾的數組值
24. 使用cmath時禁用math.h垃圾
25. 鏈接列表垃圾值c
26. C++ TCP套接字垃圾值
27. 垃圾值返回總和C++
28. realloc（）在c ..打印垃圾值
29. 來自Arduino的處理和C＃串行讀取垃圾/垃圾值
30. C中的垃圾字符