2013-07-08 100 views
1

我無法通過ftp從此html文件中獲取html文本。我使用美麗的湯通過http/https閱讀html文件,但由於某些原因,我無法從ftp下載/讀取。請幫忙!如何通過ftp url下載/閱讀html文件?

這是網址。 a link

這是我的代碼到目前爲止。

BufferedReader reader = null; 
String total = ""; 
String line; 
ur = "ftp://ftp.legis.state.tx.us/bills/832/billtext/html/house_resolutions/HR00001_HR00099/HR00014I.htm" 
try { 
    URL url = new URL(ur); 
    URLConnection urlc = url.openConnection(); 
    InputStream is = urlc.getInputStream(); // To download 
    reader = new BufferedReader(new InputStreamReader(is, "UTF-8")); 
     while ((line = reader.readLine()) != null) 
      total += reader.readLine(); 

} finally { 
    if (reader != null) 
     try { reader.close(); 
     } catch (IOException logOrIgnore) {} 
} 
+1

你可以發佈錯誤的堆棧跟蹤? –

回答

0

首先想到這是關係到一個錯誤的道路分辨率discussed here但這並沒有幫助。

我不知道這裏究竟發生了什麼問題,但是我只能在這個ftp-server和MacOS Java 1.6.0_33-b03-424上重現這個錯誤。我無法用Java 1.7.0_25重現它。所以也許你會檢查一下Java更新。

或者你可以使用commons FTPClient檢索文件:

FTPClient client = new FTPClient(); 
client.connect("ftp.legis.state.tx.us"); 
client.enterLocalPassiveMode(); 
client.login("anonymous", ""); 
client.changeWorkingDirectory("bills/832/billtext/html/house_resolutions/HR00001_HR00099"); 
InputStream is = client.retrieveFileStream("HR00014I.htm"); 
1

此代碼爲我工作,爪哇1.7.0_25。請注意,您正在存儲每兩行中的一行,在條件和while循環的主體中都調用reader.readLine()

public static void main(String[] args) throws MalformedURLException, IOException { 
    BufferedReader reader = null; 
    String total = ""; 
    String line; 
    String ur = "ftp://ftp.legis.state.tx.us/bills/832/billtext/html/house_resolutions/HR00001_HR00099/HR00014I.htm"; 
    try { 
     URL url = new URL(ur); 
     URLConnection urlc = url.openConnection(); 
     InputStream is = urlc.getInputStream(); // To download 
     reader = new BufferedReader(new InputStreamReader(is, "UTF-8")); 
     while ((line = reader.readLine()) != null) { 
      total += line; 
     } 
    } finally { 
     if (reader != null) { 
      try { 
       reader.close(); 
      } catch (IOException logOrIgnore) { 
      } 
     } 
    } 
}