無法下載HTML

我想下載這個網站無法下載HTML

我使用這個代碼：

  Document doc = null; 
     try { 
      doc =Jsoup.connect(link).userAgent("Mozilla").get(); 
     } catch (IOException e) { 
      // TODO Auto-generated catch block 
      e.printStackTrace(); 
     } 
Log.i ("html", doc.toString());

更新： ASLO試圖使用它：

HttpClient client = new DefaultHttpClient(); 
     HttpGet request = new HttpGet(link); 
     HttpResponse response = null; 
     try { 
      response = client.execute(request); 
     } catch (ClientProtocolException e1) { 
      // 
      e1.printStackTrace(); 
     } catch (IOException e1) { 
      // 
      e1.printStackTrace(); 
     } 


     InputStream in = null; 
     try { 
      in = response.getEntity().getContent(); 
     } catch (IllegalStateException e1) { 
      // 
      e1.printStackTrace(); 
     } catch (IOException e1) { 
      // 
      e1.printStackTrace(); 
     } 
     BufferedReader reader = null; 
     try { 
      reader = new BufferedReader(new InputStreamReader(in, "UTF-8")); 
     } catch (UnsupportedEncodingException e) { 
      // 
      e.printStackTrace(); 
     } 
     StringBuilder str = new StringBuilder(); 
     String line = null; 
     try { 
      while((line = reader.readLine()) != null) 
      { 
       str.append(line); 
      } 
     } catch (IOException e1) { 
      // 
      e1.printStackTrace(); 
     } 
     try { 
      in.close(); 
     } catch (IOException e1) { 
      // 
      e1.printStackTrace(); 
     } 
     String html = str.toString(); 
     Log.e("html", html);

再次響應像這樣：

  <html> 
<body> 
<script>document.cookie="BPC=f563534535121d5a1ba5bd1e153b"; 
    document.location.href="http://...link.../all?attempt=1";</script> 
</body> 
</html>

我找不到任何解決方案...頁面無法下載也許是因爲沒有cookie ......或者是什麼？

來源

2014-01-24 user36603

需要更多信息。「使用這個代碼」 - 爲什麼？那是什麼語言？看起來像一些Javascript，但是再次..它不。 – MortenMoulder

@Snorlax java和android見標籤 – user36603

是的，我明白了。它仍然沒有意義。「試圖下載這個html」是什麼意思？ - 你想用Java顯示它嗎？您是否嘗試使用Java下載並將其存儲在SDCard上？ – MortenMoulder

在腳本標籤，你有這樣的語句：

document.location.href="....link..../all?attempt=1";

通常它迫使瀏覽器的位置重新加載頁面。我認爲它實際上是你想要下載的頁面「.... link ...？attempt = 1」。

如果你不使用腳本中定義的cookie，它不能確定它會工作，但它值得一試。

來源

2014-01-24 10:09:24 Julien

WTF是這個網站！ ;）它在瀏覽器中的位置？ – Julien

感謝您的好建議，我只是重新連接到Cookie BPC = f563534535121d5a1ba5bd1e153b – user36603

回答

相關問題