我想下載一些遠程頁面。源代碼中有很長的一行。 curl和wget都下載文件,但決定錯過這一行。是否有另一個命令行實用程序,我可以使用和/或沒有人知道我可以如何解決這個問題。使用wget和捲曲的網頁奇怪的下載
編輯: 我可以澄清,我已經試過wget和捲曲,並且這兩個文件都錯過了這一行。
編輯:
[[email protected] scripts]$ curl --version
curl 7.15.5 (x86_64-redhat-linux-gnu) libcurl/7.15.5 OpenSSL/0.9.8b zlib/1.2.3 libidn/0.6.5
Protocols: tftp ftp telnet dict ldap http file https ftps
Features: GSS-Negotiate IDN IPv6 Largefile NTLM SSL libz
[[email protected] scripts]$ wget --version
GNU Wget 1.11.4 Red Hat modified
Copyright (C) 2008 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later
<http://www.gnu.org/licenses/gpl.html>.
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law.
Originally written by Hrvoje Niksic <[email protected]>.
Currently maintained by Micah Cowan <[email protected]>.
你怎麼知道這條線真的存在? – recursive 2010-09-09 14:11:34
我轉到http://mysite.com/page.html ...查看源代碼... – Simon 2010-09-09 14:14:30
如果您使用瀏覽器查看DOM源,它可能與實際中的內容不一樣源從服務器下載。例如,Javascript可以在文檔下載完成後對其進行轉換。 – recursive 2010-09-09 14:16:56