2011-08-21 96 views
2

我正在考慮刪除所有格式,所有文本就像您轉到網站的任何頁面,用戶Ctrl + A和Ctrl + C,然後使用Ctrl + V將所有內容粘貼到記事本中。你會明白我的意思是隻提取所有的文本....讓我們用一個網站,一個更好的解釋: 這個網站: https://developer.palm.com/content/resources/develop/quick_start_ios.html這可能僅從HTML文件中提取所有文本嗎?

而且我想要的東西:

jump to navigation 
jump to content 

Showcase 
Why webOS 
The Opportunity 
Innovative Platform 
Cross-Platform 
HP Reach 
Vibrant Community 
Showcase 
Device Showcase 
App Showcase 
Developer Voices 
My Apps 
Resources 
Design 
Enyo Design Guide 
Advanced Application Guidelines 
webOS and Game Development 
Development 
Download the SDK 
Enyo from the Ground Up 
Enyo Tutorial 
Third-party Tools 
Developer Device Program 
PDK Development 
Unactivated Devices 
Glossary 
Distribution and Promotion 
Distributing with HP 
App Content Criteria 
App Submission Checklist 
International e-commerce FAQ 
Submit Your Enyo App 
Market Your App 
Promo codes 
In-App Purchase 
FAQs 
Developer Program FAQ 
International e-commerce FAQ 
PDK Technical FAQ 
Videos 
View All 
Community 
Connect 
Forums 
Developer Blog 
Events 
Twitter 
IRC 
RSS 
Resources 
Third-party Developers 
webOS on github 
Guide to Custom Feeds 
webOS101 (external) 
Community Sites 
mobspot 
Cyrket 
PreCentral 
webOS Roundup 
Documentation 
SDK Documentation 
Index 
Developer Guide 
API Reference 
Sign In Sign Up Search Form 
Search 
HomeResourcesQuick Start iOS 
Quick Start - iOS Developers 
Print 
Email 
Share 
If you've been developing for iOS® and are looking to expand your audience, we're here to help. Getting started with webOS is easy! If your current focus is OpenGL/SDL, then the transition will be simplicity itself. We have lots of great stories of developers porting their OpenGL apps very quickly. You can use the publicly available 3.0 SDK to do OpenGL/SDL development now with the included Plug-in Development Kit (PDK). Best of all, the PDK integrates nicely with Xcode. 

If your focus is web app development, you'll want to look at Enyo, our next-generation JavaScript framework, which is included in the 3.0 SDK. 

Ready to get started? 

Download the SDK 
It's free! (While you're at it, sign up for the Developer Program.) 

Try the Enyo tutorial or the OpenGL sample app 
Choose the sample that's most appropriate for your skill set. 

Check out our Resources pages 
Get more information on developing for webOS. Or go straight to the Reference section to get all the details. 
Quick Start Guide 

iOS Developers 
Web Developers 
C/C++ Developers 
Next Steps 

Sign up! 
Become a member of the webOS developer community 
Watch Dev Day videos 
See the talks from the NYC Dev Day 
Find a Developer 
Check out our list of third-party developers and designers 
Support 
We are here to help! 
Why webOS 
Business Case for webOS 
Success Stories 
App Showcase 
Contact Us 
Getting Started 
Join the HP webOS Developer Program 
Download the SDK/PDK 
Developing Your First App 
Videos 
webOS CONNECT Events 
MWC Developer Conference 
NYC Developer Day 
Podcasts 
Support 
Help 
FAQs 
Stay up to date 
About RSS Feeds 
Developer Blog 
© 2011 Hewlett-Packard Development Company, L.P. 
The information contained herein is subject to change without notice. All screen images simulated. HP Pre 3 planned availability this summer. Privacy Statement 
Supported browsers: Firefox 3.6+; Google Chrome 10+; Safari 5+; Internet Explorer 8+ 
Palm.comLegal NoticesContact Us 

回答

3

我用lynx,從終端試試這個:

lynx -dump http://www.google.com 
+0

只有在您安裝了lynx的情況下,才能在linux下運行。 – Shoan

6

這應該工作

<?php 

echo strip_tags(file_get_contents("https://developer.palm.com/content/resources/develop/quick_start_ios.html")); 

那的總體思路。你可以做像str_replace('<br/>', '\n', $output)這樣的東西來更好地進行格式化。

+3

什麼是downvote?這個問題已經被標記爲php,並且上面的代碼有效......任何人都低估了請解釋。 – arunkumar

0

另一種方式來做到這一點是檢索頁面的身體標記值:

$html = new DOMDocument(); 

$html->loadHTMLFile("https://developer.palm.com/content/resources/develop/quick_start_ios.html"); 

$body = $html->getElementsByTagName("body"); 
$body = $body->item(0); 

echo $body->nodeValue; 
0

您可以使用文檔樹要做到這一點,只是把所有的文本節點,並刪除所有的元素節點。

你可以使用javascript或C++與webkit實現這一點。

相關問題