我有一個實驗室任務,並且我正在關於刪除html標記的問題。下面是刪除HTML標籤的方法:刪除剩餘的html標記
public String getFilteredPageContents() {
String str = getUnfilteredPageContents();
String temp = "";
boolean b = false;
for(int i = 0; i<str.length(); i++) {
if(str.charAt(i) == '&' || str.charAt(i) == '<') {
b = true;
}
if(b == false) {
temp += str.charAt(i);
}
if(str.charAt(i) == '>' || str.charAt(i) == ';') {
b = false;
}
}
return temp;
}
這裏是我的文字最早的形式:
<!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML//EN">
<html>
<head>
<meta http-equiv="Content-Type"
content="text/html; charset=iso-8859-1">
<meta name="GENERATOR" content="Microsoft FrontPage 2.0">
<title>A Shropshire Lad</title>
</head>
<body bgcolor="#008000" text="#FFFFFF" topmargin="10"
leftmargin="20">
<p align="center"><font size="6"><strong></strong></font> </p>
<div align="center"><center>
<pre><font size="7"><strong>A Shropshire Lad
</strong></font><strong>
by A.E. Housman
Published by Dover 1990</strong></pre>
</center></div>
<p><strong>This collection of sixty three poems appeared in 1896.
Many of them make references to Shrewsbury and Shropshire,
however, Housman was not a native of the county. The Shropshire
of his book is a mindscape in which he blends old ballad meters,
classical reminiscences and intense emotional experiences
"recollected in tranquility." Although they are not
particularly to my taste, their style, simplicity and
timelessness are obvious even to me. Below are two short poems
which amused me, I hope you find them interesting too.</strong></p>
<hr size="8" width="80%" color="#FFFFFF">
<div align="left">
<pre><font size="5"><strong><u>
XIII</u></strong></font><font size="4"><strong>
When I was one-and-twenty
I heard a wise man say,
'Give crowns and pounds and guineas
But not your heart away;</strong></font></pre>
</div><div align="left">
<pre><font size="4"><strong>Give pearls away and rubies
But keep your fancy free.
But I was one-and-twenty,
No use to talk to me.</strong></font></pre>
</div><div align="left">
<pre><font size="4"><strong>When I was one-and-twenty
I heard him say again,
'The heart out of the bosom
Was never given in vain;
'Tis paid with sighs a plenty
And sold for endless rue'
And I am two-and-twenty,
And oh, 'tis true 'tis true.
</strong></font><strong></strong></pre>
</div>
<hr size="8" width="80%" color="#FFFFFF">
<pre><font size="5"><strong><u>LVI . The Day of Battle</u></strong></font><font
size="4"><strong>
'Far I hear the bugle blow
To call me where I would not go,
And the guns begin the song,
"Soldier, fly or stay for long."</strong></font></pre>
<pre><font size="4"><strong>'Comrade, if to turn and fly
Made a soldier never die,
Fly I would, for who would not?
'Tis sure no pleasure to be shot.</strong></font></pre>
<pre><font size="4"><strong>'But since the man that runs away
Lives to die another day,
And cowards' funerals, when they come,
Are not wept so well at home,</strong></font></pre>
<pre><font size="4"><strong>'Therefore, though the best is bad,
Stand and do the best, my lad;
Stand and fight and see your slain,
And take the bullet in your brain.'</strong></font></pre>
<hr size="8" width="80%" color="#FFFFFF">
</body>
</html>
而當這段文字實現我的方法:
charset=iso-8859-1">
A Shropshire Lad
A Shropshire Lad
by A.E. Housman
Published by Dover 1990
This collection of sixty three poems appeared in 1896.
Many of them make references to Shrewsbury and Shropshire,
however, Housman was not a native of the county. The Shropshire
of his book is a mindscape in which he blends old ballad meters,
classical reminiscences and intense emotional experiences
recollected in tranquility. Although they are not
particularly to my taste, their style, simplicity and
timelessness are obvious even to me. Below are two short poems
which amused me, I hope you find them interesting too.
.
.
.
我的問題是:我怎樣才能擺脫在文本charset=iso-8859-1">
的開頭的小代碼。我無法擺脫那堆代碼?謝謝...
您可以先避免使用FrontPage。像這樣的工具可以方便地交換正確的代碼 – Joseph 2012-03-04 00:48:36
避免FrontPage可能是一個好主意。但我認爲這個任務是處理HTML代碼,無論它來自哪裏? – Nayuki 2012-03-04 00:51:51