如何檢測空格，特殊字符的HTML標籤蟒蛇

-4

I/O 1< img > '< input > 
I/O 1<' img > '< input >

我想如下所需的輸出，如果<後跟空間存在發生這種情況。

I/O 1<img>'<input>

任何人都可以用正則表達式來幫助我嗎？

2016-07-22 Venu

嘗試<\s+，\s+>，並且>\s+：

import re 

s = "I/O 1< img > '< input >" 
s = re.sub(r"<\s+", "<", s) 
s = re.sub(r"\s+>", ">", s) 
s = re.sub(r">\s+", ">", s) 
print(s)

輸出：

I/O 1<img>'<input>

2016-07-22 07:31:06

我已經定義我的starttagopen = re.compile（' <[> A-ZA-Z]'），我該怎麼修改有上面的代碼 – Venu

@Venu我不明白。如果您已經有試過的代碼，請編輯您的問題並將其包含在其中。 – 2016-07-22 07:39:29

s= "I/O 1< img > '< input >"

使用s.find找到HTML標籤的啓動（ '<'）

s [0：s.find（'<'）] will s在html標記開始之前選擇從0到索引的子字符串

s [s.find（'<'）：]將選擇從html標記開始到結束的子字符串。

s.replace（ ' ' ''）將取代no_spaces空間

(s[0:s.find('<')]) + (s[s.find('<'):].replace(' ',''))

2016-07-25 07:26:58

嗨，空格，特殊字符可以是任意長度。感謝您的迴應。主要目標是修剪空格，「<」 – Venu

之後的特殊字符嗨，我只是做了一個編輯。從'<'的索引開始使用s.find（'<'） –

請[edit]（http://stackoverflow.com/posts/38561914/edit）解釋這是如何幫助解決用戶的問題。 – Pureferret

回答