2015-10-01 21 views
0

我有這樣的字符串中VAR1如何改變一個Python 3字符串轉換成「可讀」文本

var1 = '$a=1%7Cscroll%20on%20%22Page%3A%20Generator-Sets-Construction%3Fid%3Dci%26s%3DY2l8Tj00Mjk0NzQ4MDY5KzQyOTQ5NjM4OTY%3D%22%7C-%7Cscroll%7C1443616500011%7C1443616500586%7C3774$fId=16440287_806$rId=RID_-62268720$rpId=1762047089$domR=1443616443684$time=1443616500588' 

如何更改該字符串的內容成編碼「可讀」文本即非URL。

從研究,這裏是我嘗試過的代碼,但它仍然保留了URL編碼項目,例如%20等

import html 
print(html.unescape('$a=1%7Cscroll%20on%20%22Page%3A%20Generator-Sets-  Construction%3Fid%3Dci%26s%3DY2l8Tj00Mjk0NzQ4MDY5KzQyOTQ5NjM4OTY%3D%22%7C-%7Cscroll%7C1443616500011%7C1443616500586%7C3774$fId=16440287_806$rId=RID_-62268720$rpId=1762047089$domR=1443616443684$time=1443616500588')) 

所有幫助表示讚賞,或者如果有一個現有的模塊這樣做。

+0

重複此[問題](http://stackoverflow.com/questions/8136788/decode-escaped-characters-in-url)? – strubbly

回答

1

你試圖做的是取消參數字符串的引用,而不是忽略html。以下應該工作 -

import urllib.parse 
print(urllib.parse.unquote('$a=1%7Cscroll%20on%20%22Page%3A%20Generator-Sets-  Construction%3Fid%3Dci%26s%3DY2l8Tj00Mjk0NzQ4MDY5KzQyOTQ5NjM4OTY%3D%22%7C-%7Cscroll%7C1443616500011%7C1443616500586%7C3774$fId=16440287_806$rId=RID_-62268720$rpId=1762047089$domR=1443616443684$time=1443616500588')) 
+0

是的,這個工程!非常感謝你Vikas。 –

相關問題