Python重新排列＆從html頁面標題中刪除字符

我正在運行Python 2.7.11 |在Windows 10上使用beautifulsoup4和lxml。Python重新排列＆從html頁面標題中刪除字符

import urllib2 
import re 
from bs4 import BeautifulSoup 

soup = BeautifulSoup(urllib2.urlopen("http://www.daisuki.net/us/en/anime/watch.GUNDAMUNICORNRE0096.13142.html"), "lxml") 
Name = soup.title.string 

print(Name.replace('#', ""))

輸出：

01 DEPARTURE 0096 - 機動戰士UNICORN RE：0096 - DAISUKI

所需的輸出：

機動戰士UNICORN RE：0096 - 01出發0096

我將如何去除最後的「 - DAISUKI」並重新排序字符串？標題的

來源

2016-04-28 Yami

拆分通過-和重新排列部分：

>>> import urllib2 
>>> from bs4 import BeautifulSoup 
>>> 
>>> soup = BeautifulSoup(urllib2.urlopen("http://www.daisuki.net/us/en/anime/watch.GUNDAMUNICORNRE0096.13142.html"), "lxml") 
>>> Name = soup.title.string 
>>> 
>>> " - ".join(Name.replace('#', "").split(" - ")[1::-1]) 
u'MOBILE SUIT GUNDAM UNICORN RE:0096 - 01 DEPARTURE 0096'

來源

2016-04-28 20:31:53 alecxe

哈克溶液傳入：

Name = "01 DEPARTURE 0096 - MOBILE SUIT GUNDAM UNICORN RE:0096 - DAISUKI" 
print ("- ".join(reversed(Name.split('-')[:2]))).strip()

來源

2016-04-28 20:36:03 taesu

Python重新排列＆從html頁面標題中刪除字符

回答

相關問題