我有一個包含特殊字符的列表（例如é或空格），當我打印這些字符被印有它們的Unicode代碼列表中，而它們被正確地打印，如果我打印單獨列表元素：打印特殊字符在Python

#!/usr/bin/env python 
# -*- coding: utf-8 -*- 

my_list = ['éléphant', 'Hello World'] 
print(my_list) 
print(my_list[0]) 
print(my_list[1])

此代碼的輸出是

['\xc3\xa9l\xc3\xa9phant', 'Hello World']

éléphant

Hello World

我想有['éléphant', 'Hello World']第一個輸出。我應該改變什麼？

來源

2017-07-09 fonfonx

你可以做這樣的事情以正確編碼您'print'聲明：'>>>印刷再版（my_list）.decode（「Unicode的逃逸」） .encode（'latin-1'）' 我發佈了這個答案，但刪除了它，因爲我只在python2中測試過，所以我現在評論。 –

@ViníciusAguiar我的確在使用python2。如果用'utf-8'替換'latin-1' – fonfonx

，那麼你的回答就很好。哦，那很好！我不會取消刪除它，因爲它看起來已經有幾個很好的答案了。謝謝你讓我知道！ =） –

如果可能的話，切換到Python 3，你會得到預期的結果。

如果你必須使它在Python 2的工作，然後用unicode字符串：

my_list = [u'éléphant', u'Hello World']

你有它現在的樣子，Python是解釋第一個字符串與值'\xc3\xa9l\xc3\xa9phant'一系列的字節數，其只有在UTF-8正確解碼後纔會轉換爲Unicode代碼點：'\xc3\xa9l\xc3\xa9phant'.decode('utf8') == u'\xe9l\xe9phant'。

如果你想打印列表repr並獲得「unicode」，你必須手動編碼爲UTF-8（如果這是你的終端理解的）。

>>> print repr(my_list).decode('unicode-escape').encode('utf8') 
[u'éléphant', u'Hello World']

但它更容易手動格式化：

>>> print ", ".join(my_list) 
éléphant, Hello World

來源

2017-07-09 23:35:17 randomir

究竟是什麼「repr」？ – fonfonx

['repr']（https://docs.python.org/2/library/functions.html#repr）返回一個對象的可打印表示，通常可以使用['eval']（ https://docs.python.org/2/library/functions.html#eval）。當你調用'print my_list'時，Pyhton2實際上正在打印'str（my_list）'，這個列表等同於'repr（my_list）'，這個列表組成了一個可打印的單個元素的'repr'列表。由於'unicode'字符串在Python2中並不是真正的本地語言，因此我們通過'repr（my_list）'得到的是''[u'\\ xe9l \\ xe9phant'，u'Hello World']「，unicode代碼點被轉義。 – randomir

簡短的回答，你必須自己實現它，如果你想保持這種格式輸出：

#!/usr/bin/env python 
# -*- coding: utf-8 -*- 

my_list = ['éléphant', 'Hello World'] 

def print_list (l): 
    print ("[" + ", ".join(["'%s'" % str(x) for x in l]) + "]") 

print_list (my_list)

其產生預期的

['éléphant', 'Hello World']

但是，請注意，它會將所有元素放在引號內（例如偶數），因此如果您期望列表中的字符串以外的任何內容，則可能需要更復雜的實現。

再回應

的問題是Python的運行罩下str(my_list)，在打印之前。然後，在列表的每個元素上依次運行repr()。現在

，repr()一個字符串返回字符串的ASCII唯一代表。也就是說，你看到的'\ xc3'是一個實際的反斜槓，一個實際的'c'和一個實際的'3'字符。

無法解決，如問題上的list.__str__()實施。

下面示例程序來證明。

#!/usr/bin/env python 
# -*- coding: utf-8 -*- 

# vi: ai sts=4 sw=4 et 

import pprint 

my_list = ['éléphant', 'Hello World'] 

# under the hood, python first runs str(my_list), before printing it 
my_list_as_string = str(my_list) 

# str() on a list runs repr() on each of the elements. 
# However, it seems that __repr__ on a string transforms it to an 
# ASCII-only representation 
print ('str(my_list) = %s' % str(my_list)) 
for c in my_list_as_string: 
    print c 
print ('len(str(my_list)) = %s' % len(str(my_list))) 
print ("\n") 

# Which we can confirm here, where we can see that it it also adds the quotes: 
print ('repr("é") == %s' % repr("é")) 
for c in repr("é"): 
    print c 
print ('len(repr("é")) == %s' % len(repr("é"))) 
print ("\n") 

# Even pprint fails 
print ("pprint gives the same results") 
pprint.pprint(my_list) 

# It's useless to try to encode it, since all data is ASCII 
print "Trying to encode" 
print (my_list_as_string.encode ("utf8"))

產生這樣的：

str(my_list) = ['\xc3\xa9l\xc3\xa9phant', 'Hello World'] 
[ 
' 
\ 
x 
c 
3 
\ 
x 
a 
9 
l 
\ 
x 
c 
3 
\ 
x 
a 
9 
p 
h 
a 
n 
t 
' 
, 

' 
H 
e 
l 
l 
o 

W 
o 
r 
l 
d 
' 
] 
len(str(my_list)) = 41 


repr("é") == '\xc3\xa9' 
' 
\ 
x 
c 
3 
\ 
x 
a 
9 
' 
len(repr("é")) == 10 


pprint gives the same results 
['\xc3\xa9l\xc3\xa9phant', 'Hello World'] 
Trying to encode 
['\xc3\xa9l\xc3\xa9phant', 'Hello World']

來源

2017-07-10 00:17:36 caxcaxcoatl

打印特殊字符在Python

回答

再回應

相關問題