python的Jupyter字符串標記化

我想實現simple_tokenize使用字典作爲我以前的代碼的輸出，但我收到一條錯誤消息。對以下代碼的任何幫助將不勝感激。我使用Python 2.7 Jupyterpython的Jupyter字符串標記化

import csv 
reader = csv.reader(open('data.csv')) 

dictionary = {} 
for row in reader: 
    key = row[0] 
    dictionary[key] = row[1:] 
print dictionary

上述工作得很好，但問題是有以下幾點：

import re 

words = dictionary 
split_regex = r'\W+' 

def simple_tokenize(string): 

    for i in rows: 
     word = words.split 
    #pass 

print word

我得到這個錯誤：

NameError         Traceback (most recent call last) 
<ipython-input-2-0d0e05fb1556> in <module>() 
     1 import re 
     2 
----> 3 words = dictionary 
     4 split_regex = r'\W+' 
     5 

NameError: name 'dictionary' is not defined

來源

2016-03-31 sebastian

變量不保存之間Jupyter會議，除非你explicitly do so yourself。因此，如果您運行了第一個代碼段，然後退出Jupyter會話，啓動一個新的Jupyter會話並運行第二個代碼塊，dictionary不會從第一個會話保留下來，因此將不確定，如錯誤所示。

如果您以不同的方式運行上述代碼塊（例如，不跨Jupyter會話），應該指出這一點，但標籤和回溯表明這是您的操作。

來源

2016-12-07 23:41:44 Evert

python的Jupyter字符串標記化

回答

相關問題