2017-04-27 68 views
1

我試着在spacy手中,但似乎文檔有缺陷。剛開始使用它花了我很長時間。我面臨的第一個問題是: -蟒蛇中的Spacy的問題

import spacy 
nlp = spacy.load("en") 

Warning: no model found for 'en' 
Only loading the 'en' tokenizer. 

其中我解決了導入模塊

import en_core_web_sm as en_core 
nlp=en_core.load() 

但現在當我試着運行這段代碼

from numpy import dot 
from numpy.linalg import norm 
from spacy.en import English 
parser = English() 

#Generate word vector of the word - apple 
apple = parser.vocab[u'apple'] 

#Cosine similarity function 
cosine = lambda v1, v2: dot(v1, v2)/(norm(v1) * norm(v2)) 
others = list({w for w in parser.vocab if w.has_vector and w.orth_.islower() and w.lower_ != unicode("apple")}) 

# sort by similarity score 
others.sort(key=lambda w: cosine(w.vector, apple.vector)) 
others.reverse() 


print "top most similar words to apple:" 
for word in others[:10]: 
    print word.orth_ 

即時得到

>>top most similar words to apple: 

雖然我應該得到

>> top most similar words to apple: 
>> apples iphone fruit juice cherry lemon banana pie mac orange 
+0

這對我來說很好。 – DhruvPathak

回答

2

運行python -m spacy.en.download all作爲管理員解決了問題。