1
我試着在spacy手中,但似乎文檔有缺陷。剛開始使用它花了我很長時間。我面臨的第一個問題是: -蟒蛇中的Spacy的問題
import spacy
nlp = spacy.load("en")
Warning: no model found for 'en'
Only loading the 'en' tokenizer.
其中我解決了導入模塊
import en_core_web_sm as en_core
nlp=en_core.load()
但現在當我試着運行這段代碼
from numpy import dot
from numpy.linalg import norm
from spacy.en import English
parser = English()
#Generate word vector of the word - apple
apple = parser.vocab[u'apple']
#Cosine similarity function
cosine = lambda v1, v2: dot(v1, v2)/(norm(v1) * norm(v2))
others = list({w for w in parser.vocab if w.has_vector and w.orth_.islower() and w.lower_ != unicode("apple")})
# sort by similarity score
others.sort(key=lambda w: cosine(w.vector, apple.vector))
others.reverse()
print "top most similar words to apple:"
for word in others[:10]:
print word.orth_
即時得到
>>top most similar words to apple:
雖然我應該得到
>> top most similar words to apple:
>> apples iphone fruit juice cherry lemon banana pie mac orange
這對我來說很好。 – DhruvPathak