我目前有包含一個看起來像找到句子的指數特定單詞(列表中的句子)在Python
example = ['Mary had a little lamb' ,
'Jack went up the hill' ,
'Jill followed suit' ,
'i woke up suddenly' ,
'it was a really bad dream...']
我想找到一句與索引列表的文件例如「醒來」一詞。 在這個例子中,答案應該是f(「woke」)= 3。 F是一個功能。
我試圖來標記每個句子先找到喜歡這個詞的索引:
>>> from nltk.tokenize import word_tokenize
>>> example = ['Mary had a little lamb' ,
... 'Jack went up the hill' ,
... 'Jill followed suit' ,
... 'i woke up suddenly' ,
... 'it was a really bad dream...']
>>> tokenized_sents = [word_tokenize(i) for i in example]
>>> for i in tokenized_sents:
... print i
...
['Mary', 'had', 'a', 'little', 'lamb']
['Jack', 'went', 'up', 'the', 'hill']
['Jill', 'followed', 'suit']
['i', 'woke', 'up', 'suddenly']
['it', 'was', 'a', 'really', 'bad', 'dream', '...']
但我不知道如何最終得到了字的索引以及如何將其鏈接到句子的索引。有人知道如何做到這一點?
如果在多個sentances存在什麼字?返回第一個? – CoryKramer
是的,至少有一種或所有的可能性,但在我的情況下,我應該只有一個 – alexdtx