在Python中使用Stanford Tregex

2017-03-15 Andrea

text = ('Pusheen and Smitha walked along the beach. "I want to surf", said Smitha, the CEO of Tesla. However, she fell off the surfboard') 

output1['sentences'][0]['parse'] 

Out[58]: '(ROOT\n (S\n (NP (NNP Pusheen)\n  (CC and)\n  (NNP Smitha))\n (VP (VBD walked)\n  (PP (IN along)\n  (NP (DT the) (NN beach))))\n (. .)))' 

output1['sentences'][1]['parse'] 

Out[59]: "(ROOT\n (SINV (`` ``)\n (S\n  (NP (PRP I))\n  (VP (VBP want)\n  (PP (TO to)\n   (NP (NN surf) ('' '')))))\n (, ,)\n (VP (VBD said))\n (NP\n  (NP (NNP Smitha))\n  (, ,)\n  (NP\n  (NP (DT the) (NNP CEO))\n  (PP (IN of)\n   (NP (NNP Tesla)))))\n (. .)))" 

output1['sentences'][2]['parse'] 

Out[60]: '(ROOT\n (S\n (ADVP (RB However))\n (, ,)\n (NP (PRP she))\n (VP (VBD fell)\n  (PRT (RP off))\n  (NP (DT the) (NN surfboard)))))' 

cd stanford-tregex-2016-10-31 
java -cp 'stanford-tregex.jar:' edu.stanford.nlp.trees.tregex.TregexPattern -f -s '(NP[$VP]>S)|(NP[$VP]>S\n)|(NP\n[$VP]>S)|(NP\n[$VP]>S\n)' /Users/AS/stanford-tregex-2016-10-31/exampletree.txt 

Pattern string: 
(NP[$VP]>S)|(NP[$VP]>S\n)|(NP\n[$VP]>S)|(NP\n[$VP]>S\n) 
Parsed representation: 
or 
    Root NP 
     and 
     $ VP 
     > S 
    Root NP 
     and 
     $ VP 
     > S\n 
    Root NP\n 
     and 
     $ VP 
     > S 
    Root NP\n 
     and 
     $ VP 
     > S\n 
Reading trees from file(s) file path 
\# /Users/AS/stanford-tregex-2016-10-31/exampletree.txt 
(NP (NNP Pusheen) \n (CC and) \n (NNP Smitha)) 
\# /Users/AS/stanford-tregex-2016-10-31/exampletree.txt 
(NP\n (NP (NNP Smitha)) \n (, ,) \n (NP\n (NP (DT the) (NN spokesperson)) \n (PP (IN of) \n (NP (DT the) (NNP CIA)))) \n (, ,)) 
\# /Users/AS/stanford-tregex-2016-10-31/exampletree.txt 
(NP (PRP They)) 
There were 3 matches in total. 

java -Xmx4g edu.stanford.nlp.pipeline.StanfordCoreNLPServer -port 9000 - timeout 15000 

import requests 

url = "http://localhost:9000/tregex" 
request_params = {"pattern": "(NP[$VP]>S)|(NP[$VP]>S\\n)|(NP\\n[$VP]>S)|(NP\\n[$VP]>S\\n)"} 
text = "Pusheen and Smitha walked along the beach." 
r = requests.post(url, data=text, params=request_params) 
print r.json() 

{u'sentences': [{u'0': {u'namedNodes': [], u'match': u'(NP (NNP Pusheen)\n (CC and)\n (NNP Smitha))\n'}}]} 

在Python中使用Stanford Tregex

回答

相關問題