1
我需要開發一個語音識別軟件,所以我使用Python的Pocketsphinx。 我想識別幾個單詞,因此我使用帶閾值的關鍵詞列表來排除語法單詞。 我到目前爲止這樣的代碼:如何使用pocketsphinx-python調整關鍵詞閾值
#!/usr/bin/env python
import os
from pocketsphinx import LiveSpeech, get_model_path
model_path = get_model_path()
speech = LiveSpeech(
verbose=False,
sampling_rate=16000,
buffer_size=2048,
no_search=False,
full_utt=False,
lm=False,
hmm= os.path.join(model_path,'en-us'),
#lm= '/home/paulo/PycharmProjects/VoiceRecognition/Data/TAR4858/4858.lm',
dic = '/home/paulo/PycharmProjects/VoiceRecognition/Data/TAR4858/4858.dic',
kws = '/home/paulo/PycharmProjects/VoiceRecognition/Data/TAR4858/keyphrase.list'
)
for phrase in speech:
frase = str(phrase)
print frase
這是我的字典:
BROWSER B R AW Z ER
COMPUTER K AH M P Y UW T ER
HELLO HH AH L OW
HELLO(2) HH EH L OW
MUSIC M Y UW Z IH K
OFF AO F
ON AA N
ON(2) AO N
OPEN OW P AH N
TURN T ER N
這裏是我的keyphrase.list
:
OPEN BROWSER /1e-1/
TURN ON MUSIC /1e-20/
TURN OFF MUSIC /1e-20/
TURN OFF COMPUTER /1e-25/
HELLO COMPUTER /1e-25/
這是工作,但我看到了cmu sphinx
網站有關的關鍵字列出我需要用命令調整這些閾值
pocketsphinx_continuous -infile <your_file.wav> -keyphrase <your keyphrase> \
-kws_threshold <your_threshold> -time yes
但我該怎麼做與pocketsphinx
爲python安裝?