2013-11-27 78 views
0

我正在使用Sphinx 4與以下wav files進行聲學模型自適應。 下面是結果使用bw -hmmdir wsj -moddeffn wsj/mdef -ts2cbfn .cont. -feat 1s_c_d_dd -cmn current -agc none -dictfn vn.dic -ctlfn lisp.fileids -lsnfn lisp.transcription -accumdir . 獅身人面像4無法對齊音頻

utt>  0     lisp_0001 53INFO: cmn.c(175): CMN: 73.43 2.89 -0.3 
4 -1.85 -0.98 -0.52 0.33 0.67 -0.77 -0.56 0.18 -0.50 -0.30 
    0 28 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0001 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt>  1     lisp_0002 41INFO: cmn.c(175): CMN: 74.39 2.48 -0.9 
2 -2.09 -1.31 -0.52 -0.17 0.67 0.26 -0.62 0.34 -0.26 -0.04 
    0 28 0 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0002 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt>  2     lisp_0003 57INFO: cmn.c(175): CMN: 75.86 2.02 -0.5 
3 -1.16 -0.79 -0.55 -0.77 0.92 -0.34 -0.82 0.63 -0.33 -0.60 
    0 40 2 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0003 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt>  3     lisp_0004 57INFO: cmn.c(175): CMN: 74.78 2.01 -0.3 
6 -0.52 -1.04 -1.08 -0.08 0.88 -0.51 -0.65 0.56 -0.36 -0.54 
    0 40 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0004 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt>  4     lisp_0005 49INFO: cmn.c(175): CMN: 75.03 1.80 -1.7 
6 -1.18 -1.56 -1.24 0.62 1.84 -0.58 -1.34 0.64 -0.26 -0.20 
    0 28 2 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0005 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt>  5     lisp_0006 41INFO: cmn.c(175): CMN: 76.75 0.51 -0.5 
5 -0.89 -1.18 -1.16 0.64 1.67 -1.25 -1.30 0.57 -0.26 -0.54 
    0 28 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0006 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt>  6     lisp_0007 22INFO: cmn.c(175): CMN: 82.68 -5.14 -4.5 
6 -1.20 -0.66 -0.34 -0.88 -0.05 1.29 1.60 0.97 -0.68 -1.65 
    0 28 0 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0007 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt>  7     lisp_0008 16INFO: cmn.c(175): CMN: 82.76 -6.31 -4.7 
5 -1.98 -1.04 -1.06 -0.49 1.19 1.57 1.48 0.52 -1.17 -1.32 
    0 28 0 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0008 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt>  8     lisp_0009 47INFO: cmn.c(175): CMN: 78.49 1.93 -0.6 
9 -0.95 -1.04 -0.06 -0.18 0.98 -0.98 -0.72 0.20 0.04 -0.54 
    0 32 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0009 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt>  9     lisp_0010 47INFO: cmn.c(175): CMN: 77.21 1.23 0.1 
8 -0.83 -0.89 -0.19 -0.39 0.80 -1.13 -0.86 0.38 -0.17 -0.47 
    0 32 3 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0010 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 10     lisp_0011 39INFO: cmn.c(175): CMN: 79.15 0.97 0.4 
2 -0.53 -1.72 -1.64 -0.36 1.03 0.23 -0.49 -0.59 -0.21 -0.16 
    0 32 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0011 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 11     lisp_0012 41INFO: cmn.c(175): CMN: 77.22 1.29 0.4 
5 -0.51 -2.12 -1.20 -0.52 1.09 -0.10 -0.56 -0.27 -0.60 -0.20 
    0 36 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0012 ignored 
utt 0.038x 0.320e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 12     lisp_0013 49INFO: cmn.c(175): CMN: 78.72 0.88 -0.8 
3 -0.17 -0.09 -0.18 -1.40 0.71 -0.16 -1.00 -0.03 0.07 -0.35 
    0 32 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0013 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 13     lisp_0014 51INFO: cmn.c(175): CMN: 77.42 0.56 -0.8 
8 -0.14 -0.05 -0.20 -1.31 0.90 -0.21 -1.39 0.07 0.01 -0.28 
    0 32 3 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0014 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 14     lisp_0015 57INFO: cmn.c(175): CMN: 74.21 1.50 0.0 
8 -1.50 -1.63 -0.97 0.65 0.63 -0.30 -0.07 -0.25 -0.71 -0.21 
    0 28 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0015 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 15     lisp_0016 54INFO: cmn.c(175): CMN: 74.42 1.22 0.0 
1 -1.77 -1.29 -1.20 0.30 0.83 0.39 -0.31 -0.32 -0.61 -0.11 
    0 28 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0016 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 16     lisp_0017 51INFO: cmn.c(175): CMN: 77.04 1.26 -1.0 
4 -0.57 -0.62 -0.27 -0.04 0.25 -0.97 -0.66 0.42 -0.16 -0.32 
    0 32 2 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0017 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 17     lisp_0018 53INFO: cmn.c(175): CMN: 76.83 0.69 -1.3 
5 -0.93 -0.46 -0.01 -0.53 0.61 -0.64 -1.03 0.85 -0.18 -0.15 
    0 32 2 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0018 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 18     lisp_0019 55INFO: cmn.c(175): CMN: 79.39 0.58 -0.5 
1 -1.02 -1.71 -0.55 0.44 0.80 0.32 -0.67 -0.73 -0.09 -0.21 
    0 36 2 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0019 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 19     lisp_0020 53INFO: cmn.c(175): CMN: 77.16 1.12 -0.4 
3 -1.27 -1.72 -1.32 -0.06 0.98 0.63 -0.42 -0.39 -0.03 -0.32 
    0 32 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0020 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 20     lisp_0023 43INFO: cmn.c(175): CMN: 78.04 1.22 -1.2 
4 -1.15 -0.43 -0.20 -0.23 0.78 -0.33 -0.37 0.05 -0.60 -0.73 
    0 24 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0023 ignored 
utt 0.036x 0.256e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 21     lisp_0024 54INFO: cmn.c(175): CMN: 77.16 0.71 -1.2 
7 -0.87 -0.57 -0.45 0.12 0.53 0.63 -0.43 0.26 -0.65 -0.38 
    0 24 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0024 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 22     lisp_0025 53INFO: cmn.c(175): CMN: 74.48 3.43 2.1 
5 -0.18 -1.62 -0.61 -0.64 -0.19 -0.28 0.38 0.05 -0.40 -0.01 
    0 32 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0025 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

utt> 23     lisp_0026 33INFO: cmn.c(175): CMN: 69.14 5.30 2.1 
9 -1.26 -2.55 0.65 -1.46 0.14 -0.22 -0.54 0.24 -0.34 -0.19 
    0 32 1 ERROR: "backward.c", line 421: Failed to align audio to trancript: 
final state of the search is not reached 
ERROR: "baum_welch.c", line 324: lisp_0026 ignored 
utt 0.000x 0.000e upd 0.000x 0.000e fwd 0.000x 0.000e bwd 0.000x 0.000e gau 0.0 
00x 0.000e rsts 0.000x 0.000e rstf 0.000x 0.000e rstu 0.000x 0.000e 

overall> stats 0 (-0) 0.000000e+000 0.000000e+000 0.000x 4.576e 
WARNING: "accum.c", line 617: Over 500 senones never occur in the input data. Th 
is is normal for context-dependent untied senone training or for adaptation, but 
could indicate a serious problem otherwise. 
INFO: s3mixw_io.c(232): Wrote ./mixw_counts [4147x1x8 array] 
INFO: s3tmat_io.c(174): Wrote ./tmat_counts [49x3x4 array] 
INFO: s3gau_io.c(478): Wrote ./gauden_counts with means with vars [4147x1x8 vect 
or arrays] 
INFO: main.c(1014): Counts saved to . 

我懷疑,因爲我的WAV文件沒有任何沉默墊,這些錯誤發生以上情況時,我得到。那是對的嗎?如果不是,錯誤的原因是什麼?

注:我錄製了一個長的音頻文件,包含所有的單詞,然後將它們切成單詞。這是由於我的記錄設備的緩慢存儲速度。這會妨礙每個較小文件的質量嗎?

在此先感謝

回答

1

您的音頻文件是在44.1 recoreded:

for f in *.wav; do 
    sox $f -r 16000 $f.new.wav; mv $f.new.wav $f; 
done 

爲:

file lisp_0009.wav 
lisp_0009.wav: RIFF (little-endian) data, WAVE audio, Microsoft PCM, 16 bit, mono 44100 Hz 

Sphinxtrain需要在16kHz的音頻率,則可以使用SOX重新取樣的聲音有關輸入音頻格式的更多信息,請參閱CMUSphinx適應教程

http://cmusphinx.sourceforge.net/wiki/tutorialadapt

我懷疑,因爲我的WAV文件沒有任何沉默墊,

這是一個問題了。音頻在邊界上必須有大約0.25秒的靜音。

然後將它們切成單詞。

您需要刪除話語上的文件,而不是文字