12
專專專專 (3) Viterbi Decoding Triphone Acoustic Model Prof. Lin-Shan Lee, TA. Yun- Chiao Li 1

專題研究 (3) Viterbi Decoding Triphone Acoustic Model

  • Upload
    ernie

  • View
    91

  • Download
    1

Embed Size (px)

DESCRIPTION

專題研究 (3) Viterbi Decoding Triphone Acoustic Model. Prof. Lin-Shan Lee, TA. Yun-Chiao Li. Viterbi Decoding. 03.04.mono0a.viterbi.sh 04.04.tri1.viterbi.sh. Viterbi Decoding. Instead of using WFST, we use Viterbi now Converted Kaldi Acoustic model to HTK by Vulcan (02.02.convert.htk.feat.sh). - PowerPoint PPT Presentation

Citation preview

Page 1: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

專題研究 (3)Viterbi DecodingTriphone Acoustic Model

Prof. Lin-Shan Lee, TA. Yun-Chiao Li

1

Page 2: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

03.04.mono0a.viterbi.sh04.04.tri1.viterbi.sh

Viterbi Decoding2

Page 3: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

Viterbi Decoding

Instead of using WFST, we use Viterbi now

Converted Kaldi Acoustic model to HTK by Vulcan

(02.02.convert.htk.feat.sh)

3

Convert the acoustic model from Kaldi to

HTK

Page 4: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

Viterbi Decoding4

Using the dev set to find the best acoustic

weight (acwt)

Page 5: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

04.01~04.04

Triphone Acoustic Model5

Page 6: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

Triphone Acoustic Model 6

In monophone acoustic model, ㄅ、ㄆ、ㄇ they use their own model

In triphone acoustic model, ㄅ - ㄆ - ㄇ is a model

There will be too many model and lack of training data

Page 7: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

Decision Tree

Use decision tree to tie similar models together

7

Page 8: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

04.01.tri1.train.sh (1/3)

It is very similar to 03.01

8

Page 9: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

04.01.tri1.train.sh (2/3)9

Page 10: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

04.01.tri1.train.sh (3/3)10

Page 11: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

bash 04.01.tri1.train.shbash 04.02.tri1.mkgraph.shbash 04.03.tri1.fst.shbash 04.04.tri1.viterbi.sh

Homework11

Page 12: 專題研究  (3) Viterbi Decoding Triphone Acoustic Model

Some Helpful References

“使用加權有限狀態轉換器的基於混合詞與次詞

以文字及語音指令偵測口語詞彙” – 第三章 https://www.dropbox.com/s/

dsaqh6xa9dp3dzw/wfst_thesis.pdf

Check HDecode, HLRescore in HTK Book

12