SPEAKER AUTHENTICATION Qi Li and Biing-Hwang Juangberlin.csie.ntnu.edu.tw/PastCourses/2004S... ·...

SPEAKER AUTHENTICATION

Present by : 陳子和

Qi Li and Biing-Hwang Juang

Content:• Introduction:• Speaker Authentication :

Speaker Recognition and VerificationVerbal Information Verification

• Pattern Recognition in Speaker AuthenticationBayesian Decision TheoryStochastic Models for Stationary ProcessStochastic Models for Non-Stationary ProcessStatistical Verification

• Speaker Authentication SystemSpeaker Verification SystemVIV SystemCombining SV and VIV System

• Conclusion

Introduction:• To ensure the security of a proper access to private

information, passwords or personal identification numbers (PIN) have been used. To further enhance the level of security,biometric features such as signature, fingerprint, hand shape, eye iris, and voice have been considered.

• Speaker Authenticating

1.Speaker Recognition(by characteristics)

speaker verification (SV)

speaker identification(SID)

2.verbal information verification (VIV)(by verbal content)

Multiple-choice classification problem

Speaker Authentication

A typical SV system: enrollment and test sessions.

Speaker Recognition and Verification

Fixed pass-phrase systemthe spoken digit string is first recognized by an ASR and the standard verification procedure then follows.

Text-prompted system (A safety concern)the system prompts the user to utter a randomized sequence of words in the vocabulary.

Speaker Recognition and Verification (cont.)

Text-dependent or text-constrained SV systems

Verbal Information Verification

•mismatch significantly aspects the SV performance

•Enrollment is an inconvenience to the user

•A safety concern

Verbal Information Verication (cont.)

Tele banking system

Pattern Recognition in Speaker Authentication

Bayesian Decision Theory

the probability of being class Ci given o P(Ci|o) is the posterior probability,P(o|Ci) is the conditional probability, P(Ci) is prior probability:

Let L(αi|Cj) be the loss function describing the loss incurred for taking action whenthe true class is Cj. The expected risk associated with taking action αi is

Gaussian mixture model (GMM):

Stochastic Models for Stationary Process

EM Algorithm

Stochastic Models for Stationary Process (cont.)

EM Algorithm

When Testing: assume the prior is the same for all speakTake action

Stochastic Models for Non-Stationary Process

The stationary process ignored the temporal information. In other applications, such as speaker verification, the temporal information is necessary in making decisions.

A more powerful model, Hidden Markov Model (HMM) is then applied to characterize both the temporal structure and the corresponding statistical variations along the trajectory of an utterance.

Speech SegmentationStochastic Models for Non-Stationary Process(cont.)

Viterbi Algorithm

Statistical Verification

Statistical Verification (cont.)

False rejection : rejecting the hypothesis when it is actually true.

False acceptance: accepting it when it is actually false.

Equal error rate: the error rate when the operating point is so chosenas to achieve equal error probabilities for the two types of error.

Speaker Verification

Speaker Authentication System

Speaker Authentication System (cont.)VIV : with UV

Speaker Authentication System (cont.)

Utterance Segmentation

Subword Hypothesis Testing

Confidence Measure Calculation

Sequential Utterance Verification

Utterance Segmentation

Subword Hypothesis Testing

Confidence Measure Calculation

Sequential Utterance Verification

Speaker Authentication System (cont.)Experimental Results

Conclusion

Depend on Bayesian decision theory and hypothesis testing, the hypothesis testing may be conducted at phrase, word, phoneme, orsubword levels.

On extension to the Bayesian theory to authentication is the sequential verification procedure, which can also be applied to speaker verification to achieve even lower equal error rates.

Currently, the fixed phrase SV system is more attractive to realapplications due to its good performances. it is easy to remember and convenient to use.

since VIV is to verify the verbal content instead of the voice characteristics, it is users' responsibility to protect their personal information from impostors

To improve the user convenience and system performance, the VIV and SV is combined to construct a convenient speaker authentication system.

The combined system is convenient to users since they can start to use the system without going through a formal enrollment session and waiting for model training. On the other hand, since the training data could be collected from different channels in different VIV sessions, the acoustic mismatch problem is mitigated, potentially leading to a better system performance in test sessions.

The SD HMM's can be updated to cover different acoustic environments while the system is in use to further improve the system performance.

VIV can also be used to ensure training data for SV.

SPEAKER AUTHENTICATION Qi Li and Biing-Hwang Juangberlin.csie.ntnu.edu.tw/PastCourses/2004S... ·...

Documents

Beraterprofil Udo KönigBW350 SAP Components Extraction Release 3.0 BW360 BW Performance & Administration Release 3.0 DBW70E BI Delta Netweaver 2004s TDWI SA2 Datenmodellierung im

Spoken Language Structure - NTNUberlin.csie.ntnu.edu.tw/PastCourses/2003F-SpeechSignal... · 2003-09-16 · Spoken Language Structure Berlin Chen 2003 References:-X. Huang et. al.,

Biing-Hwan Lin Senior Economist, Food Economics Division, USDA/ERS Presentation at 中正大學經濟系 May 12, 2011 Do Taxes and Subsidies Improve Diet and Health?

SP2003F Lecture06 Linear Prediction Analysis of Speech ...berlin.csie.ntnu.edu.tw/PastCourses/2003F-SpeechSignalProcessing... · Linear Prediction Analysis of Speech Sounds References:

Dlver Install Sap Netweaver 2004s

Efficient algorithms for the scaled indexing problem Biing-Feng Wang, Jyh-Jye Lin, and Shan-Chyun Ku Journal of Algorithms 52 (2004) 82–100 Presenter:

Metallit 2004 L06e print web - butler.cc.tut.fibutler.cc.tut.fi/~juhan/2004s/metallit/L06_web.pdf · Luento 6. 2 Esitiedot Miten terästen karkenevuutta voidaan ... 1-rajan yläpuolella

Accessing ABAP Functions in Web Dynpro Java · Accessing ABAP Functions in Web Dynpro Java Applies to: Web Dynpro Java in SAP NetWeaver 7.0 (2004s) Summary . This tutorial shows how

Linguistic Essentialsberlin.csie.ntnu.edu.tw/PastCourses/NaturalLanguage... · 2003. 3. 4. · • Determiners (定詞) – Determiners describe the particular reference of a noun

MLDM2004S Paper-Genetic algorithm-based clustering techniqueberlin.csie.ntnu.edu.tw/PastCourses/2004S... · Genetic algorithm-based clustering technique 2. ... decoding fitness computation

Fuzzy Analysis I - Unicampvalle/PastCourses/SFuzzy_13/... · 2013-09-09 · MT801-Tópicosemmatemáticaaplicada Fuzzy Analysis I MichaelMacedoDiniz 05desetembrode2013 Michael Macedo

ꕢ뻉엩늣띾ꪺ륌ꕨ뉻Ꙣ뭐ꖼ꣓ - cs.nccu.edu.tlien/Class/Seminar/2004s/Speech01/shyu.pdf · ꕢ뻉엩덎ꢽ땻꒣쉟뒣ꭥ륆ꚨ 1995 0.1 0.05 0.5 ‘94 version '97 '99 Two

Miten terästen karkenevuutta voidaan parantaa? Luento 6 ...butler.cc.tut.fi/~juhan/2004s/metallit/L06_web4.pdf · •Jominy equivalent hardness(J eh) 11 41 43 Myöstö ja päästö

A. B. 1 Samuel 16:13 1 Corinthians 12:8-10 Ta-angThu . . . E...“Na nasemnu in sathau biing khat cih simloh inn sungah bangmah dang neilo hi,” a ci hi. A leibatnapa tungah a neihsa

HMMs and Speech Recognition - 國立臺灣師範大學berlin.csie.ntnu.edu.tw/PastCourses/NaturalLanguageProcessing2003S... · Outline Overview of Speech Recognition Architecture

Kupari ja kuparimetallitbutler.cc.tut.fi/~juhan/2004s/metallit/L02_web.pdf3 Esitiedot Miten sinkkipitoisuus vaikuttaa kuparisinkkiseoksen ominaisuuksiin? •Miksi sinkin lisäys nostaa

Khueeh truung San pham ., vaquangeao · Cuan sach nay khong dugc sao chep ho~c sua dbi khi chua duqe phep biing van ban cila Chuong trlnh PMt triin DVan Me Kong. Chuang trlnh PMt

Metallit 2004 L04c print web - butler.cc.tut.fibutler.cc.tut.fi/~juhan/2004s/metallit/L04_web.pdf · 5 Grafiitin ydintyminen Riippue jähtymisnopeudesta, kasvunopeudesta, koostumuksesta,

Mathematics of Fuzzy Sets and Fuzzy Logic - Unicampvalle/PastCourses/SFuzzy_13/Seminar...Mathematics of Fuzzy Sets and Fuzzy Logic Marcos Eduardo Valle Departamento de Matemática

vA DAo C(>NG HoA HOI CHU NGHiA VIJ;tTNAM TRUONG DB … dung/1.td... · So'xin tuy€n d\lng) ho~c Co'quan dang cong tac d6i vai truong hQ'pxet tuy€n d~c cach; 3. Biing t6t nghi~p