语音信号处理 Speech Signal Processing 教材:数字语音处理 ...

Preview:

DESCRIPTION

语音信号处理 Speech Signal Processing 教材:数字语音处理 华中科技大学出版社 参考书:语音信号数字处理 电子工业出版社. From prehistory to the new media of the future, speech has been and will be a primary form of communication between humans. - PowerPoint PPT Presentation

Citation preview

语音信号处理

Speech Signal Processing 教材:数字语音处理

华中科技大学出版社

参考书:语音信号数字处理

电子工业出版社

From prehistory to the new media of the future, speech has been and will be a primary form of communication between humans.

Speech will become the next major trend in the personal computer market in the near future.

学习的目标:

保障现代通信领域中人与人之间、人与机器之间的信息交换的顺畅和自然。

Representation of speech signals

waveform parametric

preamplifier

anti-aliasing filter

A/D conversion

语音信号产生的数字模型Speech Production Model

Discrete

System

SpeechExcitation

根据人类语音的发音的生理过程和语音信号的声学特征来建立。

☆短时时域处理技术☆短时傅立叶分析☆语音信号的线性预测 Linear Predication

☆语音信号的同态滤波和倒谱分析☆矢量量化技术 Vector Quantification

☆隐含马尔可夫模型 HMM(Hide Markov Model)

语音信号的处理方Methods语音信号是时变信号,只是在一个短时短内可近似看作是平稳信号。

语音信号的应用 Application 语音编码技术 Speech Coding 语音合成技术 Speech Synthesis 语音识别技术 Speech Recognition 语音增强技术 Speech Enhancement声源 定位技术 Speaker Tracking/ Sound Source Localization

语音信号处理涉及语言学、声学、认知科学、生理学、心理学和数理统计

等多学科知识。用数字化的方法进行语

音的传送、存储、合成、识别和增强等

技术是整个数字化通信网中最重要最基

本的组成部分。

Speech Coding

Digital Channel

The goal is reduce the information rate, measured in bits per second, while maintaining the quality of the original speech waveform.

PCM:64kb/s waveform coder

LPC-10:2.4kb/s parametric coder

GSM:13kb/s hybrid coder

欢迎使用微软中国研究院中文语音合成系统

采用 TTS技术合成语音http://research. Microsoft.com/speech/tts

Speech Synthesis

金山词霸 /电子词典

从音乐中提取有用信号

Speech Enhancement

the goal is to improve the quality of degraded speech.

Speech Recognition

IBM语音识别软件

SONY推出的 QRIO——家庭娱乐机器人

"白雪公主 "

Furby

Sound card enables the computer to Record sound from a microphone Output sound through a speaker Manipulate sound stored on a disk

Sound card contains A/D and D/A converters Sampling rate generator

Most sound cards use DMA to read and write audio data

声卡的功能

Two types of audio file formats: Raw format, no header

raw Self-describing format, with a file header

WavMidiAiff, au, vocMpegRm, ra, ram

常见的声音文件

Audio file parameters Sampling rate Bits per sample Channels

可在 MATLAB中用 wavread函数将*.wav的文件打开,将所有数据读入一个数组中,数组中每个数的范围在 [-1,1]之间。

例如: [y,fs,Nbits]=wavread(‘c:\tts_30250.wav’);

运行结果: fs=16kHz

Nbits=16

y=[…..,0.0002,-0.0009,0.0081,…..]

常见的语音软件

Cool Edit Pro

Adobe Audition

Cool Edit Pro 2.0.lnk

THANKS

Recommended