文件名称:kulkarniIyerSridharan-AudioSegmentation
介绍说明--下载内容均来自于网络,请自行研究使用
a novel algorithm to segment
an audio piece into its structural components.
The boundaries of the homogeneous regions are decided
based on various time and frequency domain
features. The algorithm has been designed in 2 stages.
In the first stage, a vocal/non-vocal/silence classification
is done using multinomial softmax regression. The
second stage uses a hidden Markov model to ‘smooth’
the previous output as well as enforce the time dependent
structuring.
an audio piece into its structural components.
The boundaries of the homogeneous regions are decided
based on various time and frequency domain
features. The algorithm has been designed in 2 stages.
In the first stage, a vocal/non-vocal/silence classification
is done using multinomial softmax regression. The
second stage uses a hidden Markov model to ‘smooth’
the previous output as well as enforce the time dependent
structuring.
(系统自动生成,下载前可以参看下载内容)
下载文件列表
kulkarniIyerSridharan-AudioSegmentation.pdf