AVEC 2017 menu

Mixture of Gaussians/HMM toolbox

This Matlab toolbox is a simplified and bare implementation for the creation, training and evaluation of Mixture of Gaussian models and Hidden Markov Models. The Hidden Markov Models assume a Gaussian Mixture model (with a variable number of clusters) in each of the states of the HMM. Additionally, the toolbox provides the possibility to have a minimum duration constraint for each of the states (enforcing that the HMM will stay for a certain minimum time duration in the same state). This can help significantly when the observed data is noisy, but the underlying state sequence is not expected to change rapidly.

This toolbox is primarily developed for the segmentation of audio data. The main idea of the segmentation procedure is explained in [1]. This procedure is implemented in the function hmmtimesegment.m. A signal is first oversegmented in segments of equal length. On each initial segment a single Gaussian is fitted. These single Gaussians are used as generating pdf’s for each state in a HMM. Then the HMM states (and the corresponding Gaussian models) are combined into new states. Two states are merged that increase the likelihood of the data the most. The merging continues until the likelihood stops to increase.

[1] Ajmera and Wooters. A robust speaker clustering algorithm. 2003 IEEE Workshop on Automatic Speech Recognition and Understanding, 2003. ASRU’03 pp. 411-416

Categories: voice-analysis

Leave a Reply




You can use these HTML tags

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>