Log-spectral compensation of PMC Gaussian mean vectors for noisy speech recognition using log-max assumption

Reducing mismatch between HMMs trained with clean speech and speech signals recorded under background noise can be approached by distribution adaptation using parallel model combination (PMC). Accurate PMC has no closed-form expression, therefore simplification assumptions must be made in implementation. Under a new log-max assumption, adaptation formula for log-spectral parameters are presented, both for static and dynamic parameters. The system takes the mean vector (41) made up of the static part (41a) and the dynamic part (41b) and the noise vector (43) made up of the static part (43a) and dynamic part (43b) and applies to a decision circuit (45) to determine if the quiet vector plus the gain of speech produced in noise with respect to clean speech is greater than the noisy mean vector and if so the static part is equal to the gain plus the clean speech vector and the dynamic part is the change in the the quiet speech vector and if not greater than the noise vector then the static part equals the noise vector and the dynamic part is zero.
Het verminderen van wanverhouding tussen HMMs die met schone toespraak wordt opgeleid en toespraaksignalen die onder achtergrondgeluiden worden geregistreerd kan door distributieaanpassing zijn genaderd gebruikend parallelle modelcombinatie (PMC). Nauwkeurige PMC heeft geen closed-form uitdrukking, daarom moeten de vereenvoudigingsveronderstellingen in implementatie worden gemaakt. Onder een nieuwe logboek-maximum veronderstelling, wordt de aanpassingsformule voor logboek-spectrale parameters voorgesteld, zowel voor statische als dynamische parameters. Het systeem neemt gemiddelde vector (41) die uit het statische deel (41a) wordt samengesteld en het dynamische deel (41b) en lawaaivector (43) die uit het statische deel (43a) wordt samengesteld en dynamisch deel (43b) en is op een besluitkring (45) van toepassing om te bepalen als de stille vector plus de aanwinst van toespraak die in lawaai met betrekking tot schone toespraak wordt veroorzaakt groter is dan de lawaaierige gemiddelde vector en als zo het statische deel gelijk is aan de aanwinst plus de schone toespraakvector en het dynamische deel de verandering in de stille toespraakvector is en als niet groter dan de lawaaivector het statische deel dan de lawaaivector evenaart het deel is nul.

Web www.patentalert.com

< (none)

< Oxygen monitoring apparatus

> Voice activity detection speech coding to accommodate music signals

> (none)

~ 00086