Abstract
There is a considerable evidence that our perception of sound uses important features which is related to underlying signal modulations. This topic has been studied extensively via perceptual experiments, yet there are few, if any, well-developed signal processing methods which capitalize on or model these effects. We begin by summarizing evidence of the importance of modulation representations from psychophysical, physiological, and other sources. The concept of a two-dimensional joint acoustic and modulation frequency representation is proposed. A
simple single sinusoidal amplitude modulator of a sinusoidal
carrier is then used to illustrate properties of an unconstrained
and ideal joint representation. Added constraints are required to
remove or reduce undesired interference terms and to provide
invertibility. It is then noted that the constraints would also
apply to more general and complex cases of broader modulation and
carriers. Applications in single-channel speaker separation and
in audio coding are used to illustrate the applicability of this
joint representation. Other applications in signal analysis and
filtering are suggested.