


default search action
SAPA@INTERSPEECH 2004: Jeju Island, Korea
- ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing, ICC, Jeju, Korea, October 3, 2004. ISCA 2004

- Futoshi Asano, Hideki Asoh:

Sound source localization and separation based on the EM algorithm. 37 - Matti Ryynänen, Anssi Klapuri:

Modelling of note events for singing transcription. 40 - Stefan Winter, Hiroshi Sawada, Shoko Araki, Shoji Makino:

Hierarchical clustering applied to overcomplete BSS for convolutive mixtures. 48 - Kazuyoshi Yoshii, Masataka Goto, Hiroshi G. Okuno:

Drum sound identification for polyphonic music using template adaptation and matching methods. 51 - Yasunari Obuchi:

Multiple-microphone robust speech recognition using decoder-based channel selection. 52 - Tomohiro Nakatani, Keisuke Kinoshita, Masato Miyoshi, Parham Zolfaghari:

Harmonicity based blind dereverberation with time warping. 53 - Tuomas Virtanen:

Separation of sound sources by convolutive sparse coding. 55 - Guoning Hu, DeLiang Wang:

Auditory segmentation based on event detection. 62 - Plamen J. Prodanov, Andrzej Drygajlo:

Bayesian networks for error handling through multimodality fusion in spoken dialogues with mobile robots. 70 - Werner Hemmert, Marcus Holmberg, David Gelbart:

Auditory-based automatic speech recognition. 74 - Hugo Bastos de Paula, Hani C. Yehia, Mauricio Alves Loureiro:

Representation and classification of the timbre space of a single musical instrument. 86 - Guillaume Lathoud, Iain McCowan:

A sector-based approach for localization of multiple speakers with microphone arrays. 93 - Daniel P. W. Ellis, Keansub Lee:

Features for segmenting and classifying long-duration recordings of "personal" audio. 106 - Chunghsin Yeh, Axel Röbel:

Physical principles driven joint evaluation of multiple f0 hypotheses. 109 - Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano:

MAP estimation of speech spectral component under GGD a priori. 115 - Shigeki Sagayama, Keigo Takahashi, Hirokazu Kameoka, Takuya Nishimoto:

Specmurt anasylis: a piano-roll-visualization of polyphonic music signal by deconvolution of log-frequency spectrum. 128 - Marios Athineos, Hynek Hermansky, Daniel P. W. Ellis:

PLP-squared: autoregressive modeling of auditory-like 2-d spectro-temporal patterns. 129 - Hynek Hermansky:

Stochastic techniques in deriving perceptual knowledge. 136 - Manuel Reyes-Gomez, Nebojsa Jojic, Daniel P. W. Ellis:

Towards single-channel unsupervised source separation of speech mixtures: the layered harmonics/formants separation-tracking model. 137 - John R. Hershey, Trausti T. Kristjansson, Zhengyou Zhang:

Model-based fusion of bone and air sensors for speech enhancement and robust speech recognition. 139 - Aarthi M. Reddy, Bhiksha Raj:

Soft mask estimation for single channel speaker separation. 158 - Paris Smaragdis:

Discovering auditory objects through non-negativity constraints. 161

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














