Thomas, S. , Ganapathy, S. and Hermansky, H. , Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features , number Idiap-RR-04-2009, 2009.
Motlicek, P. , Ganapathy, S. and Hermansky, H. , Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec , in: 10th Annual Conference of the International Speech Communication Association, pages 2591-2594, ISCA 2009, ISCA, Brighton, England, 2009.
Pinto, J. P. , Sivaram, G. S. V. S. , Hermansky, H. and Magimai-Doss, M. , Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator , in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Error Resilient Speech Coding Using Sub-band Hilbert Envelopes , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, pages 355-362, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Error Resilient Speech Coding Using Sub-band Hilbert Envelopes , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
Anemuller, J. , Back, J. -H. , Caputo, B. , Havlena, M. , Luo, J. , Kayser, H. , Leibe, B. , Motlicek, P. , Pajdla, T. , Pavel, M. , Torii, A. , van Gool, L. , Zweig, A. and Hermansky, H. , The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events , in: Proceedings of the International Conference on Multimodal Interfaces, 2008.
Pinto, J. P. , Szoke, I. , Prasanna, S. R. Mahadeva and Hermansky, H. , Fast approximate spoken term detection from sequence of phonemes , in: The 31st Annual International ACM SIGIR Conference 20-24 July 2008, pages 28-33, Singapore,, 2008.
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domain , in: INTERSPEECH 2008, Brisbane, Australia, 2008.
Motlicek, P. , Ganapathy, S. , Hermansky, H. , Garudadri, H. and Athineos, M. , Perceptually motivated Sub-band Decomposition for FDLP Audio Coding , in: Text, Speech and Dialogue, pages 435-442, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
Thomas, A. , Ganapathy, S. and Hermansky, H. , Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain , in: 16th European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
Pinto, J. P. , Sivaram, G. S. V. S. and Hermansky, H. , Reverse correlation for analyzing mlp posterior features in asr , in: 11th International Conference on Text, Speech and Dialogue (TSD), pages 469-476, Brno, Czech Republic, 2008. [DOI]
Thomas, A. , Ganapathy, S. and Hermansky, H. , Recognition of reverberant speech using frequency domain linear prediction , in: IEEE Signal Processing Letters, 2008.
Motlicek, P. , Ganapathy, S. and Hermansky, H. , Entropy coding of Quantized Spectral Components in FDLP audio codec , number Idiap-RR-71-2008, 2008.
Ganapathy, S. , Thomas, A. and Hermansky, H. , Front-end for far-field speech recognition based on frequency domain linear prediction , in: Interspeech 2008, Brisbane, Australia, 2008.
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Autoregressive modelling of hilbert envelopes for wide-band audio coding , in: AES 124th Convention, Audio Engineering Society, Amsterdam, 2008.
Thomas, A. , Ganapathy, S. and Hermansky, H. , Hilbert envelope based features for far-field speech recognition , in: MLMI 2008, Utrecht, The Netherlands, 2008.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes , number Idiap-RR-75-2008, 2008.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION , number Idiap-RR-74-2008, 2008.
Ganapathy, S. , Thomas, S. and Hermansky, H. , Modulation Frequency Features For Phoneme Recognition In Noisy Speech , in: Journal of Acoustical Society of America - Express Letters, 2008.
Thomas, A. , Ganapathy, S. and Hermansky, H. , Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech , in: Interspeech 2008, Brisbane, Australia, 2008.
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Temporal masking for bit-rate reduction in audio codec based on frequency domain linear prediction , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4781-4784, Las Vegas, NV, 2008. [DOI]
Parthasarathi, S. H. K. , Motlicek, P. and Hermansky, H. , Exploiting Contextual Information for Speech/Non-Speech Detection , in: Text, Speech and Dialogue, pages 451-459, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
Parthasarathi, S. H. K. , Motlicek, P. and Hermansky, H. , Exploiting temporal context for speech/non-speech detection , number Idiap-RR-21-2008, 2008.
Valente, F. and Hermansky, H. , On the combination of auditory and modulation frequency channels for asr applications , in: Interspeech 2008, Brisbane, Australia, 2008.
Pinto, J. P. and Hermansky, H. , Combining evidence from a generative and a discriminative model in phoneme recognition , in: Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Pinto, J. P. , Hermansky, H. , Yegnanarayana, B. and Magimai-Doss, M. , Exploiting contextual information for improved phoneme recognition , in: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2008), pages 4449-4452, Las Vegas, NV, 2008. [DOI]
Valente, F. and Hermansky, H. , Hierarchical and parallel processing of modulation spectrum for asr applications , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4165-4168, 2008. [DOI]
Weinshall, D. , Hermansky, H. , Zweig, A. , Luo, J. , Jimison, H. , Ohl, F. and Pavel, M. , Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree , in: Advances in Neural Information Processing Systems 21, 2008.
Sivaram, G. S. V. S. and Hermansky, H. , Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition , in: Proc. 16th European Signal Processing Conference (EUSIPCO), Lausanne, 2008.
Sivaram, G. S. V. S. and Hermansky, H. , Introducing temporal asymmetries in feature extraction for automatic speech recognition , in: Interspeech 2008, Brisbane, Australia, 2008.
Parthasarathi, S. H. K. and Hermansky, H. , A data-driven approach to speech/non-speech detection , number Idiap-RR-23-2008, 2008.
Valente, F. , Vepa, J. , Plahl, C. , Gollan, C. , Hermansky, H. and Schlüter, R. , Hierarchical neural networks feature extraction for lvcsr system , in: Interspeech 2007, 2007.
Prasanna, S. R. Mahadeva , Yegnanarayana, B. , Pinto, J. P. and Hermansky, H. , Analysis of confusion matrix to combine evidence for phoneme recognition , number 27, 2007.
Pinto, J. P. , R. M., P. , Yegnanarayana, B. and Hermansky, H. , Significance of contextual information in phoneme recognition , 2007.
Pinto, J. P. , Bourlard, H. , Graves, A. and Hermansky, H. , Comparing different word lattice rescoring approaches towards keyword spotting , number 32, 2007.
Lovitt, A. , Pinto, J. P. and Hermansky, H. , On confusions in a phoneme recognizer , 2007.
Motlicek, P. , Ganapathy, S. , Hermansky, H. and Garudadri, H. , Scalable wide-band audio codec based on frequency domain linear prediction , number 16, 2007.
Pinto, J. P. , Lovitt, A. and Hermansky, H. , Exploiting phoneme similarities in hybrid hmm-ann keyword spotting , in: Proceedings of Interspeech, 2007.
Valente, F. , Vepa, J. and Hermansky, H. , Multi-stream features combination based on dempster-shafer rule for lvcsr system , in: Interspeech 2007, 2007.
Motlicek, P. , Hermansky, H. , Ganapathy, S. and Garudadri, H. , Frequency domain linear prediction for qmf sub-bands and applications to audio coding , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 248-258, 2007.
Motlicek, P. , Hermansky, H. , Ganapathy, S. , Garudadri, H. and Srinivasamurthy, N. , Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes , in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), pages 350-357, 2007.
Valente, F. and Hermansky, H. , Combination of acoustic classifiers based on dempster-shafer theory of evidence , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
Motlicek, P. , Ullal, V. and Hermansky, H. , Wide-band perceptual audio coding based on frequency-domain linear prediction , number 58, 2006.
Motlicek, P. , Hermansky, H. , Garudadri, H. and Srinivasamurthy, N. , Audio coding based on long temporal contexts , number 30, 2006.
Ketabdar, H. and Hermansky, H. , Identifying unexpected words using in-context and out-of-context phoneme posteriors , number 68, 2006.
Powered by Agaion