Guide:
  • If you want to have the list of publications issued from a specific Individual Project (IP), write in the search field (IM2.IP). IP can have the following value: DMA, AP, VP, MPR, MCA, HMI, ISD, BMI

  • If you want to find joint publications between IPs, write in the search field (joint), click on search and then click on Keywords

  • If you want to display all the publications for a specific author, use the shortcut called -Authors- located in the main menu
 

Hermansky, H.    

Firstname:H. 
Surname:Hermansky 
Email: 
Institute: 
Homepage: 

45 publications (0 read)

16 Keywords relate to this author

Arithmetic Coding
Audio Coding
Entropy Coding
Frequency Domain Linear Prediction (FDLP)
Huffman Coding
IM2.AP
IM2.AP.MPR
IM2.BMI
IM2.DMA
IM2.MPR
IM2.VP
Joint publication
Report_VI
Report_VII
Report_VIII
Speech coding




Publications as Author



2009

Thomas, S., Ganapathy, S. and Hermansky, H., Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features, number Idiap-RR-04-2009, 2009.
 
Motlicek, P., Ganapathy, S. and Hermansky, H., Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, in: 10th Annual Conference of the International Speech Communication Association, pages 2591-2594, ISCA 2009, ISCA, Brighton, England, 2009.
 
Pinto, J. P., Sivaram, G. S. V. S., Hermansky, H. and Magimai-Doss, M., Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, pages 355-362, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
 

2008

Anemuller, J., Back, J. -H., Caputo, B., Havlena, M., Luo, J., Kayser, H., Leibe, B., Motlicek, P., Pajdla, T., Pavel, M., Torii, A., van Gool, L., Zweig, A. and Hermansky, H., The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, in: Proceedings of the International Conference on Multimodal Interfaces, 2008.
 
Pinto, J. P., Szoke, I., Prasanna, S. R. Mahadeva and Hermansky, H., Fast approximate spoken term detection from sequence of phonemes, in: The 31st Annual International ACM SIGIR Conference 20-24 July 2008, pages 28-33, Singapore,, 2008.
 
Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domain, in: INTERSPEECH 2008, Brisbane, Australia, 2008.
 
Motlicek, P., Ganapathy, S., Hermansky, H., Garudadri, H. and Athineos, M., Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, in: Text, Speech and Dialogue, pages 435-442, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
 
Thomas, A., Ganapathy, S. and Hermansky, H., Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain, in: 16th European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
 
Pinto, J. P., Sivaram, G. S. V. S. and Hermansky, H., Reverse correlation for analyzing mlp posterior features in asr, in: 11th International Conference on Text, Speech and Dialogue (TSD), pages 469-476, Brno, Czech Republic, 2008. [DOI]
 
Thomas, A., Ganapathy, S. and Hermansky, H., Recognition of reverberant speech using frequency domain linear prediction, in: IEEE Signal Processing Letters, 2008.
 
Motlicek, P., Ganapathy, S. and Hermansky, H., Entropy coding of Quantized Spectral Components in FDLP audio codec, number Idiap-RR-71-2008, 2008.
 
Ganapathy, S., Thomas, A. and Hermansky, H., Front-end for far-field speech recognition based on frequency domain linear prediction, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Autoregressive modelling of hilbert envelopes for wide-band audio coding, in: AES 124th Convention, Audio Engineering Society, Amsterdam, 2008.
 
Thomas, A., Ganapathy, S. and Hermansky, H., Hilbert envelope based features for far-field speech recognition, in: MLMI 2008, Utrecht, The Netherlands, 2008.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, number Idiap-RR-75-2008, 2008.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, number Idiap-RR-74-2008, 2008.
 
Ganapathy, S., Thomas, S. and Hermansky, H., Modulation Frequency Features For Phoneme Recognition In Noisy Speech, in: Journal of Acoustical Society of America - Express Letters, 2008.
 
Thomas, A., Ganapathy, S. and Hermansky, H., Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Temporal masking for bit-rate reduction in audio codec based on frequency domain linear prediction, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4781-4784, Las Vegas, NV, 2008. [DOI]
 
Parthasarathi, S. H. K., Motlicek, P. and Hermansky, H., Exploiting Contextual Information for Speech/Non-Speech Detection, in: Text, Speech and Dialogue, pages 451-459, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
 
Parthasarathi, S. H. K., Motlicek, P. and Hermansky, H., Exploiting temporal context for speech/non-speech detection, number Idiap-RR-21-2008, 2008.
 
Valente, F. and Hermansky, H., On the combination of auditory and modulation frequency channels for asr applications, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Pinto, J. P. and Hermansky, H., Combining evidence from a generative and a discriminative model in phoneme recognition, in: Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Pinto, J. P., Hermansky, H., Yegnanarayana, B. and Magimai-Doss, M., Exploiting contextual information for improved phoneme recognition, in: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2008), pages 4449-4452, Las Vegas, NV, 2008. [DOI]
 
Valente, F. and Hermansky, H., Hierarchical and parallel processing of modulation spectrum for asr applications, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4165-4168, 2008. [DOI]
 
Weinshall, D., Hermansky, H., Zweig, A., Luo, J., Jimison, H., Ohl, F. and Pavel, M., Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree, in: Advances in Neural Information Processing Systems 21, 2008.
 
Sivaram, G. S. V. S. and Hermansky, H., Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition, in: Proc. 16th European Signal Processing Conference (EUSIPCO), Lausanne, 2008.
 
Sivaram, G. S. V. S. and Hermansky, H., Introducing temporal asymmetries in feature extraction for automatic speech recognition, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Parthasarathi, S. H. K. and Hermansky, H., A data-driven approach to speech/non-speech detection, number Idiap-RR-23-2008, 2008.
 

2007

Valente, F., Vepa, J., Plahl, C., Gollan, C., Hermansky, H. and Schlüter, R., Hierarchical neural networks feature extraction for lvcsr system, in: Interspeech 2007, 2007.
 
Prasanna, S. R. Mahadeva, Yegnanarayana, B., Pinto, J. P. and Hermansky, H., Analysis of confusion matrix to combine evidence for phoneme recognition, number 27, 2007.
 
Pinto, J. P., R. M., P., Yegnanarayana, B. and Hermansky, H., Significance of contextual information in phoneme recognition, 2007.
 
Pinto, J. P., Bourlard, H., Graves, A. and Hermansky, H., Comparing different word lattice rescoring approaches towards keyword spotting, number 32, 2007.
 
Lovitt, A., Pinto, J. P. and Hermansky, H., On confusions in a phoneme recognizer, 2007.
 
Motlicek, P., Ganapathy, S., Hermansky, H. and Garudadri, H., Scalable wide-band audio codec based on frequency domain linear prediction, number 16, 2007.
 
Pinto, J. P., Lovitt, A. and Hermansky, H., Exploiting phoneme similarities in hybrid hmm-ann keyword spotting, in: Proceedings of Interspeech, 2007.
 
Valente, F., Vepa, J. and Hermansky, H., Multi-stream features combination based on dempster-shafer rule for lvcsr system, in: Interspeech 2007, 2007.
 
Motlicek, P., Hermansky, H., Ganapathy, S. and Garudadri, H., Frequency domain linear prediction for qmf sub-bands and applications to audio coding, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 248-258, 2007.
 
Motlicek, P., Hermansky, H., Ganapathy, S., Garudadri, H. and Srinivasamurthy, N., Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes, in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), pages 350-357, 2007.
 
Valente, F. and Hermansky, H., Combination of acoustic classifiers based on dempster-shafer theory of evidence, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
 

2006

Motlicek, P., Ullal, V. and Hermansky, H., Wide-band perceptual audio coding based on frequency-domain linear prediction, number 58, 2006.
 
Motlicek, P., Hermansky, H., Garudadri, H. and Srinivasamurthy, N., Audio coding based on long temporal contexts, number 30, 2006.
 
Ketabdar, H. and Hermansky, H., Identifying unexpected words using in-context and out-of-context phoneme posteriors, number 68, 2006.
 
Powered by Agaion