Guide:
  • If you want to have the list of publications issued from a specific Individual Project (IP), write in the search field (IM2.IP). IP can have the following value: DMA, AP, VP, MPR, MCA, HMI, ISD, BMI

  • If you want to find joint publications between IPs, write in the search field (joint), click on search and then click on Keywords

  • If you want to display all the publications for a specific author, use the shortcut called -Authors- located in the main menu
 

Keywords (234)
Mesot, B. and Barber, D., A bayesian switching linear dynamical system for scale-invariant robust speech extraction, 2007.
 
Keywords:Report_VII, IM2.AP

Paiement, J. -F., Grandvalet, Y., Bengio, S. and Eck, D., A Distance Model for Rhythms, in: 25th International Conference on Machine Learning (ICML), 2008.
 
Keywords:IM2.AP, Report_VIII

Huang, Y., Vinyals, O., Friedland, G., Müller, C., Mirghafori, N. and Wooters, C., A Fast-Match approach for robust, faster than real-time Speaker Diarization, in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
 
Keywords:Report_VII, IM2.AP

Mesot, B. and Barber, D., A gaussian sum smoother for inference in switching linear dynamical systems, 2007.
 
Keywords:Report_VII, IM2.AP

Cheng, O., Dines, J. and Magimai-Doss, M., A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition, number 62, 2006.
 
Keywords:Report_VI, IM2.AP

Gillick, D., Riedhammer, K., Favre, B. and Hakkani-Tur, D., A global optimization framework for meeting summarization, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
 
Keywords:IM2.AP, Report_VIII

Vinyals, O. and Friedland, G., A hardware-independent fast logarithm approximation with adjustable accuracy, in: 10th IEEE International Symposium on Multimedia, Berkeley, CA, USA, pages 61-65, 2008.
 
Keywords:IM2.AP, Report_VIII

Mariéthoz, J. and Bengio, S., A kernel trick for sequences applied to text-independent speaker verification systems, in: Pattern Recognition, volume 40, number 8, ISSN 0031-3203, 2007.
 
Keywords:IM2.AP, Report_VI

Keshet, J. and Chazan, D., A Kernel Wrapper for Phoneme Sequence Recognition, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Keywords:IM2.AP, Report_VIII

Keshet, J., Shalev-Shwartz, S., Singer, Y. and Chazan, D., A Large Margin Algorithm for Forced Alignment, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Keywords:IM2.AP, Report_VIII

Garner, P. N., A MAP Approach to Noise Compensation of Speech, number Idiap-RR-08-2009, 2009.
 
Keywords:IM2.AP, Report_VIII

Li, W., Kumatani, K., Dines, J., Magimai-Doss, M. and Bourlard, H., A neural network based regression approach for recogninizing simultaneous speech, in: Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
 
Keywords:Report_VII, IM2.AP

Li, W., Kumatani, K., Dines, J., Magimai-Doss, M. and Bourlard, H., A neural network based regression approach for recognizing simultaneous speech, number Idiap-RR-10-2008, 2008.
 
Keywords:Report_VII,IM2.AP

Valente, F., A Novel Criterion for Classifiers Combination in Multistream Speech Recognition, in: IEEE Signal Processing Letters, volume 16, number 7, pages 561-564, ISSN 1070-9908, 2009. [DOI]
 
Keywords:IM2.AP, Report_VIII

Keshet, J., A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Keywords:IM2.AP, Report_VIII

Dines, J. and Magimai-Doss, M., A study of phoneme and grapheme based context-dependent asr systems, number 12, 2007.
 
Keywords:Report_VI, IM2.AP, major

Garner, P. N., A weighted finite state transducer tutorial, number Idiap-Com-03-2008, 2008.
 
Keywords:IM2.AP, Report_VII

Anguera, X., Wooters, C. and Hernando, J., Acoustic Beamforming for Speaker Diarization of Meetings, in: to appear in IEEE Transactions on Audio, Speech and Language Processing, 2007.
 
Keywords:Report_VI, IM2.AP

Aradilla, G., Acoustic models for posterior features in speech recognition, Ecole Polytechnique Fédérale de Lausanne, 2008.
 
Keywords:IM2.AP, Report_VII

Kumatani, K., McDonough, J., Klakow, D., Garner, P. N. and Li, W., Adaptive beamforming with a maximum negentropy criterion,, in: The Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2008.
 
Keywords:Report_VII, IM2.AP

Kumatani, K., Mayer, H., Gehrig, T., Stoimenov, E., McDonough, J. and Wölfel, M., Adaptive beamforming with a minimum mutual information criterion, pages 2527--2541, 2007. [DOI]
 
Keywords:IM2.AP, Report_VII

Valente, F., Bourlard, H. and Deepu, V., Agglomerative information bottleneck for speaker diarization of meetings data, number 31, 2007.
 
Keywords:Report_VI, IM2.AP

Aradilla, G., Vepa, J. and Bourlard, H., An acoustic model based on kullback-leibler divergence for posterior features, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
 
Keywords:Report_VI, IM2.AP

Cuendet, S., Shriberg, E., Favre, B., Fung, J. and Hakkani-Tur, D., An analysis of sentence segmentation features for broadcast news, broadcast conversations, and meetings, in: SIGIR Workshop on Searching Conversational Spontaneous Speech, 2007.
 
Keywords:Report_VII, IM2.AP

Cetin, O., Kantor, A., King, S., Bartels, C., Magimai-Doss, M., Frankel, J. and Livescu, K., An Articulatory Feature-based Tandem Approach and Factored Observation Modeling, in: Proc. ICASSP, Honolulu, 2007.
 
Keywords:Report_VI, IM2.AP

Kaufmann, T. and Pfister, B., An HPSG parser supporting discontinuous licenser rules, in: International Conference on HPSG, 2007.
 
Keywords:Report_VI, IM2.AP

Vijayasenan, D., Valente, F. and Bourlard, H., An Information Theoretic Approach to Speaker Diarization of Meeting Data, in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 7, pages 1382-1393, 2009. [DOI]
 
Keywords:IM2.AP, Report_VIII

Kamangar, K., Hakkani-Tur, D., Tur, G. and Levit, M., An iterative unsupervised learning method for information distillation, in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
 
Keywords:Report_VII, IM2.AP

Prasanna, S. R. Mahadeva, Yegnanarayana, B., Pinto, J. P. and Hermansky, H., Analysis of confusion matrix to combine evidence for phoneme recognition, number 27, 2007.
 
Keywords:IM2.AP, Report_VII

Kaufmann, T. and Pfister, B., Applying licenser rules to a grammar with continuous constituents, in: The Proceedings of the 14th International Conference on Head-Driven Phrase Structure Grammar, 2007.
 
Keywords:Report_VII, IM2.AP

Motlicek, P., Ganapathy, S. and Hermansky, H., Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, in: 10th Annual Conference of the International Speech Communication Association, pages 2591-2594, ISCA 2009, ISCA, Brighton, England, 2009.
 
Keywords:Arithmetic Coding, Audio Coding, Entropy Coding, Frequency Domain Linear Prediction (FDLP), Huffman Coding, IM2.AP, Report_VIII

Livescu, K., Cetin, O., Hasegawa-Johnson, M., King, S., Bartels, C., Borges, N., Kantor, A., Lal, P., Yung, L., Bezman, A., Dawson-Haggerty, S., Woods, B., Frankel, J., Magimai-Doss, M. and Saenko, K., Articulatory Feature-based Methods for Acoustic and Audio-visual speech Recognition: Summary from the 2006 JHU Summer Workshop, in: Proc. ICASSP, Honolulu, 2007.
 
Keywords:Report_VI, IM2.AP

A. Peregoudov, , Vinciarelli, A. and Bourlard, H., Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations, number 56, 2006.
 
Keywords:Report_VI, IM2.AP.MCA, joint publication

Motlicek, P., Hermansky, H., Garudadri, H. and Srinivasamurthy, N., Audio coding based on long temporal contexts, number 30, 2006.
 
Keywords:Report_VI, IM2.AP

Ullal, V. and Motlicek, P., Audio coding based on long temporal segments: experiments with quantization of excitation signal, number 46, 2006.
 
Keywords:Report_VI, IM2.AP

Cuendet, S., Hakkani-Tur, D. and Shriberg, E., Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech, in: to appear in Proceedings of MLMI, Brno, Czech Republic, 2007.
 
Keywords:Report_VI, IM2.AP

Knox, M. and Mirghafori, N., Automatic Laughter Detection Using Neural Networks, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 
Keywords:Report_VI, IM2.AP

Motlicek, P., Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, in: 10thAnnual Conference of the International Speech Communication Association, pages 1215-1218, ISCA, Brighton, England, 2009.
 
Keywords:IM2.AP, Report_VIII

Motlicek, P., Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, in: 10thAnnual Conference of the International Speech Communication Association, ISCA, 2009.
 
Keywords:IM2.AP,Report_VIII

Keshet, J. and Bengio, S., Automatic speech and speaker recognition: large margin and kernel methods, John Wiley & Sons, 2008.
 
Keywords:IM2.AP, Report_VII

Anguera, X., Wooters, C., Pardo, J. M. and Hernando, J., Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings, in: Proc. ICASSP, Honolulu, 2007.
 
Keywords:Report_VI, IM2.AP

Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Autoregressive modelling of hilbert envelopes for wide-band audio coding, in: AES 124th Convention, Audio Engineering Society, Amsterdam, 2008.
 
Keywords:IM2.AP, Report_VII

Kumatani, K., McDonough, J., Rauch, B., Klakow, D., Garner, P. N. and Li, W., Beamforming with a Maximum Negentropy Criterion, in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 5, pages 994-1008, 2008.
 
Keywords:IM2.AP, Report_VIII

Orabona, F., Keshet, J. and Caputo, B., Bounded kernel-based perceptrons, in: Journal of Machine Learning Research, volume Accepted for pub, 2009.
 
Keywords:IM2.AP, Report_VIII

Vinciarelli, A. and Favre, S., Broadcast news story segmentation using social network analysis and hidden markov models, in: ACM International Conference on Multimedia, pages 261-264, 2007.
 
Keywords:Report_VI, IM2.AP.MPR, joint publication

Hwang, M. -Y., Peng, G., Wang, W., Faria, A., Heidel, A. and Ostendorf, M., Building a Highly Accurate Mandarin Speech Recognizer, in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
 
Keywords:Report_VII, IM2.AP

Garg, N., Favre, B., Riedhammer, K. and Hakkani-Tur, D., Clusterrank: a graph based method for meeting summarization, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 
Keywords:IM2.AP, Report_VIII

Valente, F. and Hermansky, H., Combination of acoustic classifiers based on dempster-shafer theory of evidence, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
 
Keywords:Report_VI, IM2.AP

Vijayasenan, D., Valente, F. and Bourlard, H., Combination of agglomerative and sequential clustering for speaker diarization, in: International Conference on Acoustics, Speech and Signal Processing, 2008.
 
Keywords:Report_VII, IM2.AP

Zheng, J., Cetin, O., Hwang, M. -Y., Lei, X., Stolcke, A. and Morgan, N., Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition, in: Proc. ICASSP, Honolulu., 2007.
 
Keywords:Report_VI, IM2.AP

Pinto, J. P. and Hermansky, H., Combining evidence from a generative and a discriminative model in phoneme recognition, in: Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:IM2.AP, Report_VII

Müller, C. and Burkhardt, F., Combining Short-term Cepstral and Long-term Pitch Features for Automatic Recognition of Speaker Age, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 
Keywords:Report_VI, IM2.AP

Pinto, J. P., Bourlard, H., Graves, A. and Hermansky, H., Comparing different word lattice rescoring approaches towards keyword spotting, number 32, 2007.
 
Keywords:IM2.AP, Report_VII

Liu, Y. and Shriberg, E., Comparing Evaluation Metrics for Sentence Boundary Detection, in: Proc. ICASSP, Honolulu, 2007.
 
Keywords:Report_VI, IM2.AP

Faria, A. and Morgan, N., Corrected Tandem Features for Acoustic Model Training, in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
 
Keywords:Report_VII, IM2.AP

Faria, A. and Morgan, N., Corrected tandem features for acoustic model training, in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
 
Keywords:Report_VII, IM2.AP

Lovitt, A., Correcting confusion matrices for phone recognizers, number 03, 2007.
 
Keywords:Report_VI, IM2.AP

Guz, U., Cuendet, S., Hakkani-Tur, D. and Tur, G., Co-training Using Prosodic and Lexical Information for Sentence Segmentation, in: to appear in Proceedings of Interspeech, Antwerp, 2007.
 
Keywords:Report_VI, IM2.AP

Cuendet, S., Hakkani-Tur, D., Shriberg, E., Fung, J. and Favre, B., Cross-Genre Feature Comparisons for Spoken Sentence Segmentation, in: International Conference on Semantic Computing (ICSC), Irvine, CA, 2007.
 
Keywords:Report_VII, IM2.AP

Singla, A. and Hakkani-Tur, D., Cross-lingual sentence extraction for information distillation, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:Report_VII, IM2.AP

Aradilla, G. and Ajmera, J., Detection and recognition of number sequences within spoken utterances, in: 2nd Workshop on Speech in Mobile and Pervasive Environments, 2007.
 
Keywords:Report_VII, IM2.AP

Vergyri, D., Mandal, A., Wang, W., Stolcke, A., Zheng, J., Graciarena, M., Rybach, D., Gollan, C., Schlater, R., Kirchoff, K., Faria, A. and Morgan, N., Development of the sri/nightingale arabic asr system, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:Report_VII, IM2.AP

Vergyri, D., Mandal, A., Wang, W., Stolcke, A., Zheng, J., Graciarena, M., Rybach, D., Gollan, C., Schlater, R., Kirchoff, K., Faria, A. and Morgan, N., Development of the sri/nightingale arabic asr system, in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 1437-1440, 2008.
 
Keywords:IM2.AP, Report_VIII

Dines, J. and Vepa, J., Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics, number 13, 2007.
 
Keywords:Report_VI, IM2.AP

Grangier, D., Keshet, J. and Bengio, S., Discriminative Keyword Spotting, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Keywords:IM2.AP, Report_VIII

Keshet, J., Grangier, D. and Bengio, S., Discriminative Keyword Spotting, in: Speech Communication, volume 51, number 4, pages 317-329, 2009.
 
Keywords:IM2.AP, Report_VIII

Mariéthoz, J., Discrmininant models for text-independent speaker verification, number 70, 2006.
 
Keywords:Report_VI, IM2.AP

Li, W., Effective post-processing for single-channel frequency-domain speech enhancement, pages 149-152, 2008. [DOI]
 
Keywords:IM2.AP, Report_VII

Li, W., Effective post-processing of single-channel frequency-domain speech enhancement, in: IEEE conference on multimedia and expo, 2008.
 
Keywords:Report_VII, IM2.AP

Sivaram, G. S. V. S. and Hermansky, H., Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition, in: Proc. 16th European Signal Processing Conference (EUSIPCO), Lausanne, 2008.
 
Keywords:IM2.AP, Report_VII

Ketabdar, H. and Bourlard, H., Enhanced phone posteriors for improving speech recognition systems, number Idiap-RR-39-2008, 2008.
 
Keywords:IM2.AP, Report_VII

Ketabdar, H., Enhancing posterior based speech recognition systems, Ecole Polytechnique Fédérale de Lausanne, 2008.
 
Keywords:IM2.AP, Report_VIII

Magimai-Doss, M., Hakkani-Tur, D., Cetin, O., Shriberg, E., Fung, J. and Mirghafori, N., Entropy Based Classifier Combination for Sentence Segmentation,, in: Proc. ICASSP, Honolulu, 2007.
 
Keywords:Report_VI, IM2.AP

Motlicek, P., Ganapathy, S. and Hermansky, H., Entropy coding of Quantized Spectral Components in FDLP audio codec, number Idiap-RR-71-2008, 2008.
 
Keywords:IM2.AP, Report_VIII

Hung, H., Huang, Y., Friedland, G. and Gatica-Perez, D., Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies, in: IEEE ICASSP, Las Vegas, NV, 2008.
 
Keywords:Report_VII, IM2.AP

Pinto, J. P., Hermansky, H., Yegnanarayana, B. and Magimai-Doss, M., Exploiting contextual information for improved phoneme recognition, in: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2008), pages 4449-4452, Las Vegas, NV, 2008. [DOI]
 
Keywords:IM2.AP, Report_VII

Parthasarathi, S. H. K., Motlicek, P. and Hermansky, H., Exploiting Contextual Information for Speech/Non-Speech Detection, in: Text, Speech and Dialogue, pages 451-459, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
 
Keywords:IM2.AP, Report_VIII

Pinto, J. P., Lovitt, A. and Hermansky, H., Exploiting phoneme similarities in hybrid hmm-ann keyword spotting, in: Proceedings of Interspeech, 2007.
 
Keywords:Report_VI, IM2.AP

Parthasarathi, S. H. K., Motlicek, P. and Hermansky, H., Exploiting temporal context for speech/non-speech detection, number Idiap-RR-21-2008, 2008.
 
Keywords:IM2.AP,Report_VII

Pinto, J. P., Szoke, I., Prasanna, S. R. Mahadeva and Hermansky, H., Fast approximate spoken term detection from sequence of phonemes, in: The 31st Annual International ACM SIGIR Conference 20-24 July 2008, pages 28-33, Singapore,, 2008.
 
Keywords:IM2.AP, Report_VII

Kumatani, K., McDonough, J., Schacht, S., Klakow, D., Garner, P. N. and Li, W., Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming, in: International Conferance on Acoustics Speech and Signal Processing, 2008.
 
Keywords:Report_VII, IM2.AP

Kumatani, K., McDonough, J., Schacht, S., Klakow, D., Garner, P. N. and Li, W., Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition, number Idiap-RR-02-2008, 2008.
 
Keywords:IM2.AP, Report_VIII

Huijbregts, M., Wooters, C. and Ordelman, R., Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections, in: to appear in Proceedings of Interspeech, Antwerp, 2007.
 
Keywords:Report_VI, IM2.AP

Motlicek, P., Hermansky, H., Ganapathy, S. and Garudadri, H., Frequency domain linear prediction for qmf sub-bands and applications to audio coding, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 248-258, 2007.
 
Keywords:IM2.AP, Report_VI

Ganapathy, S., Thomas, A. and Hermansky, H., Front-end for far-field speech recognition based on frequency domain linear prediction, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:IM2.AP, Report_VII

Friedland, G., Vinyals, O., Huang, Y. and Muller, C., Fusion of short-term and long-term features for improved speaker diarization, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4077-4080, 2009.
 
Keywords:IM2.AP, Report_VIII

Knox, M., Morgan, N. and Mirghafori, N., Getting the last laugh: automatic laughter segmentation in meetings, in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 797-800, 2008.
 
Keywords:IM2.AP, Report_VIII

Knox, M., Morgan, N. and Mirghafori, N., Getting the last laugh: automatic laughter segmentation in meetings, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:Report_VII, IM2.AP

Valente, F. and Hermansky, H., Hierarchical and parallel processing of modulation spectrum for asr applications, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4165-4168, 2008. [DOI]
 
Keywords:IM2.AP, Report_VII

Ketabdar, H. and Bourlard, H., Hierarchical integration of phonetic and lexical knowledge in phone posterior estimation, in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
 
Keywords:Report_VII, IM2.AP

Valente, F., Vepa, J., Plahl, C., Gollan, C., Hermansky, H. and Schlüter, R., Hierarchical neural networks feature extraction for lvcsr system, in: Interspeech 2007, 2007.
 
Keywords:Report_VI, IM2.AP

Valente, F., Magimai-Doss, M., Plahl, C. and Suman, R., Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009.
 
Keywords:speech recognition, TANDEM features, IM2.AP, Report_VIII

Shriberg, E., Higher level features in speaker recognition, in: in C. Muller (Ed.) Speaker Classification I. Springer-Verlag, New York, 2008.
 
Keywords:Report_VII, IM2.AP

Shriberg, E., Higher level features in speaker recognition, in: Speaker Classification I, Lecture Notes in Computer Science, Springer, 2007.
 
Keywords:Report_VII, IM2.AP

Thomas, A., Ganapathy, S. and Hermansky, H., Hilbert envelope based features for far-field speech recognition, in: MLMI 2008, Utrecht, The Netherlands, 2008.
 
Keywords:IM2.AP, Report_VII

Thomas, A., Ganapathy, S. and Hermansky, H., Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:IM2.AP, Report_VII

Gelbart, D., Morgan, N. and Tsymbal, A., Hill-climbing feature selection for multi-stream asr, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 
Keywords:IM2.AP, Report_VIII

Dutoit, T., Couvreur, L. and Bourlard, H., How does a dictation machine recognize speech ?, in: Applied Signal Processing--A MATLAB approach, pages 104-148, Springer MA, 2008.
 
Keywords:IM2.AP, Report_VIII

Plauché, M., Cetin, O. and Uhdaykumar, N., How to build a spoken dialog system with limited (or no) resources, in: AI in ICT for Development Workshop of the Twentieth Intl. Joint Conf. on AI, Hyderabad, India, 2007.
 
Keywords:Report_VI, IM2.AP

Ketabdar, H. and Hermansky, H., Identifying unexpected words using in-context and out-of-context phoneme posteriors, number 68, 2006.
 
Keywords:Report_VI, IM2.AP

Hillard, D., Huang, Z., Ji, H., Grishman, R., Hakkani-Tur, D., Harper, M., Ostendorf, M. and Wang, W., Impact of Automatic Comma Prediction on POS/Name Tagging of Speech, in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
 
Keywords:Report_VI, IM2.AP

Picart, B., Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity, number Idiap-RR-18-2009, 2009.
 
Keywords:IM2.AP, Report_VIII

Ketabdar, H. and Bourlard, H., In-context phone posteriors as complementary features for tandem asr, in: ICSLP'08, Brisbane, Australia,, 2008.
 
Keywords:IM2.AP, Report_VII

Mesot, B., Inference in switching linear dynamical systems applied to noise robust speech recognition of isolated digits, Ecole Polytechnique Fédérale de Lausanne, 2008.
 
Keywords:Report_VII,IM2.AP

Levit, M., Hakkani-Tur, D., Tur, G. and Gillick, D., Integrating Several Annotation Layers for Statistical Information Distillation, in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
 
Keywords:Report_VII, IM2.AP

Levit, M., Hakkani-Tur, D., Tur, G. and Gillick, D., Integrating several annotation layers for statistical information distillation, in: Workshop on Automatic Speech Recognition and Understanding, 2007.
 
Keywords:Report_VII, IM2.AP

Vijayasenan, D., Valente, F. and Bourlard, H., Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, in: Interspeech 2008, 2008.
 
Keywords:IM2.AP, Report_VIII

Sivaram, G. S. V. S. and Hermansky, H., Introducing temporal asymmetries in feature extraction for automatic speech recognition, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:IM2.AP, Report_VII

Parthasarathi, S. H. K., Magimai-Doss, M., Bourlard, H. and Gatica-Perez, D., Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, in: Proceedings of Interspeech 2009, 2009.
 
Keywords:IM2.AP, IM2.MCA, Report_VIII

Mariéthoz, J., Bengio, S. and Grandvalet, Y., Kernel Based Text-Independnent Speaker Verification, number Idiap-RR-68-2008, 2008.
 
Keywords:IM2.AP, Report_VIII

Zacharie, D. G. and Pinto, J. P., Keyword spotting on word lattices, number 22, 2007.
 
Keywords:Report_VI, IM2.AP

Vijayasenan, D., Valente, F. and Bourlard, H., KL Realignment for Speaker Diarization with Multiple Feature Streams, in: 10th Annual Conference of the International Speech Communication Association, 2009.
 
Keywords:IM2.AP, Report_VIII

Andreani, G., Di Fabbrizio, G., Gilbert, M., Gillick, D., Hakkani-Tur, D. and Lemon, O., Lets DiSCoH: Collecting an Annotated Open Corpus with Dialog Acts and Reward Signals for Natural Language Helpdesks, in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
 
Keywords:Report_VI, IM2.AP

Xie, S., Favre, B., Hakkani-Tur, D. and Liu, Y., Leveraging sentence weights in a concept-based optimization framework for extractive meeting summarization, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 
Keywords:IM2.AP, Report_VIII

Friedland, G. and Vinyals, O., Live speaker identification in conversations, in: ACM Multimedia 2008, Vancouver, Canada, pages 1017-1018, 2008.
 
Keywords:IM2.AP, Report_VIII

Vinyals, O. and Friedland, G., Live speaker identification in meetings: "who is speaking now?", in: Technical Report TR-08-001, International Computer Science Institute, Berkeley, CA, 2008.
 
Keywords:Report_VII, IM2.AP

Ganapathy, S., Motlicek, P. and Hermansky, H., Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, number Idiap-RR-75-2008, 2008.
 
Keywords:IM2.AP,Report_VIII

Grangier, D., Machine Learning for Information Retrieval, École Polytechnique Fédérale de Lausanne, 2008.
 
Keywords:discriminative learning, image retrieval, Information Retrieval, learning to rank, machine learning, online learning, spoken keyword spotting, text retrieval, IM2.AP,Report_VIII

Livescu, K., Bezman, A., Borges, N., Yung, L., Cetin, O., Frankel, J., King, S., Magimai-Doss, M., Chi, X. and Lavoie, L., Manual Transcription of Conversational Speech at the Articulatory Feature Level, in: Proc. ICASSP, Honolulu, 2007.
 
Keywords:Report_VI, IM2.AP

Hemptinne, C., Master thesis: integration of the harmonic plus noise model (hnm) into the hidden markov model-based speech synthesis system (hts), number 69, 2006.
 
Keywords:Report_VI, IM2.AP

Kumatani, K., McDonough, J., Rauch, B., Garner, P. N., Li, W. and Dines, J., Maximum kurtosis beamforming with the generalized sidelobe canceller, in: Proceedings of INTERSPEECH, September 2008, Brisbane, Australia, 2009.
 
Keywords:IM2.AP, Report_VIII

Kumatani, K., McDonough, J., Klakow, D., Garner, P. N. and Li, W., Maximum negentropy beamforming, number Idiap-RR-07-2008, 2008.
 
Keywords:Report_VII, IM2.AP

Dines, J., Yamagishi, J. and King, S., Measuring the gap between HMM-based ASR and TTS, in: Proceedings of Interspeech, Brighton, U.K., 2009.
 
Keywords:speech recognition, speech synthesis, unified models, IM2.AP,Report_VIII

Kumatani, K., Mayer, H., Gehrig, T., Stoimenov, E., McDonough, J. and Wölfel, M., Minimum mutual information beamforming for simultaneous active speakers, in: IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), pages 71-76, Kyoto, 2007. [DOI]
 
Keywords:IM2.AP, Report_VII

Li, W., Doss, M. M., Dines, J. and Bourlard, H., Mlp-based log spectral energy mapping for robust overlapping speech recognition, in: European Signal Processing Conference, 2008.
 
Keywords:Report_VII, IM2.AP

Tur, G., Guz, U. and Hakkani-Tur, D., Model Adaptation for Dialog Act Tagging, in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
 
Keywords:Report_VI, IM2.AP

Cuendet, S., Hakkani-Tur, D. and Tur, G., Model Adaptation for Sentence Segmentation from Speech, in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
 
Keywords:Report_VI, IM2.AP

Cuendet, S., Model adaptation for sentence unit segmentation from speech, number 64, 2006.
 
Keywords:Report_VI, IM2.AP

Anguera, X., Shinozaki, T., Wooters, C. and Hernando, J., Model Complexity Selection and Cross-validation EM Training for Robust Speaker Diarization, in: Proc. ICASSP, Honolulu, 2007.
 
Keywords:Report_VI, IM2.AP

Ganapathy, S., Motlicek, P. and Hermansky, H., MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, number Idiap-RR-74-2008, 2008.
 
Keywords:IM2.AP, Report_VIII

Ganapathy, S., Thomas, S. and Hermansky, H., Modulation Frequency Features For Phoneme Recognition In Noisy Speech, in: Journal of Acoustical Society of America - Express Letters, 2008.
 
Keywords:IM2.AP, Report_VIII

Vinyals, O. and Friedland, G., Modulation spectrogram features for speaker diarization, in: Interspeech 2008, Brisbane, Australia, pages 630-633, 2008.
 
Keywords:IM2.AP, Report_VIII

Vinyals, O. and Friedland, G., Modulation spectrogram features for speaker diarization, in: to appear in proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:Report_VII, IM2.AP

Friedland, G., Hung, H. and Yeo, C., Multi-modal speaker diarization of real-world meetings using compressed-domain video features, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4069-4072, 2009.
 
Keywords:IM2.AP, Report_VIII

Valente, F., Vepa, J. and Hermansky, H., Multi-stream features combination based on dempster-shafer rule for lvcsr system, in: Interspeech 2007, 2007.
 
Keywords:Report_VI, IM2.AP.MPR, joint publication

Zhao, S. Y. and Morgan, N., Multi-stream spectro-temporal features for robust speech recognition, in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 898-901, 2008.
 
Keywords:IM2.AP, Report_VIII

Zhao, S. and Morgan, N., Multi-stream spectro-temporal features for robust speech recognition, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:Report_VII, IM2.AP

Zhao, S. Y., Ravuri, R. and Morgan, N., Multi-stream to many-stream: using spectro-temporal features for asr, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 
Keywords:IM2.AP, Report_VIII

Vijayasenan, D., Valente, F. and Bourlard, H., Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, in: Proceedings of International conference on acoustics speech and signal processing, 2009.
 
Keywords:IM2.AP, Report_VIII

Vijayasenan, D., Valente, F. and Bourlard, H., MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009.
 
Keywords:IM2.AP, Report_VIII

Stoyanchev, S., Tur, G. and Hakkani-Tur, D., Name-aware speech recognition for interactive question answering, in: IEEE ICASSP, Las Vegas, NV, 2008.
 
Keywords:Report_VII, IM2.AP

Li, W., Dines, J., Magimai-Doss, M. and Bourlard, H., Neural network based regression for robust overlapping speech recognition using microphone arrays, in: Interspeech, 2008.
 
Keywords:Report_VII, IM2.AP

Li, W., Dines, J., Magimai-Doss, M. and Bourlard, H., Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
 
Keywords:binary masking, microphone array, neural network, overlapping speech recognition, speech separation, IM2.AP, Report_VIII

Li, W. and Bourlard, H., Non-linear spectral stretching for in-car speech recognition, in: Interspeech, 2007.
 
Keywords:Report_VII, IM2.AP

Motlicek, P., Hermansky, H., Ganapathy, S., Garudadri, H. and Srinivasamurthy, N., Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes, in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), pages 350-357, 2007.
 
Keywords:IM2.AP, Report_VII

Imseng, D., Novel initialization methods for Speaker Diarization, number Idiap-RR-07-2009, 2009.
 
Keywords:IM2.AP, Report_VIII

Luo, J., Caputo, B., Zweig, A., Back, J. -H. and Anemuller, J., Object category detection using audio-visual cues, in: International Conference on Computer Vision Systems (ICVS08), 2008.
 
Keywords:IM2.AP, Report_VII

Lathoud, G., Observations on multi-band asynchrony in distant speech recordings, number 74, 2006.
 
Keywords:Report_VI, IM2.AP

Lovitt, A., Pinto, J. P. and Hermansky, H., On confusions in a phoneme recognizer, 2007.
 
Keywords:Report_VI, IM2.AP

Magimai-Doss, M., Aradilla, G. and Bourlard, H., On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR, number Idiap-RR-24-2009, 2009.
 
Keywords:IM2.AP, Report_VIII

Valente, F. and Hermansky, H., On the combination of auditory and modulation frequency channels for asr applications, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:IM2.AP, Report_VII

Scaringella, N., On the design of audio features robust to the album-effect for music information retrieval., Ecole Polytechnique Fédérale de Lausanne, 2009.
 
Keywords:channel normalization, machine learning, music information retrieval, neural networks, rhythm, timbre, IM2.AP,Report_VIII

Gottlieb, L. and Friedland, G., On the use of artificial conversation data for speaker recognition in cars, in: IEEE International Conference for Semantic Computing, Berkeley, USA, 2009.
 
Keywords:IM2.AP, Report_VIII

Boakye, K., Trueba-Hornero, B., Vinyals, O. and Friedland, G., Overlapped speech detection for improved speaker diarization in multiparty meetings, in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
 
Keywords:Report_VII, IM2.AP

Riedhammer, K., Gillick, D., Favre, B. and Hakkani-Tur, D., Packing the meeting summarization knapsack, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:Report_VII, IM2.AP

Gerber, M., Kaufmann, T. and Pfister, B., Perceptron-based class verification, in: Proceedings of NOLISP (ISCA Workshop on non linear speech processing), 2007.
 
Keywords:Report_VI, IM2.AP

Thomas, S., Ganapathy, S. and Hermansky, H., Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features, number Idiap-RR-04-2009, 2009.
 
Keywords:IM2.AP, Report_VIII

Aradilla, G., Bourlard, H. and Magimai-Doss, M., Posterior features applied to speech recognition tasks with limited training data, number Idiap-RR-15-2008, 2008.
 
Keywords:IM2.AP, Report_VII

Aradilla, G., Bourlard, H. and Magimai-Doss, M., Posterior features applied to speech recognition tasks with user-defined vocabulary, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
 
Keywords:IM2.AP, Report_VIII

Aradilla, G. and Bourlard, H., Posterior-based features and distances in template matching for speech recognition, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 204-214, 2007. [DOI]
 
Keywords:Report_VII, IM2.AP

Paiement, J. -F., Grandvalet, Y. and Bengio, S., Predictive Models for Music, number Idiap-RR-51-2008, 2008.
 
Keywords:IM2.AP, Report_VIII

Paiement, J. -F., Bengio, S. and Eck, D., Probabilistic Models for Melodic Prediction, number Idiap-RR-50-2008, 2008.
 
Keywords:IM2.AP, Report_VIII

Paiement, J. -F., Probabilistic models for music, École Polytechnique Fédérale de Lausanne, 2008.
 
Keywords:chord progressions, generative models, machine learning, melodies, music, probabilistic models, IM2.AP, Report_VIII

Friedland, G., Vinyals, O., Huang, Y. and Muller, C., Prosodic and other long-term features for speaker diarization, in: IEEE Transactions on Audio, Speech and Language Processing, volume 17, number 5, pages 985-993, 2009.
 
Keywords:IM2.AP, Report_VIII

Favre, B., Grishman, R., Hillard, D., Ji, H., Hakkani-Tur, D. and Ostendorf, M., Punctuating speech for information extraction, in: IEEE ICASSP, Las Vegas, NV, 2008.
 
Keywords:Report_VII, IM2.AP

Gerber, M., Beutler, R. and Pfister, B., Quasi text-independent speaker verification based on pattern matching, in: Proceedings of Interspeech, ISCA, 2007.
 
Keywords:Report_VI, IM2.AP

Garner, P. N., Dines, J., Hain, T., El Hannani, A., Karafiat, M., Korchagin, D., Lincoln, M., Wan, V. and Zhang, L., Real-Time ASR from Meetings, in: Proceedings of Interspeech, Brighton, UK., 2009.
 
Keywords:IM2.AP, Report_VIII

Bourlard, H. and Renals, S., Recognition and understanding of meetings overview of the european ami and amida projects, in: LangTech 2008, Rome, 2008.
 
Keywords:IM2.AP, Report_VII

Thomas, A., Ganapathy, S. and Hermansky, H., Recognition of reverberant speech using frequency domain linear prediction, in: IEEE Signal Processing Letters, 2008.
 
Keywords:IM2.AP, Report_VII

Baker, J., Deng, L., Glass, J., Khudanpur, S., Lee, C. -H., Morgan, N. and O'Shgughnessy, D., Research developments and directions in speech recognition and understanding, in: IEEE Signal Processing Magazine, volume 26, number 3, pages 75-80, 2009.
 
Keywords:IM2.AP, Report_VIII

Baker, J., Deng, L., Glass, J., Khudanpur, S., Lee, C. -H., Morgan, N. and O'Shgughnessy, D., Research developments and directions in speech recognition and understanding, in: IEEE Signal Processing Magazine, volume 26, number 4, pages 78-85, 2009.
 
Keywords:IM2.AP, Report_VIII

Pinto, J. P., Sivaram, G. S. V. S. and Hermansky, H., Reverse correlation for analyzing mlp posterior features in asr, in: 11th International Conference on Text, Speech and Dialogue (TSD), pages 469-476, Brno, Czech Republic, 2008. [DOI]
 
Keywords:IM2.AP, Report_VII

Vinyals, O., Friedland, G. and Mirghafori, N., Revisiting a basic function on current CPUs: A fast logarithm implementation with adjustable accuracy, in: ICSI Technical Report number TR-07-002, 2007.
 
Keywords:Report_VII, IM2.AP

Huang, Y., Robust and rapid speaker diarization, in: Master Thesis, University of California, Berkeley, 2007.
 
Keywords:Report_VII, IM2.AP

Wöllmer, M., Eyben, F., Keshet, J., Graves, A., Schuller, B. and Rigoll, G., Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech using Bidirectional LSTM Networks, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, 2009.
 
Keywords:IM2.AP, Report_VIII

Li, W., Dines, J. and Magimai-Doss, M., Robust overlapping speech recognition based on neural networks, number Idiap-RR-55-2007, 2007.
 
Keywords:IM2.AP, Report_VII

Imseng, D. and Friedland, G., Robust Speaker Diarization for Short Speech Recordings, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
 
Keywords:IM2.AP, IM2.MCA, Report_VIII

Rajan, P., Parthasarathi, S. H. K. and Murthy, H., Robustness of Phase based Features for Speaker Recognition, in: Proceedings of Interspeech, 2009.
 
Keywords:IM2.AP, Report_VIII

Vinciarelli, A., Role recognition in broadcast news using social network analysis and duration distribution modeling, in: IEEE Transactions on Multimedia, 2007.
 
Keywords:Report_VI, IM2.AP.MCA, joint publucation

Motlicek, P., Ganapathy, S., Hermansky, H. and Garudadri, H., Scalable wide-band audio codec based on frequency domain linear prediction, number 16, 2007.
 
Keywords:Report_VI, IM2.AP

Vinciarelli, A., Fernàndez, F. and Favre, S., Semantic segmentation of radio programs using social network analysis and duration distribution modeling, in: IEEE International Conference on Multimedia and Expo (ICME), 2007.
 
Keywords:Report_VI, IM2.AP.MPR, joint publication

Lathoud, G. and Odobez, J. -M., Short-term spatio-temporal clustering applied to multiple moving speakers, in: IEEE Transactions on Audio, Speech and Language Processing, 2007.
 
Keywords:Report_VI, IM2.AP.MPR, joint publication

Pinto, J. P., R. M., P., Yegnanarayana, B. and Hermansky, H., Significance of contextual information in phoneme recognition, 2007.
 
Keywords:Report_VI, IM2.AP

Garner, P. N., Silence models in weighted finite-state transducers, in: Interspeech, Brisbane, Australia, 2008.
 
Keywords:IM2.AP, Report_VII

Garner, P. N., SNR Features for Automatic Speech Recognition, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
 
Keywords:IM2.AP, Report_VIII

Lathoud, G., Spatio-temporal analysis of spontaneous speech with microphone arrays, École Polytechnique Fédérale de Lausanne, 2006.
 
Keywords:Report_VI, IM2.AP.VP, joint publication

Kolar, J., Liu, Y. and Shriberg, E., Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 
Keywords:Report_VI, IM2.AP

Parthasarathi, S. H. K., Magimai-Doss, M., Gatica-Perez, D. and Bourlard, H., Speaker Change Detection with Privacy-Preserving Audio Cues, in: Proceedings of ICMI-MLMI 2009, 2009.
 
Keywords:IM2.AP, Report_VIII

Friedland, G. and van Leeuwen, D., Speaker diarization and identification, IEEE Press/Wiley, 2009.
 
Keywords:IM2.AP, Report_VIII

Pardo, J. M., Anguera, X. and Wooters, C., Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information, in: to appear in IEEE Transactions on Computers, 2007.
 
Keywords:Report_VI, IM2.AP

Stoll, L., Frankel, J. and Mirghafori, N., Speaker Recognition Via Nonlinear Discriminant Features, in: Proceedings of NOLISP, Paris, France,, 2007.
 
Keywords:Report_VI, IM2.AP

Stolcke, A., Kajarekar, S., Ferrer, L. and Shriberg, E., Speaker recognition with session variability normalization based on mllr adaptation transforms, in: IEEE Transactions on Audio, Speech, and Language Processing, volume 15, pages 1987-1998, 2007.
 
Keywords:Report_VII, IM2.AP

Stolcke, A., Kajarekar, S., Ferrer, L. and Shriberg, E., Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms, in: IEEE Transactions on Audio, Speech, and Language Processing, special issue on speaker and language recognition, 2007.
 
Keywords:Report_VII, IM2.AP

Garg, N. and Hakkani-Tur, D., Speaker role detection in meetings using lexical information and social network analysis, in: Technical Report TR-08-004, International Computer Science Institute, Berkeley, CA, 2008.
 
Keywords:Report_VII, IM2.AP

Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domain, in: INTERSPEECH 2008, Brisbane, Australia, 2008.
 
Keywords:IM2.AP, Report_VII

Thomas, A., Ganapathy, S. and Hermansky, H., Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain, in: 16th European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
 
Keywords:IM2.AP, Report_VII

Gaudard, C., Aradilla, G. and Bourlard, H., Speech recognition based on template matching and phone posterior probabilities, number 02, 2007.
 
Keywords:Report_VI, IM2.AP

Dines, J., Saheer, L. and Liang, H., Speech recognition with speech synthesis models by marginalising over decision tree leaves, in: Proceedings of Interspeech, Brighton, U.K., 2009.
 
Keywords:decision trees, speech recognition, speech synthesis, unified models, IM2.AP, Report_VIII

Huang, Y., Friedland, G., Müller, C. and Mirghafori, N., Speeding up speaker diarization by using prosodic features, in: Technical Report TR-07-004, International Computer Science Institute, Berkeley, California, 2007.
 
Keywords:Report_VII, IM2.AP

Hakkani-Tur, D. and Tur, G., Statistical Sentence Extraction for Information Distillation, in: Proc. ICASSP, Honolulu, 2007.
 
Keywords:Report_VI, IM2.AP

Vepa, J. and King, S., Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis, in: IEEE Trans. on Audio, Speech and Language Processing, volume 14, number 5, pages 1763-1771, 2006.
 
Keywords:Report_VI, IM2.AP

Grandvalet, Y., Rakotomamonjy, A., Keshet, J. and Canu, S., Support Vector Machines with a Reject Option, in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008.
 
Keywords:IM2.AP,Report_VIII

Mesot, B. and Barber, D., Switching linear dynamical systems for noise robust speech recognition, number 08, 2006.
 
Keywords:Report_VI, IM2.AP

Mesot, B., Switching linear dynamical systems for noise robust speech recognition of isolated degits, STI School of Engineering, EPFL, 2008.
 
Keywords:Report_VII, IM2.AP

Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Temporal masking for bit-rate reduction in audio codec based on frequency domain linear prediction, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4781-4784, Las Vegas, NV, 2008. [DOI]
 
Keywords:IM2.AP, Report_VII

Romsdorfer, H. and Pfister, B., Text analysis and language identification for polyglot text-to-speech synthesis, in: Speech Communication (Elsevier), 2007.
 
Keywords:Report_VI, IM2.AP

Huijbregts, M. and Wooters, C., The Blame Game: Performance Analysis of Speaker Diarization System Components, in: to appear in Proc. Interspeech, Antwerp., 2007.
 
Keywords:Report_VI, IM2.AP

Wooters, C. and Huijbregts, M., The ICSI RT07s Speaker Diarization System, in: to appear in Lecture Notes in Computer Science, 2007.
 
Keywords:Report_VI, IM2.AP

Wooters, C. and Huijbregts, M., The ICSI RT07s speaker diarization system, in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
 
Keywords:Report_VII, IM2.AP

Janin, A., Stolcke, A., Anguera, X., Boakye, K., Cetin, O., Frankel, J. and Zheng, J., The ICSI-SRI Spring 2006 Meeting Evaluation System, in: In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006); Lecture Notes in Computer Science. Springer, 2006.
 
Keywords:Report_VI, IM2.AP

Moore, D., The juicer lvcsr decoder - user manual for juicer version 0.5.0, number 03, 2006.
 
Keywords:Report_VI, IM2.AP

Stolcke, A., Anguera, X., Boakye, K., Cetin, O., Janin, A., Magimai-Doss, M., Wooters, C. and Zheng, J., The sri-icsi spring 2007 meeting and lecture recognition system, in: Lecture Notes in Computer Science, 2007.
 
Keywords:Report_VII, IM2.AP, joint publication

Stolcke, A., Anguera, X., Boakye, K., Cetin, O., Janin, A., Magimai-Doss, M., Wooters, C. and Zheng, J., The SRI-ICSI spring 2007 meeting and lecture recognition system, in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
 
Keywords:Report_VII, IM2.AP, joint publication

Keshet, J., Theoretical foundations for large-margin kernel-based continuous speech recognition, number Idiap-RR-44-2007, 2007.
 
Keywords:IM2.AP, Report_VII

Scaringella, N., Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, in: "Int. Conf. on Music Information Retrieval (ISMIR)", 2008.
 
Keywords:IM2.AP, Report_VIII

Hung, H. and Friedland, G., Towards audio-visual on-line diarization of participants in group meetings, in: European Conference on Computer Vision (ECCV) 2008, Marseille, France, 2008.
 
Keywords:IM2.AP, Report_VIII

Hakkani-Tur, D., Towards automatic argument diagramming of multiparty meetings, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
 
Keywords:IM2.AP, Report_VIII

Vinyals, O. and Friedland, G., Towards semantic analysis of conversations: a system for the live identification of speakers in meetings, in: to appear in Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, CA, 2008.
 
Keywords:Report_VII, IM2.AP

Lovitt, A., Truncation confusion patterns in onset consonants, in: Interspeech 2007, 2007.
 
Keywords:Report_VI, IM2.AP

Boakye, K., Vinyals, O. and Friedland, G., Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech, in: Interspeech, 2008.
 
Keywords:Report_VII, IM2.AP

Boakye, K., Vinyals, O. and Friedland, G., Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech, in: Interspeech 2008, Brisbane, Australia, pages 32-35, 2008.
 
Keywords:IM2.AP, Report_VIII

Gillick, D., Hakkani-Tur, D. and Levit, M., Unsupervised learning of edit parameters for matching name variants, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Keywords:Report_VII, IM2.AP

Lathoud, G., Magimai-Doss, M. and Bourlard, H., Unsupervised spectral subtraction for noise-robust asr on unknown transmission channels, number 09, 2006.
 
Keywords:Report_VI, IM2.AP

Maganti, H. K., Motlicek, P. and Gatica-Perez, D., Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms, number 57, 2006.
 
Keywords:Report_VI, IM2.AP

BenZeghiba, M. F. and Bourlard, H., User-customized password speaker verification using multiple reference and background models, in: Speech Communication, volume 8, pages 1200-1213, 2006.
 
Keywords:Report_VI, IM2.AP

Hung, H., Jayagopi, D., Yeo, C., Friedland, G., Ba, S., Odobez, J. -M., Ramchandran, K., Mirghafori, N. and Gatica-Perez, D., Using audio and video features to classify the most dominant person in a group meeting multi-layer background subtraction based on color and texture, in: Proc. ACM Multi Media, Augsburg, Germany, 2007.
 
Keywords:Report_VII, IM2.AP.VP, joint publication

Hung, H., Jayagopi, D., Yeo, C., Friedland, G., Ba, S., Odobez, J. -M., Ramchandran, K., Mirghafori, N. and Gatica-Perez, D., Using audio and video features to classify the most dominant person in meetings, in: Proceedings of ACM Multimedia 2007, pp. 835-838, Augsburg, Germany, 2007.
 
Keywords:Report_VII, IM2.AP.VP, joint publication

Aradilla, G., Bourlard, H. and Magimai-Doss, M., Using kl-based acoustic models in a large vocabulary recognition task, number Idiap-RR-14-2008, 2008.
 
Keywords:IM2.AP, Report_VII

Friedland, G., Yeo, C. and Hung, H., Visual speaker localization aided by acoustic models (full paper), in: Proceedings of ACM Multimedia, Beijing, China, 2009.
 
Keywords:IM2.AP, Report_VIII

Pinto, J. P., Sivaram, G. S. V. S., Hermansky, H. and Magimai-Doss, M., Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009.
 
Keywords:IM2.AP, Report_VIII

Faria, A. and Morgan, N., When a mismatch can be good: large vocabulary speech recognition trained with idealized tandem features, in: Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, 2008.
 
Keywords:Report_VII, IM2.AP

Motlicek, P., Ullal, V. and Hermansky, H., Wide-band perceptual audio coding based on frequency-domain linear prediction, number 58, 2006.
 
Keywords:Report_VI, IM2.AP

Lei, H. and Mirghafori, N., Word-Conditioned HMM Supervectors for Speaker Recognition, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 
Keywords:Report_VI, IM2.AP

Lei, H. and Mirghafori, N., Word-conditioned phone N-grams for speaker recognition, in: Proc. ICASSP, Honolulu, 2007.
 
Keywords:Report_VI, IM2.AP





Powered by Agaion