Mesot, B. and Barber, D. , A bayesian switching linear dynamical system for scale-invariant robust speech extraction , 2007.
Keywords: Report_VII, IM2.AP
Paiement, J. -F. , Grandvalet, Y. , Bengio, S. and Eck, D. , A Distance Model for Rhythms , in: 25th International Conference on Machine Learning (ICML), 2008.
Keywords: IM2.AP , Report_VIII
Huang, Y. , Vinyals, O. , Friedland, G. , Müller, C. , Mirghafori, N. and Wooters, C. , A Fast-Match approach for robust, faster than real-time Speaker Diarization , in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
Keywords: Report_VII, IM2.AP
Mesot, B. and Barber, D. , A gaussian sum smoother for inference in switching linear dynamical systems , 2007.
Keywords: Report_VII, IM2.AP
Cheng, O. , Dines, J. and Magimai-Doss, M. , A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition , number 62, 2006.
Keywords: Report_VI, IM2.AP
Gillick, D. , Riedhammer, K. , Favre, B. and Hakkani-Tur, D. , A global optimization framework for meeting summarization , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
Keywords: IM2.AP , Report_VIII
Vinyals, O. and Friedland, G. , A hardware-independent fast logarithm approximation with adjustable accuracy , in: 10th IEEE International Symposium on Multimedia, Berkeley, CA, USA, pages 61-65, 2008.
Keywords: IM2.AP , Report_VIII
Mariéthoz, J. and Bengio, S. , A kernel trick for sequences applied to text-independent speaker verification systems , in: Pattern Recognition, volume 40, number 8, ISSN 0031-3203, 2007.
Keywords: IM2.AP , Report_VI
Keshet, J. and Chazan, D. , A Kernel Wrapper for Phoneme Sequence Recognition , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Keywords: IM2.AP , Report_VIII
Keshet, J. , Shalev-Shwartz, S. , Singer, Y. and Chazan, D. , A Large Margin Algorithm for Forced Alignment , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Keywords: IM2.AP , Report_VIII
Garner, P. N. , A MAP Approach to Noise Compensation of Speech , number Idiap-RR-08-2009, 2009.
Keywords: IM2.AP , Report_VIII
Li, W. , Kumatani, K. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , A neural network based regression approach for recogninizing simultaneous speech , in: Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
Keywords: Report_VII, IM2.AP
Li, W. , Kumatani, K. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , A neural network based regression approach for recognizing simultaneous speech , number Idiap-RR-10-2008, 2008.
Keywords: Report_VII,IM2.AP
Valente, F. , A Novel Criterion for Classifiers Combination in Multistream Speech Recognition , in: IEEE Signal Processing Letters, volume 16, number 7, pages 561-564, ISSN 1070-9908, 2009. [DOI]
Keywords: IM2.AP , Report_VIII
Keshet, J. , A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Keywords: IM2.AP , Report_VIII
Dines, J. and Magimai-Doss, M. , A study of phoneme and grapheme based context-dependent asr systems , number 12, 2007.
Keywords: Report_VI, IM2.AP , major
Garner, P. N. , A weighted finite state transducer tutorial , number Idiap-Com-03-2008, 2008.
Keywords: IM2.AP , Report_VII
Anguera, X. , Wooters, C. and Hernando, J. , Acoustic Beamforming for Speaker Diarization of Meetings , in: to appear in IEEE Transactions on Audio, Speech and Language Processing, 2007.
Keywords: Report_VI, IM2.AP
Aradilla, G. , Acoustic models for posterior features in speech recognition , Ecole Polytechnique Fédérale de Lausanne, 2008.
Keywords: IM2.AP , Report_VII
Kumatani, K. , McDonough, J. , Klakow, D. , Garner, P. N. and Li, W. , Adaptive beamforming with a maximum negentropy criterion, , in: The Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2008.
Keywords: Report_VII, IM2.AP
Kumatani, K. , Mayer, H. , Gehrig, T. , Stoimenov, E. , McDonough, J. and Wölfel, M. , Adaptive beamforming with a minimum mutual information criterion , pages 2527--2541, 2007. [DOI]
Keywords: IM2.AP , Report_VII
Valente, F. , Bourlard, H. and Deepu, V. , Agglomerative information bottleneck for speaker diarization of meetings data , number 31, 2007.
Keywords: Report_VI, IM2.AP
Aradilla, G. , Vepa, J. and Bourlard, H. , An acoustic model based on kullback-leibler divergence for posterior features , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
Keywords: Report_VI, IM2.AP
Cuendet, S. , Shriberg, E. , Favre, B. , Fung, J. and Hakkani-Tur, D. , An analysis of sentence segmentation features for broadcast news, broadcast conversations, and meetings , in: SIGIR Workshop on Searching Conversational Spontaneous Speech, 2007.
Keywords: Report_VII, IM2.AP
Cetin, O. , Kantor, A. , King, S. , Bartels, C. , Magimai-Doss, M. , Frankel, J. and Livescu, K. , An Articulatory Feature-based Tandem Approach and Factored Observation Modeling , in: Proc. ICASSP, Honolulu, 2007.
Keywords: Report_VI, IM2.AP
Kaufmann, T. and Pfister, B. , An HPSG parser supporting discontinuous licenser rules , in: International Conference on HPSG, 2007.
Keywords: Report_VI, IM2.AP
Vijayasenan, D. , Valente, F. and Bourlard, H. , An Information Theoretic Approach to Speaker Diarization of Meeting Data , in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 7, pages 1382-1393, 2009. [DOI]
Keywords: IM2.AP , Report_VIII
Kamangar, K. , Hakkani-Tur, D. , Tur, G. and Levit, M. , An iterative unsupervised learning method for information distillation , in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
Keywords: Report_VII, IM2.AP
Prasanna, S. R. Mahadeva , Yegnanarayana, B. , Pinto, J. P. and Hermansky, H. , Analysis of confusion matrix to combine evidence for phoneme recognition , number 27, 2007.
Keywords: IM2.AP , Report_VII
Kaufmann, T. and Pfister, B. , Applying licenser rules to a grammar with continuous constituents , in: The Proceedings of the 14th International Conference on Head-Driven Phrase Structure Grammar, 2007.
Keywords: Report_VII, IM2.AP
Motlicek, P. , Ganapathy, S. and Hermansky, H. , Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec , in: 10th Annual Conference of the International Speech Communication Association, pages 2591-2594, ISCA 2009, ISCA, Brighton, England, 2009.
Keywords: Arithmetic Coding, Audio Coding, Entropy Coding, Frequency Domain Linear Prediction (FDLP), Huffman Coding, IM2.AP , Report_VIII
Livescu, K. , Cetin, O. , Hasegawa-Johnson, M. , King, S. , Bartels, C. , Borges, N. , Kantor, A. , Lal, P. , Yung, L. , Bezman, A. , Dawson-Haggerty, S. , Woods, B. , Frankel, J. , Magimai-Doss, M. and Saenko, K. , Articulatory Feature-based Methods for Acoustic and Audio-visual speech Recognition: Summary from the 2006 JHU Summer Workshop , in: Proc. ICASSP, Honolulu, 2007.
Keywords: Report_VI, IM2.AP
A. Peregoudov, , Vinciarelli, A. and Bourlard, H. , Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations , number 56, 2006.
Keywords: Report_VI, IM2.AP .MCA, joint publication
Motlicek, P. , Hermansky, H. , Garudadri, H. and Srinivasamurthy, N. , Audio coding based on long temporal contexts , number 30, 2006.
Keywords: Report_VI, IM2.AP
Ullal, V. and Motlicek, P. , Audio coding based on long temporal segments: experiments with quantization of excitation signal , number 46, 2006.
Keywords: Report_VI, IM2.AP
Cuendet, S. , Hakkani-Tur, D. and Shriberg, E. , Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech , in: to appear in Proceedings of MLMI, Brno, Czech Republic, 2007.
Keywords: Report_VI, IM2.AP
Knox, M. and Mirghafori, N. , Automatic Laughter Detection Using Neural Networks , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Keywords: Report_VI, IM2.AP
Motlicek, P. , Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices , in: 10thAnnual Conference of the International Speech Communication Association, pages 1215-1218, ISCA, Brighton, England, 2009.
Keywords: IM2.AP , Report_VIII
Motlicek, P. , Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices , in: 10thAnnual Conference of the International Speech Communication Association, ISCA, 2009.
Keywords: IM2.AP ,Report_VIII
Keshet, J. and Bengio, S. , Automatic speech and speaker recognition: large margin and kernel methods , John Wiley & Sons, 2008.
Keywords: IM2.AP , Report_VII
Anguera, X. , Wooters, C. , Pardo, J. M. and Hernando, J. , Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings , in: Proc. ICASSP, Honolulu, 2007.
Keywords: Report_VI, IM2.AP
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Autoregressive modelling of hilbert envelopes for wide-band audio coding , in: AES 124th Convention, Audio Engineering Society, Amsterdam, 2008.
Keywords: IM2.AP , Report_VII
Kumatani, K. , McDonough, J. , Rauch, B. , Klakow, D. , Garner, P. N. and Li, W. , Beamforming with a Maximum Negentropy Criterion , in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 5, pages 994-1008, 2008.
Keywords: IM2.AP , Report_VIII
Orabona, F. , Keshet, J. and Caputo, B. , Bounded kernel-based perceptrons , in: Journal of Machine Learning Research, volume Accepted for pub, 2009.
Keywords: IM2.AP , Report_VIII
Vinciarelli, A. and Favre, S. , Broadcast news story segmentation using social network analysis and hidden markov models , in: ACM International Conference on Multimedia, pages 261-264, 2007.
Keywords: Report_VI, IM2.AP .MPR, joint publication
Hwang, M. -Y. , Peng, G. , Wang, W. , Faria, A. , Heidel, A. and Ostendorf, M. , Building a Highly Accurate Mandarin Speech Recognizer , in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
Keywords: Report_VII, IM2.AP
Garg, N. , Favre, B. , Riedhammer, K. and Hakkani-Tur, D. , Clusterrank: a graph based method for meeting summarization , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Keywords: IM2.AP , Report_VIII
Valente, F. and Hermansky, H. , Combination of acoustic classifiers based on dempster-shafer theory of evidence , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
Keywords: Report_VI, IM2.AP
Vijayasenan, D. , Valente, F. and Bourlard, H. , Combination of agglomerative and sequential clustering for speaker diarization , in: International Conference on Acoustics, Speech and Signal Processing, 2008.
Keywords: Report_VII, IM2.AP
Zheng, J. , Cetin, O. , Hwang, M. -Y. , Lei, X. , Stolcke, A. and Morgan, N. , Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition , in: Proc. ICASSP, Honolulu., 2007.
Keywords: Report_VI, IM2.AP
Pinto, J. P. and Hermansky, H. , Combining evidence from a generative and a discriminative model in phoneme recognition , in: Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Keywords: IM2.AP , Report_VII
Müller, C. and Burkhardt, F. , Combining Short-term Cepstral and Long-term Pitch Features for Automatic Recognition of Speaker Age , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Keywords: Report_VI, IM2.AP
Pinto, J. P. , Bourlard, H. , Graves, A. and Hermansky, H. , Comparing different word lattice rescoring approaches towards keyword spotting , number 32, 2007.
Keywords: IM2.AP , Report_VII
Liu, Y. and Shriberg, E. , Comparing Evaluation Metrics for Sentence Boundary Detection , in: Proc. ICASSP, Honolulu, 2007.
Keywords: Report_VI, IM2.AP
Faria, A. and Morgan, N. , Corrected Tandem Features for Acoustic Model Training , in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
Keywords: Report_VII, IM2.AP
Faria, A. and Morgan, N. , Corrected tandem features for acoustic model training , in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
Keywords: Report_VII, IM2.AP
Lovitt, A. , Correcting confusion matrices for phone recognizers , number 03, 2007.
Keywords: Report_VI, IM2.AP
Guz, U. , Cuendet, S. , Hakkani-Tur, D. and Tur, G. , Co-training Using Prosodic and Lexical Information for Sentence Segmentation , in: to appear in Proceedings of Interspeech, Antwerp, 2007.
Keywords: Report_VI, IM2.AP
Cuendet, S. , Hakkani-Tur, D. , Shriberg, E. , Fung, J. and Favre, B. , Cross-Genre Feature Comparisons for Spoken Sentence Segmentation , in: International Conference on Semantic Computing (ICSC), Irvine, CA, 2007.
Keywords: Report_VII, IM2.AP
Singla, A. and Hakkani-Tur, D. , Cross-lingual sentence extraction for information distillation , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Keywords: Report_VII, IM2.AP
Aradilla, G. and Ajmera, J. , Detection and recognition of number sequences within spoken utterances , in: 2nd Workshop on Speech in Mobile and Pervasive Environments, 2007.
Keywords: Report_VII, IM2.AP
Vergyri, D. , Mandal, A. , Wang, W. , Stolcke, A. , Zheng, J. , Graciarena, M. , Rybach, D. , Gollan, C. , Schlater, R. , Kirchoff, K. , Faria, A. and Morgan, N. , Development of the sri/nightingale arabic asr system , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Keywords: Report_VII, IM2.AP
Vergyri, D. , Mandal, A. , Wang, W. , Stolcke, A. , Zheng, J. , Graciarena, M. , Rybach, D. , Gollan, C. , Schlater, R. , Kirchoff, K. , Faria, A. and Morgan, N. , Development of the sri/nightingale arabic asr system , in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 1437-1440, 2008.
Keywords: IM2.AP , Report_VIII
Dines, J. and Vepa, J. , Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics , number 13, 2007.
Keywords: Report_VI, IM2.AP
Grangier, D. , Keshet, J. and Bengio, S. , Discriminative Keyword Spotting , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Keywords: IM2.AP , Report_VIII
Keshet, J. , Grangier, D. and Bengio, S. , Discriminative Keyword Spotting , in: Speech Communication, volume 51, number 4, pages 317-329, 2009.
Keywords: IM2.AP , Report_VIII
Mariéthoz, J. , Discrmininant models for text-independent speaker verification , number 70, 2006.
Keywords: Report_VI, IM2.AP
Li, W. , Effective post-processing for single-channel frequency-domain speech enhancement , pages 149-152, 2008. [DOI]
Keywords: IM2.AP , Report_VII
Li, W. , Effective post-processing of single-channel frequency-domain speech enhancement , in: IEEE conference on multimedia and expo, 2008.
Keywords: Report_VII, IM2.AP
Sivaram, G. S. V. S. and Hermansky, H. , Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition , in: Proc. 16th European Signal Processing Conference (EUSIPCO), Lausanne, 2008.
Keywords: IM2.AP , Report_VII
Ketabdar, H. and Bourlard, H. , Enhanced phone posteriors for improving speech recognition systems , number Idiap-RR-39-2008, 2008.
Keywords: IM2.AP , Report_VII
Ketabdar, H. , Enhancing posterior based speech recognition systems , Ecole Polytechnique Fédérale de Lausanne, 2008.
Keywords: IM2.AP , Report_VIII
Magimai-Doss, M. , Hakkani-Tur, D. , Cetin, O. , Shriberg, E. , Fung, J. and Mirghafori, N. , Entropy Based Classifier Combination for Sentence Segmentation, , in: Proc. ICASSP, Honolulu, 2007.
Keywords: Report_VI, IM2.AP
Motlicek, P. , Ganapathy, S. and Hermansky, H. , Entropy coding of Quantized Spectral Components in FDLP audio codec , number Idiap-RR-71-2008, 2008.
Keywords: IM2.AP , Report_VIII
Hung, H. , Huang, Y. , Friedland, G. and Gatica-Perez, D. , Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies , in: IEEE ICASSP, Las Vegas, NV, 2008.
Keywords: Report_VII, IM2.AP
Pinto, J. P. , Hermansky, H. , Yegnanarayana, B. and Magimai-Doss, M. , Exploiting contextual information for improved phoneme recognition , in: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2008), pages 4449-4452, Las Vegas, NV, 2008. [DOI]
Keywords: IM2.AP , Report_VII
Parthasarathi, S. H. K. , Motlicek, P. and Hermansky, H. , Exploiting Contextual Information for Speech/Non-Speech Detection , in: Text, Speech and Dialogue, pages 451-459, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
Keywords: IM2.AP , Report_VIII
Pinto, J. P. , Lovitt, A. and Hermansky, H. , Exploiting phoneme similarities in hybrid hmm-ann keyword spotting , in: Proceedings of Interspeech, 2007.
Keywords: Report_VI, IM2.AP
Parthasarathi, S. H. K. , Motlicek, P. and Hermansky, H. , Exploiting temporal context for speech/non-speech detection , number Idiap-RR-21-2008, 2008.
Keywords: IM2.AP ,Report_VII
Pinto, J. P. , Szoke, I. , Prasanna, S. R. Mahadeva and Hermansky, H. , Fast approximate spoken term detection from sequence of phonemes , in: The 31st Annual International ACM SIGIR Conference 20-24 July 2008, pages 28-33, Singapore,, 2008.
Keywords: IM2.AP , Report_VII
Kumatani, K. , McDonough, J. , Schacht, S. , Klakow, D. , Garner, P. N. and Li, W. , Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming , in: International Conferance on Acoustics Speech and Signal Processing, 2008.
Keywords: Report_VII, IM2.AP
Kumatani, K. , McDonough, J. , Schacht, S. , Klakow, D. , Garner, P. N. and Li, W. , Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition , number Idiap-RR-02-2008, 2008.
Keywords: IM2.AP , Report_VIII
Huijbregts, M. , Wooters, C. and Ordelman, R. , Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections , in: to appear in Proceedings of Interspeech, Antwerp, 2007.
Keywords: Report_VI, IM2.AP
Motlicek, P. , Hermansky, H. , Ganapathy, S. and Garudadri, H. , Frequency domain linear prediction for qmf sub-bands and applications to audio coding , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 248-258, 2007.
Keywords: IM2.AP , Report_VI
Ganapathy, S. , Thomas, A. and Hermansky, H. , Front-end for far-field speech recognition based on frequency domain linear prediction , in: Interspeech 2008, Brisbane, Australia, 2008.
Keywords: IM2.AP , Report_VII
Friedland, G. , Vinyals, O. , Huang, Y. and Muller, C. , Fusion of short-term and long-term features for improved speaker diarization , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4077-4080, 2009.
Keywords: IM2.AP , Report_VIII
Knox, M. , Morgan, N. and Mirghafori, N. , Getting the last laugh: automatic laughter segmentation in meetings , in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 797-800, 2008.
Keywords: IM2.AP , Report_VIII
Knox, M. , Morgan, N. and Mirghafori, N. , Getting the last laugh: automatic laughter segmentation in meetings , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Keywords: Report_VII, IM2.AP
Valente, F. and Hermansky, H. , Hierarchical and parallel processing of modulation spectrum for asr applications , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4165-4168, 2008. [DOI]
Keywords: IM2.AP , Report_VII
Ketabdar, H. and Bourlard, H. , Hierarchical integration of phonetic and lexical knowledge in phone posterior estimation , in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
Keywords: Report_VII, IM2.AP
Valente, F. , Vepa, J. , Plahl, C. , Gollan, C. , Hermansky, H. and Schlüter, R. , Hierarchical neural networks feature extraction for lvcsr system , in: Interspeech 2007, 2007.
Keywords: Report_VI, IM2.AP
Valente, F. , Magimai-Doss, M. , Plahl, C. and Suman, R. , Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system , in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009.
Keywords: speech recognition, TANDEM features, IM2.AP , Report_VIII
Shriberg, E. , Higher level features in speaker recognition , in: in C. Muller (Ed.) Speaker Classification I. Springer-Verlag, New York, 2008.
Keywords: Report_VII, IM2.AP
Shriberg, E. , Higher level features in speaker recognition , in: Speaker Classification I, Lecture Notes in Computer Science, Springer, 2007.
Keywords: Report_VII, IM2.AP
Thomas, A. , Ganapathy, S. and Hermansky, H. , Hilbert envelope based features for far-field speech recognition , in: MLMI 2008, Utrecht, The Netherlands, 2008.
Keywords: IM2.AP , Report_VII
Thomas, A. , Ganapathy, S. and Hermansky, H. , Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech , in: Interspeech 2008, Brisbane, Australia, 2008.
Keywords: IM2.AP , Report_VII
Gelbart, D. , Morgan, N. and Tsymbal, A. , Hill-climbing feature selection for multi-stream asr , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Keywords: IM2.AP , Report_VIII
Dutoit, T. , Couvreur, L. and Bourlard, H. , How does a dictation machine recognize speech ? , in: Applied Signal Processing--A MATLAB approach, pages 104-148, Springer MA, 2008.
Keywords: IM2.AP , Report_VIII
Plauché, M. , Cetin, O. and Uhdaykumar, N. , How to build a spoken dialog system with limited (or no) resources , in: AI in ICT for Development Workshop of the Twentieth Intl. Joint Conf. on AI, Hyderabad, India, 2007.
Keywords: Report_VI, IM2.AP
Ketabdar, H. and Hermansky, H. , Identifying unexpected words using in-context and out-of-context phoneme posteriors , number 68, 2006.
Keywords: Report_VI, IM2.AP
Hillard, D. , Huang, Z. , Ji, H. , Grishman, R. , Hakkani-Tur, D. , Harper, M. , Ostendorf, M. and Wang, W. , Impact of Automatic Comma Prediction on POS/Name Tagging of Speech , in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
Keywords: Report_VI, IM2.AP
Picart, B. , Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity , number Idiap-RR-18-2009, 2009.
Keywords: IM2.AP , Report_VIII
Ketabdar, H. and Bourlard, H. , In-context phone posteriors as complementary features for tandem asr , in: ICSLP'08, Brisbane, Australia,, 2008.
Keywords: IM2.AP , Report_VII
Mesot, B. , Inference in switching linear dynamical systems applied to noise robust speech recognition of isolated digits , Ecole Polytechnique Fédérale de Lausanne, 2008.
Keywords: Report_VII,IM2.AP
Levit, M. , Hakkani-Tur, D. , Tur, G. and Gillick, D. , Integrating Several Annotation Layers for Statistical Information Distillation , in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
Keywords: Report_VII, IM2.AP
Levit, M. , Hakkani-Tur, D. , Tur, G. and Gillick, D. , Integrating several annotation layers for statistical information distillation , in: Workshop on Automatic Speech Recognition and Understanding, 2007.
Keywords: Report_VII, IM2.AP
Vijayasenan, D. , Valente, F. and Bourlard, H. , Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization , in: Interspeech 2008, 2008.
Keywords: IM2.AP , Report_VIII
Sivaram, G. S. V. S. and Hermansky, H. , Introducing temporal asymmetries in feature extraction for automatic speech recognition , in: Interspeech 2008, Brisbane, Australia, 2008.
Keywords: IM2.AP , Report_VII
Parthasarathi, S. H. K. , Magimai-Doss, M. , Bourlard, H. and Gatica-Perez, D. , Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations , in: Proceedings of Interspeech 2009, 2009.
Keywords: IM2.AP , IM2.MCA, Report_VIII
Mariéthoz, J. , Bengio, S. and Grandvalet, Y. , Kernel Based Text-Independnent Speaker Verification , number Idiap-RR-68-2008, 2008.
Keywords: IM2.AP , Report_VIII
Zacharie, D. G. and Pinto, J. P. , Keyword spotting on word lattices , number 22, 2007.
Keywords: Report_VI, IM2.AP
Vijayasenan, D. , Valente, F. and Bourlard, H. , KL Realignment for Speaker Diarization with Multiple Feature Streams , in: 10th Annual Conference of the International Speech Communication Association, 2009.
Keywords: IM2.AP , Report_VIII
Andreani, G. , Di Fabbrizio, G. , Gilbert, M. , Gillick, D. , Hakkani-Tur, D. and Lemon, O. , Lets DiSCoH: Collecting an Annotated Open Corpus with Dialog Acts and Reward Signals for Natural Language Helpdesks , in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
Keywords: Report_VI, IM2.AP
Xie, S. , Favre, B. , Hakkani-Tur, D. and Liu, Y. , Leveraging sentence weights in a concept-based optimization framework for extractive meeting summarization , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Keywords: IM2.AP , Report_VIII
Friedland, G. and Vinyals, O. , Live speaker identification in conversations , in: ACM Multimedia 2008, Vancouver, Canada, pages 1017-1018, 2008.
Keywords: IM2.AP , Report_VIII
Vinyals, O. and Friedland, G. , Live speaker identification in meetings: "who is speaking now?" , in: Technical Report TR-08-001, International Computer Science Institute, Berkeley, CA, 2008.
Keywords: Report_VII, IM2.AP
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes , number Idiap-RR-75-2008, 2008.
Keywords: IM2.AP ,Report_VIII
Grangier, D. , Machine Learning for Information Retrieval , École Polytechnique Fédérale de Lausanne, 2008.
Keywords: discriminative learning, image retrieval, Information Retrieval, learning to rank, machine learning, online learning, spoken keyword spotting, text retrieval, IM2.AP ,Report_VIII
Livescu, K. , Bezman, A. , Borges, N. , Yung, L. , Cetin, O. , Frankel, J. , King, S. , Magimai-Doss, M. , Chi, X. and Lavoie, L. , Manual Transcription of Conversational Speech at the Articulatory Feature Level , in: Proc. ICASSP, Honolulu, 2007.
Keywords: Report_VI, IM2.AP
Hemptinne, C. , Master thesis: integration of the harmonic plus noise model (hnm) into the hidden markov model-based speech synthesis system (hts) , number 69, 2006.
Keywords: Report_VI, IM2.AP
Kumatani, K. , McDonough, J. , Rauch, B. , Garner, P. N. , Li, W. and Dines, J. , Maximum kurtosis beamforming with the generalized sidelobe canceller , in: Proceedings of INTERSPEECH, September 2008, Brisbane, Australia, 2009.
Keywords: IM2.AP , Report_VIII
Kumatani, K. , McDonough, J. , Klakow, D. , Garner, P. N. and Li, W. , Maximum negentropy beamforming , number Idiap-RR-07-2008, 2008.
Keywords: Report_VII, IM2.AP
Dines, J. , Yamagishi, J. and King, S. , Measuring the gap between HMM-based ASR and TTS , in: Proceedings of Interspeech, Brighton, U.K., 2009.
Keywords: speech recognition, speech synthesis, unified models, IM2.AP ,Report_VIII
Kumatani, K. , Mayer, H. , Gehrig, T. , Stoimenov, E. , McDonough, J. and Wölfel, M. , Minimum mutual information beamforming for simultaneous active speakers , in: IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), pages 71-76, Kyoto, 2007. [DOI]
Keywords: IM2.AP , Report_VII
Li, W. , Doss, M. M. , Dines, J. and Bourlard, H. , Mlp-based log spectral energy mapping for robust overlapping speech recognition , in: European Signal Processing Conference, 2008.
Keywords: Report_VII, IM2.AP
Tur, G. , Guz, U. and Hakkani-Tur, D. , Model Adaptation for Dialog Act Tagging , in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
Keywords: Report_VI, IM2.AP
Cuendet, S. , Hakkani-Tur, D. and Tur, G. , Model Adaptation for Sentence Segmentation from Speech , in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
Keywords: Report_VI, IM2.AP
Cuendet, S. , Model adaptation for sentence unit segmentation from speech , number 64, 2006.
Keywords: Report_VI, IM2.AP
Anguera, X. , Shinozaki, T. , Wooters, C. and Hernando, J. , Model Complexity Selection and Cross-validation EM Training for Robust Speaker Diarization , in: Proc. ICASSP, Honolulu, 2007.
Keywords: Report_VI, IM2.AP
Ganapathy, S. , Motlicek, P. and Hermansky, H. , MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION , number Idiap-RR-74-2008, 2008.
Keywords: IM2.AP , Report_VIII
Ganapathy, S. , Thomas, S. and Hermansky, H. , Modulation Frequency Features For Phoneme Recognition In Noisy Speech , in: Journal of Acoustical Society of America - Express Letters, 2008.
Keywords: IM2.AP , Report_VIII
Vinyals, O. and Friedland, G. , Modulation spectrogram features for speaker diarization , in: Interspeech 2008, Brisbane, Australia, pages 630-633, 2008.
Keywords: IM2.AP , Report_VIII
Vinyals, O. and Friedland, G. , Modulation spectrogram features for speaker diarization , in: to appear in proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Keywords: Report_VII, IM2.AP
Friedland, G. , Hung, H. and Yeo, C. , Multi-modal speaker diarization of real-world meetings using compressed-domain video features , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4069-4072, 2009.
Keywords: IM2.AP , Report_VIII
Valente, F. , Vepa, J. and Hermansky, H. , Multi-stream features combination based on dempster-shafer rule for lvcsr system , in: Interspeech 2007, 2007.
Keywords: Report_VI, IM2.AP .MPR, joint publication
Zhao, S. Y. and Morgan, N. , Multi-stream spectro-temporal features for robust speech recognition , in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 898-901, 2008.
Keywords: IM2.AP , Report_VIII
Zhao, S. and Morgan, N. , Multi-stream spectro-temporal features for robust speech recognition , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Keywords: Report_VII, IM2.AP
Zhao, S. Y. , Ravuri, R. and Morgan, N. , Multi-stream to many-stream: using spectro-temporal features for asr , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Keywords: IM2.AP , Report_VIII
Vijayasenan, D. , Valente, F. and Bourlard, H. , Mutual Information based Channel Selection for Speaker Diarization of Meetings Data , in: Proceedings of International conference on acoustics speech and signal processing, 2009.
Keywords: IM2.AP , Report_VIII
Vijayasenan, D. , Valente, F. and Bourlard, H. , MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009.
Keywords: IM2.AP , Report_VIII
Stoyanchev, S. , Tur, G. and Hakkani-Tur, D. , Name-aware speech recognition for interactive question answering , in: IEEE ICASSP, Las Vegas, NV, 2008.
Keywords: Report_VII, IM2.AP
Li, W. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , Neural network based regression for robust overlapping speech recognition using microphone arrays , in: Interspeech, 2008.
Keywords: Report_VII, IM2.AP
Li, W. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
Keywords: binary masking, microphone array, neural network, overlapping speech recognition, speech separation, IM2.AP , Report_VIII
Li, W. and Bourlard, H. , Non-linear spectral stretching for in-car speech recognition , in: Interspeech, 2007.
Keywords: Report_VII, IM2.AP
Motlicek, P. , Hermansky, H. , Ganapathy, S. , Garudadri, H. and Srinivasamurthy, N. , Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes , in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), pages 350-357, 2007.
Keywords: IM2.AP , Report_VII
Imseng, D. , Novel initialization methods for Speaker Diarization , number Idiap-RR-07-2009, 2009.
Keywords: IM2.AP , Report_VIII
Luo, J. , Caputo, B. , Zweig, A. , Back, J. -H. and Anemuller, J. , Object category detection using audio-visual cues , in: International Conference on Computer Vision Systems (ICVS08), 2008.
Keywords: IM2.AP , Report_VII
Lathoud, G. , Observations on multi-band asynchrony in distant speech recordings , number 74, 2006.
Keywords: Report_VI, IM2.AP
Lovitt, A. , Pinto, J. P. and Hermansky, H. , On confusions in a phoneme recognizer , 2007.
Keywords: Report_VI, IM2.AP
Magimai-Doss, M. , Aradilla, G. and Bourlard, H. , On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR , number Idiap-RR-24-2009, 2009.
Keywords: IM2.AP , Report_VIII
Valente, F. and Hermansky, H. , On the combination of auditory and modulation frequency channels for asr applications , in: Interspeech 2008, Brisbane, Australia, 2008.
Keywords: IM2.AP , Report_VII
Scaringella, N. , On the design of audio features robust to the album-effect for music information retrieval. , Ecole Polytechnique Fédérale de Lausanne, 2009.
Keywords: channel normalization, machine learning, music information retrieval, neural networks, rhythm, timbre, IM2.AP ,Report_VIII
Gottlieb, L. and Friedland, G. , On the use of artificial conversation data for speaker recognition in cars , in: IEEE International Conference for Semantic Computing, Berkeley, USA, 2009.
Keywords: IM2.AP , Report_VIII
Boakye, K. , Trueba-Hornero, B. , Vinyals, O. and Friedland, G. , Overlapped speech detection for improved speaker diarization in multiparty meetings , in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
Keywords: Report_VII, IM2.AP
Riedhammer, K. , Gillick, D. , Favre, B. and Hakkani-Tur, D. , Packing the meeting summarization knapsack , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Keywords: Report_VII, IM2.AP
Gerber, M. , Kaufmann, T. and Pfister, B. , Perceptron-based class verification , in: Proceedings of NOLISP (ISCA Workshop on non linear speech processing), 2007.
Keywords: Report_VI, IM2.AP
Thomas, S. , Ganapathy, S. and Hermansky, H. , Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features , number Idiap-RR-04-2009, 2009.
Keywords: IM2.AP , Report_VIII
Aradilla, G. , Bourlard, H. and Magimai-Doss, M. , Posterior features applied to speech recognition tasks with limited training data , number Idiap-RR-15-2008, 2008.
Keywords: IM2.AP , Report_VII
Aradilla, G. , Bourlard, H. and Magimai-Doss, M. , Posterior features applied to speech recognition tasks with user-defined vocabulary , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
Keywords: IM2.AP , Report_VIII
Aradilla, G. and Bourlard, H. , Posterior-based features and distances in template matching for speech recognition , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 204-214, 2007. [DOI]
Keywords: Report_VII, IM2.AP
Paiement, J. -F. , Grandvalet, Y. and Bengio, S. , Predictive Models for Music , number Idiap-RR-51-2008, 2008.
Keywords: IM2.AP , Report_VIII
Paiement, J. -F. , Bengio, S. and Eck, D. , Probabilistic Models for Melodic Prediction , number Idiap-RR-50-2008, 2008.
Keywords: IM2.AP , Report_VIII
Paiement, J. -F. , Probabilistic models for music , École Polytechnique Fédérale de Lausanne, 2008.
Keywords: chord progressions, generative models, machine learning, melodies, music, probabilistic models, IM2.AP , Report_VIII
Friedland, G. , Vinyals, O. , Huang, Y. and Muller, C. , Prosodic and other long-term features for speaker diarization , in: IEEE Transactions on Audio, Speech and Language Processing, volume 17, number 5, pages 985-993, 2009.
Keywords: IM2.AP , Report_VIII
Favre, B. , Grishman, R. , Hillard, D. , Ji, H. , Hakkani-Tur, D. and Ostendorf, M. , Punctuating speech for information extraction , in: IEEE ICASSP, Las Vegas, NV, 2008.
Keywords: Report_VII, IM2.AP
Gerber, M. , Beutler, R. and Pfister, B. , Quasi text-independent speaker verification based on pattern matching , in: Proceedings of Interspeech, ISCA, 2007.
Keywords: Report_VI, IM2.AP
Garner, P. N. , Dines, J. , Hain, T. , El Hannani, A. , Karafiat, M. , Korchagin, D. , Lincoln, M. , Wan, V. and Zhang, L. , Real-Time ASR from Meetings , in: Proceedings of Interspeech, Brighton, UK., 2009.
Keywords: IM2.AP , Report_VIII
Bourlard, H. and Renals, S. , Recognition and understanding of meetings overview of the european ami and amida projects , in: LangTech 2008, Rome, 2008.
Keywords: IM2.AP , Report_VII
Thomas, A. , Ganapathy, S. and Hermansky, H. , Recognition of reverberant speech using frequency domain linear prediction , in: IEEE Signal Processing Letters, 2008.
Keywords: IM2.AP , Report_VII
Baker, J. , Deng, L. , Glass, J. , Khudanpur, S. , Lee, C. -H. , Morgan, N. and O'Shgughnessy, D. , Research developments and directions in speech recognition and understanding , in: IEEE Signal Processing Magazine, volume 26, number 3, pages 75-80, 2009.
Keywords: IM2.AP , Report_VIII
Baker, J. , Deng, L. , Glass, J. , Khudanpur, S. , Lee, C. -H. , Morgan, N. and O'Shgughnessy, D. , Research developments and directions in speech recognition and understanding , in: IEEE Signal Processing Magazine, volume 26, number 4, pages 78-85, 2009.
Keywords: IM2.AP , Report_VIII
Pinto, J. P. , Sivaram, G. S. V. S. and Hermansky, H. , Reverse correlation for analyzing mlp posterior features in asr , in: 11th International Conference on Text, Speech and Dialogue (TSD), pages 469-476, Brno, Czech Republic, 2008. [DOI]
Keywords: IM2.AP , Report_VII
Vinyals, O. , Friedland, G. and Mirghafori, N. , Revisiting a basic function on current CPUs: A fast logarithm implementation with adjustable accuracy , in: ICSI Technical Report number TR-07-002, 2007.
Keywords: Report_VII, IM2.AP
Huang, Y. , Robust and rapid speaker diarization , in: Master Thesis, University of California, Berkeley, 2007.
Keywords: Report_VII, IM2.AP
Wöllmer, M. , Eyben, F. , Keshet, J. , Graves, A. , Schuller, B. and Rigoll, G. , Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech using Bidirectional LSTM Networks , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, 2009.
Keywords: IM2.AP , Report_VIII
Li, W. , Dines, J. and Magimai-Doss, M. , Robust overlapping speech recognition based on neural networks , number Idiap-RR-55-2007, 2007.
Keywords: IM2.AP , Report_VII
Imseng, D. and Friedland, G. , Robust Speaker Diarization for Short Speech Recordings , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
Keywords: IM2.AP , IM2.MCA, Report_VIII
Rajan, P. , Parthasarathi, S. H. K. and Murthy, H. , Robustness of Phase based Features for Speaker Recognition , in: Proceedings of Interspeech, 2009.
Keywords: IM2.AP , Report_VIII
Vinciarelli, A. , Role recognition in broadcast news using social network analysis and duration distribution modeling , in: IEEE Transactions on Multimedia, 2007.
Keywords: Report_VI, IM2.AP .MCA, joint publucation
Motlicek, P. , Ganapathy, S. , Hermansky, H. and Garudadri, H. , Scalable wide-band audio codec based on frequency domain linear prediction , number 16, 2007.
Keywords: Report_VI, IM2.AP
Vinciarelli, A. , Fernàndez, F. and Favre, S. , Semantic segmentation of radio programs using social network analysis and duration distribution modeling , in: IEEE International Conference on Multimedia and Expo (ICME), 2007.
Keywords: Report_VI, IM2.AP .MPR, joint publication
Lathoud, G. and Odobez, J. -M. , Short-term spatio-temporal clustering applied to multiple moving speakers , in: IEEE Transactions on Audio, Speech and Language Processing, 2007.
Keywords: Report_VI, IM2.AP .MPR, joint publication
Pinto, J. P. , R. M., P. , Yegnanarayana, B. and Hermansky, H. , Significance of contextual information in phoneme recognition , 2007.
Keywords: Report_VI, IM2.AP
Garner, P. N. , Silence models in weighted finite-state transducers , in: Interspeech, Brisbane, Australia, 2008.
Keywords: IM2.AP , Report_VII
Garner, P. N. , SNR Features for Automatic Speech Recognition , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
Keywords: IM2.AP , Report_VIII
Lathoud, G. , Spatio-temporal analysis of spontaneous speech with microphone arrays , École Polytechnique Fédérale de Lausanne, 2006.
Keywords: Report_VI, IM2.AP .VP, joint publication
Kolar, J. , Liu, Y. and Shriberg, E. , Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Keywords: Report_VI, IM2.AP
Parthasarathi, S. H. K. , Magimai-Doss, M. , Gatica-Perez, D. and Bourlard, H. , Speaker Change Detection with Privacy-Preserving Audio Cues , in: Proceedings of ICMI-MLMI 2009, 2009.
Keywords: IM2.AP , Report_VIII
Friedland, G. and van Leeuwen, D. , Speaker diarization and identification , IEEE Press/Wiley, 2009.
Keywords: IM2.AP , Report_VIII
Pardo, J. M. , Anguera, X. and Wooters, C. , Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information , in: to appear in IEEE Transactions on Computers, 2007.
Keywords: Report_VI, IM2.AP
Stoll, L. , Frankel, J. and Mirghafori, N. , Speaker Recognition Via Nonlinear Discriminant Features , in: Proceedings of NOLISP, Paris, France,, 2007.
Keywords: Report_VI, IM2.AP
Stolcke, A. , Kajarekar, S. , Ferrer, L. and Shriberg, E. , Speaker recognition with session variability normalization based on mllr adaptation transforms , in: IEEE Transactions on Audio, Speech, and Language Processing, volume 15, pages 1987-1998, 2007.
Keywords: Report_VII, IM2.AP
Stolcke, A. , Kajarekar, S. , Ferrer, L. and Shriberg, E. , Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms , in: IEEE Transactions on Audio, Speech, and Language Processing, special issue on speaker and language recognition, 2007.
Keywords: Report_VII, IM2.AP
Garg, N. and Hakkani-Tur, D. , Speaker role detection in meetings using lexical information and social network analysis , in: Technical Report TR-08-004, International Computer Science Institute, Berkeley, CA, 2008.
Keywords: Report_VII, IM2.AP
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domain , in: INTERSPEECH 2008, Brisbane, Australia, 2008.
Keywords: IM2.AP , Report_VII
Thomas, A. , Ganapathy, S. and Hermansky, H. , Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain , in: 16th European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
Keywords: IM2.AP , Report_VII
Gaudard, C. , Aradilla, G. and Bourlard, H. , Speech recognition based on template matching and phone posterior probabilities , number 02, 2007.
Keywords: Report_VI, IM2.AP
Dines, J. , Saheer, L. and Liang, H. , Speech recognition with speech synthesis models by marginalising over decision tree leaves , in: Proceedings of Interspeech, Brighton, U.K., 2009.
Keywords: decision trees, speech recognition, speech synthesis, unified models, IM2.AP , Report_VIII
Huang, Y. , Friedland, G. , Müller, C. and Mirghafori, N. , Speeding up speaker diarization by using prosodic features , in: Technical Report TR-07-004, International Computer Science Institute, Berkeley, California, 2007.
Keywords: Report_VII, IM2.AP
Hakkani-Tur, D. and Tur, G. , Statistical Sentence Extraction for Information Distillation , in: Proc. ICASSP, Honolulu, 2007.
Keywords: Report_VI, IM2.AP
Vepa, J. and King, S. , Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis , in: IEEE Trans. on Audio, Speech and Language Processing, volume 14, number 5, pages 1763-1771, 2006.
Keywords: Report_VI, IM2.AP
Grandvalet, Y. , Rakotomamonjy, A. , Keshet, J. and Canu, S. , Support Vector Machines with a Reject Option , in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008.
Keywords: IM2.AP ,Report_VIII
Mesot, B. and Barber, D. , Switching linear dynamical systems for noise robust speech recognition , number 08, 2006.
Keywords: Report_VI, IM2.AP
Mesot, B. , Switching linear dynamical systems for noise robust speech recognition of isolated degits , STI School of Engineering, EPFL, 2008.
Keywords: Report_VII, IM2.AP
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Temporal masking for bit-rate reduction in audio codec based on frequency domain linear prediction , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4781-4784, Las Vegas, NV, 2008. [DOI]
Keywords: IM2.AP , Report_VII
Romsdorfer, H. and Pfister, B. , Text analysis and language identification for polyglot text-to-speech synthesis , in: Speech Communication (Elsevier), 2007.
Keywords: Report_VI, IM2.AP
Huijbregts, M. and Wooters, C. , The Blame Game: Performance Analysis of Speaker Diarization System Components , in: to appear in Proc. Interspeech, Antwerp., 2007.
Keywords: Report_VI, IM2.AP
Wooters, C. and Huijbregts, M. , The ICSI RT07s Speaker Diarization System , in: to appear in Lecture Notes in Computer Science, 2007.
Keywords: Report_VI, IM2.AP
Wooters, C. and Huijbregts, M. , The ICSI RT07s speaker diarization system , in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
Keywords: Report_VII, IM2.AP
Janin, A. , Stolcke, A. , Anguera, X. , Boakye, K. , Cetin, O. , Frankel, J. and Zheng, J. , The ICSI-SRI Spring 2006 Meeting Evaluation System , in: In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006); Lecture Notes in Computer Science. Springer, 2006.
Keywords: Report_VI, IM2.AP
Moore, D. , The juicer lvcsr decoder - user manual for juicer version 0.5.0 , number 03, 2006.
Keywords: Report_VI, IM2.AP
Stolcke, A. , Anguera, X. , Boakye, K. , Cetin, O. , Janin, A. , Magimai-Doss, M. , Wooters, C. and Zheng, J. , The sri-icsi spring 2007 meeting and lecture recognition system , in: Lecture Notes in Computer Science, 2007.
Keywords: Report_VII, IM2.AP , joint publication
Stolcke, A. , Anguera, X. , Boakye, K. , Cetin, O. , Janin, A. , Magimai-Doss, M. , Wooters, C. and Zheng, J. , The SRI-ICSI spring 2007 meeting and lecture recognition system , in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
Keywords: Report_VII, IM2.AP , joint publication
Keshet, J. , Theoretical foundations for large-margin kernel-based continuous speech recognition , number Idiap-RR-44-2007, 2007.
Keywords: IM2.AP , Report_VII
Scaringella, N. , Timbre and Rhythmic TRAP-TANDEM features for music information retrieval , in: "Int. Conf. on Music Information Retrieval (ISMIR)", 2008.
Keywords: IM2.AP , Report_VIII
Hung, H. and Friedland, G. , Towards audio-visual on-line diarization of participants in group meetings , in: European Conference on Computer Vision (ECCV) 2008, Marseille, France, 2008.
Keywords: IM2.AP , Report_VIII
Hakkani-Tur, D. , Towards automatic argument diagramming of multiparty meetings , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
Keywords: IM2.AP , Report_VIII
Vinyals, O. and Friedland, G. , Towards semantic analysis of conversations: a system for the live identification of speakers in meetings , in: to appear in Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, CA, 2008.
Keywords: Report_VII, IM2.AP
Lovitt, A. , Truncation confusion patterns in onset consonants , in: Interspeech 2007, 2007.
Keywords: Report_VI, IM2.AP
Boakye, K. , Vinyals, O. and Friedland, G. , Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech , in: Interspeech, 2008.
Keywords: Report_VII, IM2.AP
Boakye, K. , Vinyals, O. and Friedland, G. , Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech , in: Interspeech 2008, Brisbane, Australia, pages 32-35, 2008.
Keywords: IM2.AP , Report_VIII
Gillick, D. , Hakkani-Tur, D. and Levit, M. , Unsupervised learning of edit parameters for matching name variants , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Keywords: Report_VII, IM2.AP
Lathoud, G. , Magimai-Doss, M. and Bourlard, H. , Unsupervised spectral subtraction for noise-robust asr on unknown transmission channels , number 09, 2006.
Keywords: Report_VI, IM2.AP
Maganti, H. K. , Motlicek, P. and Gatica-Perez, D. , Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms , number 57, 2006.
Keywords: Report_VI, IM2.AP
BenZeghiba, M. F. and Bourlard, H. , User-customized password speaker verification using multiple reference and background models , in: Speech Communication, volume 8, pages 1200-1213, 2006.
Keywords: Report_VI, IM2.AP
Hung, H. , Jayagopi, D. , Yeo, C. , Friedland, G. , Ba, S. , Odobez, J. -M. , Ramchandran, K. , Mirghafori, N. and Gatica-Perez, D. , Using audio and video features to classify the most dominant person in a group meeting multi-layer background subtraction based on color and texture , in: Proc. ACM Multi Media, Augsburg, Germany, 2007.
Keywords: Report_VII, IM2.AP .VP, joint publication
Hung, H. , Jayagopi, D. , Yeo, C. , Friedland, G. , Ba, S. , Odobez, J. -M. , Ramchandran, K. , Mirghafori, N. and Gatica-Perez, D. , Using audio and video features to classify the most dominant person in meetings , in: Proceedings of ACM Multimedia 2007, pp. 835-838, Augsburg, Germany, 2007.
Keywords: Report_VII, IM2.AP .VP, joint publication
Aradilla, G. , Bourlard, H. and Magimai-Doss, M. , Using kl-based acoustic models in a large vocabulary recognition task , number Idiap-RR-14-2008, 2008.
Keywords: IM2.AP , Report_VII
Friedland, G. , Yeo, C. and Hung, H. , Visual speaker localization aided by acoustic models (full paper) , in: Proceedings of ACM Multimedia, Beijing, China, 2009.
Keywords: IM2.AP , Report_VIII
Pinto, J. P. , Sivaram, G. S. V. S. , Hermansky, H. and Magimai-Doss, M. , Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator , in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009.
Keywords: IM2.AP , Report_VIII
Faria, A. and Morgan, N. , When a mismatch can be good: large vocabulary speech recognition trained with idealized tandem features , in: Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, 2008.
Keywords: Report_VII, IM2.AP
Motlicek, P. , Ullal, V. and Hermansky, H. , Wide-band perceptual audio coding based on frequency-domain linear prediction , number 58, 2006.
Keywords: Report_VI, IM2.AP
Lei, H. and Mirghafori, N. , Word-Conditioned HMM Supervectors for Speaker Recognition , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Keywords: Report_VI, IM2.AP
Lei, H. and Mirghafori, N. , Word-conditioned phone N-grams for speaker recognition , in: Proc. ICASSP, Honolulu, 2007.
Keywords: Report_VI, IM2.AP
Powered by Agaion