Ali, K. , Fleuret, F. , Hasler, D. and Fua, P. , Joint learning of pose estimators and features for object detection , in: Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2009.
Aradilla, G. , Bourlard, H. and Magimai-Doss, M. , Posterior features applied to speech recognition tasks with user-defined vocabulary , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
Ba, S. and Odobez, J. -M. , Recognizing human visual focus of attention from head pose in meetings , in: IEEE Trans. on System, Man and Cybernetics: part B, Man, volume 39, number 1, pages 16-34, 2009.
Ba, S. , Hung, H. and Odobez, J. -M. , Visual activity context for focus of attention estimation in dynamic meetings , in: IEEE Proc. Int. Conf. on Multimedia and Expo (ICME), 2009.
Baechler, M. , Bloechle, J. -L. , Humm, A. , Ingold, R. and Hennebert, J. , Labeled images verification using gaussian mixture models , in: Proceedings of 24th Annual ACM Symposium on Applied Computing (ACM SAC'09), pages 1331-1336, 2009.
Baker, J. , Deng, L. , Glass, J. , Khudanpur, S. , Lee, C. -H. , Morgan, N. and O'Shgughnessy, D. , Research developments and directions in speech recognition and understanding , in: IEEE Signal Processing Magazine, volume 26, number 4, pages 78-85, 2009.
Baker, J. , Deng, L. , Glass, J. , Khudanpur, S. , Lee, C. -H. , Morgan, N. and O'Shgughnessy, D. , Research developments and directions in speech recognition and understanding , in: IEEE Signal Processing Magazine, volume 26, number 3, pages 75-80, 2009.
Beekhof, F. , Voloshynovskiy, S. , Koval, O. and Holotyak, T. , Multi-class classifiers based on binary classifiers: performance, efficiency, and minimum coding matrix distances , in: MLSP 2009, 2009.
Berclaz, J. , Fleuret, F. and Fua, P. , Multiple object tracking using flow linear programming , number 10-2009, 2009.
Bertini, E. , Lalanne, D. and Rigamonti, M. , Extended excentric labeling , in: International Journal of the Eurographics Association, volume 28, 2009.
Bertini, E. and Lalanne, D. , Surveying the complementary roles of automatic data analysis and visualization in knowledge discovery , in: Proceedings of ACM SIGKDD Workshop on Visual Analytics and Knowledge Discovery, VAKD '09, 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (VAKD 2009), pages 12-20, 2009.
Bloechle, J. -L. , Lalanne, D. and Ingold, R. , Ocd: an optimized and canonical document format , in: Proceedings of 10th IEEE International Conference on Document Analysis and Recognition (ICDAR 2009), pages 236-240, 2009.
Bologna, G. , Deville, B. and Pun, T. , Blind navigation along a sinuous path by means of the see color interface , in: IWINAC2009, 3rd International Work-conference on the Interplay between Natural and Artificial Computation, Santiago de Compostela, Spain, June 22--27, 2009.
Bologna, G. , Deville, B. and Pun, T. , On the use of the auditory pathway to represent image scenes in real-time , in: Neurocomputing, volume 72, pages 839-849, 2009.
Bologna, G. , Malandain, S. , Deville, B. and Pun, T. , The multi-touch see color interface , in: ICTA 2009, The 2nd International Conference on Information and Communication Technologies and Accessibility, Hammamet, Tunisia, May 7--9, 2009.
Bruno, E. and Marchand-Maillet, S. , Multimodal preference aggregation for multimedia information retrieval , in: To appear in Journal of Multimedia, 2009.
Bruno, E. and Marchand-Maillet, S. , multiview clustering: a late fusion approach using latent models , in: Proceedings of the 32nd ACM Special Interest Group on Information Retrieval Conference, SIGIR 09, 2009.
Caputo, B. , Hayman, E. , Fritz, M. and Ekluhnd, J. -O , Classifying Material in the Real World , in: Image and vision Computing, volume accepted for pub, 2009.
Chanel, G. , Kierkels, J. , Soleymani, M. and Pun, T. , short-term emotion assessment in a recall paradigm , in: International Journal of Human-Computer Studies, volume 67, number 8, pages 607-627, 2009.
Dines, J. , Yamagishi, J. and King, S. , Measuring the gap between HMM-based ASR and TTS , in: Proceedings of Interspeech, Brighton, U.K., 2009.
Dines, J. , Saheer, L. and Liang, H. , Speech recognition with speech synthesis models by marginalising over decision tree leaves , in: Proceedings of Interspeech, Brighton, U.K., 2009.
Drygajlo, A. , Li, W. and Zhu, K. , Q-stack aging model for face verification , in: 17th European Signal Processing Conference, 2009.
Duffner, S. , Odobez, J. -M. and Ricci, E. , Dynamic Partitioned Sampling For Tracking With Discriminative Features , in: Proceedings of the British Maschine Vision Conference, London, 2009.
Dumas, B. , Lalanne, D. and Ingold, R. , Benchmarking fusion engines of multimodal interactive systems , in: Proceedings of International Conference on Multimodal Interfaces and Workshop on Machine Learning for Multi-modal Interaction (ICMI-MLMI 2009), 2009.
Favre, S. , Dielmann, A. and Vinciarelli, A. , Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models , in: ACM International Conference on Multimedia, To Appear, 2009.
Fleuret, F. , Multi-layer boosting for pattern recognition , in: Pattern Recognition Letters (PRL), volume 30, pages 237-241, 2009.
Friedland, G. , Vinyals, O. , Huang, Y. and Muller, C. , Fusion of short-term and long-term features for improved speaker diarization , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4077-4080, 2009.
Friedland, G. , Hung, H. and Yeo, C. , Multi-modal speaker diarization of real-world meetings using compressed-domain video features , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4069-4072, 2009.
Friedland, G. , Vinyals, O. , Huang, Y. and Muller, C. , Prosodic and other long-term features for speaker diarization , in: IEEE Transactions on Audio, Speech and Language Processing, volume 17, number 5, pages 985-993, 2009.
Friedland, G. and van Leeuwen, D. , Speaker diarization and identification , IEEE Press/Wiley, 2009.
Friedland, G. , Yeo, C. and Hung, H. , Visual Speaker Localization Aided by Acoustic Models , in: ACM Multimedia, 2009.
Friedland, G. , Yeo, C. and Hung, H. , Visual speaker localization aided by acoustic models (full paper) , in: Proceedings of ACM Multimedia, Beijing, China, 2009.
Frinken, V. and Bunke, H. , Evaluating retraining rules for semi-supervised learning in neural network based cursive word recognition , in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 31-35, 2009.
Frinken, V. , Riesen, K. and Bunke, H. , Improving graph classification by isomap , in: Graph-Based Representations in Pattern Recognition, pages 205-214, Springer, 2009.
Frinken, V. and Bunke, H. , Self-training strategies for handwriting word recognition , in: Proc. Industrial Conf. Advances in Data Mining. Applications and Theoretical Aspects, pages 291-300, Springer, 2009.
Galbally, J. , McCool, C. , Fierrez, J. , Marcel, S. and Ortega-Garcia, J. , Hill-Climbing Attack to an Eigenface-Based Face Verification System , in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Error Resilient Speech Coding Using Sub-band Hilbert Envelopes , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, pages 355-362, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Error Resilient Speech Coding Using Sub-band Hilbert Envelopes , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
Garau, G. , Ba, S. , Bourlard, H. and Odobez, J. -M. , Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation , in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009.
Garg, N. , Favre, B. , Riedhammer, K. and Hakkani-Tur, D. , Clusterrank: a graph based method for meeting summarization , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Garg, N. , Co-occurrence Models for Image Annotation and Retrieval , number Idiap-RR-22-2009, 2009.
Garg, N. and Gatica-Perez, D. , Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr , number Idiap-RR-21-2009, 2009.
Garner, P. N. , A MAP Approach to Noise Compensation of Speech , number Idiap-RR-08-2009, 2009.
Garner, P. N. , Dines, J. , Hain, T. , El Hannani, A. , Karafiat, M. , Korchagin, D. , Lincoln, M. , Wan, V. and Zhang, L. , Real-Time ASR from Meetings , in: Proceedings of Interspeech, Brighton, UK., 2009.
Garner, P. N. , SNR Features for Automatic Speech Recognition , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
Gatica-Perez, D. , Automatic nonverbal analysis of social interaction in small groups: a review , in: Image and Vision Computing, Special Issue on Human Naturalistic Behavior, in press, 2009.
Gelbart, D. , Morgan, N. and Tsymbal, A. , Hill-climbing feature selection for multi-stream asr , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Gillick, D. , Riedhammer, K. , Favre, B. and Hakkani-Tur, D. , A global optimization framework for meeting summarization , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
Gonzalez, G. , Fleuret, F. and Fua, P. , Learning rotational features for filament detection , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2009.
Gonzalez, G. , Aguet, F. , Fleuret, F. , Unser, M. and Fua, P. , Steerable features for statistical 3d dendrite detection , in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2009.
Gottlieb, L. and Friedland, G. , On the use of artificial conversation data for speaker recognition in cars , in: IEEE International Conference for Semantic Computing, Berkeley, USA, 2009.
Graves, A. , Liwicki, M. , Fernandez, S. , Bertolami, R. , Bunke, H. and Schmidhuber, J. , A novel connectionist system for unconstrained handwriting recognition , in: IEEE Trans. PAMI, volume 31, number 5, pages 855-869, ISSN 0162-8828, 2009.
Gurban, M. and Thiran, J. -Ph. , Information theoretic feature extraction for audio-visual speech recognition , in: IEEE Trans. on Signal Processing, volume in press, 2009.
Hakkani-Tur, D. , Towards automatic argument diagramming of multiparty meetings , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
Heusch, G. and Marcel, S. , Bayesian Networks to Combine Intensity and Color Information in Face Recognition , number Idiap-RR-27-2009, 2009.
Humm, A. , Hennebert, J. and Ingold, R. , Combined handwriting and speech modalities for user authentication , in: IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans, volume 39, 2009.
Humm, A. , Ingold, R. and Hennebert, J. , Spoken handwriting for user authentication using joint modelling systems , in: Proceedings of 6th International Symposium on Image and Signal Processing and Analysis (ISPA'09), 2009.
Hung, H. and Ba, S. , Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features , number Idiap-RR-20-2009, 2009.
Imseng, D. , Novel initialization methods for Speaker Diarization , number Idiap-RR-07-2009, 2009.
Imseng, D. and Friedland, G. , Robust Speaker Diarization for Short Speech Recordings , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
Indermühle, E. , Liwicki, M. and Bunke, H. , Combining alignment results for historical handwritten document analysis , in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 1186-1190, 2009.
Ivanov, I. , Dufaux, F. , Ha, T. M. and Ebrahimi, T. , Towards Generic Detection of Unusual Events in Video Surveillance , in: 6th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSSâ09), Genoa, Italy, 2009.
Jayagopi, D. , Bogdan, R. and Gatica-Perez, D. , Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour , in: Proceedings ICME 2009, 2009.
Jayagopi, D. and Gatica-Perez, D. , Discovering group nonverbal conversational patterns with topics , in: accepted for publication in Proc. ICMI-MLMI, 2009.
Jayagopi, D. , Modeling dominance in group conversations using nonverbal activity cues , in: IEEE Trans. on Audio, Speech, and Language Processing, Special Issue on Multimodal Processing for Speech-based Interactions, volume 17, pages 501-513, 2009.
Keshet, J. , Grangier, D. and Bengio, S. , Discriminative Keyword Spotting , in: Speech Communication, volume 51, number 4, pages 317-329, 2009.
Koval, O. , Voloshynovskiy, S. , Caire, F. and Bas, P. , On security threats for robust perceptual hashin , in: Electronic Imaging 2009, 2009.
Kryszczuk, K. and Drygajlo, A. , Improving biometric verification with class-independent quality information , pages 310-321, 2009.
Kryszczuk, K. and Drygajlo, A. , Improving biometric verification with class-independent quality information , in: IET Signal Processing, Special Issue on Biometric Recognition, volume 3, number 4, pages 310-321, 2009.
Kumatani, K. , McDonough, J. , Rauch, B. , Garner, P. N. , Li, W. and Dines, J. , Maximum kurtosis beamforming with the generalized sidelobe canceller , in: Proceedings of INTERSPEECH, September 2008, Brisbane, Australia, 2009.
Lalanne, D. , Nigay, L. , Palanque, P. , Robinson, P. , Vanderdonckt, J. and Ladry, J. -F. , Fusion engines for multimodal interfaces: a survey , in: Proceedings of International Conference on Multimodal Interfaces and Workshop on Machine Learning for Multi-modal Interaction (ICMI-MLMI 2009), 2009.
Lalanne, D. and Kholas, J. , Human machine interaction , 2009.
Le, Q. A. and Popescu-Belis, A. , Automatic vs. human question answering over multimedia meeting recordings , in: Interspeech 2009 (10th Annual Conference of the International Speech Communication Association), 2009.
Lee, J. -S. , De Simone, F. and Ebrahimi, T. , Video coding based on audio-visual attention , in: IEEE International Conference on Multimedia and Expo (ICME'09), New York, USA, 2009.
Lefèvre, S. and Odobez, J. -M. , Structure and appearance features for robust 3d facial actions tracking , in: International Conference on Multimedia and Expo (ICME), 2009.
Li, W. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
Luo, J. , Orabona, F. and Caputo, B. , An online framework for learning novel concepts over multiple cues , in: Proceeding of The 9th Asian Conference on Computer Vision, Xi'an, China, 2009.
Magimai-Doss, M. , Aradilla, G. and Bourlard, H. , On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR , number Idiap-RR-24-2009, 2009.
Marchand-Maillet, S. , Szekely, E. and Bruno, E. , Optimizing strategies for the exploration of social-networks and associated data collections , in: Proceedings of the International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS'09) - Special session on "People, Pixels, Peers: Interactive Content in Social Networks", 2009.
McCool, C. and Marcel, S. , Parts-Based Face Verification using Local Frequency Bands , in: in Proceedings of IEEE/IAPR International Conference on Biometrics, 2009.
Monay, F. , Quelhas, P. , Odobez, J. -M. and Gatica-Perez, D. , Contextual classification of image patches with latent aspect models , in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009.
Morrison, D. , Bruno, E. and Marchand-Maillet, S. , capturing the semantics of user interaction: a review and case study , in: Emergent Web Intelligence, Springer, 2009.
Morrison, D. , Marchand-Maillet, S. and Bruno, E. , Modelling long-term relevance feedback , in: Proceedings of the ECIR Workshop on Information Retrieval over Social Networks, 2009.
Motlicek, P. , Ganapathy, S. and Hermansky, H. , Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec , in: 10th Annual Conference of the International Speech Communication Association, pages 2591-2594, ISCA 2009, ISCA, Brighton, England, 2009.
Motlicek, P. , Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices , in: 10thAnnual Conference of the International Speech Communication Association, pages 1215-1218, ISCA, Brighton, England, 2009.
Motlicek, P. , Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices , in: 10thAnnual Conference of the International Speech Communication Association, ISCA, 2009.
Negoescu, R. -A. , Gatica-Perez, D. , Adams, B. , Phung, D. and Venkatesh, S. , Flickr Hypergroups , number Idiap-Internal-RR-73-2009, 2009.
Noceti, N. , Caputo, B. , Castellini, C. , Baldassarre, L. , Barla, A. , Rosasco, L. , Odone, F. and Sandini, G. , Towards a theoretical framework for learning multi-modal patterns for embodied agents , in: International Conference on Image Analysis and Processing, 2009.
Orabona, F. , Caputo, B. , Fillbrandt, A. and Ohl, F. , A theoretical framework for transfer of knowledge across modalities in artificial and cognitive systems , in: International Conference on Developmental Learning, 2009.
Orabona, F. , Keshet, J. and Caputo, B. , Bounded kernel-based perceptrons , in: Journal of Machine Learning Research, volume Accepted for pub, 2009.
Orabona, F. , Castellini, C. , Caputo, B. , Fiorilla, A. E. and Sandini, G. , Model adaptation with least-square SVM for adaptive hand prosthetics , in: IEEE International conference on Robotics and Automation, 2009.
Orabona, F. , Castellini, C. , Caputo, B. , Luo, J. and Sandini, G. , Towards Life-long Learning for Cognitive Systems: Online Independent Support Vector Machine , in: Pattern Recognition, volume Accepted for Pub, 2009.
Ortega-Garcia, J. , Fierrez, J. , Alonso-Fernandez, F. , Galbally, J. , M. R. Freire, , Gonzalez-Rodriguez, J. , Garcia-Mateo, C. , Alba-Castro, J. -L. , E. Gonzalez-Agulla, , E. Otero-Muras, , S. Garcia-Salicetti, , L. Allano, , B. Ly-Van, , B. Dorizzi, , Kittler, J. , Bourlai, T. , Poh, N. , Deravi, F. , M. W. R. Ng, , M. Fairhurst, , Hennebert, J. , Humm, A. , M. Tistarelli, , L. Brodo, , Richiardi, J. , Drygajlo, A. , H. Ganster, , F. M. Sukno, , Pavani, S. -K. , A. Frangi, , L. Akarun, and A. Savran, , The multi-scenario multi-environment biosecure multimodal database (bmdb) , in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 2009.
Pantic, M. and Vinciarelli, A. , Implicit Human Centered Tagging , in: IEEE Signal Processing Magazine, volume 26, 2009.
Parthasarathi, S. H. K. , Magimai-Doss, M. , Bourlard, H. and Gatica-Perez, D. , Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations , in: Proceedings of Interspeech 2009, 2009.
Parthasarathi, S. H. K. , Magimai-Doss, M. , Gatica-Perez, D. and Bourlard, H. , Speaker Change Detection with Privacy-Preserving Audio Cues , in: Proceedings of ICMI-MLMI 2009, 2009.
Perrin, X. , Chavarriaga, R. , Pradalier, C. , Millán, J. del R. and Siegwart, R. , Dialog Management Technique for Brain-Computer Interfaces , 2009.
Perrin, X. , Colas, F. , Pradalier, C. and Siegwart, R. , Learning human habits and reactions to external events with a dynamic Bayesian network , 2009.
Perrin, X. , Colas, F. , Pradalier, C. and Siegwart, R. , Learning to identify users and predict their destination in a robotic guidance application , in: Field and Service Robotics (FSR), 2009.
Picart, B. , Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity , number Idiap-RR-18-2009, 2009.
Pinto, J. P. , Sivaram, G. S. V. S. , Hermansky, H. and Magimai-Doss, M. , Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator , in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009.
Popescu-Belis, A. , Poller, P. , Kilgour, J. , Boertjes, E. , Carletta, J. , Castronovo, S. , Fapso, M. , Flynn, M. , Nanchen, A. , Wilson, T. , Wit, J. de and Yazdani, M. , A multimedia retrieval system using speech input , in: ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), 2009.
Popescu-Belis, A. , Carletta, J. , Kilgour, J. and Poller, P. , Accessing a large multimodal corpus using an automatic content linking device , in: Multimodal Corpora, Springer-Verlag, 2009.
Popescu-Belis, A. , Comparing meeting browsers using a task-based evaluation method , number Idiap-RR-11-2009, 2009.
Popescu-Belis, A. and Vinciarelli, A. , Multimedia meeting processing and retrieval at the idiap research institute , in: Informer (Newsletter of the BCS Information Retrieval Specialist Group), volume 29, pages 14-16, 2009.
Pronobis, A. and Caputo, B. , COLD: The COsy Localization Database , in: International Journal of Robotics Research, volume 28, number 5, pages 588-594, 2009.
Raducanu, B. and Gatica-Perez, D. , You are fired! Nonverbal role analysis in competitive meetings , in: Proc. ICASSP, Taiwan, 2009.
Rajan, P. , Parthasarathi, S. H. K. and Murthy, H. , Robustness of Phase based Features for Speaker Recognition , in: Proceedings of Interspeech, 2009.
Ricci, E. and Odobez, J. -M. , Real-time simultaneous head tracking and pose estimation , in: IEEE International Conference on Image Processing (ICIP), 2009.
Richiardi, J. , Drygajlo, A. and Kryszczuk, K. , Static models of derivative-coordinates phase spaces for multivariate time series classification: an application to signature verification , pages 140-149, 2009.
Richiardi, J. , Kryszczuk, K. and Drygajlo, A. , Static models of derivative-coordinates phase spaces for multivariate time series classification: an application to signature verification , in: Advances in Biometrics, Lecture Notes in Computer Science 5558, pages 1200-1208, 2009.
Roy, A. and Marcel, S. , Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection , number Idiap-RR-28-2009, 2009.
Salamin, H. , Favre, S. and Vinciarelli, A. , Automatic Role Recognition in Multiparty Recordings: Using Social Affiliation Networks for Feature Extraction , in: IEEE Transactions on Multimedia, To Appear, 2009.
Scaringella, N. , On the design of audio features robust to the album-effect for music information retrieval. , Ecole Polytechnique Fédérale de Lausanne, 2009.
De Simone, F. , Dufaux, F. , Ebrahimi, T. , Delogu, C. and Baroncini, V. , A subjective study of the influence of color information on visual quality assessment of high resolution pictures , in: Fourth International Workshop on Video Processing and Quality Metrics for Consumer Electronics (VPQM-09), Scottsdale, Arizona, USA, 2009.
Soleymani, M. , Chanel, G. , Kierkels, J. and Pun, T. , affective characterization of movie scenes based on content analysis and physiological changes , in: To appear in International Journal of Semantic Computing, 2009.
Thomas, S. , Ganapathy, S. and Hermansky, H. , Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features , number Idiap-RR-04-2009, 2009.
Tommasi, T. and Caputo, B. , The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories , in: BMVC, 2009.
Ullah, M. M. , Orabona, F. and Caputo, B. , You live, you learn, you forget: continuous learning of visual places with a forgetting mechanism , in: International Conference on Robotic and Systems, 2009.
Valente, F. , A Novel Criterion for Classifiers Combination in Multistream Speech Recognition , in: IEEE Signal Processing Letters, volume 16, number 7, pages 561-564, ISSN 1070-9908, 2009. [DOI]
Valente, F. , Magimai-Doss, M. , Plahl, C. and Suman, R. , Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system , in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009.
Vijayasenan, D. , Valente, F. and Bourlard, H. , An Information Theoretic Approach to Speaker Diarization of Meeting Data , in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 7, pages 1382-1393, 2009. [DOI]
Vijayasenan, D. , Valente, F. and Bourlard, H. , KL Realignment for Speaker Diarization with Multiple Feature Streams , in: 10th Annual Conference of the International Speech Communication Association, 2009.
Vijayasenan, D. , Valente, F. and Bourlard, H. , MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009.
Vijayasenan, D. , Valente, F. and Bourlard, H. , Mutual Information based Channel Selection for Speaker Diarization of Meetings Data , in: Proceedings of International conference on acoustics speech and signal processing, 2009.
Vinciarelli, A. , Capturing Order in Social Interactions , in: IEEE Signal Processing Magazine, 2009.
Vinciarelli, A. , Suditu, N. and Pantic, M. , Implicit Human Centered Tagging , in: Proceedings of IEEE Conference on Multimedia and Expo, pages 1428-1431, 2009.
Vinciarelli, A. , Pantic, M. and Bourlard, H. , Social Signal Processing: Survey of an Emerging Domain , in: Image and Vision Computing, 2009.
Voloshynovskiy, S. , Koval, O. , Beekhof, F. and Holotyak, T. , Binary robust hashing based on probabilistic bit reliability , in: IEEE Workshop on Statistical Signal Processing 2009, 2009.
Voloshynovskiy, S. , Koval, O. , Beekhof, F. and Pun, T. , Random projections based item authentication , in: Electronic Imaging 2009, 2009.
Wuthrich, M. , Liwicki, M. , Fischer, A. , Indermühle, E. , Bunke, H. , Viehhauser, G. and Stolz, M. , Language model integration for the recognition of handwritten medieval documents , in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 211-215, 2009.
Wöllmer, M. , Eyben, F. , Keshet, J. , Graves, A. , Schuller, B. and Rigoll, G. , Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech using Bidirectional LSTM Networks , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, 2009.
Xie, S. , Favre, B. , Hakkani-Tur, D. and Liu, Y. , Leveraging sentence weights in a concept-based optimization framework for extractive meeting summarization , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Yao, J. and Odobez, J. -M. , Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets , number Idiap-RR-19-2009, 2009.
Yao, J. and Odobez, J. -M. , Multi-camera multi-person 3d space tracking with mcmc in surveillance scenarios , in: European Conference on Computer Vision, workshop on Multi Camera and Multi-modal Sensor Fusion Algorithms and Applications (ECCV-M2SFA2), Marseille, 2009.
Zhao, S. Y. , Ravuri, R. and Morgan, N. , Multi-stream to many-stream: using spectro-temporal features for asr , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Zhu, K. , Drygajlo, A. and Li, W. , Q-stack aging model for face verification , 2009.
Keshet, J. and Chazan, D. , A Kernel Wrapper for Phoneme Sequence Recognition , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Keshet, J. , Shalev-Shwartz, S. , Singer, Y. and Chazan, D. , A Large Margin Algorithm for Forced Alignment , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Keshet, J. , A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Grangier, D. , Keshet, J. and Bengio, S. , Discriminative Keyword Spotting , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Deville, B. , Bologna, G. , Vinckenbosch, M. and Pun, T. , See color: seeing colours with an orchestra , in: Human Machine Interaction: Research Results of the MMI Program, pages 251-279, Springer, 2009.
Anemuller, J. , Back, J. -H. , Caputo, B. , Luo, J. , Ohl, F. , Orabona, F. , Vogels, R. , Weinshall, D. and Zweig, A. , Biologically Motivated Audio-Visual Cue Integration for Object , in: Proceedings of the first Internatinal Conference on Cognitive Systems, 2008.
Anemuller, J. , Back, J. -H. , Caputo, B. , Havlena, M. , Luo, J. , Kayser, H. , Leibe, B. , Motlicek, P. , Pajdla, T. , Pavel, M. , Torii, A. , van Gool, L. , Zweig, A. and Hermansky, H. , The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events , in: Proceedings of the International Conference on Multimodal Interfaces, 2008.
Aradilla, G. , Acoustic models for posterior features in speech recognition , Ecole Polytechnique Fédérale de Lausanne, 2008.
Aradilla, G. , Bourlard, H. and Magimai-Doss, M. , Posterior features applied to speech recognition tasks with limited training data , number Idiap-RR-15-2008, 2008.
Aradilla, G. , Bourlard, H. and Magimai-Doss, M. , Using kl-based acoustic models in a large vocabulary recognition task , number Idiap-RR-14-2008, 2008.
Ba, S. and Odobez, J. -M. , Multi-party focus of attention recognition in meetings from head pose and multimodal contextual cues , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008.
Ba, S. and Odobez, J. -M. , Multi-person visual focus of attention from head pose and meeting contextual cues , number Idiap-RR-47-2008, 2008.
Ba, S. and Odobez, J. -M. , Multi-person visual focus of attention from head pose and meeting contextual cues , number 47, 2008.
Ba, S. and Odobez, J. -M. , Recognizing visual focus of attention from head pose in natural meetings , in: accepted for publication in IEEE Trans. on System, Man and Cybernetics: Part B, Man,, 2008.
Ba, S. and Odobez, J. -M. , Visual focus of attention estimation from head pose posterior probability distributions , in: IEEE Proc. Int. Conf. on Multimedia and Expo (ICME), 2008.
Beekhof, F. , Voloshynovskiy, S. , Koval, O. and Villán, R. , Secure surface identification codes , in: Steganography, and Watermarking of Multimedia Contents X, 2008. [DOI]
Berclaz, J. , Fleuret, F. and Fua, P. , Multi-camera tracking and atypical motion detection with behavioral maps , in: The 10th European Conference on Computer Vision (ECCV 2008), Marseille, France, 2008.
Berclaz, J. , Fleuret, F. and Fua, P. , Multi-camera tracking and atypical motion detection with behavioral maps , in: Proceedings of the European Conference on Computer Vision (ECCV), pages 112-125, 2008.
Berclaz, J. , Fleuret, F. and Fua, P. , Principled Detection-by-classification from Multiple Views , in: proceedings of the International Conference on Computer Vision Theory and Applications, pages 375-382, 2008.
Bertolami, R. and Bunke, H. , Ensemble methods to improve the performance of an english handwritten text line recognizer , in: Arabic and Chinese Handwriting Recognition, pages 265-277, Springer, 2008.
Bertolami, R. and Bunke, H. , Hidden Markov model based ensemble methods for offline handwritten text line recognition , in: Pattern Recognition, volume 41, number 11, pages 3452-3460, 2008.
Bertolami, R. and Bunke, H. , Including language model information in the combination of handwritten text line recognizers , in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 25-30, 2008.
Bertolami, R. and Bunke, H. , Integration of n-gram language models in multiple classifier systems for offline handwritten text line recognition , in: Int. Journal of Pattern Recognition and Art. Intelligence, volume 22, number 7, pages 1301-1321, 2008.
Bertolami, R. , Gutmann, C. , Spitz, L. and Bunke, H. , Shape code based lexicon reduction for offline handwriting recognition , in: Proc. 8th IAPR Int. Workshop on Document Analysis Systems, pages 158-163, 2008.
Besson, P. , Popovici, V. , Vesin, J. M. , Thiran, J. -Ph. and Kunt, M. , Extraction of audio features specific to speech production for multimodal speaker detection , in: IEEE Transactions on Multimedia, volume 10, number 1, pages 63-73, 2008. [DOI]
Boakye, K. , Trueba-Hornero, B. , Vinyals, O. and Friedland, G. , Overlapped speech detection for improved speaker diarization in multiparty meetings , in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
Boakye, K. , Vinyals, O. and Friedland, G. , Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech , in: Interspeech, 2008.
Boakye, K. , Vinyals, O. and Friedland, G. , Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech , in: Interspeech 2008, Brisbane, Australia, pages 32-35, 2008.
Bologna, G. , Deville, B. , Vinckenbosch, M. and Pun, T. , a perceptual interface for vision substitution in a color matching experiment , in: Proceeding on IEEE IJCNN, IEEE World congress on computational intelligence, 2008.
Bologna, G. , Deville, B. , Vinckenbosch, M. and Pun, T. , Pairing colored socks and following a red serpentine with sounds of musical instruments , in: ICAD 08, International Conference on Auditory Displays, Paris, France, June 24--27, 2008.
Bourlard, H. , Chavarriaga, R. , Galán, F. and Millán, J. del R. , Characterizing the eeg correlates of exploratory behavior , in: IEEE Transactions on Neural Systems & Rehabilitation Engineering, 2008.
Bourlard, H. and Renals, S. , Recognition and understanding of meetings overview of the european ami and amida projects , in: LangTech 2008, Rome, 2008.
Breitenstein, M. D. , Kuettel, D. , Weise, T. , van Gool, L. and Pfister, H. , Real-time face pose estimation from single range images , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), IEEE Press, 2008.
Bruno, E. , Moënne-Loccoz, N. and Marchand-Maillet, S. , Design of multimodal dissimilarity spaces for retrieval of multimedia documents , in: To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Bunke, H. , Dickinson, P. , Neuhaus, M. and Stettler, M. , Matching of hypergraphs -- algorithms, applications, and experiments , in: Applied Pattern Recognition, pages 131-154, Springer, 2008.
Camastra, F. and Vinciarelli, A. , Machine learning for audio, image and video analysis , Advanced Information and Knowledge Processing, volume XVI, Springer Verlag, ISBN 978-1-84800-006-3, 2008.
Caputo, B. , Class specific object recognition using kernel Gibbs distributions , in: ELectronic Letters on Computer vision and Image Analysis, volume 7, number 2, pages 96-109, 2008.
Carincotte, C. , Naturel, X. , Hick, M. , Odobez, J. -M. , Yao, J. , Bastide, A. and Corbucci, B. , Understanding Metro Station Usage using Closed Circuit Television Cameras Analysis , in: 11th International IEEE Conference on Intelligent Transportation Systems (ITSC), Bejing, 2008.
Carreras, A. , Cordara, G. , Delgado, J. , Dufaux, F. , Francini, G. , Ha, T. M. , Rodriguez, E. and Tous, R. , A search and retrieval framework for the management of copyrighted audiovisual content , in: 50th International Symposium ELMAR 2008, Zadar, Croatia, 2008.
Chanel, G. , Rebetez, C. , Betrancourt, M. and Pun, T. , boredom, engagement and anxiety as indicators for adaptation to difficulty in games , in: ACM Mindtrek conference, 2008.
Chavarriaga, R. , Galán, F. and Millán, J. del R. , Asynchronous detection and classification of oscillatory brain activity , in: 16 European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
Cornelis, N. , Leibe, B. , Cornelis, K. and van Gool, L. , 3d urban scene modeling integrating recognition and reconstruction , in: International Journal of Computer Vision, volume 78, number 2-3, pages 121-141, 2008.
van den Berg, M. , Koller-Meier, E. and van Gool, L. , Fast body posture estimation using volumetric features , in: IEEE Visual Motion Computing (MOTION), 2008.
Deville, B. , Bologna, G. , Vinckenbosch, M. and Pun, T. , Guiding the focus of attention of blind people with visual saliency , in: Workshop on Computer Vision Applications for the Visually Impaired (CVAVI 08), Satellite Workshop of theEuropean Conference on Computer Vision (ECCV 2008), Marseille, France, October 18, 2008.
Deville, B. , Bologna, G. , Vinckenbosch, M. and Pun, T. , guiding the focus of attention of blind people with visual saliency , in: Workshop on Computer Vision Applications for the Visually Impaired (CVAVI 08), 2008.
Dollé, L. , Khamassi, M. , Girard, B. , Guillot, A. and Chavarriaga, R. , Analyzing interactions between navigation strategies using a computational model of action selection , in: Spatial Cognition 2008 (SC '08), pages 71-86, Freiburg, Germany, 2008.
Dufaux, F. and Ebrahimi, T. , H.264/AVC Video Scrambling for Privacy Protection , in: IEEE International Conference on Image Processing (ICIP2008), San Diego, 2008.
Dumas, B. , Lalanne, D. and Ingold, R. , Démonstration : hephaistk, une bo\^\ite à outils pour le prototypage d'interfaces multimodales , 2008.
Dumas, B. , Lalanne, D. and Ingold, R. , Demonstration : hephaistk, une bo\^\ite à outils pour le prototypage d'interfaces multimodales , in: Proceedings of 20e Conférence sur l'Interaction Homme-Machine (IHM 08), pages 215-216, 2008.
Dumas, B. , Lalanne, D. and Ingold, R. , Prototyping multimodal interfaces with smuiml modeling language , in: Proceedings of CHI 2008 Workshop on UIDLs for Next Generation User Interfaces (CHI 2008 workshop), pages 63-66, 2008.
Dumas, B. , Lalanne, D. and Ingold, R. , Prototyping multimodal interfaces with smuiml modeling language , pages 63-66, 2008.
Dumas, B. , Lalanne, D. , Guinard, D. , Koenig, R. and Ingold, R. , Strengths and weaknesses of software architectures for the rapid creation of tangible and multimodal interfaces , in: Proceedings of 2nd international conference on Tangible and Embedded Interaction (TEI 2008), pages 47-54, 2008.
Dumas, B. , Lalanne, D. , Guinard, D. , Koenig, R. and Ingold, R. , Strengths and weaknesses of software architectures for the rapid creation of tangible and multimodal interfaces , pages 47-54, 2008.
Dutoit, T. , Couvreur, L. and Bourlard, H. , How does a dictation machine recognize speech ? , in: Applied Signal Processing--A MATLAB approach, pages 104-148, Springer MA, 2008.
Ess, A. , Leibe, B. , Schindler, K. and van Gool, L. , A mobile vision system for robust multi-person tracking , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
Estrella, P. , Popescu-Belis, A. and King, M. , Improving contextual quality models for mt evaluation based on evaluators' feedback. , in: LREC 2008 (6th International Conference on Language Resources and Evaluation), 2008.
Faria, A. and Morgan, N. , Corrected tandem features for acoustic model training , in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
Faria, A. and Morgan, N. , Corrected Tandem Features for Acoustic Model Training , in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
Faria, A. and Morgan, N. , When a mismatch can be good: large vocabulary speech recognition trained with idealized tandem features , in: Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, 2008.
Favre, B. , Grishman, R. , Hillard, D. , Ji, H. , Hakkani-Tur, D. and Ostendorf, M. , Punctuating speech for information extraction , in: IEEE ICASSP, Las Vegas, NV, 2008.
Favre, S. , Salamin, H. , Vinciarelli, A. , Hakkani-Tur, D. and Garg, N. , Role recognition for meeting participants: an approach based on lexical information and social network analysis , in: ACM International Conference on Multimedia, Vancouver, Canada, 2008.
Favre, S. , Salamin, H. and Vinciarelli, A. , Role recognition in multiparty recordings using social affiliation networks and discrete distributions , in: The Tenth International Conference on Multimodal Interfaces (ICMI 2008), Chania, Greece, 2008.
Ferrez, P. W. and Millán, J. del R. , Eeg-based brain-computer interaction: improved accuracy by automatic single-trial error detection , in: Advances in Neural Information Processing Systems 20, pages 441-448, Cambridge, MA, 2008.
Ferrez, P. W. and Millán, J. del R. , Error-related eeg potentials generated during simulated brain-computer interaction , in: IEEE Transactions on Biomedical Engineering, volume 55, number 3, pages 923-929, 2008. [DOI]
Ferrez, P. W. and Millán, J. del R. , Error-Related EEG Potentials Generated During Simulated Brain-Computer Interaction , in: IEEE Trans. on Biomedical Engineering, volume 55, number 3, pages 923-929, 2008.
Ferrez, P. W. and Millán, J. del R. , Simultaneous real-time detection of motor imagery and error-related potentials for improved bci accuracy , in: Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course, 2008.
Fleuret, F. , Berclaz, J. , Lengagne, R. and Fua, P. , Multi-Camera People Tracking with a Probabilistic Occupancy Map , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 30, number 2, pages 267-282, 2008.
Fleuret, F. and Geman, D. , Stationary features and cat detection , in: Journal of Machine Learning Research, 2008.
Fleuret, F. and Geman, D. , Stationary features and cat detection , in: Journal of Machine Learning Research (JMLR), volume 9, pages 2549-2578, 2008.
Friedland, G. and Vinyals, O. , Live speaker identification in conversations , in: ACM Multimedia 2008, Vancouver, Canada, pages 1017-1018, 2008.
Galán, F. , Nuttin, M. , Lew, E. , Ferrez, P. W. , Vanacker, G. , Philips, J. and Millán, J. del R. , A brain-actuated wheelchair: asynchronous and non-invasive brain-computer interfaces for continuous control of robots , in: Clinical Neurophysiology, number 119, pages 2159-2169, 2008.
Galán, F. , Nuttin, M. , Vanhooydonck, D. , Lew, E. , Ferrez, P. W. , Philips, J. and Millán, J. del R. , Continuous brain-actuated control of an intelligent wheelchair by human eeg , in: 4th International Brain-Computer Interface Workshop & Training Course, Graz University of Technology, Graz, Austria, 2008.
Galán, F. , Methods for Asynchronous and Non-Invasive EEG-Based Brain-Computer Interfaces. Towards Intelligent Brain-Actuated Wheelchairs , University of Barcelona, 2008.
Gammeter, S. , Ess, A. , Jaeggli, T. , Leibe, B. , Schindler, K. and van Gool, L. , Articulated multibody tracking under egomotion , in: European Conference on Computer Vision (ECCV'08), Springer, 2008.
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Autoregressive modelling of hilbert envelopes for wide-band audio coding , in: AES 124th Convention, Audio Engineering Society, Amsterdam, 2008.
Ganapathy, S. , Thomas, A. and Hermansky, H. , Front-end for far-field speech recognition based on frequency domain linear prediction , in: Interspeech 2008, Brisbane, Australia, 2008.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes , number Idiap-RR-75-2008, 2008.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION , number Idiap-RR-74-2008, 2008.
Ganapathy, S. , Thomas, S. and Hermansky, H. , Modulation Frequency Features For Phoneme Recognition In Noisy Speech , in: Journal of Acoustical Society of America - Express Letters, 2008.
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domain , in: INTERSPEECH 2008, Brisbane, Australia, 2008.
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Temporal masking for bit-rate reduction in audio codec based on frequency domain linear prediction , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4781-4784, Las Vegas, NV, 2008. [DOI]
Garg, N. and Hakkani-Tur, D. , Speaker role detection in meetings using lexical information and social network analysis , in: Technical Report TR-08-004, International Computer Science Institute, Berkeley, CA, 2008.
Garipelli, G. , Chavarriaga, R. and Millán, J. del R. , Fast recognition of anticipation related potentials , in: IEEE Transactions on Biomedical Engineering, 2008.
Garipelli, G. , Chavarriaga, R. and Millán, J. del R. , Recognition of anticipatory behavior from human eeg , in: 4th Intl. Brain-Computer Interface Workshop and Training Course, Graz University, Austria, 2008.
Garner, P. N. , A weighted finite state transducer tutorial , number Idiap-Com-03-2008, 2008.
Garner, P. N. , Silence models in weighted finite-state transducers , in: Interspeech, Brisbane, Australia, 2008.
Gatica-Perez, D. and Farrahi, K. , Daily routine classification from mobile phone data , in: Workshop on Machine Learning and Multimodal Interaction (MLMI08), Utrecht, The Netherlands, 2008.
Gatica-Perez, D. and Farrahi, K. , Discovering human routines from cell phone data with topic models , in: IEEE International Symposium on Wearable Computers (ISWC), Pittsburgh, Pennsylvania, 2008.
Gatica-Perez, D. and Farrahi, K. , What did you do today? discovering daily routines from large-scale mobile data , in: ACM International Conference on Multimedia (ACMMM), Vancouver, 2008.
Gillick, D. , Hakkani-Tur, D. and Levit, M. , Unsupervised learning of edit parameters for matching name variants , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Goldmann, L. , Adamek, T. , Vajda, P. , Karaman, M. , Mörzinger, R. , Galmar, E. , Sikora, T. , O'Connor, N. , Ha-Minh, T. , Ebrahimi, T. , Schallauer, P. and Huet, B. , Towards Fully Automatic Image Segmentation Evaluation , in: Advanced Concepts for Intelligent Vision Systems (ACIVS), Springer, Juan-les-Pins, 2008.
Gonzalez, G. , Fleuret, F. and Fua, P. , Automated delineation of dendritic networks in noisy image stacks , in: Proceedings of the European Conference on Computer Vision (ECCV), pages 214-227, 2008.
Gonzalez, G. , Fleuret, F. and Fua, P. , Automated delineation of dendritic networks in noisy image stacks , in: The 10th European Conference on Computer Vision, Marseille, France, 2008.
Grandjean, D. and Pun, T. , Multimodality in emotions and for their assessment , 2008.
Grandvalet, Y. , Rakotomamonjy, A. , Keshet, J. and Canu, S. , Support Vector Machines with a Reject Option , in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008.
Grangier, D. and Bengio, S. , A discriminative kernel-based model to rank images from text queries , in: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2008.
Grangier, D. , Machine Learning for Information Retrieval , École Polytechnique Fédérale de Lausanne, 2008.
Grossmann, E. , Gaspar, J. -A. and Orabona, F. , Calibration from statistical properties of the visual world , in: European Conf. on Computer Vision, 2008.
Gui, L. , Thiran, J. -Ph. and Paragios, N. , Cooperative object segmentation and behavior inference in image sequences , in: International Journal of Computer Vision, ISSN 0920-5691, 2008. [DOI]
Gurban, M. , Thiran, J. -Ph. , Drugman, T. and Dutoit, T. , Dynamic modality weighting for multi-stream HMMs in Audio-Visual Speech Recognition , in: 10th International Conference on Multimodal Interfaces, Chania, Greece, 2008.
Gurban, M. and Thiran, J. -Ph. , Using entropy as a stream reliability estimate for audio-visual speech recognition , in: 16th European Signal Processing Conference, Lausanne, Switzerland, 2008.
Hoffmann, U. , Vesin, J. M. , Ebrahimi, T. and Diserens, K. , An efficient p300-based brain-computer interface for disabled subjects , in: Journal of Neuroscience Methods, volume 167, number 1, pages 115-125, 2008. [DOI]
Hoffmann, U. , Yazdani, A. , Vesin, J. M. and Ebrahimi, T. , Bayesian feature selection applied in a p300 brain- computer interface , in: 16th European Signal Processing Conference, Lausanne, 2008.
Hoffmann, U. , Naruniec, J. , Yazdani, A. and Ebrahimi, T. , Face Detection Using Discrete Gabor Jets And Color Information , in: SIGMAP 2008 - International Conference on Signal Processing and Multimedia Applications, Porto, 2008.
Humm, A. , Hennebert, J. and Ingold, R. , Combined handwriting and speech modalities for user authentication , in: IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans, volume 38, 2008.
Humm, A. , Modelling combined handwriting and speech modalities for user authentication , University of Fribourg, Switzerland, 2008.
Humm, A. , Hennebert, J. and Ingold, R. , Spoken signature for user authentication , in: SPIE Journal of Electronic Imaging, volume 17, 2008.
Humm, A. , Hennebert, J. and Ingold, R. , Spoken signature for user authentication , in: SPIE Journal of Electronic Imaging, volume 17, 2008.
Hung, H. , Huang, Y. , Yeo, C. and Gatica-Perez, D. , Associating audio-visual activity cues in a dominance estimation framework , in: CVPR Workshop on Human Communicative Behavior, 2008.
Hung, H. , Huang, Y. , Friedland, G. and Gatica-Perez, D. , Estimating the dominant person in multi-party conversations using speaker diarization strategies , in: ICASSP 08, 2008.
Hung, H. , Huang, Y. , Friedland, G. and Gatica-Perez, D. , Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies , in: IEEE ICASSP, Las Vegas, NV, 2008.
Hung, H. and Gatica-Perez, D. , Identifying dominant people in meetings from audio-visual sensors , in: Proc. IEEE Int. Conf. on Automatic Face and Gesture Recognition, Special Session on Multimodal HCI for Smart Environments, 2008.
Hung, H. and Gatica-Perez, D. , Identifying dominant people in meetings from audio-visual sensors , in: Proc. IEEE Int. Conf. on Automatic Face and Gesture Recognition (FG), Special Session on Multi-Sensor HCI for Smart Environments, 2008.
Hung, H. , Jayagopi, D. , Ba, S. , Odobez, J. -M. and Gatica-Perez, D. , Investigating automatic dominance estimation in groups from visual attention and speaking activity , in: International Conference on Multimodal Interfaces (ICMI), 2008.
Hung, H. , Jayagopi, D. , Ba, S. , Odobez, J. -M. and Gatica-Perez, D. , Investigating automatic dominance estimation in groups from visual attention and speaking activity , in: Proc. ICMI, 2008.
Hung, H. and Friedland, G. , Towards audio-visual on-line diarization of participants in group meetings , in: European Conference on Computer Vision (ECCV) 2008, Marseille, France, 2008.
Indermühle, E. , Liwicki, M. and Bunke, H. , Recognition of handwritten historical documents: hmm -adaptation vs. writer specific training , in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 186-191, 2008.
Jayagopi, D. , Raducanu, B. and Gatica-Perez, D. , Characterizing conversational group dynamics using nonverbal behavior , in: Proc. IEEE Int. Conf. on Multimedia (ICME), 2008.
Jayagopi, D. , Hung, H. , Yeo, C. and Gatica-Perez, D. , Modeling dominance in group conversations from nonverbal activity cues , in: IEEE Trans. on Audio, Speech and Language Processing, Special Issue on Multimodal Processing for Speech-based Interactions, accepted for publication, 2008.
Jayagopi, D. , Predicting the dominant clique in meetings through fusion of nonverbal cues , in: Proc. ACM Vancouver, Canada, 2008.
Jayagopi, D. , Hung, H. , Yeo, C. and Gatica-Perez, D. , Predicting the dominant clique in meetings through fusion of nonverbal cues , in: ACM MM 2008, Vancouver, Canada, 2008.
Jayagopi, D. , Predicting two facets of social verticality in meetings from five-minute time slices and nonverbal cues , in: Proc. ICMI, 2008.
Jayagopi, D. , Ba, S. , Odobez, J. -M. and Gatica-Perez, D. , Predicting two facets of social verticality in meetings from five-minute time slices and nonverbal cues , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), Special Session on Social Signal Processing, 2008.
Kamangar, K. , Hakkani-Tur, D. , Tur, G. and Levit, M. , An iterative unsupervised learning method for information distillation , in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
Keshet, J. and Bengio, S. , Automatic speech and speaker recognition: large margin and kernel methods , John Wiley & Sons, 2008.
Ketabdar, H. and Bourlard, H. , Enhanced phone posteriors for improving speech recognition systems , number Idiap-RR-39-2008, 2008.
Ketabdar, H. , Enhancing posterior based speech recognition systems , Ecole Polytechnique Fédérale de Lausanne, 2008.
Ketabdar, H. and Bourlard, H. , Hierarchical integration of phonetic and lexical knowledge in phone posterior estimation , in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
Ketabdar, H. and Bourlard, H. , In-context phone posteriors as complementary features for tandem asr , in: ICSLP'08, Brisbane, Australia,, 2008.
Kludas, J. , Bruno, E. and Marchand-Maillet, S. , Can feature information interaction help for information fusion in multimedia problems? , in: First International Workshop on Metadata Mining for Image Understanding, pages 23-33, 2008.
Kludas, J. , Bruno, E. and Marchand-Maillet, S. , Can feature information interaction help for information fusion in multimedia problems? , in: To appear in Multimedia Tools and Applications Journal special issue on "Metadata Mining for Image Understanding", 2008.
Kludas, J. , Marchand-Maillet, S. and Bruno, E. , Exploiting document feature interactions for efficient information fusion in high dimensional spaces , in: Proceedings of the First International Workshops on Image Processing Theory, Tools and Applications (IPTA'2008), 2008.
Kludas, J. , Bruno, E. and Marchand-Maillet, S. , Exploiting synergistic and redundant features for multimedia document classification , in: 32nd Annual Conference of the German Classification Society - Advances in Data Analysis, Data Handling and Business Intelligence (GfKl 2008), 2008.
Kludas, J. , Bruno, E. and Marchand-Maillet, S. , Exploiting synergistic and redundant features for multimedia document classification , in: 32nd Annual Conference of the German Classification Society - Advances in Data Analysis, Data Handling and Business Intelligence (GfKl 2008), 2008.
Knox, M. , Morgan, N. and Mirghafori, N. , Getting the last laugh: automatic laughter segmentation in meetings , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Knox, M. , Morgan, N. and Mirghafori, N. , Getting the last laugh: automatic laughter segmentation in meetings , in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 797-800, 2008.
Kokiopoulou, E. , Frossard, P. and Verscheure, O. , Fast keyword detection with sparse time-frequency models , in: IEEE Int. Conf. on Multimedia & Expo (ICME), 2008.
Kokiopoulou, E. , Pirillos, S. and Frossard, P. , Graph-based classification for multiple observations of transformed patterns , in: IEEE Int. Conf. Pattern Recognition (ICPR), 2008.
Kokiopoulou, E. and Frossard, P. , Minimum distance between pattern transformation manifolds: algorithm and applications , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Kokiopoulou, E. , Frossard, P. and Gkorou, D. , Optimal polynomial filtering for accelerating distributed consensus , in: IEEE Int. Symp. on Information Theory (ISIT), 2008.
Kokiopoulou, E. and Frossard, P. , Semantic coding by supervised dimensionality reduction , in: IEEE Transactions on Multimedia, volume 10, number 2, 2008.
Kosinov, S. and Pun, T. , Distance-based discriminant analysis method and its applications , in: Pattern Analysis and Applications, volume 11, number 3-4, pages 227-246, 2008.
Kosinov, S. , Bruno, E. and Marchand-Maillet, S. , Spatially-consistent partial matching for intra- and inter-image prototype selection , in: To appear in Signal Processing: Image Communication special issue on "Semantic Analysis for Interactive Multimedia Services", 2008.
Koval, O. , Voloshynovskiy, S. , Beekhof, F. and Pun, T. , Analysis of physical unclonable identification based on reference list decoding , in: Steganography, and Watermarking of Multimedia Contents X, 2008.
Koval, O. , Voloshynovskiy, S. and Pun, T. , Privacy-preserving multimodal person and object identification , in: Proceedings of the 10th ACM Workshop on Multimedia & Security, 2008.
Koval, O. , Voloshynovskiy, S. , Caire, F. and Bas, P. , Privacy-preserving multimodal person and object identification , in: MM&Sec 2008, 2008.
Koval, O. , Voloshynovskiy, S. , Beekhof, F. and Pun, T. , Security analysis of robust perceptual hashing , in: Steganography, and Watermarking of Multimedia Contents X, 2008.
Kryszczuk, K. and Drygajlo, A. , Credence estimation and error prediction in biometric identity verification , in: Signal Processing, volume 88, number 4, pages 916-925, 2008.
Kryszczuk, K. and Drygajlo, A. , Impact of feature correlations on separation between bivariate normal distributions , 2008.
Kryszczuk, K. and Drygajlo, A. , Impact of feature correlations on separation between bivariate normal distributions , in: 19th International Conference on Pattern Recognition, 2008.
Kryszczuk, K. and Drygajlo, A. , On quality of quality measures for classification , in: Biometrics and Identity Management, Lecture Notes in Computer Science 5372, pages 19-28, 2008.
Kryszczuk, K. and Drygajlo, A. , On quality of quality measures for classification , pages 19-28, Springer, 2008.
Kryszczuk, K. and Drygajlo, A. , What do quality measures predict in biometrics , pages -,-29, 2008.
Kryszczuk, K. and Drygajlo, A. , What do quality measures predict in biometrics , in: 16th European Signal Processing Conference, 2008.
Kumatani, K. , McDonough, J. , Klakow, D. , Garner, P. N. and Li, W. , Adaptive beamforming with a maximum negentropy criterion, , in: The Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2008.
Kumatani, K. , McDonough, J. , Rauch, B. , Klakow, D. , Garner, P. N. and Li, W. , Beamforming with a Maximum Negentropy Criterion , in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 5, pages 994-1008, 2008.
Kumatani, K. , McDonough, J. , Schacht, S. , Klakow, D. , Garner, P. N. and Li, W. , Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming , in: International Conferance on Acoustics Speech and Signal Processing, 2008.
Kumatani, K. , McDonough, J. , Schacht, S. , Klakow, D. , Garner, P. N. and Li, W. , Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition , number Idiap-RR-02-2008, 2008.
Kumatani, K. , McDonough, J. , Klakow, D. , Garner, P. N. and Li, W. , Maximum negentropy beamforming , number Idiap-RR-07-2008, 2008.
Lalanne, D. , Rigamonti, M. , Ingold, R. , Evéquoz, F. and Dumas, B. , An ego-centric and tangible approach to meeting indexing and browsing , Lecture Notes in Computer Science, volume Volume 4892, Springer Berlin / Heidelberg, ISBN 978-3-540-78154-7, 2008. [DOI]
Leibe, B. , Schindler, K. , Cornelis, N. and van Gool, L. , Coupled object detection and tracking from static cameras and moving vehicles , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Leibe, B. , Ettlin, A. and Schiele, B. , Learning semantic object parts for object categorization , in: Image and Vision Computing, volume 26, number 1, pages 15-26, 2008.
Leibe, B. , Leonardis, A. and Schiele, B. , Robust object detection with interleaved categorization and segmentation , in: International Journal of Computer Vision, volume 77, number 1-3, pages 259-289, 2008.
Li, W. , Kumatani, K. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , A neural network based regression approach for recogninizing simultaneous speech , in: Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
Li, W. , Kumatani, K. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , A neural network based regression approach for recognizing simultaneous speech , number Idiap-RR-10-2008, 2008.
Li, W. , Effective post-processing for single-channel frequency-domain speech enhancement , pages 149-152, 2008. [DOI]
Li, W. , Effective post-processing of single-channel frequency-domain speech enhancement , in: IEEE conference on multimedia and expo, 2008.
Li, W. , Doss, M. M. , Dines, J. and Bourlard, H. , Mlp-based log spectral energy mapping for robust overlapping speech recognition , in: European Signal Processing Conference, 2008.
Li, W. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , Neural network based regression for robust overlapping speech recognition using microphone arrays , in: Interspeech, 2008.
Liwicki, M. and Bunke, H. , Combining on-line and off-line blstm networks for handwritten text line recognition , in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 31-36, 2008.
Liwicki, M. and Bunke, H. , Recognition of whiteboard notes -- online, offline and combination , World Scientific, ISBN 978-9812814531, 2008.
Liwicki, M. , Schlapbach, A. and Bunke, H. , Writer-dependent recognition of handwritten whiteboard notes in smart meeting room environments , in: Proc. 8th IAPR Int. Workshop on Document Analysis Systems, pages 151-157, 2008.
Llonch, R. Sala , Kokiopoulou, E. , Tosic, I. and Frossard, P. , 3d face recognition using sparse spherical representations , in: IEEE Int. Conf. Pattern Recognition (ICPR), 2008.
Luo, J. , Caputo, B. , Zweig, A. , Back, J. -H. and Anemuller, J. , Object category detection using audio-visual cues , in: International Conference on Computer Vision Systems (ICVS08), 2008.
Mariéthoz, J. , Bengio, S. and Grandvalet, Y. , Kernel Based Text-Independnent Speaker Verification , number Idiap-RR-68-2008, 2008.
Matena, L. , Jaimes, A. and Popescu-Belis, A. , Graphical representation of meetings on mobile devices , in: MobileHCI 2008 Demonstrations (10th ACM International Conference on Human-Computer Interaction with Mobile Devices and Services), 2008.
Mesot, B. , Inference in switching linear dynamical systems applied to noise robust speech recognition of isolated digits , Ecole Polytechnique Fédérale de Lausanne, 2008.
Mesot, B. , Switching linear dynamical systems for noise robust speech recognition of isolated degits , STI School of Engineering, EPFL, 2008.
Meynet, J. and Thiran, J. -Ph. , Ensembles of SVMs using an Information Theoretic Criterion , in: Pattern Recognition Letters, 2008.
Meynet, J. , Arsan, T. , Cruz Mota, J. and Thiran, J. -Ph. , Fast multi-view face tracking with pose estimation , in: 16th European Signal Processing Conference, Lausanne, 2008.
Meynet, J. and Thiran, J. -Ph. , Information Theoretic Combination of Classifiers , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008. [DOI]
Millán, J. del R. , Brain-controlled robots , in: IEEE International Conference on Robotics and Automation (ICRA 2008), Pasadena, CA, USA,, 2008. [DOI]
Millán, J. del R. , Brain-Controlled Robots , in: IEEE Intelligent Systems, 2008.
Millán, J. del R. , Ferrez, P. W. , Galán, F. , Lew, E. and Chavarriaga, R. , Non-invasive brain-machine interaction , in: International Journal of Pattern Recognition and Artificial Intelligence, 2008.
Morrison, D. , Marchand-Maillet, S. and Bruno, E. , Semantic clustering of images using patterns of relevance feedback , in: Proceedings of the 6th International Workshop on Content-based Multimedia Indexing (CBMI'2008), 2008.
Motlicek, P. , Ganapathy, S. and Hermansky, H. , Entropy coding of Quantized Spectral Components in FDLP audio codec , number Idiap-RR-71-2008, 2008.
Motlicek, P. , Ganapathy, S. , Hermansky, H. , Garudadri, H. and Athineos, M. , Perceptually motivated Sub-band Decomposition for FDLP Audio Coding , in: Text, Speech and Dialogue, pages 435-442, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
Naturel, X. and Odobez, J. -M. , Detecting queues at vending machines: a statistical layered approach , in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008.
Negoescu, R. -A. and Gatica-Perez, D. , Analyzing flickr groups , in: Proceedings of the 2008 international conference on Content-based image and video retrieval (CIVR '08), Sheraton Fallsview Hotel, Niagara Falls, Canada, 2008.
Negoescu, R. -A. and Gatica-Perez, D. , Topickr: Flickr Groups and Users Reloaded , in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008.
Nijholt, A. , Tan, D. , Allison, B. , Millán, J. del R. , Moore, M. and Graimann, B. , Brain-computer interfaces for hci and games , in: Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts, 2008.
Noris, B. , Benmachiche, K. and Billard, A. , Calibration-free eye gaze direction detection with gaussian processes , in: International Conference on Computer Vision Theory and Applications (VISAPP 2008), Funchal, Portugal, 2008.
Orabona, F. , Keshet, J. and Caputo, B. , The Projectron: a Bounded Kernel-Based Perceptron , in: Int. Conf. on Machine Learning, 2008.
Ouaret, M. , Dufaux, F. and Ebrahimi, T. , Enabling Privacy For Distributed Video Coding by Transform Domain Scrambling , in: 2008 SPIE Visual Communications and Image Processing, San Diego, USA, 2008.
Paiement, J. -F. , Grandvalet, Y. , Bengio, S. and Eck, D. , A Distance Model for Rhythms , in: 25th International Conference on Machine Learning (ICML), 2008.
Paiement, J. -F. , Grandvalet, Y. and Bengio, S. , Predictive Models for Music , number Idiap-RR-51-2008, 2008.
Paiement, J. -F. , Bengio, S. and Eck, D. , Probabilistic Models for Melodic Prediction , number Idiap-RR-50-2008, 2008.
Paiement, J. -F. , Probabilistic models for music , École Polytechnique Fédérale de Lausanne, 2008.
Parthasarathi, S. H. K. and Hermansky, H. , A data-driven approach to speech/non-speech detection , number Idiap-RR-23-2008, 2008.
Parthasarathi, S. H. K. , Motlicek, P. and Hermansky, H. , Exploiting Contextual Information for Speech/Non-Speech Detection , in: Text, Speech and Dialogue, pages 451-459, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
Parthasarathi, S. H. K. , Motlicek, P. and Hermansky, H. , Exploiting temporal context for speech/non-speech detection , number Idiap-RR-21-2008, 2008.
Pellegrini, S. , Schindler, K. and D. Nardi, , A generalization of the icp algorithm for articulated bodies , in: British Machine Vision Conference (BMVC'08), 2008.
Perrin, X. , Chavarriaga, R. , Ray, C. , Siegwart, R. and Millán, J. del R. , A comparative psychophysical and eeg study of different feedback modalities for hri , in: Human-Robot Interaction (HRI08), 2008.
Perruchoud, L. , The Anterior Cingulate Cortex , number Idiap-Com-02-2008, 2008.
Pinto, J. P. and Hermansky, H. , Combining evidence from a generative and a discriminative model in phoneme recognition , in: Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Pinto, J. P. , Hermansky, H. , Yegnanarayana, B. and Magimai-Doss, M. , Exploiting contextual information for improved phoneme recognition , in: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2008), pages 4449-4452, Las Vegas, NV, 2008. [DOI]
Pinto, J. P. , Szoke, I. , Prasanna, S. R. Mahadeva and Hermansky, H. , Fast approximate spoken term detection from sequence of phonemes , in: The 31st Annual International ACM SIGIR Conference 20-24 July 2008, pages 28-33, Singapore,, 2008.
Pinto, J. P. , Sivaram, G. S. V. S. and Hermansky, H. , Reverse correlation for analyzing mlp posterior features in asr , in: 11th International Conference on Text, Speech and Dialogue (TSD), pages 469-476, Brno, Czech Republic, 2008. [DOI]
Popescu-Belis, A. , Dimensionality of dialogue act tagsets: an empirical analysis of large corpora , in: Language Resources and Evaluation, volume 42, number 1, pages 99-107, 2008. [DOI]
Popescu-Belis, A. , Bourlard, H. and Renals, S. , Machine learning for multimodal interaction iv , LNCS, volume 4892, Springer-Verlag, ISBN 978-3-540-78154-7, 2008.
Popescu-Belis, A. and Stiefelhagen, R. , Machine learning for multimodal interaction v , LNCS, volume 5237, Springer-Verlag, ISBN 978-3-540-85852-2, 2008.
Popescu-Belis, A. , Reference-based vs. task-based evaluation of human language technology , in: LREC 2008 ELRA Workshop on Evaluation: "Looking into the Future of Evaluation: When automatic metrics meet task-based and performance-based approaches", pages 12-16, ELRA, 2008.
Popescu-Belis, A. , Flynn, M. , Wellner, P. and Baudrion, P. , Task-based evaluation of meeting browsers: from bet task elicitation to user behavior analysis , in: LREC 2008 (6th International Conference on Language Resources and Evaluation), 2008.
Prodanov, P. , Drygajlo, A. , Richiardi, J. and Alexander, A. , Low-level grounding in a multimodal mobile service robot conversational system using graphical models , in: Intelligent Service Robotics, volume 1, pages 3-26, 2008. [DOI]
Pronobis, M. and Magimai-Doss, M. , Integrating audio and vision for robust automatic gender recognition , number Idiap-RR-73-2008, 2008.
Pronobis, A. , Martinez Monos, O. and Caputo, B. , SVM-based Discriminative Accumulation Scheme for Place Recognition , in: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA08), 2008.
Quack, T. , Bay, H. and van Gool, L. , Object recognition for the internet of things , in: Internet of Things 2008, 2008.
Quack, T. , Leibe, B. and van Gool, L. , World-scale mining of objects and events from community photo collections , in: Conference on Image and Video Retrieval (CIVR'08), ACM, 2008.
Rakotomamonjy, A. , Bach, F. , Canu, S. and Grandvalet, Y. , SimpleMKL , in: Journal of Machine Learning Research, volume 9, pages 2491-2521, 2008.
Rayner, M. , Tsourakis, N. , Georgescul, M. and Bouillon, P. , Building mobile spoken dialogue applications using regulus , in: Proceedings of the Sixth International Language Resources and Evaluation (LREC'08), 2008.
Richiardi, J. , Drygajlo, A. and Todesco, L. , Promoting diversity in gaussian mixture ensembles: an application to signature verification , pages 140-149, Springer, 2008.
Richiardi, J. , Drygajlo, A. and Todesco, L. , Promoting diversity in gaussian mixture ensembles: an application to signature verification , in: Biometrics and Identity Management, Lecture Notes in Computer Science 5372, pages 140-149, 2008.
Riedhammer, K. , Gillick, D. , Favre, B. and Hakkani-Tur, D. , Packing the meeting summarization knapsack , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Rigamonti, M. , A framework for structuring multimedia archives and for browsing efficiently through multimodal links , University of Fribourg, Switzerland, 2008.
Rigamonti, M. , A framework for structuring multimedia archives and for browsing efficiently through multimodal links , University of Fribourg, Switzerland, 2008.
Roth, D. , Koller-Meier, E. , Rowe, D. , Moeslund, T. B. and van Gool, L. , Event-based tracking evaluation metric , in: IEEE Workshop on Motion and Video Computing (WMVC), 2008.
Scaringella, N. , Timbre and Rhythmic TRAP-TANDEM features for music information retrieval , in: "Int. Conf. on Music Information Retrieval (ISMIR)", 2008.
Schindler, K. and van Gool, L. , Action snippets: how many frames does human action recognition require? , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), IEEE Press, 2008.
Schindler, K. and van Gool, L. , Combining densely sampled form and motion for human action recognition , in: DAGM Annual Pattern Recognition Symposium, Springer, 2008.
Schindler, K. and Suter, D. , Object detection by global contour shape , in: Pattern Recognition, 2008.
Schindler, K. , van Gool, L. and B. de Gelder, , Recognizing emotions expressed by body pose: a biologically inspired neural model , in: Neural Networks, 2008.
Schlapbach, A. , Liwicki, M. and Bunke, H. , A writer identification system for on-line whiteboard data , in: Pattern Recognition, volume 41, pages 2381-2397, 2008.
Schlapbach, A. , Wettstein, F. and Bunke, H. , Automatic estimation of the readability of handwritten text , in: Proc. 16th European Signal Processing Conference, 2008.
Schlapbach, A. , Bunke, H. and Wettstein, F. , Estimating the readability of handwritten text -- a support vector regression based approach , in: Proc. 19th Int. Conf. on Pattern Recognition, IEEE, 2008.
Schlapbach, A. and Bunke, H. , Off-line writer identification and verification using gaussian mixture models , in: Machine Learning in Document Analysis and Recognition, pages 409-428, Springer, 2008.
Schlapbach, A. , Writer identification and verification , volume 311, IOS Press, ISBN 978-1-58603-825-0, 2008.
Schouten, B. , Juul, N. , Drygajlo, A. and Tistarelli, M. , Biometrics and identity management , Springer, 2008.
Schouten, B. , Juul, N. , Drygajlo, A. and Tistarelli, M. , Biometrics and identity management , Springer, 2008.
Shahrokni, A. , Drummond, T. , Fleuret, F. and Fua, P. , Classification-based Probabilistic Modeling of Texture Transition for Fast Line Search Tracking and Delineation , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Shriberg, E. , Higher level features in speaker recognition , in: in C. Muller (Ed.) Speaker Classification I. Springer-Verlag, New York, 2008.
De Simone, F. , Ticca, D. , Dufaux, F. , Ansorge, M. and Ebrahimi, T. , A comparative study of color image compression standards using perceptually driven quality metrics , in: SPIE Optics and Photonics, San Diego, CA USA, 2008.
De Simone, F. , Ansorge, M. and Ebrahimi, T. , A multi-channel objective model for the full-reference assessment of color pictures , in: 2nd K-space Jamboree Workshop, Paris, 2008.
Singla, A. and Hakkani-Tur, D. , Cross-lingual sentence extraction for information distillation , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Sivaram, G. S. V. S. and Hermansky, H. , Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition , in: Proc. 16th European Signal Processing Conference (EUSIPCO), Lausanne, 2008.
Sivaram, G. S. V. S. and Hermansky, H. , Introducing temporal asymmetries in feature extraction for automatic speech recognition , in: Interspeech 2008, Brisbane, Australia, 2008.
Smith, K. , Ba, S. , Gatica-Perez, D. and Odobez, J. -M. , Tracking the visual focus of attention for a varying number of wandering people , in: IEEE Trans. on Pattern Analysis and Machine Intelligence,, volume 30, number 7, pages 1212-1229, 2008.
Soleymani, M. , Chanel, G. , Kierkels, J. and Pun, T. , affective characterization of movie scenes based on multimedia content analysis and user's physiological emotional responses , in: IEEE International Symposium on Multimedia, 2008.
Soleymani, M. , Chanel, G. , Kierkels, J. and Pun, T. , affective ranking of movie scenes using physiological signals and content analysis , in: 2nd ACM Workshop on the Many Faces of Multimedia Semantics, ACM MM08, 2008.
Soleymani, M. , Kierkels, J. , Chanel, G. , Bruno, E. , Marchand-Maillet, S. and T. Pun, , Estimating emotions and tracking interest during movie watching based on multimedia content and physiological responses , in: Joint (IM)2-Interactive Multimodal Information Management and Affective Sciences NCCRs meeting, 2008.
Soleymani, M. , Chanel, G. , Kierkels, J. and Pun, T. , Valence-arousal representation of movie scenes based on multimedia content analysis and user's physiological emotional responses , in: MLMI 2008, 5th Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
Soleymani, M. , Chanel, G. , Kierkels, J. and Pun, T. , valence-arousal representation of movie scenes based on multimedia content analysis and user's physiological emotional responses , 5th Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
Sorci, M. , Antonini, G. , Cerretani, B. , Cruz Mota, J. , Rubin, T. , Bierlaire, M. and Thiran, J. -Ph. , Modelling human perception of static facial expressions , in: Face and Gesture Recognition 2008, Amsterdam, 2008.
Spindler, T. , Wartmann, C. , Hovestadt, L. , Roth, D. , van Gool, L. and Steffen, A. , Privacy in video surveilled spaces , in: Journal of Computer Security, volume 16, number 2, pages 199-222, 2008.
Stolcke, A. , Anguera, X. , Boakye, K. , Cetin, O. , Janin, A. , Magimai-Doss, M. , Wooters, C. and Zheng, J. , The SRI-ICSI spring 2007 meeting and lecture recognition system , in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
Stoyanchev, S. , Tur, G. and Hakkani-Tur, D. , Name-aware speech recognition for interactive question answering , in: IEEE ICASSP, Las Vegas, NV, 2008.
Szafranski, M. , Grandvalet, Y. and Rakotomamonjy, A. , Composite Kernel Learning , in: Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), pages 1040-1047, Omnipress, 2008.
Thomas, A. , Ganapathy, S. and Hermansky, H. , Hilbert envelope based features for far-field speech recognition , in: MLMI 2008, Utrecht, The Netherlands, 2008.
Thomas, A. , Ganapathy, S. and Hermansky, H. , Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech , in: Interspeech 2008, Brisbane, Australia, 2008.
Thomas, A. , Ganapathy, S. and Hermansky, H. , Recognition of reverberant speech using frequency domain linear prediction , in: IEEE Signal Processing Letters, 2008.
Thomas, A. , Ganapathy, S. and Hermansky, H. , Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain , in: 16th European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
Thomas, A. , Ferrari, V. , Leibe, B. , Tuytelaars, T. and van Gool, L. , Using recognition to guide a robot's attention , in: Robotics Science and Systems, 2008.
Tommasi, T. , Orabona, F. and Caputo, B. , CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach , number Idiap-RR-77-2008, 2008.
Tommasi, T. , Orabona, F. and Caputo, B. , Cue Integration for Medical Image Annotation , in: Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Springer-Verlag, 2008.
Tommasi, T. , Orabona, F. and Caputo, B. , Discriminative cue integration for medical image annotation , in: Pattern Recognition Letters, 2008.
Torii, A. , Havlena, M. , Pajdla, T. and B. Leibe, , Measuring camera translation by the dominant apical angle , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
Tous, R. , Carreras, A. , Delgado, J. , Cordara, G. , Gianluca, F. , Peig, E. , Dufaux, F. and Galinski, G. , An Architecture for TV Content Distributed Search and Retrieval Using the MPEG Query Format (MPQF) , in: International Workshop on Ambient Media Delivery and Interactive Television (AMDIT 2008), Quebec City, Canada, 2008.
Tsourakis, N. , Lisowska, A. , Bouillon, P. and Rayner, M. , From desktop to mobile: adapting a successful voice interaction platform for use in mobile devices , in: Third ACM MobileHCI Workshop on Speech in Mobile and Pervasive Environments (SiMPE), Amsterdam, the Netherlands., 2008.
Ullah, M. M. , Pronobis, A. , Caputo, B. , Luo, J. , Jensfelt, P. and Christensen, H. I. , Towards Robust Place Recognition for Robot Localization , in: IEEE International Conference on Robotics ad Automation, 2008.
Valente, F. and Hermansky, H. , Hierarchical and parallel processing of modulation spectrum for asr applications , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4165-4168, 2008. [DOI]
Valente, F. and Hermansky, H. , On the combination of auditory and modulation frequency channels for asr applications , in: Interspeech 2008, Brisbane, Australia, 2008.
Vergyri, D. , Mandal, A. , Wang, W. , Stolcke, A. , Zheng, J. , Graciarena, M. , Rybach, D. , Gollan, C. , Schlater, R. , Kirchoff, K. , Faria, A. and Morgan, N. , Development of the sri/nightingale arabic asr system , in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 1437-1440, 2008.
Vergyri, D. , Mandal, A. , Wang, W. , Stolcke, A. , Zheng, J. , Graciarena, M. , Rybach, D. , Gollan, C. , Schlater, R. , Kirchoff, K. , Faria, A. and Morgan, N. , Development of the sri/nightingale arabic asr system , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Vijayasenan, D. , Valente, F. and Bourlard, H. , Combination of agglomerative and sequential clustering for speaker diarization , in: International Conference on Acoustics, Speech and Signal Processing, 2008.
Vijayasenan, D. , Valente, F. and Bourlard, H. , Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization , in: Interspeech 2008, 2008.
Vinciarelli, A. , Pantic, M. , Bourlard, H. and Pentland, A. , Social signal processing: state-of-the-art and future perspectives of an emerging domain , in: Proceedings of the ACM International Conference on Multimedia, 2008.
Vinciarelli, A. , Pantic, M. , Bourlard, H. and Pentland, A. , Social signals, their function, and automatic analysis: a survey , in: Proceedings of International Conference on Multimodal Interfaces (to appear), 2008.
Vinyals, O. and Friedland, G. , A hardware-independent fast logarithm approximation with adjustable accuracy , in: 10th IEEE International Symposium on Multimedia, Berkeley, CA, USA, pages 61-65, 2008.
Vinyals, O. and Friedland, G. , Live speaker identification in meetings: "who is speaking now?" , in: Technical Report TR-08-001, International Computer Science Institute, Berkeley, CA, 2008.
Vinyals, O. and Friedland, G. , Modulation spectrogram features for speaker diarization , in: to appear in proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Vinyals, O. and Friedland, G. , Modulation spectrogram features for speaker diarization , in: Interspeech 2008, Brisbane, Australia, pages 630-633, 2008.
Vinyals, O. and Friedland, G. , Towards semantic analysis of conversations: a system for the live identification of speakers in meetings , in: to appear in Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, CA, 2008.
Voloshynovskiy, S. , Koval, O. , Villán, R. , Beekhof, F. and Pun, T. , Authentication of biometric identification documents via mobile devices , in: Journal of Electronic Imaging, 2008.
Voloshynovskiy, S. , Koval, O. and Pun, T. , Multimodal authentication based on random projections and distributed coding , in: Proceedings of the 10th ACM Workshop on Multimedia & Security, 2008.
Voloshynovskiy, S. , Koval, O. , Beekhof, F. and Pun, T. , Multimodal authentication based on random projections and distributed coding , in: MM&Sec 2008, 2008.
Weinshall, D. , Hermansky, H. , Zweig, A. , Luo, J. , Jimison, H. , Ohl, F. and Pavel, M. , Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree , in: Advances in Neural Information Processing Systems 21, 2008.
Weise, T. , Leibe, B. and van Gool, L. , Accurate and robust registration for in-hand modeling , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
Wooters, C. and Huijbregts, M. , The ICSI RT07s speaker diarization system , in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
Yao, J. and Odobez, J. -M. , Fast human detection from videos using covariance features , in: European Conference on Computer Vision, workshop on Visual Surveillance (ECCV-VS), Marseille, 2008.
Yao, J. and Odobez, J. -M. , Multi-camera 3d person tracking with particle filter in a surveillance environment , in: 16th European Signal processing Conference (EUSIPCO), 2008.
Zeng, G. and van Gool, L. , Multi-label image segmentation via point-wise repetition , in: International Conference on Computer Vision and Pattern Recognition (CVPR), 2008.
Zhao, S. and Morgan, N. , Multi-stream spectro-temporal features for robust speech recognition , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Zhao, S. Y. and Morgan, N. , Multi-stream spectro-temporal features for robust speech recognition , in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 898-901, 2008.
I. Bogdanova, , A. Bur, and Hügli, H. , The spherical approach to omnidirectional visual attention , in: XVI European Signal Processing Conference (EUSIPCO 2008), 2008.
I. Bogdanova, , A. Bur, and Hügli, H. , Visual attention on the sphere [in press] , in: IEEE Transactios on Image Processing, 2008.
Varga, T. and Bunke, H. , Perturbation models for generating synthetic training data in handwriting recognition , in: Machine Learning in Document Analysis and Recognition, pages 333-360, Springer, 2008.
Tommasi, T. , Orabona, F. and Caputo, B. , An SVM Confidence-Based Approach to Medical Image Annotation , in: Evaluating Systems for Multilingual and Multimodal Information Access -- 9th Workshop of the Cross-Language Evaluation Forum, 2008.
Popescu-Belis, A. , Bourlard, H. and Renals, S. , Machine learning for multimodal interaction iv (revised selected papers from mlmi 2007, brno, 28-30 june 2007) , LNCS 4892, Springer-Verlag, 2008.
Popescu-Belis, A. and Stiefelhagen, R. , Machine learning for multimodal interaction v (proceedings of mlmi 2008, utrecht, 8-10 september 2008) , LNCS 5237, Springer-Verlag, 2008.
Popescu-Belis, A. , Boertjes, E. , Kilgour, J. , Poller, P. , Castronovo, S. , Wilson, T. , Jaimes, A. and Carletta, J. , The amida automatic content linking device: just-in-time document retrieval in meetings , in: Machine Learning for Multimodal Interaction V (Proceedings of MLMI 2008, Utrecht, 8-10 September 2008), pages 273-284, Springer-Verlag, 2008.
Popescu-Belis, A. , Boertjes, E. , Kilgour, J. , Poller, P. , Castronovo, S. , Wilson, T. , Jaimes, A. and Carletta, J. , The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings , in: Machine Learning for Multimodal Interaction V, pages 272-283, Springer-Verlag, Utrecht, 2008. [DOI]
Popescu-Belis, A. , Baudrion, P. , Flynn, M. and Wellner, P. , Towards an objective test for meeting browsers: the bet4tqb pilot experiment , in: Machine Learning for Multimodal Interaction IV, pages 108-119, Springer-Verlag, 2008. [DOI]
Aloise, F. , Caporusso, N. , Mattia, D. , Babiloni, F. , Kauhanen, L. , Millán, J. del R. , Nuttin, M. , Marciani, M. G. and Cincotti, F. , Brain-machine interfaces through control of electroencephalographic signals and vibrotactile feedback , in: Proceedings of the 12th International Conference on Human-Computer Interaction, 2007.
Anguera, X. , Wooters, C. and Hernando, J. , Acoustic Beamforming for Speaker Diarization of Meetings , in: to appear in IEEE Transactions on Audio, Speech and Language Processing, 2007.
Anguera, X. , Wooters, C. , Pardo, J. M. and Hernando, J. , Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings , in: Proc. ICASSP, Honolulu, 2007.
Anguera, X. , Shinozaki, T. , Wooters, C. and Hernando, J. , Model Complexity Selection and Cross-validation EM Training for Robust Speaker Diarization , in: Proc. ICASSP, Honolulu, 2007.
Ansari-Asl, K. , Chanel, G. and Pun, T. , A channel selection method for eeg classification in emotion assessment based on synchronization likelihoo , in: Eusipco 2007, 15th Eur. Signal Proc. Conf., 2007.
Aradilla, G. , Vepa, J. and Bourlard, H. , An acoustic model based on kullback-leibler divergence for posterior features , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
Aradilla, G. and Ajmera, J. , Detection and recognition of number sequences within spoken utterances , in: 2nd Workshop on Speech in Mobile and Pervasive Environments, 2007.
Aradilla, G. and Bourlard, H. , Posterior-based features and distances in template matching for speech recognition , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 204-214, 2007. [DOI]
Ba, S. , Joint head tracking and pose estimation for visual focus of attention recognition , École Polytechnique Fédérale de Lausanne, 2007.
Ba, S. and Odobez, J. -M. , Probabilistic head pose tracking evaluation in single and multiple camera setups , in: Classification of Events, Activities and Relationship Evaluation and Workshop, 2007.
Bay, H. , Ess, A. , Tuytelaars, T. and van Gool, L. , Speeded-up robust features (surf) , in: Computer Vision and Image Understanding (CVIU), 2007.
Behera, A. , Lalanne, D. and Ingold, R. , Docmir: an automatic document-based indexing system for meeting retrieval , in: Multimedia Tools and Applications, volume 37, number 2, 2007.
Bengio, S. and Mariéthoz, J. , Biometric person authentication is a multiple classifier problem , in: 7th International Workshop on Multiple Classifier Systems, MCS, 2007.
Bertini, E. , Hertzog, P. and Lalanne, D. , Spiralview: a visual tool to improve monitoring and understanding of security data in corporate , in: IEEE Symposium on Visual Analytics Science and Technology 2007 (VAST'07), pages to appear, 2007.
Bertolami, R. and Bunke, H. , Multiple classifier methods for offline handwritten text line recognition , in: Multiple Classifier Systems, pages 72-81, Springer, 2007.
Bertolami, R. , Uchida, S. , Zimmermann, M. and Bunke, H. , Non-uniform slant correction for handwritten text line recognition , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 18-22, 2007.
Besson, P. , Popovici, V. , Vesin, J. M. , Thiran, J. -Ph. and Kunt, M. , Extraction of audio features specific to speech production for multimodal speaker detection , in: IEEE Transactions on Multimedia, 2007. [DOI]
Bogdanova, I. , Bresson, X. , Thiran, J. -Ph. and Vandergheynst, P. , Scale-space analysis and active contours for omnidirectional images , in: IEEE Transactions on Image Processing, volume 16, number 7, pages 1888-1901, 2007. [DOI]
Bologna, G. , Deville, B. , Pun, T. and Vinckenbosch, M. , Identifying major components of pictures by audio encoding of colors , in: IWINAC2007, 2nd. Int. Work-conf. on the Interplay between Natural and Artificial Computation, 2007.
Bologna, G. , Deville, B. , Pun, T. and Vinckenbosch, M. , Transforming 3d coloured pixels into musical instrument notes for vision substitution applications , in: Eurasip J. of Image and Video Processing, Special Issue: Image and Video Processing for Disability, accepted for publication, 2007.
Bouillon, P. , Flores, G. , Starlander, M. , Chatzichrisafis, N. , Santaholma, M. , Tsourakis, N. , Rayner, M. and Hockey, B. A. , A bidirectional grammar-based medical speech translator , in: Proceedings of workshop on Grammar-based approaches to spoken language processing, pages 41-48, ACL 2007, Prague, Czech Republic, 2007.
Bouillon, P. , Chatzichrisafis, N. , Halimi, S. , Hockey, B. A. , Isahara, H. , Kanzaki, K. , Nakao, Y. , Novellas Vall, B. , Rayner, M. , Santaholma, M. and Starlander, M. , Medslt: a multi-lingual grammar-based medical speech translator , in: Proceedings of First International Workshop on Intercultural Collaboration, IWIC2007, Kyoto, Japan, 2007.
Bouillon, P. , Rayner, M. , Novellas Vall, B. , Starlander, M. , Santaholma, M. , Nakao, Y. and Chatzichrisafis, N. , Une grammaire partagée multi-tâche pour le traitement de la parole : application aux langues romanes , in: TAL (Traitement Automatique des Langues), volume 47, number 3, 2007.
Bray, M. , Koller-Meier, E. and van Gool, L. , Smart particle filtering for high-dimensional tracking , in: Computer Vision and Image Understanding, 2007.
Bresson, X. , Esedoglu, S. , Vandergheynst, P. , Thiran, J. -Ph. and Osher, S. , Fast Global Minimization of the Active Contour/Snake Model , in: Journal of Mathematical Imaging and Vision, volume 28, number 2, pages 151-167, 2007. [DOI]
Broschart, M. , de Negueruela, C. , Millán, J. del R. and Menon, C. , Augmenting astronaut's capabilities through brain-machine interfaces , in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, 2007.
Bruno, E. , Kludas, J. and Marchand-Maillet, S. , Combining multimodal preferences for multimedia information retrieval , in: ACM SIGMM - International Workshop on Multimedia Information Retrieval, 2007.
Bruno, E. , Kludas, J. and Marchand-Maillet, S. , Combining multimodal preferences for multimedia information retrieval , in: Proc. of International Workshop on Multimedia Information Retrieval, 2007.
Bunke, H. and Neuhaus, M. , Graph matching -- exact and error-tolerant methods and the automatic learning of edit costs , in: Mining Graph Data, pages 17-34, Wiley, 2007.
Bunke, H. , Dickinson, P. , Humm, A. , Irniger, C. and Kraetzl, M. , Graph sequence visualisation and its application to computer network monitoring and abnormal event detection , in: Applied Graph Theory in Computer Vision and Pattern Recognition, pages 227-245, Springer, 2007.
Bunke, H. and Varga, T. , Off-line Roman cursive handwriting recognition , in: Digital Document Processing: Major Directions and Recent Advances, volume 20, pages 165-173, 2007.
Cetin, O. , Kantor, A. , King, S. , Bartels, C. , Magimai-Doss, M. , Frankel, J. and Livescu, K. , An Articulatory Feature-based Tandem Approach and Factored Observation Modeling , in: Proc. ICASSP, Honolulu, 2007.
Chanel, G. , Ansari-Asl, K. and Pun, T. , Valence-arousal evaluation using physiological signals in an emotion recall paradigm , in: 2007 IEEE SMC, Int. Conf. on Systems, Man and Cybernetics, Smart cooperative systems and cybernetics: advancing knowledge and security for humanity, 2007.
Chavarriaga, R. , Ferrez, P. W. and Millán, J. del R. , To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces , in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007.
Chavarriaga, R. , Ferrez, P. W. and del R. Millán, J. , To err is human: learning from error potentials in brain-computer interfaces , in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007.
Chen, L. , Barber, D. and Odobez, J. -M. , Dynamical dirichlet mixture model , number 02, 2007.
Chiappa, S. and Barber, D. , Bayesian factorial linear gaussian state-space models for biosignal decomposition , in: IEEE Signal Processing Letters, 2007.
Cincotti, F. , Mattia, D. , Aloise, F. , Bufalari, S. , Astolfi, L. , Fallani, F. De Vico , Tocci, A. , Bianchi, L. , Marciani, M. G. , Gao, S. , Millán, J. del R. and Babiloni, F. , High-resolution eeg techniques for brain-computer interface applications , in: Journal of Neuroscience Methods, volume 167, pages 31-42, ISSN 0165-0270, 2007.
Cincotti, F. , Kauhanen, L. and Aloise, F. , Vibrotactile feedback for brain-computer interface operation , in: Computational Intelligence and Neuroscience, volume 2007, pages Article ID, 2007.
Cuendet, S. , Shriberg, E. , Favre, B. , Fung, J. and Hakkani-Tur, D. , An analysis of sentence segmentation features for broadcast news, broadcast conversations, and meetings , in: SIGIR Workshop on Searching Conversational Spontaneous Speech, 2007.
Cuendet, S. , Hakkani-Tur, D. and Shriberg, E. , Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech , in: to appear in Proceedings of MLMI, Brno, Czech Republic, 2007.
Cuendet, S. , Hakkani-Tur, D. , Shriberg, E. , Fung, J. and Favre, B. , Cross-Genre Feature Comparisons for Spoken Sentence Segmentation , in: International Conference on Semantic Computing (ICSC), Irvine, CA, 2007.
Dessimoz, D. , Richiardi, J. , Champod, C. and Drygajlo, A. , Multimodal biometrics for identity documents (MBioID) , in: Forensic Science International, volume 167, pages 154-159, 2007. [DOI]
Dines, J. and Magimai-Doss, M. , A study of phoneme and grapheme based context-dependent asr systems , number 12, 2007.
Dines, J. and Vepa, J. , Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics , number 13, 2007.
Dornhege, G. , del R. Millán, J. , Hinterberger, T. , McFarland, D. and Müller, K. -R. , Towards brain-computer interfacing , The MIT Press, 2007.
Drugman, T. , Gurban, M. and Thiran, J. -Ph. , Relevant Feature Selection for Audio-Visual Speech Recognition , in: 9th International Workshop on Multimedia Signal Processing (MMSP), Chania, Crete, Greece, 2007.
Drygajlo, A. , Man-machine voice communication , pages 433-461, EPFL Press, 2007. [DOI]
Drygajlo, A. , Multimodal biometrics for identity documents and smart cards european challenge , in: Proc. 15th European Signal Processing Conf. (EUSIPCO), 2007.
Einsele, F. , Hennebert, J. and Ingold, R. , Towards identification of very low resolution, anti-aliased characters , in: IEEE International Symposium on Signal Processing and its Applications (ISSPA'07), Sharjah, United Arab Emirates, 2007.
Ess, A. , Leibe, B. and van Gool, L. , Depth and appearance for mobile scene analysis , in: International Conference on Computer Vision (ICCV'07), 2007.
Ess, A. , Neubeck, A. and van Gool, L. , Generalised linear pose estimation , in: BMVC, 2007.
Evéquoz, F. and Lalanne, D. , Indexing and visualizing digital memories through personal email archive , pages 21-24, 2007.
Evéquoz, F. and Lalanne, D. , Personal information management through interactive visualizations , pages 158-160, 2007.
Ferrez, P. W. , Error-related eeg potentials in brain-computer interfaces , École Polytechnique Fédérale de Lausanne, 2007.
Ferrez, P. W. and Millán, J. del R. , Error-related eeg potentials in brain-computer interfaces , in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
Frapolli, F. , Hirsbrunner, B. and Lalanne, D. , Dynamic rules: towards interactive games intelligence , in: Tangible Play: Research and Design for Tangible and Tabletop Games. Workshop at the 2007 Intelligent User Interfaces Conference (IUI'07), pages 29-32, 2007.
Galán, F. , Nuttin, M. , Lew, E. , Ferrez, P. W. , Vanacker, G. , Philips, J. , van Brussel, H. and Millán, J. del R. , An asynchronous and non-invasive brain-actuated wheelchair , in: Proceedings of the 13th International Symposium on Robotics Research, 2007.
Galán, F. , Ferrez, P. W. , Oliva, F. , Guàrdia, J. and del R. Millán, J. , Feature extraction for multi-class bci using canonical variates analysis , number 23, 2007.
Galán, F. , Palix, J. , Chavarriaga, R. , Ferrez, P. W. , Lew, E. , Hauert, C. -A. and Millán, J. del R. , Visuo-spatial attention frame recognition for brain-computer interfaces , in: Proceedings of the 1st International Conference on Cognitive Neurodynamics, 2007.
Gaudard, C. , Aradilla, G. and Bourlard, H. , Speech recognition based on template matching and phone posterior probabilities , number 02, 2007.
Georgescul, M. , Clark, A. and Armstrong, S. , Exploiting structural meeting-specific features for topic segmentation , in: Actes de la 14ème Conférence sur le Traitement Automatique des Langues Naturelles, Toulouse, France, 2007.
Gerber, M. , Kaufmann, T. and Pfister, B. , Perceptron-based class verification , in: Proceedings of NOLISP (ISCA Workshop on non linear speech processing), 2007.
Gerber, M. , Beutler, R. and Pfister, B. , Quasi text-independent speaker verification based on pattern matching , in: Proceedings of Interspeech, ISCA, 2007.
Germann, M. , Breitenstein, M. D. , Park, I. K. and Pfister, H. , Automatic pose estimation for range images on the gpu , in: Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007), pages 81-90, IEEE Computer Society, 2007.
Grangier, D. and Bengio, S. , Learning the inter-frame distance for discriminative template-based keyword detection , in: International Conference on Speech Communication and Technology (INTERSPEECH), 2007.
Graves, A. , Liwicki, M. and Bunke, H. , Unconstrained on-line handwriting recognition with recurrent neural networks , in: Advances in Neural Information Processing, 2007.
Gurban, M. , Valles, A. and Thiran, J. -Ph. , Low-Dimensional Motion Features for Audio-Visual Speech Recognition , in: 15th European Signal Processing Conference (EUSIPCO), Poznan, Poland, Poznan, Poland, 2007.
Guz, U. , Cuendet, S. , Hakkani-Tur, D. and Tur, G. , Co-training Using Prosodic and Lexical Information for Sentence Segmentation , in: to appear in Proceedings of Interspeech, Antwerp, 2007.
Hakkani-Tur, D. and Tur, G. , Statistical Sentence Extraction for Information Distillation , in: Proc. ICASSP, Honolulu, 2007.
Hennebert, J. , Loeffel, R. , Humm, A. and Ingold, R. , A new forgery scenario based on regaining dynamics of signature , in: Accepted for publication, International Conference on Biometrics (ICB 2007), Seoul Korea, 2007.
Hennebert, J. , Humm, A. and Ingold, R. , Modelling spoken signatures with gaussian mixture model adaptation , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 07), 2007.
Hennebert, J. , Please repeat: my voice is my password. from the basics to real-life implementations of speaker verification technologies , in: Invited lecture at the Information Security Summit (IS2 2007), Prague, 2007.
Heusch, G. and Marcel, S. , A novel statistical generative model dedicated to face recognition , number Idiap-RR-39-2007, 2007.
Heusch, G. and Marcel, S. , Face authentication with salient local features and static bayesian network , in: IEEE / IAPR Intl. Conf. On Biometrics (ICB), 2007.
Hoffmann, U. , Vesin, J. M. and Ebrahimi, T. , Recent advances in brain-computer interfaces , in: IEEE International Workshop on Multimedia Signal Processing, Chania, Crete, Greece, 2007.
Huang, Y. , Vinyals, O. , Friedland, G. , Müller, C. , Mirghafori, N. and Wooters, C. , A Fast-Match approach for robust, faster than real-time Speaker Diarization , in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
Huang, Y. , Robust and rapid speaker diarization , in: Master Thesis, University of California, Berkeley, 2007.
Huang, Y. , Friedland, G. , Müller, C. and Mirghafori, N. , Speeding up speaker diarization by using prosodic features , in: Technical Report TR-07-004, International Computer Science Institute, Berkeley, California, 2007.
Huijbregts, M. , Wooters, C. and Ordelman, R. , Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections , in: to appear in Proceedings of Interspeech, Antwerp, 2007.
Huijbregts, M. and Wooters, C. , The Blame Game: Performance Analysis of Speaker Diarization System Components , in: to appear in Proc. Interspeech, Antwerp., 2007.
Humm, A. , Hennebert, J. and Ingold, R. , Database and evaluation protocols for user authentication using combined handwriting and speech modalities , 2007.
Humm, A. , Hennebert, J. and Ingold, R. , Hidden markov models for spoken signature verification , 2007.
Humm, A. , Hennebert, J. and Ingold, R. , Modelling combined handwriting and speech modalities , in: Accepted for publication, International Conference on Biometrics (ICB 2007), Seoul Korea, 2007.
Humm, A. , Hennebert, J. and Ingold, R. , Spoken handwriting verification using statistical models , in: Accepted for publication, International Conference on Document Analysis and Recognition (ICDAR 07), Curitiba Brazil, 2007.
Hung, H. , Jayagopi, D. , Yeo, C. , Friedland, G. , Ba, S. , Odobez, J. -M. , Ramchandran, K. , Mirghafori, N. and Gatica-Perez, D. , Using audio and video features to classify the most dominant person in a group meeting , 2007.
Hung, H. , Jayagopi, D. , Yeo, C. , Friedland, G. , Ba, S. , Odobez, J. -M. , Ramchandran, K. , Mirghafori, N. and Gatica-Perez, D. , Using audio and video features to classify the most dominant person in a group meeting multi-layer background subtraction based on color and texture , in: Proc. ACM Multi Media, Augsburg, Germany, 2007.
Hung, H. , Jayagopi, D. , Yeo, C. , Friedland, G. , Ba, S. , Odobez, J. -M. , Ramchandran, K. , Mirghafori, N. and Gatica-Perez, D. , Using audio and video features to classify the most dominant person in meetings , in: Proceedings of ACM Multimedia 2007, pp. 835-838, Augsburg, Germany, 2007.
Hwang, M. -Y. , Peng, G. , Wang, W. , Faria, A. , Heidel, A. and Ostendorf, M. , Building a Highly Accurate Mandarin Speech Recognizer , in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
Hérault, R. and Grandvalet, Y. , Sparse probabilistic classifiers , in: International Conference on Machine Learning (ICML), 2007.
Jaeggli, T. , Koller-Meier, E. and van Gool, L. , Learning generative models for monocular body pose estimation , in: ACCV, 2007.
Jaeggli, T. , Koller-Meier, E. and van Gool, L. , Multi-activity tracking in lle body pose space , in: 2nd Workshop on HUMAN MOTION Understanding, Modeling, Capture and Animation, ICCV, 2007.
Jaimes, A. , Gatica-Perez, D. , Sebe, N. and Huang, T. S. , Guest Editors' Introduction: Human-Centered Computing-Toward a Human Revolution , in: Computer, volume 40, number 5, pages 30-34, 2007.
Jaimes, A. , Gatica-Perez, D. , Sebe, N. and Huang, T. S. , Human-centered computing: toward a human revolution , in: IEEE Computer, volume 40, number 5, 2007. [DOI]
Kaufmann, T. and Pfister, B. , An HPSG parser supporting discontinuous licenser rules , in: International Conference on HPSG, 2007.
Kaufmann, T. and Pfister, B. , Applying licenser rules to a grammar with continuous constituents , in: The Proceedings of the 14th International Conference on Head-Driven Phrase Structure Grammar, 2007.
Keshet, J. , Theoretical foundations for large-margin kernel-based continuous speech recognition , number Idiap-RR-44-2007, 2007.
Kittler, J. , Poh, N. , Fatukasi, O. , Messer, K. , Kryszczuk, K. , Richiardi, J. and Drygajlo, A. , Quality dependent fusion of intramodal and multimodal biometric experts , in: Proc. SPIE Defense and Security Symposium, 2007.
Kludas, J. , Bruno, E. and Marchand-Maillet, S. , Information fusion in multimedia information retrieval , in: Workshop on Adaptive Multimedia Retrieval (AMR 2007), 2007.
Knox, M. and Mirghafori, N. , Automatic Laughter Detection Using Neural Networks , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Kokiopoulou, E. and Frossard, P. , Accelarating distributed consensus using extrapolation , in: IEEE Signal Processing Letters, volume 14, number 10, pages 665-668, 2007.
Kokiopoulou, E. and Frossard, P. , Accelerating Distributed Consensus Using Extrapolation , in: IEEE Signal Processing Letters, volume 14, number 10, 2007. [DOI]
Kokiopoulou, E. and Frossard, P. , Dimensionality Reduction with Adaptive Approximation , in: IEEE Int. Conf. on Multimedia & Expo (ICME), Beijing, China, 2007.
Kokiopoulou, E. and Frossard, P. , Image alignment with rotation manifolds built on sparse geometric expansions , in: IEEE International Workshop on Multimedia Signal Processing, Chania, Crete, Greece, 2007.
Kolar, J. , Liu, Y. and Shriberg, E. , Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Koval, O. , Voloshynovskiy, S. and Pun, T. , Analysis of multimodal binary detection systems based on dependent/independent modalities , in: Proceedings of the IEEE 2007 International Workshop on Multimedia Signal Processing, 2007.
Koval, O. , Voloshynovskiy, S. and Pun, T. , Error exponent analysis of person identification based on fusion of dependent/independent modalities , in: Proceedings of SPIE-IS&T Electronic Imaging 2007, Security, Steganography, and Watermarking of Multimedia Contents IX, 2007.
Kron, E. , Rayner, M. , Santaholma, M. and Bouillon, P. , A development environment for building grammar-based speech-enabled applications , in: Proceedings of workshop on Grammar-based approaches to spoken language processing, pages 49-52, ACL 2007, Prague, Czech Republic, 2007.
Kronegg, J. , Chanel, G. , Voloshynovskiy, S. and Pun, T. , Eeg-based synchronized brain-computer interfaces: a model for optimizing the number of mental tasks , in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, volume 15, number 1, pages 50-58, 2007.
Kryszczuk, K. and Drygajlo, A. , Improving classification with class-independent quality measures: q-stack in face verification , in: Proc. 2nd Int. Conference in Biometrics (ICB 2007), 2007.
Kryszczuk, K. and Drygajlo, A. , Q-stack: uni- and multimodal classifier stacking with quality measures , in: Proc. 7th Int. Workshop on Multiple Classifier Systems, Springer, 2007.
Kryszczuk, K. , Richiardi, J. and Drygajlo, A. , Reliability estimation for multimodal error prediction and fusion , in: Proc. 7th Int. Workshop on Pattern Recognition in Information Systems (PRIS 2007), 2007.
Kryszczuk, K. , Richiardi, J. , Prodanov, P. and Drygajlo, A. , Reliability-based decision fusion in multimodal biometric verification systems , in: EURASIP Journal of Advances in Signal Processing, 2007.
Kumatani, K. , Mayer, H. , Gehrig, T. , Stoimenov, E. , McDonough, J. and Wölfel, M. , Adaptive beamforming with a minimum mutual information criterion , pages 2527--2541, 2007. [DOI]
Kumatani, K. , Mayer, H. , Gehrig, T. , Stoimenov, E. , McDonough, J. and Wölfel, M. , Minimum mutual information beamforming for simultaneous active speakers , in: IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), pages 71-76, Kyoto, 2007. [DOI]
Lalanne, D. , Evéquoz, F. , Rigamonti, M. , Dumas, B. and Ingold, R. , An ego-centric and tangible approach to meeting indexing and browsing , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI'07), pages to appear, 2007.
Lalanne, D. , Evéquoz, F. , Chiquet, H. , Müller, M. , Radgohar, M. and Ingold, R. , Going through digital versus physical augmented gaming , in: Tangible Play: Research and Design for Tangible and Tabletop Games. Workshop at the 2007 Intelligent User Interfaces Conference (IUI'07), pages 41-44, 2007.
Lalanne, D. and van den Hoven, E. , Supporting human memory with interactive systems , pages 215-216, 2007.
Lalanne, D. , Bertini, E. , Hertzog, P. and Bados, P. , Visual analysis of corporate network intelligence: abstracting and reasoning on yesterdays for acting today , 2007.
Laptev, I. , Caputo, B. and Lindberg, T. , Local velocity-adapted motion events for spatio-temporal recognition , in: Computer Vision and Image Undertanding, volume 108, number 3, pages 207-229, ISSN 1077-3142, 2007.
Lathoud, G. and Odobez, J. -M. , Short-term spatio-temporal clustering applied to multiple moving speakers , in: IEEE Transactions on Audio, Speech and Language Processing, 2007.
Lei, H. and Mirghafori, N. , Word-Conditioned HMM Supervectors for Speaker Recognition , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Lei, H. and Mirghafori, N. , Word-conditioned phone N-grams for speaker recognition , in: Proc. ICASSP, Honolulu, 2007.
Leibe, B. , Schindler, K. and van Gool, L. , Coupled detection and trajectory estimation for multi-object tracking , in: International Conference on Computer Vision (ICCV'07), 2007.
Leibe, B. , Cornelis, N. , Cornelis, K. and van Gool, L. , Dynamic 3d scene analysis from a moving vehicle , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'07), 2007.
Levit, M. , Hakkani-Tur, D. , Tur, G. and Gillick, D. , Integrating several annotation layers for statistical information distillation , in: Workshop on Automatic Speech Recognition and Understanding, 2007.
Levit, M. , Hakkani-Tur, D. , Tur, G. and Gillick, D. , Integrating Several Annotation Layers for Statistical Information Distillation , in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
Li, W. and Bourlard, H. , Non-linear spectral stretching for in-car speech recognition , in: Interspeech, 2007.
Li, W. , Dines, J. and Magimai-Doss, M. , Robust overlapping speech recognition based on neural networks , number Idiap-RR-55-2007, 2007.
Lisowska, A. , Betrancourt, M. , Armstrong, S. and Rajman, M. , Minimizing modality bias when exploring input preference for multimodal systems in new domains: the archivus case study , in: CHI' 07, San José, California, 2007.
Lisowska, A. , Armstrong, S. , Melichar, M. , Ailomaa, M. and Rajman, M. , The wizard of oz meets multimodal language-enabled gui interfaces: new challenges , in: Proceedings of CHI' 07, San José, California, 2007.
Liu, Y. and Shriberg, E. , Comparing Evaluation Metrics for Sentence Boundary Detection , in: Proc. ICASSP, Honolulu, 2007.
Livescu, K. , Cetin, O. , Hasegawa-Johnson, M. , King, S. , Bartels, C. , Borges, N. , Kantor, A. , Lal, P. , Yung, L. , Bezman, A. , Dawson-Haggerty, S. , Woods, B. , Frankel, J. , Magimai-Doss, M. and Saenko, K. , Articulatory Feature-based Methods for Acoustic and Audio-visual speech Recognition: Summary from the 2006 JHU Summer Workshop , in: Proc. ICASSP, Honolulu, 2007.
Livescu, K. , Bezman, A. , Borges, N. , Yung, L. , Cetin, O. , Frankel, J. , King, S. , Magimai-Doss, M. , Chi, X. and Lavoie, L. , Manual Transcription of Conversational Speech at the Articulatory Feature Level , in: Proc. ICASSP, Honolulu, 2007.
Liwicki, M. , Graves, A. , Bunke, H. and Schmidhuber, J. , A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 367-371, 2007.
Liwicki, M. , Schlapbach, A. , Loretan, P. and Bunke, H. , Automatic detection of gender and handedness from on-line handwriting , in: Proc. 13th Conf. of the Graphonomics Society, pages 179-183, 2007.
Liwicki, M. and Bunke, H. , Combining on-line and off-line systems for handwriting recognition , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 372-376, 2007.
Liwicki, M. and Bunke, H. , Feature selection for on-line handwriting recognition of whiteboard notes , in: Proc. 13th Conf. of the Graphonomics Society, pages 101-105, 2007.
Liwicki, M. and Bunke, H. , Handwriting recognition of whiteboard notes -- studying the influence of training set size and type , in: Int. Journal of Pattern Recognition and Art. Intelligence, volume 21, number 1, pages 83-98, 2007.
Liwicki, M. , Indermühle, E. and Bunke, H. , On-line handwritten text line detection using dynamic programming , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 447-451, 2007.
Lovitt, A. , Correcting confusion matrices for phone recognizers , number 03, 2007.
Lovitt, A. , Pinto, J. P. and Hermansky, H. , On confusions in a phoneme recognizer , 2007.
Lovitt, A. , Truncation confusion patterns in onset consonants , in: Interspeech 2007, 2007.
Lüthy, F. , Varga, T. and Bunke, H. , Using hidden Markov models as a tool for handwritten text line segmentation , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 8-12, 2007.
Magimai-Doss, M. , Hakkani-Tur, D. , Cetin, O. , Shriberg, E. , Fung, J. and Mirghafori, N. , Entropy Based Classifier Combination for Sentence Segmentation, , in: Proc. ICASSP, Honolulu, 2007.
Marcel, S. , Abbet, P. and Guillemot, M. , Google portrait , number Idiap-Com-07-2007, 2007.
Marcel, S. , Joint bi-modal face and speaker authentication using explicit polynomial expansion , number 14, 2007.
Marcel, S. , Rodriguez, Y. and Heusch, G. , On the recent use of local binary patterns for face authentication , in: International Journal on Image and Video Processing Special Issue on Facial Image Processing, 2007.
Marcel, S. and del R. Millán, J. , Person authentication using brainwaves (eeg) and maximum a posteriori model adaptation , in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Special Issue on Biometrics, 2007.
Marchand-Maillet, S. , Bruno, E. , Nürnberger, A. and Detyniecki, M. , Adaptive multimedia retrieval: user, context and feedback , Springer, 2007.
Mariéthoz, J. and Bengio, S. , A kernel trick for sequences applied to text-independent speaker verification systems , in: Pattern Recognition, volume 40, number 8, ISSN 0031-3203, 2007.
McCowan, I. , Maganti, H. K. and Gatica-Perez, D. , Speech enhancement and recognition in meetings with an audio-visual sensor array , in: IEEE Trans. on Audio, Speech, and Language Processing, volume 15, number 8, pages 2257-2269, 2007.
Mesot, B. and Barber, D. , A bayesian switching linear dynamical system for scale-invariant robust speech extraction , 2007.
Mesot, B. and Barber, D. , A gaussian sum smoother for inference in switching linear dynamical systems , 2007.
Meynet, J. , Popovici, V. and Thiran, J. -Ph. , Face Detection with Boosted Gaussian Features , in: Pattern Recognition, volume 40, number 8, pages 2283-2291, 2007. [DOI]
Meynet, J. and Thiran, J. -Ph. , Information Theoretic Combination of Classifiers with Application to AdaBoost , in: 7th international Workshop on Multiple Classifier Systems (MCS), Prague, Prague, 2007.
Meynet, J. , Popovici, V. and Thiran, J. -Ph. , Mixtures of Boosted Classifiers for Frontal Face Detection , in: Signal, Image and Video Processing, volume 1, number 1, pages 29-38, 2007. [DOI]
Millán, J. del R. , Buttfield, A. , Vidaurre, C. , Krauledat, M. , Schlögl, A. , Shenoy, P. , Blankertz, B. , Rao, R. P. N. , Cabeza, R. , Pfurtscheller, G. and Müller, K. -R. , Adaptation in brain-computer interfaces , in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
Millán, J. del R. , Ferrez, P. W. , Galán, F. , Lew, E. and Chavarriaga, R. , Non-invasive brain-actuated interaction , in: Proceedings of the 2nd International Symposium on Brain, Vision and Artificial Intelligence, 2007. [DOI]
Millán, J. del R. , Ferrez, P. W. and Buttfield, A. , The idiap brain-computer interface: an asynchronous multi-class approach , in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
Monay, F. , Learning the structure of image collections with latent aspect models , in: ., 2007.
Monay, F. and Gatica-Perez, D. , Modeling semantic aspects for cross-media image indexing , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 29, pages 1802-1817, ISSN 0162-8828, 2007. [DOI]
Morrison, D. , Marchand-Maillet, S. and Bruno, E. , Automatic image annotation with relevance feedback and latent semantic analysis , in: Workshop on Adaptive Multimedia Retrieval (AMR 2007), 2007.
Morrison, D. , Marchand-Maillet, S. and Bruno, E. , Hierarchical long-term learning for automatic image , in: International Conference on Semantics And digital Media Technologies (SAMT 2007), 2007.
Morrison, D. , Marchand-Maillet, S. and Bruno, E. , Hierarchical long-term learning for automatic image annotation , in: Proceedings 2nd International Conference on Semantic and Digital Media Technologies, 2007.
Motlicek, P. , Hermansky, H. , Ganapathy, S. and Garudadri, H. , Frequency domain linear prediction for qmf sub-bands and applications to audio coding , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 248-258, 2007.
Motlicek, P. , Hermansky, H. , Ganapathy, S. , Garudadri, H. and Srinivasamurthy, N. , Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes , in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), pages 350-357, 2007.
Motlicek, P. , Ganapathy, S. , Hermansky, H. and Garudadri, H. , Scalable wide-band audio codec based on frequency domain linear prediction , number 16, 2007.
Müller, C. and Burkhardt, F. , Combining Short-term Cepstral and Long-term Pitch Features for Automatic Recognition of Speaker Age , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Müller, P. , Zeng, G. , Wonka, P. and van Gool, L. , Image-based procedural modeling of facades , in: Proceedings of ACM SIGGRAPH 2007 / ACM Transactions on Graphics, ACM Press, 2007.
Neuhaus, M. and Bunke, H. , A quadratic programming approach to the graph edit distance problem , in: Graph-Based Representations in Pattern Recognition, pages 92-102, Springer, 2007.
Neuhaus, M. and Bunke, H. , Bridging the gap between graph edit distance and kernel machines , Machine Perception and Artificial Intelligence, volume 68, World Scientific, ISBN 978-981-270-817-5, 2007.
Noris, B. , Benmachiche, K. , Meynet, J. , Thiran, J. -Ph. and Billard, A. , Analysis of Head Mounted Wireless Camera Videos for Early Diagnosis of Autism , in: International Conference on Recognition Systems, 2007.
Odobez, J. -M. and Ba, S. , A cognitive and unsupervised map adaptation approach to the recognition of the focus of attention from head pose , in: International Conference on Multi-Media & Expo (ICME07), 2007.
Orabona, F. , Castellini, C. , Caputo, B. , Luo, J. and Sandini, G. , Indoor place recognition using online independent support vector machines , in: 18th British Machine Vision Conference (BMVC07), pages 1090-1099, Warwick, UK, 2007.
Orabona, F. , Castellini, C. , Caputo, B. , Luo, J. and Sandini, G. , On-line independent support vector machines for cognitive systems , number Idiap-RR-63-2007, 2007.
Ozden, K. E. , Schindler, K. and van Gool, L. , Simultaneous segmentation and 3d reconstruction of monocular image sequences , in: International Conference on Computer Vision (ICCV'07), 2007.
Pallotta, V. , Seretan, V. and Ailomaa, M. , User requirement analysis for meeting information retrieval based on query elicitation , in: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007), pages 1008-1015, Association for Computational Linguistics, 2007.
Pardo, J. M. , Anguera, X. and Wooters, C. , Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information , in: to appear in IEEE Transactions on Computers, 2007.
Paugam-Moisy, H. , Martinez, R. and Bengio, S. , A supervised learning approach based on stdp and polychronization in spiking neuron networks , in: European Symposium on Artificial Neural Networks, ESANN, 2007.
Perrin, X. , Chavarriaga, R. , Siegwart, R. and del R. Millán, J. , Bayesian controller for a novel semi-autonomous navigation concept , in: 3rd European Conference on Mobile Robots (ECMR 2007), 2007.
Philips, J. , Millán, J. del R. , Vanacker, G. , Lew, E. , Galán, F. , Ferrez, P. W. , van Brussel, H. and Nuttin, M. , Adaptive shared control of a brain-actuated simulated wheelchair , in: Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics, pages 408-414, 2007. [DOI]
Piccardi, L. , Noris, B. , Barbey, O. , Schiavone, G. , Keller, F. , Von Hofsten, C. and Billard, A. , Wearcam: a head mounted wireless camera for monitoring gaze attention and for the diagnosis of developmental disorders in young children , in: 16th IEEE International Symposium on Robot & Human Interactive Communication, RO-MAN, 2007.
Pinto, J. P. , Bourlard, H. , Graves, A. and Hermansky, H. , Comparing different word lattice rescoring approaches towards keyword spotting , number 32, 2007.
Pinto, J. P. , Lovitt, A. and Hermansky, H. , Exploiting phoneme similarities in hybrid hmm-ann keyword spotting , in: Proceedings of Interspeech, 2007.
Pinto, J. P. , R. M., P. , Yegnanarayana, B. and Hermansky, H. , Significance of contextual information in phoneme recognition , 2007.
Plauché, M. , Cetin, O. and Uhdaykumar, N. , How to build a spoken dialog system with limited (or no) resources , in: AI in ICT for Development Workshop of the Twentieth Intl. Joint Conf. on AI, Hyderabad, India, 2007.
Popescu-Belis, A. and Zufferey, S. , Contrasting the automatic identification of two discourse markers in multiparty dialogues , in: Proceedings of SIGDIAL 2007, pages 10, Antwerp, Belgium, 2007.
Popescu-Belis, A. , Evaluation of nlg: some analogies and differences with mt and reference resolution , in: MT Summit XI Workshop on Using Corpora for NLG and MT (UCNLG MT), pages 66-68, 2007.
Popescu-Belis, A. and Estrella, P. , Generating usable formats for metadata and annotations in a large meeting corpus , in: ACL 2007, pages 93-96, ACL 2007, Prague, Czech Republic, 2007.
Popescu-Belis, A. , Le rôle des métriques d'évaluation dans le processus de recherche en tal , in: TAL (Traitement Automatique des Langues), volume 47, number 2, 2007.
Prasanna, S. R. Mahadeva , Yegnanarayana, B. , Pinto, J. P. and Hermansky, H. , Analysis of confusion matrix to combine evidence for phoneme recognition , number 27, 2007.
Pronobis, A. and Caputo, B. , Confidence-based cue integration for visual place recognition , number 17, 2007.
Quack, T. , Ferrari, V. , Leibe, B. and van Gool, L. , Efficient mining of frequent and distinctive feature configurations , in: accepted for ICCV'07, 2007.
Quack, T. , Ferrari, V. , Leibe, B. and van Gool, L. , Efficient mining of frequent and distinctive feature configurations , in: International Conference on Computer Vision (ICCV'07), 2007.
Quelhas, P. , Odobez, J. -M. , Gatica-Perez, D. and Tuytelaars, T. , A thousand words in a scene , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 29, number 9, pages 151575-1589, 2007. [DOI]
del R. Millán, J. , Tapping the mind or resonating minds? , in: European Visions for the Knowledge Age, Cheshire Henbury, 2007.
Rakotomamonjy, A. , Bach, F. , Canu, S. and Grandvalet, Y. , More efficiency in multiple kernel learning , in: International Conference on Machine Learning (ICML), 2007.
Renals, S. , Hain, T. and Bourlard, H. , Recognition and understanding of meetings the ami and amida projects , in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'07, pages 238-247, Kyoto, 2007. [DOI]
Richiardi, J. , Kryszczuk, K. and Drygajlo, A. , Quality measures in unimodal and multimodal biometric verification , in: Proc. 15th European Signal Processing Conf. (EUSIPCO), 2007.
Richiardi, J. and Drygajlo, A. , Reliability-based voting schemes using modality-independent features in multi-classifier biometric authentication , in: Proc. 7th Int. Workshop on Multiple Classifier Systems, Springer, 2007.
Riesen, K. , Neuhaus, M. and Bunke, H. , Bipartite graph matching for computing the edit distance of graphs , in: Graph-Based Representations in Pattern Recognition, pages 1-12, Springer, 2007.
Riesen, K. , Neuhaus, M. and Bunke, H. , Graph embedding in vector spaces by means of prototype selection , in: Graph-Based Representations in Pattern Recognition, pages 383-393, Springer, 2007.
Rigamonti, M. , Lalanne, D. and Ingold, R. , Faericworld: browsing multimedia events through static documents and links , in: In proc. of INTERACT 2007, pages to appear, Springer-Verlag, 2007.
Romsdorfer, H. and Pfister, B. , Text analysis and language identification for polyglot text-to-speech synthesis , in: Speech Communication (Elsevier), 2007.
Rytsar, R. and Pun, T. , Computational aspects of the eeg forward problem solution for real head model using finite element , in: 29th Annual Int. Conf. IEEE Engineering in Medicine and Biology Society, 2007.
Schindler, K. , Suter, D. and H. Wang, , A model-selection framework for multibody structure-and-motion of image sequences , in: International Journal of Computer Vision, volume 79, number 2, pages 159-177, 2007.
Schlapbach, A. and Bunke, H. , A writer identification and verification system using HMM based recognizers , in: Pattern Analysis and Applications, volume 10, number 1, pages 33-43, 2007.
Schlapbach, A. and Bunke, H. , Fusing asynchronous feature streams for on-line writer identification , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 103-107, 2007.
Shriberg, E. , Higher level features in speaker recognition , in: Speaker Classification I, Lecture Notes in Computer Science, Springer, 2007.
Smith, K. , Bayesian methods for visual multi-object tracking with applications to human activity recognition , École Polytechnique Fédérale de Lausanne, 2007.
Sorci, M. , Antonini, G. and Thiran, J. -Ph. , Fisher's Discriminant and Relevant Component Analysis for static facial expression classification , in: 15th European Signal Processing Conference (EUSIPCO), Poznan, Poland, Poznan, Poland, 2007.
Starlander, M. , Using a wizard of oz as a baseline to determine which system architecture is the best for a spoken language translation system , in: Proceedings of Nodalida 2007, pages 161-164, Tartu, Estonia, 2007.
Stolcke, A. , Kajarekar, S. , Ferrer, L. and Shriberg, E. , Speaker recognition with session variability normalization based on mllr adaptation transforms , in: IEEE Transactions on Audio, Speech, and Language Processing, volume 15, pages 1987-1998, 2007.
Stolcke, A. , Kajarekar, S. , Ferrer, L. and Shriberg, E. , Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms , in: IEEE Transactions on Audio, Speech, and Language Processing, special issue on speaker and language recognition, 2007.
Stolcke, A. , Anguera, X. , Boakye, K. , Cetin, O. , Janin, A. , Magimai-Doss, M. , Wooters, C. and Zheng, J. , The sri-icsi spring 2007 meeting and lecture recognition system , in: Lecture Notes in Computer Science, 2007.
Stoll, L. , Frankel, J. and Mirghafori, N. , Speaker Recognition Via Nonlinear Discriminant Features , in: Proceedings of NOLISP, Paris, France,, 2007.
Szekely, E. , Bruno, E. and Marchand-Maillet, S. , Clustered multidimensional scaling for exploration in information retrieval , in: International Conference on the Theory of Information Retrieval, 2007.
Thomas, A. , Ferrari, V. , Leibe, B. , Tuytelaars, T. and van Gool, L. , Depth-from-recognition: inferring metadata by cognitive feedback , in: ICCV'07 Workshop on 3D Representations for Recognition, 2007.
Uldry, L. , Ferrez, P. W. and del R. Millán, J. , Feature selection methods on distributed linear inverse solutions for a non-invasive brain-machine interface , number 04, 2007.
Valente, F. , Bourlard, H. and Deepu, V. , Agglomerative information bottleneck for speaker diarization of meetings data , number 31, 2007.
Valente, F. and Hermansky, H. , Combination of acoustic classifiers based on dempster-shafer theory of evidence , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
Valente, F. , Vepa, J. , Plahl, C. , Gollan, C. , Hermansky, H. and Schlüter, R. , Hierarchical neural networks feature extraction for lvcsr system , in: Interspeech 2007, 2007.
Valente, F. , Vepa, J. and Hermansky, H. , Multi-stream features combination based on dempster-shafer rule for lvcsr system , in: Interspeech 2007, 2007.
Vanacker, G. , Millán, J. del R. , Lew, E. , Ferrez, P. W. , Galán, F. , Philips, J. , van Brussel, H. and Nuttin, M. , Context-based filtering for assisted brain-actuated wheelchair driving , in: Computational Intelligence and Neuroscience, volume 2007, pages 3, ISSN 1687-5265, 2007.
Villán, R. , Voloshynovskiy, S. , Koval, O. , Deguillaume, F. and Pun, T. , Tamper-proofing of Electronic and Printed Text Documents via Robust Hashing and Data-Hiding , in: Proceedings of SPIE-IS&T Electronic Imaging 2007, Security, Steganography, and Watermarking of Multimedia Contents IX, 2007.
Vinciarelli, A. and Favre, S. , Broadcast news story segmentation using social network analysis and hidden markov models , in: ACM International Conference on Multimedia, pages 261-264, 2007.
Vinciarelli, A. , Mapping nonverbal communication into social status: automatic recognition of journalists and non-journalists in radio news , number 33, 2007.
Vinciarelli, A. , Role recognition in broadcast news using social network analysis and duration distribution modeling , in: IEEE Transactions on Multimedia, 2007.
Vinciarelli, A. and Favre, S. , Role recognition in radio programs using social affiliation networks and mixtures of discrete distributions: an approach inspired by social cognition , number Idiap-RR-40-2007, 2007.
Vinciarelli, A. , Fernàndez, F. and Favre, S. , Semantic segmentation of radio programs using social network analysis and duration distribution modeling , in: IEEE International Conference on Multimedia and Expo (ICME), 2007.
Vinyals, O. , Friedland, G. and Mirghafori, N. , Revisiting a basic function on current CPUs: A fast logarithm implementation with adjustable accuracy , in: ICSI Technical Report number TR-07-002, 2007.
Weise, T. , Leibe, B. and van Gool, L. , Fast 3d scanning with automatic motion compensation , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'07), 2007.
Wooters, C. and Huijbregts, M. , The ICSI RT07s Speaker Diarization System , in: to appear in Lecture Notes in Computer Science, 2007.
Yao, J. and Odobez, J. -M. , Multi-layer background subtraction based on color and texture , in: CVPR 2007 Workshop on Visual Surveillance (VS2007), pages 1-8, 2007. [DOI]
Zacharie, D. G. and Pinto, J. P. , Keyword spotting on word lattices , number 22, 2007.
Zheng, J. , Cetin, O. , Hwang, M. -Y. , Lei, X. , Stolcke, A. and Morgan, N. , Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition , in: Proc. ICASSP, Honolulu., 2007.
Peralta Menendez, R. Grave de , González Andino, S. L. , Ferrez, P. W. and Millán, J. del R. , Non-invasive estimates of local field potentials for brain-computer interfaces , in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
Fasel, B. and van Gool, L. , Interactive museum guide: accurate retrieval of object descriptions , in: Adaptive Multimedia Retrieval: User, Context, and Feedback, pages 179-191, Springer, 2007.
van Gool, L. , Zeng, G. , van den Borre, F. and Müller, P. , Towards mass-produced building models , in: Photogrammetric Image Analysis, pages 209-220, Institute of Photogrammetry and Cartography, Technische Universitaet Muenchen, 2007.
Alecu, T. I. , Voloshynovskiy, S. and Pun, T. , The gaussian transform of distributions: definition, computation and application , in: IEEE Trans. on Signal Processing, volume 54, number 8, pages 2976-2995, 2006.
Andreani, G. , Di Fabbrizio, G. , Gilbert, M. , Gillick, D. , Hakkani-Tur, D. and Lemon, O. , Lets DiSCoH: Collecting an Annotated Open Corpus with Dialog Acts and Reward Signals for Natural Language Helpdesks , in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
Ba, S. and Odobez, J. -M. , A study on visual focus of attention recognition from head pose in a meeting room , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006.
Ba, S. and Odobez, J. -M. , Recognizing people's focus of attention from head poses: a study , number 42, 2006.
Barber, D. and Chiappa, S. , Unified inference for variational bayesian linear gaussian state-space models , in: NIPS, 2006.
BenZeghiba, M. F. and Bourlard, H. , User-customized password speaker verification using multiple reference and background models , in: Speech Communication, volume 8, pages 1200-1213, 2006.
Bertolami, R. , Halter, B. and Bunke, H. , Combination of multiple handwritten text line recognition systems with a recursive approach , in: Proc. 10th Int. Workshop Frontiers in Handwriting Recognition, pages 61-65, 2006.
Buttfield, A. and del R. Millán, J. , Online classifier adaptation in brain-computer interfaces , number 16, 2006.
Buttfield, A. , Ferrez, P. W. and del R. Millán, J. , Towards a robust bci: error potentials and online learning , in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, volume 14, number 2, pages 164-168, 2006.
Cattin, P. C. , Bay, H. , van Gool, L. and Székely, G. , Retina mosaicing using local features , in: Medical Image Computing and Computer-Assisted Intervention (MICCAI), pages 185-192, 2006.
Chanel, G. , Kronegg, J. , Grandjean, D. and Pun, T. , Emotion assessment: arousal evaluation using eeg's and peripheral physiological signals , in: Proc. Int. Workshop Multimedia Content Representation, Classification and Security (MRCS), pages 530-537, Lecture Notes in Computer Science, Springer, 2006.
Cheng, O. , Dines, J. and Magimai-Doss, M. , A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition , number 62, 2006.
Chiappa, S. , Analysis and classification of eeg signals using probabilistic models for brain computer interfaces , École Polytechnique Fédérale de Lausanne, 2006.
Chiquet, H. , Evéquoz, F. and Lalanne, D. , Elcano, a tangible multimedia browser (demo). , in: Symposium on User Interface Software and Technology (UIST 2006), pages 51-52, 2006.
Cuendet, S. , Hakkani-Tur, D. and Tur, G. , Model Adaptation for Sentence Segmentation from Speech , in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
Cuendet, S. , Model adaptation for sentence unit segmentation from speech , number 64, 2006.
Dimitrakakis, C. , Ensembles for sequence learning , École Polytechnique Fédérale de Lausanne, 2006.
Everingham, M. , Zisserman, A. , Williams, C. , van Gool, L. , Allan, M. , Bishop, C. , Chapelle, O. , Dalal, N. , Deselaers, T. , Dorko, G. , Duffner, S. , Eichhorn, J. , Farquhar, J. , Fritz, M. , Garcia, C. , Griffiths, T. , Jurie, F. , Keysers, D. , Koskela, M. , Laaksonen, J. , Larlus, D. , Leibe, B. , Meng, H. , Ney, H. , Schiele, B. , Schmid, C. , Seemann, E. , Shawe-Taylor, J. , Storkey, A. , Szedmak, S. , Triggs, B. , Ulusoy, I. , Viitaniemi, V. and Zhang, J. , The 2005 pascal visual object class challenge , in: Selected Proceedings of the 1st PASCAL Challenges Workshop, Lecture Notes in AI, Springer, 2006.
Hannani, A. , Toledano, D. , Petrovska, D. , Montero-Asenjo, A. and Hennebert, J. , Using data-driven and phonetic units for speaker verification , in: IEEE Speaker and Language Recognition Workshop (Odyssey 2006), Puerto Rico, 2006.
Hemptinne, C. , Master thesis: integration of the harmonic plus noise model (hnm) into the hidden markov model-based speech synthesis system (hts) , number 69, 2006.
Hillard, D. , Huang, Z. , Ji, H. , Grishman, R. , Hakkani-Tur, D. , Harper, M. , Ostendorf, M. and Wang, W. , Impact of Automatic Comma Prediction on POS/Name Tagging of Speech , in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
Janin, A. , Stolcke, A. , Anguera, X. , Boakye, K. , Cetin, O. , Frankel, J. and Zheng, J. , The ICSI-SRI Spring 2006 Meeting Evaluation System , in: In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006); Lecture Notes in Computer Science. Springer, 2006.
Janvier, B. , Bruno, E. , Marchand-Maillet, S. and Pun, T. , Handling temporal heterogeneous data for content-based management of large video collections , in: Multimedia Tools and Applications, volume 30, pages 273-288, 2006.
Just, A. , Two-handed gestures for human-computer interaction , École Polytechnique Fédérale de Lausanne, 2006.
Keller, M. and Bengio, S. , A multitask learning approach to document representation using unlabeled data , number 44, 2006.
Keller, M. , Machine learning approaches to text representation using unlabeled data , Ecole Polytechnique Fédérale de Lausanne, 2006.
Ketabdar, H. and Hermansky, H. , Identifying unexpected words using in-context and out-of-context phoneme posteriors , number 68, 2006.
Kosinov, S. , Marchand-Maillet, S. , Kozintsev, I. , Dulong, C. and Pun, T. , Dual diffusion model of spreading activation for content-based image retrieval , in: 8th ACM SIGMM - International Workshop on Multimedia Information Retrieval, 2006.
Koval, O. , Voloshynovskiy, S. , Holotyak, T. and Pun, T. , Information-theoretic analysis of steganalysis in real images , in: ACM Multimedia and Security Workshop 2006, 2006.
Lathoud, G. , Observations on multi-band asynchrony in distant speech recordings , number 74, 2006.
Lathoud, G. , Spatio-temporal analysis of spontaneous speech with microphone arrays , École Polytechnique Fédérale de Lausanne, 2006.
Lathoud, G. , Magimai-Doss, M. and Bourlard, H. , Unsupervised spectral subtraction for noise-robust asr on unknown transmission channels , number 09, 2006.
Leibe, B. , Mikolajczyk, K. and Schiele, B. , Efficient clustering and matching for object class recognition , in: British Machine Vision Conference (BMVC, 2006.
Leibe, B. , Cornelis, N. , Cornelis, K. and van Gool, L. , Integrating recognition and reconstruction for cognitive traffic scene analysis from a moving vehicle , in: DAGM Annual Pattern Recognition Symposium, pages 192-201, Springer, 2006.
Leibe, B. , Mikolajczyk, K. and Schiele, B. , Segmentation based multi-cue integration for object detection , in: British Machine Vision Conference (BMVC, 2006.
Liwicki, M. and Bunke, H. , HMM-based on-line recognition of handwritten whiteboard notes , in: Proceedings 10th International Workshop Frontiers in Handwriting Recognition, pages 595-599, 2006.
Luo, J. , Pronobis, A. , Caputo, B. and Jensfelt, P. , Incremental learning for place recognition in dynamic environments , number 52, 2006.
Luo, J. , Pronobis, A. and Caputo, B. , Svm-based transfer of visual knowledge across robotic platforms , number 65, 2006.
Maganti, H. K. , Motlicek, P. and Gatica-Perez, D. , Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms , number 57, 2006.
Marcel, S. , Rodriguez, Y. , Guillemot, M. and Popescu-Belis, A. , Annotation of face detection: description of xml format and files , number 06, 2006.
Marcel, S. , Keomany, J. and Rodriguez, Y. , Robust-to-illumination face localisation using active shape models and local binary patterns , number 47, 2006.
Mariéthoz, J. , Discrmininant models for text-independent speaker verification , number 70, 2006.
Melichar, M. , Cenek, P. , Ailomaa, M. , Lisowska, A. and Rajman, M. , From Vocal to Multimodal Dialogue Management , in: Eighth International Conference on Multimodal Interfaces (ICMI'06), Banff, Canada, 2006.
Mendels, F. , Thiran, J. -Ph. and Vandergheynst, P. , Matching pursuit-based shape representation and recognition using scale-space , in: International Journal of Imaging Systems and Technology, volume 6, number 15, pages 162-180, 2006. [DOI]
Mesot, B. and Barber, D. , A bayesian alternative to gain adaptation in autoregressive hidden markov models , number 55, 2006.
Mesot, B. and Barber, D. , Switching linear dynamical systems for noise robust speech recognition , number 08, 2006.
Moore, D. , The juicer lvcsr decoder - user manual for juicer version 0.5.0 , number 03, 2006.
Motlicek, P. , Hermansky, H. , Garudadri, H. and Srinivasamurthy, N. , Audio coding based on long temporal contexts , number 30, 2006.
Motlicek, P. , Ullal, V. and Hermansky, H. , Wide-band perceptual audio coding based on frequency-domain linear prediction , number 58, 2006.
Moënne-Loccoz, N. , Janvier, B. , Marchand-Maillet, S. and Bruno, E. , Handling temporal heterogeneous data for content-based management of large video collections , in: Multimedia Tools and Applications, volume 31, pages 309-325, 2006.
Müller, P. , Wonka, P. , Haegler, S. , Ulmer, A. and van Gool, L. , Procedural modeling of buildings , in: Proceedings of ACM SIGGRAPH 2006 / ACM Transactions on Graphics, pages 614-623, ACM Press, 2006.
Müller, M. , Evéquoz, F. and Lalanne, D. , Tjass, a smart board for augmenting card game playing and learning (demo) , in: Symposium on User Interface Software and Technology (UIST 2006), pages 67-68, 2006.
Poh, N. and Bengio, S. , Estimating the confidence interval of expected performance curve in biometric authentication using joint bootstrap , number 25, 2006.
Poh, N. , Multi-system biometric authentication: optimal fusion and user-specific information , École Polytechnique Fédérale de Lausanne, 2006.
Poh, N. and Bengio, S. , Using chimeric users to construct fusion classifiers in biometric authentication tasks: an investigation , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006.
Pozdnoukhov, A. , Prior knowledge in kernel methods , École Polytechnique Fédérale de Lausanne, 2006.
Pun, T. , Alecu, T. I. , Chanel, G. , Kronegg, J. and Voloshynovskiy, S. , Brain-computer interaction research at the computer vision and multimedia laboratory, university of geneva , in: IEEE Trans. Neural Systems and Rehabilitation Engineering, Special Issue on Brain-Computer Interaction, volume 14, number 2, pages 210-213, 2006.
Pérez-Freire, L. , Pérez-González, F. and Voloshynovskiy, S. , An Accurate Analysis of Scalar Quantization-Based Data Hiding , in: IEEE Trans. on Information Forensics and Security, volume 1, number 1, pages 80-86, 2006.
Quelhas, P. and Odobez, J. -M. , Natural scene image modeling using color and texture visterms. , in: Conference on Image and Video Retrieval CIVR, 2006.
del R. Millán, J. , Renkens, F. , Mouriño, J. and Gerstner, W. , Non-invasive brain-actuated control of a mobile robot by human eeg , in: 2006 IMIA Yearbook of Medical Informatics, Schattauer Verlag, 2006.
Radgohar, M. , Evéquoz, F. and Lalanne, D. , Phong, augmenting virtual and real gaming experience (demo) , in: Symposium on User Interface Software and Technology (UIST 2006), pages 71-72, 2006.
Richiardi, J. and Drygajlo, A. , Applying biometrics to identity documents: estimating and coping with errors , 2006.
Richiardi, J. and Drygajlo, A. , Applying biometrics to identity documents: implementation issues , 2006.
Rienks, R. , Zhang, D. , Gatica-Perez, D. and Post, W. , Detection and application of influence rankings in small group meetings , in: ICMI '06: Proceedings of the 8th international conference on Multimodal interfaces, pages 257-264, ACM Press, Banff, Alberta, Canada, 2006. [DOI]
Rodriguez, Y. , Face detection and verification using local binary patterns , École Polytechnique Fédérale de Lausanne, 2006.
Schlapbach, A. and Bunke, H. , Off-line writer verification: a comparison of a hidden Markov model (HMM) and a Gaussian mixture model (GMM) based system , in: Proc. 10th Int. Workshop Frontiers in Handwriting Recognition, pages 275-280, 2006.
Smith, K. , Schreiber, S. , Beran, V. , Potúcek, I. , Rigoll, G. and Gatica-Perez, D. , Multi-person tracking in meetings: a comparative study , in: Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2006.
Smith, K. , Ba, S. , Odobez, J. -M. and Gatica-Perez, D. , Tracking attention for multiple people: wandering visual focus of attention estimation , number 40, 2006.
Spindler, T. , Wartmann, C. , Roth, D. , Steffen, A. , Hovestadt, L. and van Gool, L. , Privacy in video surveilled areas , in: International Conference on Privacy, Security and Trust (PST 2006), 2006.
Torre, E. L. , Caputo, B. and Tommasi, T. , Melanoma recognition using kernel classifiers , number 53, 2006.
Tur, G. , Guz, U. and Hakkani-Tur, D. , Model Adaptation for Dialog Act Tagging , in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
Ullal, V. and Motlicek, P. , Audio coding based on long temporal segments: experiments with quantization of excitation signal , number 46, 2006.
Vepa, J. and King, S. , Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis , in: IEEE Trans. on Audio, Speech and Language Processing, volume 14, number 5, pages 1763-1771, 2006.
Vila-Forcén, J. E. , Voloshynovskiy, S. , Koval, O. and Pun, T. , Costa problem under channel ambiguity , in: Proceedings of 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2006.
Vila-Forcén, J. E. , Voloshynovskiy, S. , Koval, O. and Pun, T. , Facial Image Compression Based on Structured Codebooks in Overcomplete Domain , in: EURASIP Journal on Applied Signal Processing, Frames and overcomplete representations in signal processing, communications, and information theory special issue, volume 2006, number Article ID 69042, pages 1-11, 2006.
Voloshynovskiy, S. , Koval, O. , Topak, E. , Forcen, J. E. V. and Pun, T. , On reversibility of random binning based data-hiding techniques: security perspectives , in: ACM Multimedia and Security Workshop 2006, 2006.
Voloshynovskiy, S. , Koval, O. , Mihcak, M. K. and Pun, T. , The edge process model and its application to information hiding capacity analysis , in: IEEE Trans. on Signal Processing, volume 54, number 5, pages 1813-1825, 2006.
Wey, P. , Fischer, B. , Bay, H. and Buhmann, J. M. , Dense stereo by triangular meshing and cross validation , in: DAGM-Symposium, pages 708-717, 2006.
Zhang, D. , Gatica-Perez, D. and Bengio, S. , Exploring contextual information in a layered framework for group action recognition , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006.
Zhang, D. , Probabilistic graphical models for human interaction analysis , École Polytechnique Fédérale de Lausanne, 2006.
A. Peregoudov, , Vinciarelli, A. and Bourlard, H. , Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations , number 56, 2006.
Brodbeck, D. , Mazza, R. and Lalanne, D. , Interactive visualization - a survey , 0000.
Dumas, B. , Lalanne, D. and Oviatt, S. , Multimodal interfaces: a survey of principles, models and frameworks , 0000.
Gatica-Perez, D. , Modeling interest in face-to-face conversations from multimodal nonverbal behavior , in: In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.), Multimodal Signal Processing, Academic Press, in press, 0000.
Gatica-Perez, D. and Odobez, J. -M. , Visual attention, speaking activity, and group conversational analysis in multi-sensor environments , in: H. Nakashima, J. Augusto, H. Aghajan (Eds.), Handbook of Ambient Intelligence and Smart Environments, Springer, in press, 0000.
Goldmann, L. , Samour, A. , Ebrahimi, T. and Sikora, T. , Multimodal person search combining information fusion and relevance feedback , in: IEEE International Workshop on Multimedia Signal Processing (MMSP 2009), Rio de Janeiro, Brazil, 0000.
Lee, J. -S. , De Simone, F. and Ebrahimi, T. , Influence of audio-visual attention on perceived quality of standard definition multimedia content , in: First International Workshop on Quality of Multimedia Experience (QoMEX 2009), San Diego, CA, U.S.A., 0000.
Lee, J. -S. and Ebrahimi, T. , Two-level bimodal association for audio-visual speech recognition , in: International Conference on Advanced Concepts for Intelligent Vision Systems (ACIVSâ09), Bordeaux, France, 0000.
Mugellini, E. , Lalanne, D. , Dumas, B. , Evéquoz, F. , Gerardi, S. , Le Calvé, A. , Boder, A. , Ingold, R. and Khaled, O. , Memodules as tangible shortcuts to multimedia information , 0000.
Noris, B. , Benmachiche, K. and Billard, A. , Calibration-free eye gaze direction detection with gaussian processes , in: International Conference on Computer Vision Theory and Applications (VISAPP 08), 0000.
De Simone, F. , Naccari, M. , Tagliasacchi, M. , Dufaux, F. , Tubaro, S. and Ebrahimi, T. , Subjective assessment of H.264/AVC video sequences transmitted over a noisy channel , in: First International Workshop on Quality of Multimedia Experience (QoMEX 2009), San Diego, CA, U.S.A., 0000.
Popescu-Belis, A. , Multimodal database annotation formats and standards, software architecture for multimodal interfaces , in: Multimodal Signal Processing: Methods and Techniques to Build Multimodal Interactive Systems, Academic Press, 0000.
Powered by Agaion