Ali, Karim , Fleuret, Francois , Hasler, David and Fua, Pascal , A real-time deformable detector. , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011.
Ali, Karim , Hasler, David and Fleuret, Francois , FlowBoost - Appearance Learning from Sparsely Annotated Video , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition, 2011.
Asaei, Afsaneh , Bourlard, Hervé and Cevher, Volkan , Model-based Compressive Sensing for Multi-party Distant Speech Recognition , in: 2011 IEEE International Conference on Acoustics, Speech and Signal Processing, 2011.
Asaei, Afsaneh , Bourlard, Hervé and Cevher, Volkan , Model-Based Compressive Sensing for Multi-Party Distant Speech Recognition , number Idiap-RR-04-2011, 2011.
Asaei, Afsaneh , Taghizadeh, Mohammad J. , Bourlard, Hervé and Cevher, Volkan , Multi-party Speech Recovery Exploiting Structured Sparsity Models , in: Proceedings of International Speech Communication Association, INTERSPEECH, 2011.
Asaei, Afsaneh , Taghizadeh, Mohammad J. , Bourlard, Hervé and Cevher, Volkan , Multi-party Speech Recovery Exploiting Structured Sparsity Models , number Idiap-RR-22-2011, 2011.
Aschwanden, G. , Haegler, S. , Bosché, F. , Gool, L. Van and Schmitt, G. , Empiric Design Evaluation in Urban Planning , in: Automation in Construction, volume 20, number 3, pages 299-310, 2011.
Banitalebi Dehkordi, Mehdi , Abutalebi, Hamid Reza and Ghanei, Hossein , A Compressive Sensing Based Compressed Neural Network for Sound Source Localization , in: Proceedings of International Symposium on Artificial Intelligence and Signal Processing, 2011.
Ben Shitrit, Horesh , Berclaz, Jerome , Fleuret, Francois and Fua, Pascal , Tracking Multiple Objects under Global Appearance Constraints , in: Proceedings of the IEEE International Conference on Computer Vision, 2011.
Berclaz, Jerome , Turetken, Engin , Fleuret, Francois and Fua, Pascal , Multiple Object Tracking using K-Shortest Paths Optimization , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2011.
Biel, Joan-Isaac and Gatica-Perez, Daniel , Call me Guru: user categories and large-scale behavior in YouTube , in: Social Media Computing, Springer, 2011.
Biel, Joan-Isaac , Aran, Oya and Gatica-Perez, Daniel , You Are Known by How You Vlog: Personality Impressions and Nonverbal Behavior in YouTube , in: Proceedings of AAAI International Conference on Weblogs and Social Media, 2011.
Boyandin, Ilya , Bertini, Enrico , Bak, Peter and Lalanne, Denis , Flowstrates: An Approach for Visual Exploration of Temporal Origin-Destination Data , in: Computer Graphics Forum, volume 30, number 3, pages 971-980, 2011.
Carrino, Stefano , Mugellini, Elena , Khaled, Omar Abou and Ingold, Rolf , ARAMIS: Toward a Hybrid Approach for Human- Environment Interaction , in: HCI (3), pages 165-174, 2011.
Carrino, Francesco , Tscherrig, Julien , Mugellini, Elena , Khaled, Omar Abou and Ingold, Rolf , Head-Computer Interface: A Multimodal Approach to Navigate through Real and Virtual Worlds , in: HCI (2), pages 222-230, 2011.
Chanel, G. , Rebetez, C. , Betrancourt, M. and T, Pun , Emotion assessment from physiological signals for adaptation of games difficulty , in: IEEE Trans. on Systems, Man, and Cybernetics - Part A: Systems and Humans, 2011.
Chen, Cheng , Yang, Yi , Nie, Feiping and Odobez, Jean-Marc , 3D human pose recovery from image by efficient visual feature selection , in: Computer Vision and Image Understanding, volume 115, number 3, 2011.
Chen, Cheng , Heili, Alexandre and Odobez, Jean-Marc , Combined Estimation of Location and Body Pose in Surveillance Video , in: AVSS, 2011.
Chen, Cheng , Learning a 3D Human Pose Distance Metric from Geometric Pose Descriptor , in: IEEE Transactions on Visualization and Computer Graphics, 2011.
Chittaranjan, Gokul , Aran, Oya and Gatica-Perez, Daniel , Exploiting observers' judgements for nonverbal group interaction analysis , in: IEEE Conference on Automatic Face and Gesture Recognition, pages 6, IEEE, 2011.
Chittaranjan, Gokul , Aran, Oya and Gatica-Perez, Daniel , Inferring truth from multiple annotators for social interaction analysis , in: Neural Information Processing Systems (NIPS) Workshop on Modeling Human Communication Dynamics (HCD), pages 4, 2011.
Chittaranjan, Gokul , Blom, J. and Gatica-Perez, Daniel , Who's Who with Big-Five: Analyzing and Classifying Personality Traits with Smartphones , in: International Symposium on Wearable Computing, pages 8, 2011.
DeSimone, F. , Goldmann, L. , Lee, J. S. and Ebrahimi, T. , Performance analysis of VP8 image and video compression based on subjective evaluations , in: SPIE Optics and Photonics, Applications of Digital Image Processing XXXIV, 8135, 2011.
DeSimone, F. , Naccari, M. , M.Tagliasacchi, , Dufaux, F. , Tubaro, S. and Ebrahimi, T. , Subjective quality assessment of H.264/AVC video streaming with packet losses , in: Eurasip Journal on Image and Video Processing, 2011 Article ID 190431, 2011.
DeSimone, F. , Goldmann, L. , Lee, J. -S. and Ebrahimi, T. , Towards high efficiency video coding: subjective evaluation of potential coding technologies , in: Journal of Visual Communication and Image Representation, 2011.
Dillenbourg, P. , Zufferey, G. , Alavi, H. S. , Jermann, P. , Do Lenh, S. , Bonnard, Q. , Cuendet, S. and Kaplan, F. , Classroom orchestration: The third circle of usability. , in: Proceedings of the 9th Computer-Supported Collaborative Learning Conference, Hong Kong, 2011.
Do, Cong-Thanh , Pastor, Dominique and Goalic, André , A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech , in: Speech Communication, 2011.
Do, Trinh-Minh-Tri and Gatica-Perez, Daniel , Contextual grouping: discovering real-life interaction types from longitudinal Bluetooth data , in: 12th International Conference on Mobile Data Management, 2011.
Do, Trinh-Minh-Tri and Gatica-Perez, Daniel , GroupUs: Smartphone Proximity Data and Human Interaction Type Mining , in: 15th annual International Symposium on Wearable Computers, 2011.
Duffner, Stefan and Odobez, Jean-Marc , Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking , in: IEEE Conference on Automatic Face and Gesture Recognition, 2011.
Duffner, Stefan and Odobez, Jean-Marc , Exploiting Long-Term Observations for Track Creation and Deletion in Online Multi-Face Tracking , number Idiap-RR-01-2011, 2011.
Emonet, Remi , Varadarajan, Jagannadan and Odobez, Jean-Marc , Extracting and Locating Temporal Motifs in Video Scenes Using a Hierarchical Non Parametric Bayesian Model , in: IEEE Conference on Computer Vision and Pattern Recognition, 2011.
Fanelli, G. , Gall, J. and Gool, L. Van , Real Time Head Pose Estimation with Random Regression Forest , in: Computer Vision and Pattern Recognition (CVPR), 2011.
Gall, J. , Fossati, A. and Gool, L. Van , Functional Categorization of Objects using Real-time Markerless Motion Capture , in: Computer Vision and Pattern Recognition (CVPR), 2011.
Garner, Philip N. , Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition , in: Speech Communication, volume 53, number 8, pages 991-1001, 2011.
Garner, Philip N. , Cepstral normalisation and the signal to noise ratio spectrum in automatic speech recognition. , number Idiap-RR-15-2011, 2011.
Gomez, J. , Bologna, G. , Deville, B. and Pun, T. , Multisource sonification for visual substitution in an auditory memory game: one, or two fingers? , in: ICAD 2011, Int. Conf. on Auditory Display, 2011.
Hamer, H. , Gall, J. , Urtasun, R. and Gool, L. Van , Data-Driven Animation of Hand-Object Interactions , in: IEEE Conference on Automatic Face and Gesture Recognition, 2011.
Imseng, David , Bourlard, Hervé , Dines, John , Garner, Philip N. and Magimai.-Doss, Mathew , Improving non-native ASR through stochastic multilingual phoneme space transformations , in: Proceedings of Interspeech, 2011.
Imseng, David , Bourlard, Hervé , Dines, John , Garner, Philip N. and Magimai.-Doss, Mathew , Improving non-native ASR through stochastic multilingual phoneme space transformations , number Idiap-RR-19-2011, 2011.
Imseng, David , Bourlard, Hervé , Magimai.-Doss, Mathew and Dines, John , Language dependent universal phoneme posterior estimation for mixed language speech recognition , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 5012-5015, 2011.
Imseng, David , Bourlard, Hervé , Magimai.-Doss, Mathew and Dines, John , Language dependent universal phoneme posterior estimation for mixed language speech recognition , number Idiap-RR-13-2011, 2011.
Ivanov, I. , Vajda, P. , Lee, J. -S. and Ebrahimi, T. , In tags we trust: Trust modeling in social tagging of multimedia content , in: IEEE Signal Processing Magazine, 2011.
Jayagopi, Dinesh Babu , Kim, Taemie , Pentland, Alex and Gatica-Perez, Daniel , Privacy-sensitive recognition of group conversational context with sociometers , in: Springer Multimedia Systems Journal, 2011.
Kludas, J. and Marchand-Maillet, S. , Effective Multimodal Information Fusion by Structure Learning , in: 14th International Conference on Information Fusion (FUSION 2011), 2011.
Koelstra, S. , Mühl, C. , Soleymani, M. , Lee, J. -S. , Yazdani, A. , Ebrahimi, T. , Pun, T. , Nijholt, A. and Patras, I. , DEAP: A database for emotion analysis using physiological signal , in: IEEE Trans. on Affective Computing, Special Issue on Naturalistic Affect Resources for System Building and Evaluation, 2011.
Koelstra, S. , Muehl, C. , Soleymani, M. , Lee, J. -S. , Yazdani, A. , Ebrahimi, T. , Pun, T. , Nijholt, A. and Patras, I. , DEAP: a database for emotion analysis using physiological signals , in: IEEE Trans. Affective Computing, 2011.
Korchagin, Danil , Impact of Excitation Frequency on Short-Term Recording Synchronisation and Confidence Estimation , number Idiap-RR-20-2011, 2011.
Korchagin, Danil , Motlicek, Petr , Duffner, Stefan and Bourlard, Hervé , Just-in-Time Multimodal Association and Fusion from Home Entertainment , number Idiap-RR-10-2011, 2011.
Korchagin, Danil and Abutalebi, Hamid Reza , Social Focus of Attention as a Time Function Derived from Multimodal Signals , in: Proceedings IEEE International Conference on Multimedia & Expo, 2011.
Lalanne, Denis and Masson, Agnes Lisowska , A Fitt of distraction: measuring the impact of distracters and multi-users on pointing efficiency , in: CHI Extended Abstracts, pages 2125-2130, 2011.
Larson, M. , Soleymani, M. , Serdyukov, P. , Rudinac, S. , Wartena, C. , Friedland, G. , Murdock, V. , Ordelman, R. and Jonesv, G. J. F. , Automatic tagging and geo-tagging in video collections and communities , in: ACM Int. Conf. on Multimedia Retrieval (ICMR) 2011, 2011.
Lee, J. -S. and Ebrahimi, T. , Audio-visual synchronization recovery in multimedia content , in: Proc. International Conference on Acoustics, Speech and Signal Processing (ICASSP'11), pages 2280-2283, 2011.
Lee, J. -S. , Simone, F. De and Ebrahimi, T. , Subjective quality assessment of scalable video coding , in: Proc. International Workshop on Quality of Multimedia Experience (QoMEX'11), 2011.
Lee, J. -S. , Simone, F. De and Ebrahimi, T. , Subjective quality evaluation of foveated video coding using audio-visual focus of attention , in: IEEE Journal of Selected Topics in Signal Processing, 2011.
Lee, J. -S. , Simone, F. De and Ebrahimi, T. , Subjective quality evaluation via paired comparison: application to scalable video coding , in: IEEE Transactions on Multimedia, 2011.
Lehmann, A. , Leibe, B. and Gool, L. Van , Fast PRISM: Branch and Bound Hough Transform for Object Class Detection , in: International Journal of Computer Vision, volume 94, number 2, pages 175-197, 2011.
Liang, Hui and Dines, John , Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation , in: Proceedings of Interspeech, 2011.
Liang, Hui and Dines, John , Phonological Knowledge Guided HMM State Mapping for Cross-Lingual Speaker Adaptation , number Idiap-RR-17-2011, 2011.
Madan, Anmol , Farrahi, Katayoun , Gatica-Perez, Daniel and Pentland, Alex , Pervasive Sensing to Model Political Opinions in Face-to-Face Networks , in: Pervasive, 2011.
Magimai.-Doss, Mathew , Rasipuram, Ramya , Aradilla, Guillermo and Bourlard, Hervé , GRAPHEME-BASED AUTOMATIC SPEECH RECOGNITION USING KL-HMM , in: Proceedings of Interspeech, 2011.
Mathias, M. , Martinovic, A. , Weissenberg, J. , Haegler, S. and Gool, L. Van , Automatic Architectural Style Recognition , in: 3D-ARCH 2011: “3D Virtual Reconstruction and Visualization of Complex Architecture, 2011.
Mekhaldi, Dalila , Lalanne, Denis and Ingold, Rolf , A Multimodal Alignment Framework for Spoken Documents , in: International Journal of Multimedia Tools and Applications, 2011.
Mohammadi, Gelareh and Vinciarelli, Alessandro , Automatic Attribution of Personality Traits Based on Prosodic Features , in: Proceedings of ACM Multimedia 2011 workshop, 2011.
Mohammadi, Gelareh and Vinciarelli, Alessandro , Humans as Feature Extractors: Combining Prosody and Personality Perception for Better Speaking Style Recognition , in: Proceeding of IEEE Int Conference on Systems, Man, and Cybernetics - Special Sessions, 2011.
Morrison, D. , Bruno, E. and Marchand-Maillet, S. , Query log simulation for long-term learning in image retrieval , in: ontent-based Multimedia Indexinding (CBMI'11), 2011.
Ozcan, Mert , Luo, Jie , Ferrari, Vittorio and Caputo, Barbara , A Large-Scale Database of Images and Captions for Automatic Face Naming , in: Proceedings of the 22nd British Machine Vision Conference, 2011.
Parthasarathi, Sree Hari Krishnan , Bourlard, Hervé and Gatica-Perez, Daniel , LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization , in: Interspeech, 2011.
Parthasarathi, Sree Hari Krishnan , Bourlard, Hervé and Gatica-Perez, Daniel , LP Residual Features for Robust, Privacy-Sensitive Speaker Diarization , number Idiap-RR-14-2011, 2011.
Parthasarathi, Sree Hari Krishnan , Gatica-Perez, Daniel , Bourlard, Hervé and Magimai.-Doss, Mathew , Privacy-Sensitive Audio Features for Speech/Nonspeech Detection , number Idiap-RR-12-2011, 2011.
Popescu-Belis, Andrei , Yazdani, Majid , Nanchen, Alexandre and Garner, Philip N. , A Just-in-Time Document Retrieval System for Dialogues or Monologues , in: SIGDIAL 2011 (12th annual SIGDIAL Meeting on Discourse and Dialogue), Demonstration Session, pages 350-352, 2011.
Popescu-Belis, Andrei , Yazdani, Majid , Nanchen, Alexandre and Garner, Philip N. , A Speech-based Just-in-Time Retrieval System using Semantic Search , in: Proceedings of the ACL-HLT 2011 System Demonstrations (49th Annual Meeting of the Association for Computational Linguistics), pages 80-86, 2011.
Popescu-Belis, Andrei , Yazdani, Majid , Nanchen, Alexandre and Garner, Philip N. , A Speech-based Just-in-Time Retrieval System using Semantic Search , number Idiap-RR-31-2011, 2011.
Popescu-Belis, Andrei and Zufferey, Sandrine , Automatic Identification of Discourse Markers in Multiparty Dialogues: An In-Depth Study of Like and Well , in: Computer Speech and Language, volume 25, number 3, pages 499-518, 2011.
Popescu-Belis, Andrei , Lalanne, Denis and Bourlard, Hervé , Finding Information in Multimedia Records of Meetings , in: Multimedia, IEEE, 2011.
Popescu-Belis, Andrei , Lalanne, Denis and Bourlard, Hervé , Finding Information in Multimedia Records of Meetings , number Idiap-RR-32-2011, 2011.
Popescu-Belis, Andrei , Lalanne, Denis and Bourlard, Hervé , When Users Meet Technology: The Meeting Browser Development Helix , number Idiap-RR-05-2011, 2011.
Rasipuram, Ramya and Magimai.-Doss, Mathew , Improving Articulatory Feature and Phoneme Recognition using Multitask Learning , in: Artificial Neural Networks and Machine Learning - ICANN 2011, pages 299-306, Springer Berlin / Heidelberg, 2011.
Rasipuram, Ramya and Magimai.-Doss, Mathew , Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, pages 5192-5195, 2011.
Rasipuram, Ramya and Magimai.-Doss, Mathew , INTEGRATING ARTICULATORY FEATURES USING KULLBACK-LEIBLER DIVERGENCE BASED ACOUSTIC MODEL FOR PHONEME RECOGNITION , number Idiap-RR-02-2011, 2011.
Rasipuram, Ramya and Magimai.-Doss, Mathew , MULTITASK LEARNING TO IMPROVE ARTICULATORY FEATURE ESTIMATION AND PHONEME RECOGNITION , number Idiap-RR-21-2011, 2011.
Razavi, N. , Gall, J. and Gool, L. Van , Scalable Multi-class Object Detection , in: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 2011.
Roy, Anindya , Magimai.-Doss, Mathew and Marcel, Sébastien , Phoneme Recognition using Boosted Binary Features , in: IEEE Intl. Conference on Acoustics, Speech and Signal Processing 2011, 2011.
Saheer, Lakshmi , Dines, John and Garner, Philip N. , Vocal Tract Length Normalization for Statistical Parametric Speech Synthesis , in: IEEE transactions on audio, speech and langugae processing, 2011.
Sanchez-Cortes, Dairazalia , Aran, Oya , Schmid Mast, Marianne and Gatica-Perez, Daniel , Detecting Emergent Leaders in Small Groups using Nonverbal Behavior , in: IEEE Transactions on Multimedia, 2011.
Scheffler, Carl and Odobez, Jean-Marc , Joint Adaptive Colour Modelling and Skin, Hair and Clothing Segmentation Using Coherent Probabilistic Index Maps , in: British Machine Vision Conference, 2011.
Skoumas, Georgios and Garner, Philip N. , Intuitive Recipes for Uncertainty Decoding with SNR Features for Noise Robust ASR , number Idiap-RR-23-2011, 2011.
Soldo, Serena , Magimai.-Doss, Mathew , Pinto, Joel Praveen and Bourlard, Hervé , Posterior Features for Template-based ASR , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, 2011.
Soleymani, M. , Koelstra, S. , Patras, I. and Pun, T. , Continuous emotion detection in response to music videos , in: EmoSPACE 2011, 1st Int. Workshop on Emotion Synthesis, rePresentation, and Analysis in Continuous spacE, in conjunction with IEEE FG 2011, 2011.
Suditu, Nicolae and Fleuret, Francois , HEAT: Iterative Relevance Feedback with One Million Images , in: International Conference on Computer Vision, 2011.
Taghizadeh, Mohammad J. , Garner, Philip N. , Bourlard, Hervé , Abutalebi, Hamid Reza and Asaei, Afsaneh , An Integrated Framework for Multi-Channel Multi-Source Localization and Voice Activity Detection , in: The Third Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2011.
Taghizadeh, Mohammad J. , Garner, Philip N. , Bourlard, Hervé , Abutalebi, Hamid Reza and Asaei, Afsaneh , AN INTEGRATED FRAMEWORK FOR MULTI-CHANNEL MULTI-SOURCE LOCALIZATION AND VOICE ACTIVITY DETECTION , number Idiap-RR-16-2011, 2011.
Vajda, P. , Ivanov, I. , Goldmann, L. and Ebrahimi, T. , Let Epitome summarize your photo collection! , in: Proc. International Conference on Multimedia and Expo (ICME'11), 2011.
Vajda, P. , Ivanov, I. , Goldmann, L. and Ebrahimi, T. , Omnidirectional object duplicate detection , in: Proc. International Workshop on Digital Signal Processing (DSPE'11), pages 332-337, 2011.
Vajda, P. , Ivanov, I. , Goldmann, L. and Ebrahimi, T. , Social game Epitome vesus automatic visual analysis , in: Proc. International Conference on Multimedia and Expo (ICME'11), 2011.
Varadarajan, Jagannadan , Emonet, Remi and Odobez, Jean-Marc , A Sequential Topic Model for Mining Recurrent Activities from Video and Audio Data Logs , in: IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011.
Wester, Mirjam and Liang, Hui , Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech , in: Proceedings of Interspeech, 2011.
Wester, Mirjam and Liang, Hui , Cross-Lingual Speaker Discrimination Using Natural and Synthetic Speech , number Idiap-RR-18-2011, 2011.
von Wyl, M. , Mohamed, H. , Bruno, E. and Marchand-Maillet, S. , A Parallel Cross-Modal Search Engine over Large-Scale Multimedia Collections with Interactive Relevance Feedback , in: ACM International Conference on Multimedia Retrieval (ACM-ICMR'11), 2011.
Yazdani, Majid and Popescu-Belis, Andrei , Using a Wikipedia-based Semantic Relatedness Measure for Document Clustering. , in: Graph-based Methods for Natural Language Processing, 2011.
Yüce, Anil , Sorci, M. and Thiran, J. -Ph. , Head pose detection using Fast Robust PCA for Side Active Appearance Models under Occlusion , in: Proceeding of the The 2011 International Conference on Image Processing, Computer Vision, and Pattern Recognition (IPCC 2011), 2011.
Yüce, A. , Sorci, M. and Thiran, J. -Ph. , Head pose detection using Fast Robust PCA for Side Active Appearance Models under Occlusion , in: International Conference on Image Processing, Computer Vision, and Pattern Recognition (IPCV 2011), 2011.
Aran, Oya and Akarun, Lale , A Multi-class Classification Strategy for Fisher Scores: Application to Signer Independent Sign Language Recognition , in: Pattern Recognition, volume 43, number 5, pages 1776-1788, 2010. [DOI]
Aran, Oya , Hung, H. and Gatica-Perez, D. , A Multimodal Corpus for Studying Dominance in Small Group Conversations , in: LREC workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Malta, May 2010, 2010.
Aran, Oya and Gatica-Perez, D. , Fusing Audio-Visual Nonverbal Cues to Detect Dominant People in Conversations , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, 2010, Istanbul, Turkey, 2010.
Asaei, Afsaneh , Picart, B. and Bourlard, H. , Analysis of Phone Posterior Feature Space Exploiting Class Specific Sparsity and MLP-based Similarity Measure , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, 2010.
Asaei, Afsaneh , Bourlard, H. and Picart, B. , Investigation of kNN Classifier on Posterior Features Towards Application in Automatic Speech Recognition , number Idiap-RR-11-2010, 2010.
Asaei, Afsaneh , Bourlard, H. and Garner, P. N. , Sparse Component Analysis for Speech Recognition in Multi-Speaker Environment , in: Proceedings of Interspeech, Makuhari, Japan, 2010.
Ba, S. and Odobez, J. -M. , Multi-Person Visual Focus of Attention from Head Pose and Meeting Contextual Cues , in: IEEE Trans. on Pattern Analysis and Machine Intelligence, accepted for publication, november 2009, number Idiap-RR-47-2008, 2010.
Bachour, Khaled , Kaplan, Frédéric and Dillenbourg, Pierre , An Interactive Table for Supporting Participation Balance in Face-to-Face Collaborative Learning , in: IEEE Transactions on Learning Technologies, ISSN 1939-1382, 2010. [DOI]
Biel, Joan-Isaac and Gatica-Perez, Daniel , Vlogcast Yourself: Nonverbal Behavior and Attention in Social Media , in: Proceedings International Conference on Multimodal Interfaces (ICMI-MLMI), 2010.
Bogdan, R. and Gatica-Perez, D. , Inferring competitive role patterns in reality TV show through nonverbal analysis , in: Multimedia Tools and Applications, Special issue on Social Media, 2010.
Bologna, G. , Deville, B. and Pun, T. , Toward local and global perception modules for vision substitution , in: Neurocomputing, volume 74, number 8, pages 1182-1190, 2010.
Boyandin, Ilya , Bertini, E. and Lalanne, D. , Using Flow Maps to Explore Migrations Over Time , in: Proceedings of Geospatial Visual Analytics Workshop in conjunction with The 13th AGILE International Conference on Geographic Information Science, 2010.
Breitenstein, M. D. , Leibe, B. and Gool, Luc Van , Evaluation of Agent Motion in Video: Online Tracking-by-Detection , in: International Conference on Cognitive Systems, 2010.
Breitenstein, M. D. , Reichlin, F. , Leibe, B. , Koller-Meier, E. and Gool, L. Van , Online Multi-Person Tracking-by-Detection from a Single, Uncalibrated Camera , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010.
Bruegger, Pascal , Lisowska, Agnes , Lalanne, Denis and Hirsbrunner, Béat , Enriching the Design and Prototyping Loop: a Set of Tools to Support the Creation of Activity-Based Pervasive Applications , in: Journal of Mobile Multimedia, volume 6, number 4, pages 339-360, 2010.
Buchinger, S. , Simone, F. De , Hotop, E. , Hlavacs, H. and Ebrahimi, T. , Gesture and Touch Controlled Video Player Interface for Mobile Devices , in: Proceedings of the ACM Multimedia International Conference, 2010.
Bunt, Harry , Alexandersson, Jan , Carletta, J. , Choe, Jae-Woong , Fang, Alex , Hasida, Koiti , Lee, Kiyong , Petukhova, Volha , Popescu-Belis, A. , Romary, Laurent , Soria, Claudia and David, Traum. , Towards a standard for dialogue act annotation , in: 7th International Conference on Language Resources and Evaluation, Malta, 2010.
Chen, Hsin-Hsi , Efthimiadis, Efthimis N. , Savoy, Jacques , Crestani, Fabio and Marchand-Maillet, S. , Proceedings of the ACM-SIGIR 2010 conference , ACM Digital Library, 2010.
Chittaranjan, Gokul and Hung, H. , Are you a Werewolf? Detecting deceptive roles and outcomes in a conversational role-playing game , in: IEEE International Conference on Acoustics, Speech and Signal Processing, 2010.
Crestani, F. , Marchand-Maillet, S. , Chen, H. -H. , Efthimiadis, E. N. and Savoy, J. , Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2010 , ACM, New York, USA, 2010.
Deville, B. , Bologna, G. and Pun, T. , Detecting objects and obstacles for visually impaired individuals using visual saliency , in: ASSETS 2010, 12th Int. ACM SigAccess Conf. on Computers and Accessibility, Demonstrations Track, 2010.
Dillenbourg, Pierre and Jermann, Patrick , Technology for Classroom Orchestration , in: New Science of Learning, pages 525-552, Springer Science+Business Media, 2010. [DOI]
Dines, John , Yamagishi, Junichi and King, Simon , Measuring the gap between HMM-based ASR and TTS , number Idiap-RR-34-2010, 2010.
Do, Trinh-Minh-Tri and Gatica-Perez, Daniel , By their apps you shall understand them: mining large-scale patterns of mobile phone usage , in: The 9th International Conference on Mobile and Ubiquitous Multimedia, 2010.
Do, Trinh-Minh-Tri and Artieres, Thierry , Neural conditional random fields , in: Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pages 177-184, JMLR: W\&CP, Chia Laguna, Sardinia, Italy, 2010.
Dumas, Bruno , Lalanne, Denis and Ingold, Rolf , Description languages for multimodal interaction: a set of guidelines and its illustration with SMUIML , in: Journal on Multimodal User Interfaces, volume 3, number 3, pages 237-247, 2010.
Evéquoz, F. , Thomet, Julien and Lalanne, D. , La navigation par facettes appliquée à la gestion de l'information personnelle , in: Proceedings of 22ème Conférence Francophone sur l'Interaction Homme-Machine (IHM'10), 2010.
Evéquoz, Florian , Thomet, Julien and Lalanne, Denis , G\érer son information personnelle au moyen de la navigation par facettes , in: Conference Internationale Francophone sur I'Interaction Homme-Machine, pages 41-48, ACM, 2010.
Fanelli, G. , Gall, J. , Romsdorfer, H. , Weise, T. and Gool, L. Van , 3D Vision Technology for Capturing Multimodal Corpora: Chances and Challenges , in: LREC Workshop on Multimodal Corpora, 2010.
Fanelli, G. , Gall, J. , Romsdorfer, H. , Weise, T. and Gool, L. Van , A 3-D Audio-Visual Corpus of Affective Communication , in: IEEE Transactions on Multimedia, volume 12, number 6, pages 591-598, 2010.
Fanelli, G. , A.Yao, , Noel, P. -L. , Gall, J. and Gool, L. Van , Hough Forest-Based Facial Expression Recognition from Video Sequences , in: International Workshop on Sign, Gesture and Activity (SGA) 2010, in conjunction with ECCV 2010, 2010.
Farrahi, Katayoun and Gatica-Perez, Daniel , Mining Human Location-Routines Using a Multi-Level Approach to Topic Modeling , in: 2010 IEEE Second International Conference on Social Computing, SIN Symposium, 2010.
Farrahi, K. and Gatica-Perez, D. , Mining Human Location-Routines using a Multi-Level Topic Model , number Idiap-RR-28-2010, 2010.
Farrahi, K. and Gatica-Perez, D. , Probabilistic Mining of Socio-Geographic Routines from Mobile Phone Data , in: IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, volume 4, number 4, pages 746-755, 2010.
Gall, J. , Yao, A. and Gool, L. Van , 2D Action Recognition Serves 3D Human Pose Estimation , in: European Conference on Computer Vision, 2010.
Gall, J. , Razavi, N. and Gool, Luc Van , On-line Adaption of Class-specific Codebooks for Instance Trackin , in: British Machine Vision Conference, 2010.
Gall, J. , Razavi, N. and Gool, L. Van , On-line Adaption of Class-specific Codebooks for Instance Tracking , in: British Machine Vision Conference, 2010.
Gammeter, S. , Quack, T. , Tingdahl, D. and van Gool, Luc , Size does matter: improving object recognition and 3D reconstruction with cross-media analysis of image clusters , in: European Conference on Computer Vision (ECCV 2010, 2010.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Autoregressive Models of Amplitude Modulations in Audio Compression , in: IEEE TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2010.
Garau, G. , Dielmann, A. and Bourlard, H. , Audio-Visual Synchronisation for Speaker Diarisation , in: International Conference on Speech and Language Processing, Interspeech, Makuhari, Japan, 2010.
Garau, G. and Bourlard, H. , Using Audio and Visual Cues for Speaker Diarisation Initialisation , in: International Conference on Acoustics, Speech and Signal Processing, 2010.
Garner, P. N. and Dines, J. , Tracter: A Lightweight Dataflow Framework , in: Proceedings of Interspeech, Makuhari, Japan, 2010.
Garner, P. N. and Dines, J. , Tracter: A Lightweight Dataflow Framework , number Idiap-RR-10-2010, 2010.
Gatica-Perez, D. and Odobez, J. -M. , Visual Attention, Speaking Activity, and Group Conversational Analysis in Multi-Sensor Environments , in: In H. Nakashima, J. Augusto, H. Aghajan (Eds.), Handbook of Ambient Intelligence and Smart Environments, Springer, 2010.
Goldmann, L. , Simone, F. De and Ebrahimi, T. , A Comprehensive Database and Subjective Evaluation Methodology for Quality of Experience in Stereoscopic Video , in: Proceedings of SPIE, 2010.
Goldmann, L. , Simone, F. De and Ebrahimi, T. , Impact of acquisition distortion on the quality of stereoscopic images , in: Proceedings of International Workshop on Video Processing and Quality Metrics for Consumer Electronics, 2010.
Gomez, J. D. , Bologna, G. and Pun, T. , Color-audio encoding interface for visual substitution: See Color Matlab-based demo , in: ASSETS 2010, 12th Int. ACM SigAccess Conf. on Computers and Accessibility, Demonstrations Track, 2010.
Hadjar, Karim and Ingold, Rolf , Improving XED for extracting content from Arabic PDFs , in: Document Analysis Systems, pages 371-376, 2010.
Haegler, S. , Wonka, P. , Arisona, Stefan Mueller , Gool, Luc Van and Müller, P. , Grammar-Based Encoding of Facades , in: EGSR, 2010.
Hain, T. , Burget, Lukas , Dines, J. , Garner, P. N. , El Hannani, A. , Huijbregts, M. , Karafiat, M. , Lincoln, M. and Wan, V. , The AMIDA 2009 Meeting Transcription System , in: Proceedings of Interspeech, Makuhari, Japan, 2010.
Hung, H. and Gatica-Perez, D. , Estimating Cohesion in Small Groups using Audio-Visual Nonverbal Behavior , number Idiap-RR-12-2010, 2010.
Hung, H. , Huang, Y. , Friedland, G. and Gatica-Perez, D. , Estimating Dominance in Multi-Party Meetings Using Speaker Diarization , in: IEEE Transactions on Audio, Speech, and Language Processing, 2010.
Hung, H. and Chittaranjan, Gokul , The Wolf Corpus: Exploring group behaviour in a competitive role-playing game , in: ACM Multimedia, 2010.
Imseng, D. and Friedland, G. , An Adaptive Initialization Method for Speaker Diarization based on Prosodic Features , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, pages 4946-4949, Dallas, USA, 2010.
Imseng, D. , Magimai-Doss, M. and Bourlard, H. , Hierarchical Multilayer Perceptron based Language Identification , in: Proceedings of Interspeech, Makuhari, Japan, 2010.
Imseng, D. , Bourlard, H. and Magimai-Doss, M. , Towards mixed language speech recognition systems , in: Proceedings of Interspeech, Makuhari, Japan, 2010.
Imseng, David and Friedland, Gerald , Tuning-Robust Initialization Methods for Speaker Diarization , number Idiap-RR-35-2010, 2010.
Ivanov, I. , Vajda, P. , Lee, J. -S. and Ebrahimi, T. , Epitome- a social game for photo album summarization , in: Proceedings of the International Workshop on Connected Multimedia, 2010.
Ivanov, I. , Vajda, P. , Lee, J. -S. , Goldmann, L. and Ebrahimi, T. , Geotag propagation in social networks based on user trust model , in: Multimedia Tools and Application, 2010.
Ivanov, I. , Vajda, P. , Goldmann, L. , Lee, J. -S. and Ebrahimi, T. , Object-based tag propagation for semi-automatic annotation of images , in: Proceedings of the ACM SIGMM International Conference on Multimedia Information Retrieval, pages 497-506, 2010.
Jayagopi, D. and Gatica-Perez, D. , Mining group nonverbal conversational patterns using probabilistic topic models , in: IEEE Transactions on Multimedia, 2010.
Jayagopi, Dinesh Babu , Kim, Taemie , Pentland, Alex and Gatica-Perez, Daniel , Recognizing conversational context in group interaction using privacy-sensitive mobile sensors , in: Proceedings of International Conference on Mobile and Ubiquitous Multimedia, Limassol, Cyprus, 2010.
Kierkels, J. , Soleymani, M. and Pun, T. , Identification of narrative peaks in clips: text features perform best , in: VideoCLEF 2009, Cross Language Evaluation Forum (CLEF) Workshop, Post-Conference Proceedings, Springer LNCS, 2010.
Knopp, J. , Prasad, M. , Willems, G. , Timofte, R. and Gool, L. Van , Hough Transform and 3D SURF for robust three dimensional classification , in: Proceedings of the European Conference on Computer Vision, 2010.
Knopp, J. , Prasad, M. and Gool, L. Van , Orientation invariant 3D object Classification using Hough Transform based methods , in: Proceedings of the ACM workshop on 3D object retrieval, 2010.
Koelstra, S. , Yazdani, A. , Soleymani, M. , Muehl, C. , Lee, J. -S. , Nijholt, A. , Pun, T. , Ebrahimi, T. and Patras, I. , Single trial classification of EEG and peripheral physiological signals for recognition of emotions induced by music videos , in: Proceedings of the International Conference on Brain Informatics, 2010.
Koelstra, S. , Yazdani, A. , Soleymani, M. , Muehl, C. , Lee, J. -S. , Nijholt, A. , Pun, T. , Ebrahimi, T. and Patras, I. , Single trial classification of EEG and peripheral physiological signals for recognition of emotions induced by music videos , in: Brain Informatics, 2010.
Kompatsiaris, I. , Marchand-Maillet, S. , Marcel, S. and van Zwol, R. , Image and Video Retrieval: Theory and Applications , Springer, 2010.
Korchagin, D. , Garner, P. N. and Dines, J. , Automatic Temporal Alignment of AV Data with Confidence Estimation , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010.
Korchagin, D. , Garner, P. N. and Motlicek, P. , Hands Free Audio Analysis from Home Entertainment , in: Proceedings of Interspeech, Makuhari, Japan, 2010.
Kuettel, D. , Breitenstein, M. D. , Gool, Luc Van and Ferrari, V. , What’s going on? Discovering Spatio-Temporal Dependencies in Dynamic Scenes , in: IEEE Conference on Computer Vision and Pattern Recognition, 2010.
Kurimo, Mikko , Byrne, William , Dines, John , Garner, Philip N. , Gibson, Matthew , Guan, Yong , Hirsimäki, Teemu , Karhila, Reima , King, Simon , Liang, Hui , Oura, Keiichiro , Saheer, Lakshmi , Shannon, Matt , Shiota, Sayaka , Tian, Jilei , Tokuda, Keiichi , Wester, Mirjam , Wu, Yi-Jian and Yamagishi, Junichi , Personalising speech-to-speech translation in the EMIME project , in: Proceedings of the ACL 2010 System Demonstrations, Association for Computational Linguistics, 2010.
Lalos, C. , Grabner, H. , Gool, L. Van and Varvarigo, T. , Object Fow: Learning object displacement , in: roceeding IEEE Workshop on Visual Surveillance, 2010.
Lee, J. -S. , Simone, F. De , Ramzan, N. , Zhao, Z. , Kurutepe, E. , Sikora, T. , Ostermann, J. , Izquierdo, E. and Ebrahimi, T. , Subjective evaluation of scalable video coding for content distribution , in: Proceedings of the ACM Multimedia International Conference, 2010.
Lee, J. -S. , Simone, F. De and Ebrahimi, T. , Video coding based on audio-visual focus of attention , in: Journal of Visual Communication and Image Representation, 2010.
Lefèvre, Stéphanie and Odobez, Jean-Marc , View-Based Appearance Model Online Learning for 3D Deformable Face Tracking , in: Proc. Int. Conf. on Computer Vision Theory and Applications, 2010.
Liang, H. , Dines, J. and Saheer, L. , A Comparison of Supervised and Unsupervised Cross-Lingual Speaker Adaptation Approaches for HMM-Based Speech Synthesis , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing, pages 4598-4601, Dallas, U.S.A., 2010.
Liang, H. and Dines, John , An Analysis of Language Mismatch in HMM State Mapping-Based Cross-Lingual Speaker Adaptation , in: Proceedings of Interspeech, Makuhari, Japan, 2010.
Luo, J. , Orabona, F. , Fornoni, Marco , Caputo, B. and Cesa-Bianchi, Nicolo , OM-2: An Online Multi-class Multi-kernel Learning Algorithm , number Idiap-RR-06-2010, 2010.
Mansfield, A. , Gehler, P. , Gool, L. Van and Rother, C. , Scene Carving: Scene Consistent Image Retargeting , in: European Conference on Computer Vision (ECCV), 2010.
Mansfield, A. , Gehler, P. , Gool, L. Van and Rothe, C. , Visibility Maps for Improving Seam Carving , in: Media Retargeting Workshop, European Conference on Computer Vision (ECCV), 2010.
Marcel, S. , McCool, C. , Matejka, Pavel , Ahonen, Timo and Cernocky, Jan , Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation , number Idiap-RR-09-2010, 2010.
Marcel, S. , McCool, C. , Atanasoaei, Cosmin , Tarsetti, Flavio , Pesan, Jan , Matejka, Pavel , Cernocky, Jan , Helistekangas, Mika and Turtinen, Markus , MOBIO: Mobile Biometric Face and Speaker Authentication , number Idiap-RR-31-2010, 2010.
Marcel, S. , McCool, C. , Matejka, Pavel , Ahonen, Timo , Cernocky, Jan and al, , On the Results of the First Mobile Biometry (MOBIO) Face and Speaker Verification Evaluation , number Idiap-RR-30-2010, 2010.
Mekhaldi, Dalila and Lalanne, Denis , Multimodal Document Alignment: Feature-based Validation to Strengthen Thematic Links , in: Journal of Multimedia Processing Technologies, volume 1, number 1, pages 30-46, 2010.
Mohammadi, Gelareh , Vinciarelli, Alessandro and Mortillaro, Marcello , The Voice of Personality: Mapping Nonverbal Vocal Behavior into Trait Attributions , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010.
Montoliu, Raul. and Gatica-Perez, Daniel , Discovering Human Places of Interest from Multimodal Mobile Phone Data , in: Proceedings of 9th International Conference on on Mobile and Ubiquitous Multimedia, 2010.
Morrison, D. , Bruno, E. and Marchand-Maillet, S. , Capturing the semantics of user interaction: A review and case study , in: Emergent Web Intelligence: Advanced Information Retrieval, Springer, 2010.
Morrison, D. , Bruno, E. and Marchand-Maillet, S. , TagCaptcha: Annotating images with CAPTCHAs , in: ACM Multimedia 2010, 2010.
Morrison, D. , Bruno, E. and Marchand-Maillet, S. , TagCaptcha: Annotating images with CAPTCHAs , in: ACM MULTIMEDIA 2010 (Demo Program), 2010.
Motlicek, P. , Garner, P. N. , Guillemot, M. and Bozzo, Vincent , AMIDA/Klewel Mini-Project , number Idiap-RR-03-2010, 2010.
Motlicek, P. and Valente, F. , Application of Out-Of-Language Detection To Spoken-Term Detection , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010.
Motlicek, P. , Valente, F. and Garner, P. N. , English Spoken Term Detection in Multilingual Recordings , in: Proceedings of Interspeech, Makuhari, Japan, 2010, ISCA, Makuhari, Japan, 2010.
Motlicek, P. , Ganapathy, S. , Hermansky, H. and Garudadri, H. , Wide-Band Audio Coding based on Frequency Domain Linear Prediction , in: EURASIP Journal on Audio Speech and Music Processing, volume 2010, number 856280, pages 14, 2010. [DOI]
Murino, V , Cristani, M and Vinciarelli, Alessandro , Socially Intelligent Surveillance and Monitoring: Analysing Social Dimensions of Physical Space , in: Proceedings of International Workshop on Socially Intelligent Surveillance and Monitoring, pages 51-58, 2010.
Nater, F. , Vangeneugden, J. , Grabner, H. , Gool, L. Van and Vogels, R. , Discrimination of locomotion direction at different speeds: A comparison between macaque monkeys and algorithms , in: ECML Workshop on rare audio-visual cues, 2010.
Nater, Fabian , Grabner, Helmut and Gool, Luc Van , Exploiting simple hierarchies for unsupervised human behavior analysis , in: CVPR, 2010.
Nater, Fabian , Grabner, Helmut and Gool, Luc Van , Visual abnormal event detection for prologed independent livin , in: IEEE Healthcom Workshop on mHealth, 2010.
Negoescu, R. -A. , Loui, Alexander and Gatica-Perez, D. , Kodak Moments and Flickr Diamonds: How Users Shape Large-scale Media , number Idiap-RR-20-2010, 2010.
Negoescu, R. -A. and Gatica-Perez, D. , Modeling and Understanding Flickr Communities through Topic-based Analysis , number Idiap-RR-19-2010, 2010.
Negoescu, R. -A. and Gatica-Perez, D. , Modeling and Understanding Flickr Communities through Topic-based Analysis , in: IEEE Transactions on Multimedia, volume 12, number 5, pages 399-416, ISSN 1520-9210, 2010. [DOI]
Orabona, F. , Luo, J. and Caputo, B. , Online-Batch Strongly Convex Multi Kernel Learning , in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2010.
Parthasarathi, S. H. K. , Magimai-Doss, M. , Bourlard, H. and Gatica-Perez, D. , Evaluating the Robustness of Privacy-Sensitive Audio Features for Speech Detection in Personal Audio Log Scenarios , in: ICASSP 2010, 2010.
Pellegrini, S. , Ess, A. and Gool, L. Van , Improving Data Association by Joint Modeling of Pedestrian Trajectories and Groupings , in: European Conference on Computer Vision (ECCV), 2010.
Pellegrini, S. , Ess, A. , Tanaskovic, M. and Gool, L. Van , Wrong Turn - No Dead End: a Stochastic Pedestrian Motion Model , in: International Workshop on Socially Intelligent Surveillance and Monitoring (SISM), 2010.
Pinto, J. P. , Sivaram, G. S. V. S. , Magimai-Doss, M. , Hermansky, H. and Bourlard, H. , Analysis of MLP Based Hierarchical Phoneme Posterior Probability Estimator , in: IEEE Transcations on Audio, Speech, and Language Processing, 2010.
Pinto, Joel Praveen , Magimai.-Doss, Mathew and Bourlard, Hervé , Hierarchical Tandem Features for ASR in Mandarin , number Idiap-RR-39-2010, 2010.
Pinto, J. P. , Multilayer Perceptron Based Hierarchical Acoustic Modeling for Automatic Speech Recognition , Ecole polytechnique fédérale de Lausanne, 2010.
Popescu-Belis, Andrei , Kilgour, Jonathan , Poller, Peter , Nanchen, Alexandre , Boertjes, Erik and de Wit, Joost , Automatic Content Linking: Speech-based Just-in-time Retrieval for Multimedia Archives , in: Proceedings of the 33rd Annual ACM SIGIR Conference, pages 703, 2010.
Popescu-Belis, A. , Finding without searching , number Idiap-Com-01-2010, 2010.
Popescu-Belis, A. , Kilgour, J. , Nanchen, A. and Poller, P. , The ACLD: Speech-based Just-in-Time Retrieval of Meeting Transcripts, Documents and Websites , in: ACM Multimedia Workshop on Searching Spontaneous Conversational Speech, Florence, Italy, 2010.
Popescu-Belis, A. , Kilgour, J. , Nanchen, A. and Poller, P. , The ACLD: Speech-based Just-in-Time Retrieval of Multimedia Documents and Websites , number Idiap-RR-26-2010, 2010.
Pronobis, A. , Luo, J. and Caputo, Barbara , The More you Learn, the Less you Store: Memory-controlled Incremental SVM for Visual Place Recognition , in: Image and Vision Computing, 2010. [DOI]
Razavi, N. , Gall, J. and Gool, Luc Van , Backprojection Revisited: Scalable Multi-view Object Detection and Similarity Metrics for Detections , in: European Conference on Computer Vision, 2010.
Roy, A. , Magimai-Doss, M. and Marcel, S. , BOOSTED BINARY FEATURES FOR NOISE-ROBUST SPEAKER VERIFICATION , in: 2010 IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, Texas, 2010.
Roy, A. and Marcel, S. , Crossmodal Matching of Speakers using Lip and Voice Features in Temporally Non-overlapping Audio and Video Streams , in: 20th International Conference on Pattern Recognition, Istanbul, Turkey, International Association for Pattern Recognition (IAPR), Istanbul, Turkey, 2010.
Roy, A. and Marcel, S. , Introducing Crossmodal Biometrics:Person Identification from Distinct Audio \& Visual Streams , in: IEEE Fourth International Conference on Biometrics: Theory, Applications and Systems, 2010.
Roy, A. and Marcel, S. , Visual processing-inspired Fern-Audio features for Noise-Robust Speaker Verification , in: ACM 25th Symposium on Applied Computing, 2010, Sierre, Switzerland, Association for Computing Machinery, 2010.
Saheer, Lakshmi , Saheer, L. , Dines, John , Dines, J. , Garner, Philip N. , Garner, P. N. , Liang, H. and Liang, Hui , Implementation of VTLN for Statistical Speech Synthesis , in: Proceedings of ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010.
Saheer, Lakshmi , Dines, John , Garner, Philip N. and Liang, Hui , Implementation of VTLN for Statistical Speech Synthesis , number Idiap-RR-32-2010, 2010.
Saheer, L. , Garner, P. N. and Dines, J. , Study of Jacobian Normalization for VTLN , number Idiap-RR-25-2010, 2010.
Saheer, L. , Garner, P. N. , Dines, J. and Liang, H. , VTLN Adaptation for Statistical Speech Synthesis , in: Proceedings of ICASSP, Dallas, Texas, 2010.
Sanchez-Cortes, Dairazalia , Aran, Oya , Schmid Mast, Marianne and Gatica-Perez, Daniel , Identifying Emergent Leadership in Small Groups using Nonverbal Communicative Cues , in: Proc. ICMI-MLMI '10 International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction, ACM New York, NY, USA \textcopyright2010, 2010.
Schwaller, Matthias , Lalanne, Denis and Khaled, Omar Abou , PyGmI: creation and evaluation of a portable gestural interface , in: NordiCHI, pages 773-776, 2010.
Simone, F. De , Tagliasacchi, M. , Naccari, M. , Tubaro, S. and Ebrahimi, T. , A H.264/AVC video database for the evaluation of quality metrics , in: Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pages 2430-2433, 2010.
Simone, F. De , Goldmann, L. , Filimonov, D. and Ebrahimi, T. , On the limits of perceptually optimized JPEG , in: Proceedings of International Workshop on Video Processing and Quality Metrics for Consumer Electronics, 2010.
Simone, F. De , Goldmann, L. , Lee, J. -S. , Ebrahimi, T. and Baroncini, V. , Subjective evaluation of next-generation video compression algorithm: a case study , in: Proceedings of SPIE, 2010.
Soleymani, M. and Larson, M. , Crowdsourcing for affective annotation of video: development of a viewer-reported boredom corpus , in: 33th ACM SIGIR, Workshop on Crowdsourcing for Search Evaluatio, 2010.
Sorci, M. , Antonini, G. , Cruz Mota, J. , Rubin, T. , Bierlaire, M. and Thiran, J. -Ph. , Modelling human perception of static facial expressions , in: Image and Vision Computing, volume 28, number 5, pages 790-806, ISSN 0262-8856, 2010. [DOI]
Sproewitz, Alexander , Pouya, Soha , Bonardi, Stéphane , van den Kieboom, Jesse , Moeckel, Rico , Billard, A. , Dillenbourg, Pierre and Ijspeert, Auke , Roombots: Reconfigurable Robots for Adaptive Furniture , in: IEEE Computational Intelligence Magazine, special issue on "Evolutionary and developmental approaches to robotics", 2010. [DOI]
Stalder, S. , Grabner, H. and Gool, L. Van , Cascaded Confidence Filtering for Improved Tracking-by-Detectio , in: European Conference on Computer Vision (ECCV), 2010.
Subburaman, Venkatesh Bala and Marcel, S. , An Alternative Scanning Strategy to Detect Faces , in: Proceedings IEEE International Conference on Acoustics, Speech and Signal Processing, Dallas, USA, 2010.
Vajda, P. , Ivanov, I. , Goldmann, L. , Lee, J. -S. and Ebrahimi, T. , 3D object duplicate detection for video retrieval , in: Proceedings of the International Workshop on Image Analysis for Multimedia Interactive Services, 2010.
Vajda, P. , Ivanov, I. , Lee, J. -S. , Goldmann, L. and Ebrahimi, T. , Propagation of geotags based on object duplicate detection , in: Proceedings of SPIE, 2010.
Vajda, P. , Ivanov, I. , Goldmann, L. , Lee, J. -S. and Ebrahimi, T. , Robust duplicate detection of 2D and 3D objects , in: International Journal of Multimedia Data Engineering and Management, 2010.
Valente, Fabio , Magimai.-Doss, Mathew , Plahl, Christian , Suman, Ravuri and Wen, Wang , A Comparative Study of MLP Front-ends for Mandarin ASR , in: Proceedings of Interspeech, Japan, 2010.
Valente, Fabio , Hierarchical and Parallel Processing of Auditory and Modulation Frequencies for Automatic Speech Recognition , in: Speech Communication, volume 52, number 10, 2010.
Valente, Fabio and Vinciarelli, Alessandro , Improving Speech Processing trough Social Signals: Automatic Speaker Segmentation of Political Debates using Role based Turn-Taking Patterns. , in: Proceedings of ACM Multimedia Workshop on Social Signal Processing, 2010.
Valente, Fabio , Multi-Stream Speech Recognition based on Dempster-Shafer Combination Rule , in: Speech Communication, volume 52, number 3, 2010.
Varadarajan, Jagannadan , Emonet, Remi and Odobez, Jean-Marc , A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining , in: NIPS-2010 Workshop on Practical Applications of Sparse Modeling: Open Issues and New Directions, 2010.
Varadarajan, Jagannadan , Emonet, Remi and Odobez, Jean-Marc , A Sparsity Constraint for Topic Models - Application to Temporal Activity Mining , number Idiap-RR-36-2010, 2010.
Varadarajan, Jagannadan , Emonet, Remi , Odobez, Jean-Marc and Odobez, J. -M. , Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes , in: BMVC 2010, pages 117.1-117.11, BMVA Press, Aberystwyth University, Aberystwyth, 2010.
Varadarajan, Jagannadan , Emonet, Remi and Odobez, Jean-Marc , Probabilistic Latent Sequential Motifs: Discovering temporal activity patterns in video scenes , number Idiap-RR-33-2010, 2010.
Verdet, Florian , Matrouf, Driss , Bonastre, Jean-François and Hennebert, Jean , Channel detectors for system fusion in the context of NIST LRE 2009 , in: INTERSPEECH, pages 733-736, 2010.
Veres, G. , Grabner, H. , Middleton, L. and Gool, L. Van , Automatic Workflow Monitoring in Industrial Environments , in: Proceedings Asian Conference on Computer Vision (ACCV), 2010.
Vijayasenan, D. , Valente, F. and Bourlard, H. , Advances in Fast Multistream Diarization based on the Information Bottleneck Framework , number Idiap-RR-23-2010, 2010.
Vijayasenan, D. , Valente, F. and Bourlard, H. , An Information Theoretic Combination of MFCC and TDOA Features for Speaker Diarization , number Idiap-RR-22-2010, 2010.
Vijayasenan, D. , Valente, F. and Bourlard, H. , Multistream Speaker Diarization beyond Two Acoustic Feature Streams , in: International Conference on Acoustics, Speech, and Signal Processing, 2010.
Vinciarelli, Alessandro , Human Behavior Understanding , Springer Verlag, 2010.
Vinciarelli, Alessandro , Murray-Smith, Roderick and Bourlard, Hervé , Mobile Social Signal Processing: vision and research issues , in: Proceedings of the International Workshop on Mobile HCI, pages 513-516, 2010.
Vinciarelli, Alessandro and Valente, Fabio , Social Signal Processing: Understanding Nonverbal Communication in Social Interactions , in: Proceedings of Measuring Behavior 2010, Eindhoven (The Netherlands), 2010.
Vinciarelli, Alessandro and Pantic, Maja , www.sspnet.eu: A Web Portal for Social Signal Processing , in: IEEE Signal Processing Magazine, volume 27, number 4, pages 142-144, 2010.
Wester, Mirjam , Dines, J. , Gibson, Matthew , Liang, H. , Wu, Yi-Jian , Saheer, L. , King, S. , Oura, Keiichiro , Garner, P. N. , Byrne, William , Guan, Yong , Hirsimäki, Teemu , Karhila, Reima , Kurimo, Mikko , Shannon, Matt , Shiota, Sayaka , Tian, Jilei , Tokuda, Keiichi and Yamagishi, J. , Speaker adaptation and the evaluation of speaker similarity in the EMIME speech-to-speech translation project , in: Proceedings of the 7th ISCA Speech Synthesis Workshop, Kyoto, Japan, 2010.
Yao, A. , Gall, J. and Gool, L. Van , A Hough Transform-Based Voting Framework for Action Recognition , in: IEEE Conference on Computer Vision and Pattern Recognition, 2010.
Yao, A. , Uebersax, D. , Gall, J. and Gool, L. Van , Tracking in Broadcast Sports , in: 32nd Annual Symposium of the German Association for Pattern Recognition, 2010.
Yazdani, M. and Popescu-Belis, A. , A Random Walk Framework to Compute Textual Semantic Similarity: a Unified Model for Three Benchmark Tasks , in: Proceedings of the 4th IEEE International Conference on Semantic Computing (ICSC 2010), Carnegie Mellon University, Pittsburgh, PA, USA, 2010.
Proceedings of the 2010 ACM Symposium on Document Engineering, Manchester, United Kingdom, September 21-24, 2010 , ACM, 2010.
Marchand-Maillet, S. , Morrison, D. , Szekely, E. and Bruno, E. , Interactive Representations of Multimodal Databases , in: Multimodal Signal Processing for Human Computer Interaction, Academis Press, 2010.
Negoescu, R. -A. and Gatica-Perez, D. , Flickr Groups: Multimedia Communities for Multimedia Analysis , number Idiap-RR-18-2010, 2010.
Floor Holder Detection and End of Speaker Turn Prediction in Meetings , in: International Conference on Speech and Language Processing, Interspeech, ISCA, Makuhari, Japan, 2010.
Towards rich mobile phone datasets: Lausanne data collection campaign , in: Proc. ACM Int. Conf. on Pervasive Services (ICPS), Berlin., 2010.
Voices of Vlogging , in: Proc. AAAI Int. Conf. on Weblogs and Social Media (ICWSM), Washington DC, 2010.
Ali, K. , Fleuret, F. , Hasler, D. and Fua, P. , Joint learning of pose estimators and features for object detection , in: Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2009.
Aradilla, G. , Bourlard, H. and Magimai-Doss, M. , Posterior features applied to speech recognition tasks with user-defined vocabulary , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
Aschwanden, Gideon , Haegler, S. , Halatsch, Jan , Jecker, Raphael , Schmitt, Gerhard and Gool, Luc Van , Evaluation of 3D City Models Using Automatic Placed Urban Agents , in: CONVR, 2009.
Ba, S. and Odobez, J. -M. , Recognizing human visual focus of attention from head pose in meetings , in: IEEE Trans. on System, Man and Cybernetics: part B, Man, volume 39, number 1, pages 16-34, 2009.
Ba, S. , Hung, H. and Odobez, J. -M. , Visual activity context for focus of attention estimation in dynamic meetings , in: IEEE Proc. Int. Conf. on Multimedia and Expo (ICME), 2009.
Baechler, M. , Bloechle, J. -L. , Humm, A. , Ingold, R. and Hennebert, J. , Labeled images verification using gaussian mixture models , in: Proceedings of 24th Annual ACM Symposium on Applied Computing (ACM SAC'09), pages 1331-1336, 2009.
Baker, J. , Deng, L. , Glass, J. , Khudanpur, S. , Lee, C. -H. , Morgan, N. and O'Shgughnessy, D. , Research developments and directions in speech recognition and understanding , in: IEEE Signal Processing Magazine, volume 26, number 4, pages 78-85, 2009.
Baker, J. , Deng, L. , Glass, J. , Khudanpur, S. , Lee, C. -H. , Morgan, N. and O'Shgughnessy, D. , Research developments and directions in speech recognition and understanding , in: IEEE Signal Processing Magazine, volume 26, number 3, pages 75-80, 2009.
Beekhof, F. , Voloshynovskiy, S. , Koval, O. and Holotyak, T. , Multi-class classifiers based on binary classifiers: performance, efficiency, and minimum coding matrix distances , in: MLSP 2009, 2009.
Bellotto, N. , Sommerlade, E. , Benfold, B. , Bibby, C. , Reid, I. , Roth, D. , Gool, L. Van , Fernandez, C. and Gonzalez, J. , A Distributed Camera System for Multi-Resolution Surveillance , in: Third ACM/IEEE International Conference on Distributed Smart Cameras, 2009.
Berclaz, J. , Shahrokni, A. , Fleuret, F. , Ferryman, James and Fua, P. , Evaluation of Probabilistic Occupancy Map People Detection for Surveillance Systems , in: Proceedings of the IEEE International Workshop on Performance Evaluation of Tracking and Surveillance, 2009.
Berclaz, J. , Fleuret, F. and Fua, P. , Multiple object tracking using flow linear programming , number 10-2009, 2009.
den Bergh, M. Van , Bosche, F. , Koller-Meier, E. and Gool, L. Van , Haarlet-based Hand Gesture Recognition for 3D Interaction , in: Proceedings of the IEEE Workshop on Applications of Computer Vision, 2009.
den Bergh, Michael Van , Kehl, Roland , Koller-Meier, E. and Gool, Luc Van , Real-time 3D Body Pose Estimation , in: Multi-Camera Networks: Concepts and Applications, pages 335-360, Elsevier, 2009.
den Bergh, Michael Van , Koller-Meier, E. and Gool, Luc Van , Real-time Body Pose Recognition using 2D or 3D Haarlets , in: International Journal of Computer Vision, volume 83, pages 72-84, 2009.
den Bergh, M. Van , Halatsch, J. , Kunze, A. , Bosche, F. , Gool, L. Van and Schmitt, G. , Towards Collaborative Interaction with Large nD Models for Effective Project Management , in: 9th International Conference on Construction Applications of Virtual Reality, 2009.
Bertini, E. , Lalanne, D. and Rigamonti, M. , Extended excentric labeling , in: International Journal of the Eurographics Association, volume 28, 2009.
Bertini, E. and Lalanne, D. , Investigating and reflecting on the integration of automatic data analysis and visualization in knowledge discovery , in: ACM SIGKDD Explorations, volume 22, 2009.
Bertini, E. and Lalanne, D. , Surveying the complementary roles of automatic data analysis and visualization in knowledge discovery , in: Proceedings of ACM SIGKDD Workshop on Visual Analytics and Knowledge Discovery, VAKD '09, 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (VAKD 2009), pages 12-20, 2009.
Biel, Joan-Isaac and Gatica-Perez, D. , Wearing a YouTube hat: directors, comedians, gurus, and user aggregated behavior , in: Proceedings of the 17th ACM International Conference on Multimedia, pages 833-836, ACM, 2009.
Bloechle, J. -L. , Lalanne, D. and Ingold, R. , Ocd: an optimized and canonical document format , in: Proceedings of 10th IEEE International Conference on Document Analysis and Recognition (ICDAR 2009), pages 236-240, 2009.
Bologna, G. , Deville, B. and Pun, T. , Blind navigation along a sinuous path by means of the see color interface , in: IWINAC2009, 3rd International Work-conference on the Interplay between Natural and Artificial Computation, Santiago de Compostela, Spain, June 22--27, 2009.
Bologna, G. , Deville, B. and Pun, T. , On the use of the auditory pathway to represent image scenes in real-time , in: Neurocomputing, volume 72, pages 839-849, 2009.
Bologna, G. , Malandain, S. , Deville, B. and Pun, T. , The multi-touch see color interface , in: ICTA 2009, The 2nd International Conference on Information and Communication Technologies and Accessibility, Hammamet, Tunisia, May 7--9, 2009.
Bosché, F. , Haas, C. T. and Akinci, B. , Automated Recognition of 3D CAD Objects in Site Laser Scans for Project 3D Status Visualization and Performance Control , in: ASCE Journal of Computing in Civil Engineering, volume 23, number 6, pages 311-318, 2009.
Breitenstein, M. D. , Grabner, Helmut and Gool, Luc Van , Hunting Nessie -- Real-Time Abnormality Detection from Webcams , in: IEEE International Workshop on Visual Surveillance, 2009.
Breitenstein, M. D. , Reichlin, Fabian , Leibe, B. , Koller-Meier, E. and Gool, Luc Van , Robust Tracking-by-Detection using a Detector Confidence Particle Filter , in: IEEE International Conference on Computer Vision, 2009.
Bruegger, Pascal , Lalanne, D. , Lisowska, A. and Hirsbrunner, B. , A Method and Tools for Designing and Prototyping Activity-based Pervasive Applications , in: Proceedings of 7th International Conference on Advances in Mobile Computing & Multimedia (ACM MoMM 2009), pages 129-136, 2009.
Bruno, E. and Marchand-Maillet, S. , Multimodal preference aggregation for multimedia information retrieval , in: To appear in Journal of Multimedia, 2009.
Bruno, E. and Marchand-Maillet, S. , multiview clustering: a late fusion approach using latent models , in: Proceedings of the 32nd ACM Special Interest Group on Information Retrieval Conference, SIGIR 09, 2009.
Caputo, B. , Hayman, E. , Fritz, M. and Ekluhnd, J. -O , Classifying Material in the Real World , in: Image and vision Computing, volume accepted for pub, 2009.
Chanel, G. , Kierkels, J. , Soleymani, M. and Pun, T. , short-term emotion assessment in a recall paradigm , in: International Journal of Human-Computer Studies, volume 67, number 8, pages 607-627, 2009.
Dines, J. , Yamagishi, J. and King, S. , Measuring the gap between HMM-based ASR and TTS , in: Proceedings of Interspeech, Brighton, U.K., 2009.
Dines, J. , Saheer, L. and Liang, H. , Speech recognition with speech synthesis models by marginalising over decision tree leaves , in: Proceedings of Interspeech, Brighton, U.K., 2009.
Drygajlo, A. , Li, W. and Zhu, K. , Q-stack aging model for face verification , in: 17th European Signal Processing Conference, 2009.
Duffner, S. , Odobez, J. -M. and Ricci, E. , Dynamic Partitioned Sampling For Tracking With Discriminative Features , in: Proceedings of the British Maschine Vision Conference, London, 2009.
Dumas, B. , Lalanne, D. and Ingold, R. , Benchmarking fusion engines of multimodal interactive systems , in: Proceedings of International Conference on Multimodal Interfaces and Workshop on Machine Learning for Multi-modal Interaction (ICMI-MLMI 2009), 2009.
Dumas, B. , Lalanne, D. and Ingold, R. , Description Languages for Multimodal Interaction: a Set of Guidelines , in: Journal on Multimodal User Interfaces, volume 3, 2009.
Dumas, B. , Lalanne, D. and Ingold, R. , HephaisTK: A Toolkit for Rapid Prototyping of Multimodal Interfaces , in: Proceedings of International Conference on Multimodal Interfaces and Workshop on Machine Learning for Multi-modal Interaction (ICMI-MLMI 2009), pages 231-232, 2009.
Dylla, K. , Müller, P. , Ulmer, A. , Haegler, S. and Fischer, B. , Rome Reborn 2.0: A Framework for Virtual City Reconstruction Using Procedural Modeling Techniques , in: Proceedings of Computer Applications and Quantitative Methods in Archaeology, 2009.
Eichner, Marcin and Ferrari, V. , Better Appearance Models for Pictorial Structures , in: British Machine Vision Conference, 2009.
Ess, A. , Schindler, K. , Leibe, B. and van Gool, L. , Improved Multi-Person Tracking with Active Occlusion Handling , in: ICRA Workshop on People Detection and Tracking, 2009.
Ess, A. , Leibe, B. , Schindler, K. and Gool, L. Van , Moving Obstacle Detection in Highly Dynamic Scenes , in: IEEE International Conference on Robotics and Automation, 2009.
Ess, A. , Leibe, B. , Schindler, K. and Gool, L. Van , Robust Multi-Person Tracking from a Mobile Platform , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 31, number 10, pages 1831-1846, 2009.
Estellers, Virginia , Gurban, M. and Thiran, J. -Ph. , SELECTING RELEVANT VISUAL FEATURES FOR SPEECHREADING , in: Proc. of the IEEE International Conference on Image Processing, Cairo, 2009.
Estrella, P. , Popescu-Belis, A. and King, M. , The FEMTI guidelines for contextual MT evaluation: principles and tools , in: Linguistica Antverpiensia New Series, volume 8, 2009.
Evéquoz, F. and Lalanne, D. , "I Thought You Would Show Me How To Do It" -- Studying and Supporting PIM Strategy Changes , in: Proceedings of ASIS&T PIM Workshop (ASIS&T 2009), 2009.
Evéquoz, F. , An Ethnographically-Inspired Survey of PIM Strategies. Technical Report , 2009.
Fanelli, G. , Gall, J. and Gool, L. Van , Hough Transform-based Mouth Localization for Audio-Visual Speech Recognition , in: British Machine Vision Conference, 2009.
Farrahi, K. and Gatica-Perez, D. , Learning and Predicting Multimodal Daily Life Patterns from Cell Phones , in: ICMI-MLMI, 2009.
Favre, S. , Dielmann, A. and Vinciarelli, A. , Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models , in: ACM International Conference on Multimedia, To Appear, 2009.
Favre, S. , Social Network Analysis in Multimedia Indexing: Making Sense of People in Multiparty Recordings , in: Proceedings of the Doctoral Consortium of the International Conference on Affective Computing \& Intelligent Interaction (ACII), pages 25-32, 2009.
Ferrari, V. , Marin, M. and and A. Zisserman, , 2D Human Pose Estimation in TV Shows , in: Statistical and Geometrical Approaches to Visual Motion Analysis, pages 128-147, Springer, 2009.
Fleuret, F. , Multi-layer boosting for pattern recognition , in: Pattern Recognition Letters (PRL), volume 30, pages 237-241, 2009.
Friedland, G. , Vinyals, O. , Huang, Y. and Muller, C. , Fusion of short-term and long-term features for improved speaker diarization , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4077-4080, 2009.
Friedland, G. , Hung, H. and Yeo, C. , Multi-modal speaker diarization of real-world meetings using compressed-domain video features , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4069-4072, 2009.
Friedland, G. , Hung, H. and Yeo, Chuohao , MULTI-MODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING COMPRESSED-DOMAIN VIDEO FEATURES , in: International Conference on Audio, Speech and Signal Processing, 2009.
Friedland, G. , Vinyals, O. , Huang, Y. and Muller, C. , Prosodic and other long-term features for speaker diarization , in: IEEE Transactions on Audio, Speech and Language Processing, volume 17, number 5, pages 985-993, 2009.
Friedland, G. and van Leeuwen, D. , Speaker diarization and identification , IEEE Press/Wiley, 2009.
Friedland, G. , Yeo, C. and Hung, H. , Visual Speaker Localization Aided by Acoustic Models , in: ACM Multimedia, 2009.
Friedland, G. , Yeo, C. and Hung, H. , Visual speaker localization aided by acoustic models (full paper) , in: Proceedings of ACM Multimedia, Beijing, China, 2009.
Frinken, V. and Bunke, H. , Evaluating retraining rules for semi-supervised learning in neural network based cursive word recognition , in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 31-35, 2009.
Frinken, V. , Riesen, K. and Bunke, H. , Improving graph classification by isomap , in: Graph-Based Representations in Pattern Recognition, pages 205-214, Springer, 2009.
Frinken, V. and Bunke, H. , Self-training strategies for handwriting word recognition , in: Proc. Industrial Conf. Advances in Data Mining. Applications and Theoretical Aspects, pages 291-300, Springer, 2009.
Galbally, J. , McCool, C. , Fierrez, J. , Marcel, S. and Ortega-Garcia, J. , Hill-Climbing Attack to an Eigenface-Based Face Verification System , in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009.
Galbally, J. , McCool, C. , Fierrez, J. , Marcel, S. and Ortega-Garcia, J. , On the vulnerability of face verification systems to hill-climbing attacks , in: Pattern Recognition, 2009.
Gall, J. and Lempitsky, V. , Class-Specific Hough Forests for Object Detection , in: IEEE Conference on Computer Vision and Pattern Recognition, 2009.
Gall, J. , Stoll, C. , de Aguiar, E. , Theobalt, C. , Rosenhahn, B. and Seidel, H. -P. , Motion Capture Using Joint Skeleton Tracking and Surface Estimation , in: IEEE Conference on Computer Vision and Pattern Recognition, 2009.
Gammeter, S. , Bossard, L. , Quack, T. and Gool, L. Van , I know what you did last summer: object-level auto-annotation of holiday snaps , in: International Conference on Computer Vision, 2009.
Ganapathy, S. , Thomas, S. , Motlicek, P. and Hermansky, H. , APPLICATIONS OF SIGNAL ANALYSIS USING AUTOREGRESSIVE MODELS FOR AMPLITUDE MODULATION , in: IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009, WASPAA '09., pages 341-344, IEEE, Mohonk Mountain House, New Paltz, New York, USA, 2009.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Error Resilient Speech Coding Using Sub-band Hilbert Envelopes , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, pages 355-362, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Error Resilient Speech Coding Using Sub-band Hilbert Envelopes , in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , MDCT for Encoding Residual Signals in Frequency Domain Linear Prediction , in: Audio Engineering Society (AES), 127th Convention, Audio Engineering Society, 60 East 42nd Street, New York, New York 10165-2520, USA;, Audio Engineering Society (AES), 2009.
Garau, G. , Ba, S. , Bourlard, H. and Odobez, J. -M. , Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation , in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009.
Garg, N. , Favre, B. , Riedhammer, K. and Hakkani-Tur, D. , Clusterrank: a graph based method for meeting summarization , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Garg, N. , Co-occurrence Models for Image Annotation and Retrieval , number Idiap-RR-22-2009, 2009.
Garg, N. and Gatica-Perez, D. , Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr , number Idiap-RR-21-2009, 2009.
Garner, P. N. , A MAP Approach to Noise Compensation of Speech , number Idiap-RR-08-2009, 2009.
Garner, P. N. , Dines, J. , Hain, T. , El Hannani, A. , Karafiat, M. , Korchagin, D. , Lincoln, M. , Wan, V. and Zhang, L. , Real-Time ASR from Meetings , in: Proceedings of Interspeech, Brighton, UK., 2009.
Garner, P. N. and Garner, Philip N. , SNR Features for Automatic Speech Recognition , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
Garner, Philip N. , SNR Features for Automatic Speech Recognition , number Idiap-RR-25-2009, 2009.
Gass, T. , Deselaers, T. and Ney, H. , Deformation-aware Log-Linear Models , in: Deutsche Arbeitsgemeinschaft für Mustererkennung Symposium, 2009.
Gatica-Perez, D. , Automatic nonverbal analysis of social interaction in small groups: a review , in: Image and Vision Computing, Special Issue on Human Naturalistic Behavior, in press, 2009.
Gatica-Perez, D. , Modeling interest in face-to-face conversations from multimodal nonverbal behavior , in: In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.), Multimodal Signal Processing, Academic Press, Academic Press, 2009.
Gehler, Peter and Schölkopf, Bernhard , An introduction to kernel learning algorithms , in: Kernel Methods for Remote Sensing Data Analysis, pages 39-60, Wiley, 2009.
Gehler, Peter and Nowozin, Sebastian , Let the Kernel Figure it Out: Principled Learning of Pre-processing for Kernel Classifiers , in: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2009.
Gehler, Peter and Nowozin, Sebastian , On Feature Combination for Multiclass Object Classification , in: Proceedings of the Twelfth IEEE International Conference on Computer Vision, 2009.
Gelbart, D. , Morgan, N. and Tsymbal, A. , Hill-climbing feature selection for multi-stream asr , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Gillick, D. , Riedhammer, K. , Favre, B. and Hakkani-Tur, D. , A global optimization framework for meeting summarization , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
Gonzalez, G. , Fleuret, F. and Fua, P. , Learning rotational features for filament detection , in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2009.
Gonzalez, G. , Aguet, F. , Fleuret, F. , Unser, M. and Fua, P. , Steerable features for statistical 3d dendrite detection , in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2009.
Gottlieb, L. and Friedland, G. , On the use of artificial conversation data for speaker recognition in cars , in: IEEE International Conference for Semantic Computing, Berkeley, USA, 2009.
Graves, A. , Liwicki, M. , Fernandez, S. , Bertolami, R. , Bunke, H. and Schmidhuber, J. , A novel connectionist system for unconstrained handwriting recognition , in: IEEE Trans. PAMI, volume 31, number 5, pages 855-869, ISSN 0162-8828, 2009.
Gui, L. , Thiran, J. -Ph. and Paragios, N. , Cooperative Object Segmentation and Behavior Inference in Image Sequences , in: International Journal of Computer Vision, volume 84, number 2, pages 146-162, 2009.
Gurban, M. and Thiran, J. -Ph. , Information theoretic feature extraction for audio-visual speech recognition , in: IEEE Trans. on Signal Processing, volume in press, 2009.
Haegler, S. , Müller, P. and Gool, Luc Van , Procedural Modeling for Digital Cultural Heritage , in: EURASIP Journal on Image and Video Processing, volume 2009, 2009.
Hakkani-Tur, D. , Towards automatic argument diagramming of multiparty meetings , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
Hamer, Henning , Schindler, K. , Koller-Meier, E. and Gool, Luc Van , Tracking a Hand Manipulating an Object , in: IEEE International Conference on Computer Vision, 2009.
Hasler, N. , Rosenhahn, B. , Thormählen, T. , Wand, M. , Gall, J. and Seidel, H. -P. , Markerless Motion Capture with Unsynchronized Moving Cameras , in: IEEE Conference on Computer Vision and Pattern Recognition, 2009.
Heusch, G. and Marcel, S. , A novel statistical generative model dedicated to face recognition , in: Image \& Vision Computing, number Idiap-RR-39-2007, 2009.
Heusch, G. , Bayesian Networks as Generative Models for Face Recognition , EPFL, 2009.
Heusch, G. and Marcel, S. , Bayesian Networks to Combine Intensity and Color Information in Face Recognition , number Idiap-RR-27-2009, 2009.
Heusch, G. and Marcel, S. , Bayesian Networks to Combine Intensity and Color Information in Face Recognition , in: International Conference on Biometrics, pages 414-423, Springer, 2009.
Humm, A. , Hennebert, J. and Ingold, R. , Combined handwriting and speech modalities for user authentication , in: IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans, volume 39, 2009.
Humm, A. , Ingold, R. and Hennebert, J. , Spoken handwriting for user authentication using joint modelling systems , in: Proceedings of 6th International Symposium on Image and Signal Processing and Analysis (ISPA'09), 2009.
Hung, H. and Ba, S. , Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features , number Idiap-RR-20-2009, 2009.
Imseng, D. , Novel initialization methods for Speaker Diarization , number Idiap-RR-07-2009, 2009.
Imseng, D. and Friedland, G. , Robust Speaker Diarization for Short Speech Recordings , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
Indermühle, E. , Liwicki, M. and Bunke, H. , Combining alignment results for historical handwritten document analysis , in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 1186-1190, 2009.
Ivanov, I. , Dufaux, F. , Ha, T. M. and Ebrahimi, T. , Towards Generic Detection of Unusual Events in Video Surveillance , in: 6th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSSâ09), Genoa, Italy, 2009.
Jayagopi, D. , Bogdan, R. and Gatica-Perez, D. , Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour , in: Proceedings ICME 2009, 2009.
Jayagopi, D. and Gatica-Perez, D. , Discovering group nonverbal conversational patterns with topics , in: accepted for publication in Proc. ICMI-MLMI, 2009.
Jayagopi, D. , Modeling dominance in group conversations using nonverbal activity cues , in: IEEE Trans. on Audio, Speech, and Language Processing, Special Issue on Multimodal Processing for Speech-based Interactions, volume 17, pages 501-513, 2009.
Jie, L. , Caputo, Barbara and Ferrari, V. , Who's Doing What: Joint Modeling of Names and Verbs for Simultaneous Face and Pose Annotation , in: Advances in Neural Information Processing Systems, 2009.
Kaplan, F , Do-Lenh, S , Bachour, K , Kao, G. Y , Gault, C and Dillenbourg, P , Interpersonal Computers for Higher Education , in: Interactive Artifacts and Furniture Supporting Collaborative Work and Learning, pages 129-145, Springer US, 2009.
Keshet, J. , Grangier, D. and Bengio, S. , Discriminative Keyword Spotting , in: Speech Communication, volume 51, number 4, pages 317-329, 2009.
Kierkels, J. , Soleymani, M. and Pun, T. , Identification of narrative peaks in clips: text features perform best , in: VideoCLEF 2009, Cross Language Evaluation Forum (CLEF) Workshop, ECDL 200, 2009.
Kierkels, J. , Soleymani, M. and Pun, T. , Queries and tags in affect-based multimedia retrieval , in: International Conference on Multimedia and Expo, Special Session on Implicit Tagging, 2009.
Kierkels, J. and Pun, T. , Simultaneous exploitation of explicit and implicit tags in affect-based multimedia retrieval , in: International Conference on Affective Computing and Intelligent Interaction, pages 274-279, 2009.
Korchagin, D. , Garner, P. N. and Dines, J. , Automatic Temporal Alignment of AV Data , number Idiap-RR-39-2009, 2009.
Korchagin, D. , Garner, P. N. and Dines, J. , Automatic Temporal Alignment of AV Data with Confidence Estimation , number Idiap-RR-40-2009, 2009.
Korchagin, D. , Memoirs of Togetherness from Audio Logs , in: Proceedings International ICST Conference on User Centric Media, Venice, Italy, 2009.
Korchagin, D. , Multimodal Data Flow Controller , number Idiap-Com-01-2009, 2009.
Korchagin, D. , Out-of-Scene AV Data Detection , in: Proceedings IADIS International Conference Applied Computing, pages 244-248, Rome, Italy, 2009.
Korchagin, D. , Out-of-Scene AV Data Detection , number Idiap-RR-31-2009, 2009.
Koval, O. , Voloshynovskiy, S. , Caire, F. and Bas, P. , On security threats for robust perceptual hashin , in: Electronic Imaging 2009, 2009.
Kryszczuk, K. and Drygajlo, A. , Improving biometric verification with class-independent quality information , pages 310-321, 2009.
Kryszczuk, K. and Drygajlo, A. , Improving biometric verification with class-independent quality information , in: IET Signal Processing, Special Issue on Biometric Recognition, volume 3, number 4, pages 310-321, 2009.
Kumatani, K. , McDonough, J. , Rauch, Barbara , Klakow, D. , Garner, P. N. and Li, Weifeng , Beamforming with a Maximum Negentropy Criterion , in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 5, pages 994-1008, 2009.
Kumatani, K. , McDonough, J. , Rauch, B. , Garner, P. N. , Li, W. and Dines, J. , Maximum kurtosis beamforming with the generalized sidelobe canceller , in: Proceedings of INTERSPEECH, September 2008, Brisbane, Australia, 2009.
Lalanne, D. , Nigay, L. , Palanque, P. , Robinson, P. , Vanderdonckt, J. and Ladry, J. -F. , Fusion engines for multimodal interfaces: a survey , in: Proceedings of International Conference on Multimodal Interfaces and Workshop on Machine Learning for Multi-modal Interaction (ICMI-MLMI 2009), 2009.
Lalanne, D. and Kholas, J. , Human machine interaction , 2009.
Le, Q. A. and Popescu-Belis, A. , Automatic vs. human question answering over multimedia meeting recordings , in: Interspeech 2009 (10th Annual Conference of the International Speech Communication Association), 2009.
Lee, J. -S. and Ebrahimi, T. , Efficient video coding in H.264/AVC by using audio-visual information , in: Proceedings of the IEEE International Workshop on Multimedia Signal Processing, 2009.
Lee, J. -S. , De Simone, F. and Ebrahimi, T. , Video coding based on audio-visual attention , in: IEEE International Conference on Multimedia and Expo (ICME'09), New York, USA, 2009.
Lefèvre, S. and Odobez, J. -M. , Structure and appearance features for robust 3d facial actions tracking , in: International Conference on Multimedia and Expo (ICME), 2009.
Lehmann, Alain , Leibe, B. and Gool, Luc Van , Feature-Centric Efficient Subwindow Search , in: IEEE International Conference on Computer Vision, 2009.
Lehmann, Alain , Leibe, B. and Gool, Luc Van , PRISM: PRincipled Implicit Shape Model , in: British Machine Vision Conference, 2009.
Li, W. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition , in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
Luo, J. , Orabona, F. and Caputo, B. , An online framework for learning novel concepts over multiple cues , in: Proceeding of The 9th Asian Conference on Computer Vision, Xi'an, China, 2009.
Luo, J. , Caputo, B. and Ferrari, V. , Who's Doing What: Joint Modeling of Names and Verbs for Simultaneous Face and Pose Annotation , in: Advances in Neural Information Processing Systems 22 (NIPS09), MIT Press, NIPS Foundation, Vancouver, B.C., Canada, 2009.
Magimai-Doss, M. , Aradilla, G. and Bourlard, H. , On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR , number Idiap-RR-24-2009, 2009.
Marchand-Maillet, S. , Szekely, E. and Bruno, E. , Optimizing strategies for the exploration of social-networks and associated data collections , in: Proceedings of the International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS'09) - Special session on "People, Pixels, Peers: Interactive Content in Social Networks", 2009.
McCool, C. and Marcel, S. , MOBIO Database for the ICPR 2010 Face and Speech Competition , number Idiap-Com-02-2009, 2009.
McCool, C. and Marcel, S. , Parts-Based Face Verification using Local Frequency Bands , in: in Proceedings of IEEE/IAPR International Conference on Biometrics, 2009.
Mekhaldi, Dalila and Lalanne, D. , Joining Meeting Documents to Strengthen Multimodal Thematic Alignment , in: Proceedings of 5th International Conference on Signal Image Technology and Internet Based Systems (SITIS 2009), pages 88-96, 2009.
Monay, F. , Quelhas, P. , Odobez, J. -M. and Gatica-Perez, D. , Contextual classification of image patches with latent aspect models , in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009.
Morrison, D. , Bruno, E. and Marchand-Maillet, S. , capturing the semantics of user interaction: a review and case study , in: Emergent Web Intelligence, Springer, 2009.
Morrison, D. , Marchand-Maillet, S. and Bruno, E. , Modelling long-term relevance feedback , in: Proceedings of the ECIR Workshop on Information Retrieval over Social Networks, 2009.
Motlicek, P. , Ganapathy, S. and Hermansky, H. , Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec , in: 10th Annual Conference of the International Speech Communication Association, pages 2591-2594, ISCA 2009, ISCA, Brighton, England, 2009.
Motlicek, P. , Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices , in: 10thAnnual Conference of the International Speech Communication Association, pages 1215-1218, ISCA, Brighton, England, 2009.
Motlicek, P. , Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices , in: 10thAnnual Conference of the International Speech Communication Association, ISCA, 2009.
Nater, Fabian , Grabner, Helmut , Jaeggli, T. and Gool, Luc Van , Tracker Trees for Unusual Event Detectio , in: IEEE International Workshop on Visual Surveillance, 2009.
Negoescu, R. -A. , Adams, B. , Phung, D. , Venkatesh, S. and Gatica-Perez, D. , Flickr Hypergroups , in: Proceedings of the 17th ACM International Conference on Multimedia, 2009.
Noceti, N. , Caputo, B. , Castellini, C. , Baldassarre, L. , Barla, A. , Rosasco, L. , Odone, F. and Sandini, G. , Towards a theoretical framework for learning multi-modal patterns for embodied agents , in: International Conference on Image Analysis and Processing, 2009.
Nuessli, Marc-Antoine , Jermann, Patrick , Sangin, Mirweis and Dillenbourg, Pierre , Collaboration and abstract representations: towards predictive models based on raw speech and eye-tracking data , in: CSCL '09: Proceedings of the 2009 conference on Computer support for collaborative learning, International Society of the Learning Sciences, Rhodes, 2009.
Orabona, F. , Caputo, B. , Fillbrandt, A. and Ohl, F. , A theoretical framework for transfer of knowledge across modalities in artificial and cognitive systems , in: International Conference on Developmental Learning, 2009.
Orabona, F. , Keshet, J. and Caputo, B. , Bounded kernel-based perceptrons , in: Journal of Machine Learning Research, volume Accepted for pub, 2009.
Orabona, F. , Castellini, C. , Caputo, B. , Fiorilla, A. E. and Sandini, G. , Model adaptation with least-square SVM for adaptive hand prosthetics , in: IEEE International conference on Robotics and Automation, 2009.
Orabona, F. , Castellini, C. , Caputo, B. , Fiorilla, A. E. and Sandini, G. , Model Adaptation with Least-Squares SVM for Adaptive Hand Prosthetics , number Idiap-RR-05-2009, 2009.
Orabona, F. , Castellini, C. , Caputo, B. , Luo, J. and Sandini, G. , Towards Life-long Learning for Cognitive Systems: Online Independent Support Vector Machine , in: Pattern Recognition, volume Accepted for Pub, 2009.
Ortega-Garcia, J. , Fierrez, J. , Alonso-Fernandez, F. , Galbally, J. , M. R. Freire, , Gonzalez-Rodriguez, J. , Garcia-Mateo, C. , Alba-Castro, J. -L. , E. Gonzalez-Agulla, , E. Otero-Muras, , S. Garcia-Salicetti, , L. Allano, , B. Ly-Van, , B. Dorizzi, , Kittler, J. , Bourlai, T. , Poh, N. , Deravi, F. , M. W. R. Ng, , M. Fairhurst, , Hennebert, J. , Humm, A. , M. Tistarelli, , L. Brodo, , Richiardi, J. , Drygajlo, A. , H. Ganster, , F. M. Sukno, , Pavani, S. -K. , A. Frangi, , L. Akarun, and A. Savran, , The multi-scenario multi-environment biosecure multimodal database (bmdb) , in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 2009.
Pantic, M. and Vinciarelli, A. , Implicit Human Centered Tagging , in: IEEE Signal Processing Magazine, volume 26, 2009.
Park, I. K. , Germann, M. , Breitenstein, M. D. and Pfister, H. , Fast and Automatic Object Pose Estimation for Range Images on the GPU , in: Machine Vision and Applications, 2009.
Parthasarathi, S. H. K. , Magimai-Doss, M. , Bourlard, H. and Gatica-Perez, D. , Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations , in: Proceedings of Interspeech 2009, 2009.
Parthasarathi, S. H. K. , Magimai-Doss, M. , Gatica-Perez, D. and Bourlard, H. , Speaker Change Detection with Privacy-Preserving Audio Cues , in: Proceedings of ICMI-MLMI 2009, 2009.
Pellegrini, S. , Ess, A. , Schindler, K. and van Gool, L. , You'll Never Walk Alone: Modeling Social Behavior for Multi-target Tracking , in: International Conference on Computer Vision, 2009.
Perrin, X. , Chavarriaga, R. , Pradalier, C. , Millán, J. del R. and Siegwart, R. , Dialog Management Technique for Brain-Computer Interfaces , 2009.
Perrin, X. , Colas, F. , Pradalier, C. and Siegwart, R. , Learning human habits and reactions to external events with a dynamic Bayesian network , 2009.
Perrin, X. , Colas, F. , Pradalier, C. and Siegwart, R. , Learning to identify users and predict their destination in a robotic guidance application , in: Field and Service Robotics (FSR), 2009.
Picart, B. , Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity , number Idiap-RR-18-2009, 2009.
Pinto, J. P. , Magimai-Doss, M. and Bourlard, H. , MLP Based Hierarchical System for Task Adaptation in ASR , in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, pages 365-370, Merano, Italy, 2009.
Pinto, J. P. , Sivaram, G. S. V. S. , Hermansky, H. and Magimai-Doss, M. , Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator , in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009.
Popescu-Belis, A. , Poller, P. , Kilgour, J. , Boertjes, E. , Carletta, J. , Castronovo, S. , Fapso, M. , Flynn, M. , Nanchen, A. , Wilson, T. , Wit, J. de and Yazdani, M. , A multimedia retrieval system using speech input , in: ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), 2009.
Popescu-Belis, A. , Carletta, J. , Kilgour, J. and Poller, P. , Accessing a large multimodal corpus using an automatic content linking device , in: Multimodal Corpora, Springer-Verlag, 2009.
Popescu-Belis, A. , Comparing meeting browsers using a task-based evaluation method , number Idiap-RR-11-2009, 2009.
Popescu-Belis, A. and Vinciarelli, A. , Multimedia meeting processing and retrieval at the idiap research institute , in: Informer (Newsletter of the BCS Information Retrieval Specialist Group), volume 29, pages 14-16, 2009.
Popescu-Belis, A. , Poller, P. , Kilgour, J. , Flynn, M. , Germesin, Sebastian , Nanchen, A. and Yazdani, M. , User Interface Design in a Just-in-time Retrieval System for Meetings , number Idiap-RR-38-2009, 2009.
Pronobis, M. and Magimai-Doss, M. , Analysis of F0 and Cepstral Features for Robust Automatic Gender Recognition , number Idiap-RR-30-2009, 2009.
Pronobis, A. and Caputo, B. , COLD: The COsy Localization Database , in: International Journal of Robotics Research, volume 28, number 5, pages 588-594, 2009.
Raducanu, B. and Gatica-Perez, D. , You are fired! Nonverbal role analysis in competitive meetings , in: Proc. ICASSP, Taiwan, 2009.
Rajan, P. , Rajan, Padmanabhan , Parthasarathi, S. H. K. , Parthasarathi, Sree Hari Krishnan , Murthy, H. and Murthy, Hema A , Robustness of Phase based Features for Speaker Recognition , in: Proceedings of Interspeech, 2009.
Rajan, Padmanabhan , Parthasarathi, Sree Hari Krishnan and Murthy, Hema A , Robustness of Phase based Features for Speaker Recognition , number Idiap-RR-14-2009, 2009.
Ricci, E. and Odobez, J. -M. , Learning Large Margin Likelihood for Realtime Head Pose Tracking , in: IEEE Int. Conference on Image Processing, Cairo, Egypt, IEEE, 2009.
Ricci, E. and Odobez, J. -M. , Real-time simultaneous head tracking and pose estimation , in: IEEE International Conference on Image Processing (ICIP), 2009.
Richiardi, J. , Drygajlo, A. and Kryszczuk, K. , Static models of derivative-coordinates phase spaces for multivariate time series classification: an application to signature verification , pages 140-149, 2009.
Richiardi, J. , Kryszczuk, K. and Drygajlo, A. , Static models of derivative-coordinates phase spaces for multivariate time series classification: an application to signature verification , in: Advances in Biometrics, Lecture Notes in Computer Science 5558, pages 1200-1208, 2009.
Roman-Rangel, Edgar , Pallan, Carlos , Odobez, J. -M. and Gatica-Perez, D. , Retrieving Ancient Maya Glyphs with Shape Context , in: 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, IEEE, Kyoto, Japan, 2009.
Roth, D. , Koller-Meier, E. and Gool, Luc Van , Multi-object tracking evaluated on sparse events , in: Multimedia Tools and Applications, 2009.
Roy, A. and Marcel, S. , Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection , number Idiap-RR-28-2009, 2009.
Roy, A. and Marcel, S. , Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection , in: British Machine Vision Conference 2009, 2009.
Salamin, H. , Favre, S. and Vinciarelli, A. , Automatic Role Recognition in Multiparty Recordings: Using Social Affiliation Networks for Feature Extraction , in: IEEE Transactions on Multimedia, To Appear, 2009.
Sanchez-Cortes, Dairazalia , Jayagopi, D. and Gatica-Perez, D. , Predicting Remote Versus Collocated Group Interactions using Nonverbal Cues , in: Proc. Int. Conf. on Multimodal Interfaces, Workshop on Multimodal Sensor-Based Systems and Mobile Phones for Social Computing,, Cambridge, 2009. [DOI]
Scaringella, N. , On the design of audio features robust to the album-effect for music information retrieval. , Ecole Polytechnique Fédérale de Lausanne, 2009.
Shaheen, M. , Gall, J. , Strzodka, R. , Gool, L. Van and Seidel, H. -P. , A Comparison of 3D Model-based Tracking Approaches for Human Motion Capture in Uncontrolled Environments , in: IEEE Workshop on Applications of Computer Vision, 2009.
De Simone, F. , Dufaux, F. , Ebrahimi, T. , Delogu, C. and Baroncini, V. , A subjective study of the influence of color information on visual quality assessment of high resolution pictures , in: Fourth International Workshop on Video Processing and Quality Metrics for Consumer Electronics (VPQM-09), Scottsdale, Arizona, USA, 2009.
Simone, F. De , Goldmann, L. , Baroncini, V. and Ebrahimi, T. , Subjective evaluation of JPEG XR image compression , in: Proceedings of SPIE, 2009.
Soldo, Serena , Magimai-Doss, M. , Pinto, J. P. and Bourlard, H. , On MLP-based Posterior Features for Template-based ASR , number Idiap-RR-37-2009, 2009.
Soleymani, M. , Kierkels, J. , Chanel, G. and Pun, T. , A Bayesian framework for video affective representation , in: International Conference on Affective Computing and Intelligent Interaction, pages 267-273, 2009.
Soleymani, M. , Davis, J. and Pun, T. , A collaborative personalized affective video retrieval system , in: International Conference on Affective Computing and Intelligent Interaction, pages 588-589, 2009.
Soleymani, M. , Chanel, G. , Kierkels, J. and Pun, T. , affective characterization of movie scenes based on content analysis and physiological changes , in: To appear in International Journal of Semantic Computing, 2009.
Sproewitz, Alexander , Billard, A. , Dillenbourg, Pierre and Ijspeert, Auke Jan , Roombots-Mechanical Design of Self-Reconfiguring Modular Robots for Adaptive Furniture , in: Proceedings of 2009 IEEE International Conference on Robotics and Automation, pages 4259-4264, Kobe, Japan, 2009. [DOI]
Stalder, S. , Grabner, H. and Gool, L. Van , Beyond Semi-Supervised Tracking: Tracking Should Be as Simple as Detection, but not Simpler than Recognition , in: OLCV 09: 3rd On-line learning for Computer Vision Workshop, 2009.
Thiran, J. -Ph. , Bourlard, H. and Marques, F. , Multimodal Signal Processing: Methods and Techniques to Build Multimodal Interactive Systems , Academic Press, ISBN 0-1237-4825-9, 2009.
Thomas, S. , Ganapathy, S. and Hermansky, H. , Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features , number Idiap-RR-04-2009, 2009.
Thomas, A. , Ferrari, V. , Leibe, B. , Tuytelaars, T. and Gool, L. Van , Shape-from-Recognition: Recognition enables Meta-data Transfer , in: Computer Vision and Image Understanding, volume 113, number 12, pages 1222-1234, 2009.
Thomas, A. , Ferrari, V. , Leibe, B. , Tuytelaars, T. and Gool, L. Van , Using Multi-view Recognition to Guide a Robot , in: International Journal of Robotics Research, volume 28, number 8, pages 976-998, 2009.
Tommasi, T. and Caputo, B. , The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories , in: BMVC, 2009.
Ullah, M. M. , Orabona, F. and Caputo, B. , You live, you learn, you forget: continuous learning of visual places with a forgetting mechanism , in: International Conference on Robotic and Systems, 2009.
Vajda, P. , Goldmann, L. and Ebrahimi, T. , Analysis of the limits of graph-based object duplicate detection , in: Prooceedings of the IEEE International Symposium on Multimedia, pages 600-605, 2009.
Valente, F. , A Novel Criterion for Classifiers Combination in Multistream Speech Recognition , in: IEEE Signal Processing Letters, volume 16, number 7, pages 561-564, ISSN 1070-9908, 2009. [DOI]
Valente, F. , Magimai-Doss, M. , Plahl, C. and Suman, R. , Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system , in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009.
Varadarajan, Jagannadan and Odobez, J. -M. , Topic Models for Scene Analysis and Abnormality Detection , in: 9th International Workshop in Visual Surveillance, IEEE, IEEE, Kyoto, Japan, 2009.
Vijayasenan, D. , Valente, F. and Bourlard, H. , An Information Theoretic Approach to Speaker Diarization of Meeting Data , in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 7, pages 1382-1393, 2009. [DOI]
Vijayasenan, D. , Valente, F. and Bourlard, H. , KL Realignment for Speaker Diarization with Multiple Feature Streams , in: 10th Annual Conference of the International Speech Communication Association, 2009.
Vijayasenan, D. , Valente, F. and Bourlard, H. , MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA , in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009.
Vijayasenan, D. , Valente, F. and Bourlard, H. , Mutual Information based Channel Selection for Speaker Diarization of Meetings Data , in: Proceedings of International conference on acoustics speech and signal processing, 2009.
Vinciarelli, A. , Dielmann, A. , Favre, S. and Salamin, H. , Canal9: A database of political debates for analysis of social interactions , in: Proceedings of the International Conference on Affective Computing and Intelligent Interaction (IEEE International Workshop on Social Signal Processing), pages 1-4, Amsterdam, Netherlands, 2009. [DOI]
Vinciarelli, A. , Capturing Order in Social Interactions , in: IEEE Signal Processing Magazine, 2009.
Vinciarelli, A. , Suditu, N. and Pantic, M. , Implicit Human Centered Tagging , in: Proceedings of IEEE Conference on Multimedia and Expo, pages 1428-1431, 2009.
Vinciarelli, A. , Pantic, M. and Bourlard, H. , Social Signal Processing: Survey of an Emerging Domain , in: Image and Vision Computing, 2009.
Voloshynovskiy, S. , Koval, O. , Beekhof, F. and Holotyak, T. , Binary robust hashing based on probabilistic bit reliability , in: IEEE Workshop on Statistical Signal Processing 2009, 2009.
Voloshynovskiy, S. , Koval, O. , Beekhof, F. and Pun, T. , Random projections based item authentication , in: Electronic Imaging 2009, 2009.
Weise, T. , Wismer, T. , Leibe, B. and Gool, L. Van , In-hand Scanning with Online Loop Closure , in: IEEE International Workshop on 3-D Digital Imaging and Modeling, 2009.
Weyand, T. , Deselaers, T. and Ney, H. , Log-Linear Mixtures for Object Recognition , in: British Machine Vision Conference, 2009.
Wu, L. , Hoi, S. C. , Jin, R. , Zhu, J. and Yu, N. , Distance Metric Learning from Uncertain Side Information with Application to Automated Photo Taggin , in: ACM Multimedia 2009, 2009.
Wuthrich, M. , Liwicki, M. , Fischer, A. , Indermühle, E. , Bunke, H. , Viehhauser, G. and Stolz, M. , Language model integration for the recognition of handwritten medieval documents , in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 211-215, 2009.
Wöllmer, M. , Eyben, F. , Keshet, J. , Graves, A. , Schuller, B. and Rigoll, G. , Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech using Bidirectional LSTM Networks , in: IEEE International Conference on Acoustic, Speech, and Signal Processing, 2009.
Xie, S. , Favre, B. , Hakkani-Tur, D. and Liu, Y. , Leveraging sentence weights in a concept-based optimization framework for extractive meeting summarization , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Yao, J. and Odobez, J. -M. , Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets , number Idiap-RR-19-2009, 2009.
Yao, J. and Odobez, J. -M. , Multi-camera multi-person 3d space tracking with mcmc in surveillance scenarios , in: European Conference on Computer Vision, workshop on Multi Camera and Multi-modal Sensor Fusion Algorithms and Applications (ECCV-M2SFA2), Marseille, 2009.
Yazdani, A. , Lee, J. -S. and Ebrahimi, T. , Implicit emotional tagging of multimedia using EEG signals and brain computer interface , in: Proceedings of the International Workshop on Social Media, pages 81-88, 2009.
Zhao, S. Y. , Ravuri, R. and Morgan, N. , Multi-stream to many-stream: using spectro-temporal features for asr , in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
Zhu, K. , Drygajlo, A. and Li, W. , Q-stack aging model for face verification , 2009.
Zhu, J. , Gool, L. Van and Hoi, S. C. , Unsupervised Face Alignment by Robust Nonrigid Mapping , in: ICCV2009, 2009.
Zufferey, Guillaume , Jermann, Patrick , Do Lenh, Son and Dillenbourg, Pierre , Using Augmentations as Bridges from Concrete to Abstract Representations , in: Proceedings of the 23rd British HCI Group Annual Conference on HCI 2009: Celebrating People and Technology, pages 130-139, British Computer Society, Cambridge (UK), 2009.
Keshet, J. and Chazan, D. , A Kernel Wrapper for Phoneme Sequence Recognition , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Keshet, J. , Shalev-Shwartz, S. , Singer, Y. and Chazan, D. , A Large Margin Algorithm for Forced Alignment , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Keshet, J. , A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Grangier, D. , Keshet, J. and Bengio, S. , Discriminative Keyword Spotting , in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
Deville, B. , Bologna, G. , Vinckenbosch, M. and Pun, T. , See color: seeing colours with an orchestra , in: Human Machine Interaction: Research Results of the MMI Program, pages 251-279, Springer, 2009.
Deville, B. , Bologna, G. , Vinckenbosch, M. and Pun, T. , See Color: Seeing colours with an orchestra , in: Human Machine Interaction, Research Results of the MMI Program, pages 251-279, Springer LNCS, 2009.
Popescu-Belis, A. , Managing Multimodal Data, Metadata and Annotations: Challenges and Solutions , in: Multimodal Signal Processing for Human-Computer Interaction, pages 183-203, Elsevier / Academic Press, 2009.
YOU ARE FIRED! NONVERBAL ROLE ANALYSIS IN COMPETITIVE MEETINGS , in: Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), Taiwan., 2009.
Anemuller, J. , Back, J. -H. , Caputo, B. , Luo, J. , Ohl, F. , Orabona, F. , Vogels, R. , Weinshall, D. and Zweig, A. , Biologically Motivated Audio-Visual Cue Integration for Object , in: Proceedings of the first Internatinal Conference on Cognitive Systems, 2008.
Anemuller, J. , Back, J. -H. , Caputo, B. , Havlena, M. , Luo, J. , Kayser, H. , Leibe, B. , Motlicek, P. , Pajdla, T. , Pavel, M. , Torii, A. , van Gool, L. , Zweig, A. and Hermansky, H. , The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events , in: Proceedings of the International Conference on Multimodal Interfaces, 2008.
Aradilla, G. , Acoustic models for posterior features in speech recognition , Ecole Polytechnique Fédérale de Lausanne, 2008.
Aradilla, G. , Bourlard, H. and Magimai-Doss, M. , Posterior features applied to speech recognition tasks with limited training data , number Idiap-RR-15-2008, 2008.
Aradilla, G. , Bourlard, H. and Magimai-Doss, M. , Using kl-based acoustic models in a large vocabulary recognition task , number Idiap-RR-14-2008, 2008.
Ba, S. and Odobez, J. -M. , Multi-party focus of attention recognition in meetings from head pose and multimodal contextual cues , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008.
Ba, S. and Odobez, J. -M. , Multi-person visual focus of attention from head pose and meeting contextual cues , number Idiap-RR-47-2008, 2008.
Ba, S. and Odobez, J. -M. , Multi-person visual focus of attention from head pose and meeting contextual cues , number 47, 2008.
Ba, S. and Odobez, J. -M. , Recognizing visual focus of attention from head pose in natural meetings , in: accepted for publication in IEEE Trans. on System, Man and Cybernetics: Part B, Man,, 2008.
Ba, S. and Odobez, J. -M. , Visual focus of attention estimation from head pose posterior probability distributions , in: IEEE Proc. Int. Conf. on Multimedia and Expo (ICME), 2008.
Beekhof, F. , Voloshynovskiy, S. , Koval, O. and Villán, R. , Secure surface identification codes , in: Steganography, and Watermarking of Multimedia Contents X, 2008. [DOI]
Berclaz, J. , Fleuret, F. and Fua, P. , Multi-camera tracking and atypical motion detection with behavioral maps , in: The 10th European Conference on Computer Vision (ECCV 2008), Marseille, France, 2008.
Berclaz, J. , Fleuret, F. and Fua, P. , Multi-camera tracking and atypical motion detection with behavioral maps , in: Proceedings of the European Conference on Computer Vision (ECCV), pages 112-125, 2008.
Berclaz, J. , Fleuret, F. and Fua, P. , Principled Detection-by-classification from Multiple Views , in: proceedings of the International Conference on Computer Vision Theory and Applications, pages 375-382, 2008.
Bertolami, R. and Bunke, H. , Ensemble methods to improve the performance of an english handwritten text line recognizer , in: Arabic and Chinese Handwriting Recognition, pages 265-277, Springer, 2008.
Bertolami, R. and Bunke, H. , Hidden Markov model based ensemble methods for offline handwritten text line recognition , in: Pattern Recognition, volume 41, number 11, pages 3452-3460, 2008.
Bertolami, R. and Bunke, H. , Including language model information in the combination of handwritten text line recognizers , in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 25-30, 2008.
Bertolami, R. and Bunke, H. , Integration of n-gram language models in multiple classifier systems for offline handwritten text line recognition , in: Int. Journal of Pattern Recognition and Art. Intelligence, volume 22, number 7, pages 1301-1321, 2008.
Bertolami, R. , Gutmann, C. , Spitz, L. and Bunke, H. , Shape code based lexicon reduction for offline handwriting recognition , in: Proc. 8th IAPR Int. Workshop on Document Analysis Systems, pages 158-163, 2008.
Besson, P. , Popovici, V. , Vesin, J. M. , Thiran, J. -Ph. and Kunt, M. , Extraction of audio features specific to speech production for multimodal speaker detection , in: IEEE Transactions on Multimedia, volume 10, number 1, pages 63-73, 2008. [DOI]
Boakye, K. , Trueba-Hornero, B. , Vinyals, O. and Friedland, G. , Overlapped speech detection for improved speaker diarization in multiparty meetings , in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
Boakye, K. , Vinyals, O. and Friedland, G. , Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech , in: Interspeech, 2008.
Boakye, K. , Vinyals, O. and Friedland, G. , Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech , in: Interspeech 2008, Brisbane, Australia, pages 32-35, 2008.
Bologna, G. , Deville, B. , Vinckenbosch, M. and Pun, T. , a perceptual interface for vision substitution in a color matching experiment , in: Proceeding on IEEE IJCNN, IEEE World congress on computational intelligence, 2008.
Bologna, G. , Deville, B. , Vinckenbosch, M. and Pun, T. , Pairing colored socks and following a red serpentine with sounds of musical instruments , in: ICAD 08, International Conference on Auditory Displays, Paris, France, June 24--27, 2008.
Bourlard, H. , Chavarriaga, R. , Galán, F. and Millán, J. del R. , Characterizing the eeg correlates of exploratory behavior , in: IEEE Transactions on Neural Systems & Rehabilitation Engineering, 2008.
Bourlard, H. and Renals, S. , Recognition and understanding of meetings overview of the european ami and amida projects , in: LangTech 2008, Rome, 2008.
Breitenstein, M. D. , Kuettel, D. , Weise, T. , van Gool, L. and Pfister, H. , Real-time face pose estimation from single range images , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), IEEE Press, 2008.
Bruno, E. , Moënne-Loccoz, N. and Marchand-Maillet, S. , Design of multimodal dissimilarity spaces for retrieval of multimedia documents , in: To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Bunke, H. , Dickinson, P. , Neuhaus, M. and Stettler, M. , Matching of hypergraphs -- algorithms, applications, and experiments , in: Applied Pattern Recognition, pages 131-154, Springer, 2008.
Camastra, F. and Vinciarelli, A. , Machine learning for audio, image and video analysis , Advanced Information and Knowledge Processing, volume XVI, Springer Verlag, ISBN 978-1-84800-006-3, 2008.
Caputo, B. , Class specific object recognition using kernel Gibbs distributions , in: ELectronic Letters on Computer vision and Image Analysis, volume 7, number 2, pages 96-109, 2008.
Carincotte, C. , Naturel, X. , Hick, M. , Odobez, J. -M. , Yao, J. , Bastide, A. and Corbucci, B. , Understanding Metro Station Usage using Closed Circuit Television Cameras Analysis , in: 11th International IEEE Conference on Intelligent Transportation Systems (ITSC), Bejing, 2008.
Carreras, A. , Cordara, G. , Delgado, J. , Dufaux, F. , Francini, G. , Ha, T. M. , Rodriguez, E. and Tous, R. , A search and retrieval framework for the management of copyrighted audiovisual content , in: 50th International Symposium ELMAR 2008, Zadar, Croatia, 2008.
Chanel, G. , Rebetez, C. , Betrancourt, M. and Pun, T. , boredom, engagement and anxiety as indicators for adaptation to difficulty in games , in: ACM Mindtrek conference, 2008.
Chavarriaga, R. , Galán, F. and Millán, J. del R. , Asynchronous detection and classification of oscillatory brain activity , in: 16 European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
Cornelis, N. , Leibe, B. , Cornelis, K. and van Gool, L. , 3d urban scene modeling integrating recognition and reconstruction , in: International Journal of Computer Vision, volume 78, number 2-3, pages 121-141, 2008.
van den Berg, M. , Koller-Meier, E. and van Gool, L. , Fast body posture estimation using volumetric features , in: IEEE Visual Motion Computing (MOTION), 2008.
Deville, B. , Bologna, G. , Vinckenbosch, M. and Pun, T. , Guiding the focus of attention of blind people with visual saliency , in: Workshop on Computer Vision Applications for the Visually Impaired (CVAVI 08), Satellite Workshop of theEuropean Conference on Computer Vision (ECCV 2008), Marseille, France, October 18, 2008.
Deville, B. , Bologna, G. , Vinckenbosch, M. and Pun, T. , guiding the focus of attention of blind people with visual saliency , in: Workshop on Computer Vision Applications for the Visually Impaired (CVAVI 08), 2008.
Dollé, L. , Khamassi, M. , Girard, B. , Guillot, A. and Chavarriaga, R. , Analyzing interactions between navigation strategies using a computational model of action selection , in: Spatial Cognition 2008 (SC '08), pages 71-86, Freiburg, Germany, 2008.
Dufaux, F. and Ebrahimi, T. , H.264/AVC Video Scrambling for Privacy Protection , in: IEEE International Conference on Image Processing (ICIP2008), San Diego, 2008.
Dumas, B. , Lalanne, D. and Ingold, R. , Démonstration : hephaistk, une bo\^\ite à outils pour le prototypage d'interfaces multimodales , 2008.
Dumas, B. , Lalanne, D. and Ingold, R. , Demonstration : hephaistk, une bo\^\ite à outils pour le prototypage d'interfaces multimodales , in: Proceedings of 20e Conférence sur l'Interaction Homme-Machine (IHM 08), pages 215-216, 2008.
Dumas, B. , Lalanne, D. and Ingold, R. , Prototyping multimodal interfaces with smuiml modeling language , in: Proceedings of CHI 2008 Workshop on UIDLs for Next Generation User Interfaces (CHI 2008 workshop), pages 63-66, 2008.
Dumas, B. , Lalanne, D. and Ingold, R. , Prototyping multimodal interfaces with smuiml modeling language , pages 63-66, 2008.
Dumas, B. , Lalanne, D. , Guinard, D. , Koenig, R. and Ingold, R. , Strengths and weaknesses of software architectures for the rapid creation of tangible and multimodal interfaces , in: Proceedings of 2nd international conference on Tangible and Embedded Interaction (TEI 2008), pages 47-54, 2008.
Dumas, B. , Lalanne, D. , Guinard, D. , Koenig, R. and Ingold, R. , Strengths and weaknesses of software architectures for the rapid creation of tangible and multimodal interfaces , pages 47-54, 2008.
Dutoit, T. , Couvreur, L. and Bourlard, H. , How does a dictation machine recognize speech ? , in: Applied Signal Processing--A MATLAB approach, pages 104-148, Springer MA, 2008.
Ess, A. , Leibe, B. , Schindler, K. and van Gool, L. , A mobile vision system for robust multi-person tracking , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
Estrella, P. , Popescu-Belis, A. and King, M. , Improving contextual quality models for mt evaluation based on evaluators' feedback. , in: LREC 2008 (6th International Conference on Language Resources and Evaluation), 2008.
Faria, A. and Morgan, N. , Corrected tandem features for acoustic model training , in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
Faria, A. and Morgan, N. , Corrected Tandem Features for Acoustic Model Training , in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
Faria, A. and Morgan, N. , When a mismatch can be good: large vocabulary speech recognition trained with idealized tandem features , in: Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, 2008.
Favre, B. , Grishman, R. , Hillard, D. , Ji, H. , Hakkani-Tur, D. and Ostendorf, M. , Punctuating speech for information extraction , in: IEEE ICASSP, Las Vegas, NV, 2008.
Favre, S. , Salamin, H. , Vinciarelli, A. , Hakkani-Tur, D. and Garg, N. , Role recognition for meeting participants: an approach based on lexical information and social network analysis , in: ACM International Conference on Multimedia, Vancouver, Canada, 2008.
Favre, S. , Salamin, H. and Vinciarelli, A. , Role recognition in multiparty recordings using social affiliation networks and discrete distributions , in: The Tenth International Conference on Multimodal Interfaces (ICMI 2008), Chania, Greece, 2008.
Ferrez, P. W. and Millán, J. del R. , Eeg-based brain-computer interaction: improved accuracy by automatic single-trial error detection , in: Advances in Neural Information Processing Systems 20, pages 441-448, Cambridge, MA, 2008.
Ferrez, P. W. and Millán, J. del R. , Error-related eeg potentials generated during simulated brain-computer interaction , in: IEEE Transactions on Biomedical Engineering, volume 55, number 3, pages 923-929, 2008. [DOI]
Ferrez, P. W. and Millán, J. del R. , Error-Related EEG Potentials Generated During Simulated Brain-Computer Interaction , in: IEEE Trans. on Biomedical Engineering, volume 55, number 3, pages 923-929, 2008.
Ferrez, P. W. and Millán, J. del R. , Simultaneous real-time detection of motor imagery and error-related potentials for improved bci accuracy , in: Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course, 2008.
Fleuret, F. , Berclaz, J. , Lengagne, R. and Fua, P. , Multi-Camera People Tracking with a Probabilistic Occupancy Map , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 30, number 2, pages 267-282, 2008.
Fleuret, F. and Geman, D. , Stationary features and cat detection , in: Journal of Machine Learning Research, 2008.
Fleuret, F. and Geman, D. , Stationary features and cat detection , in: Journal of Machine Learning Research (JMLR), volume 9, pages 2549-2578, 2008.
Friedland, G. and Vinyals, O. , Live speaker identification in conversations , in: ACM Multimedia 2008, Vancouver, Canada, pages 1017-1018, 2008.
Galán, F. , Nuttin, M. , Lew, E. , Ferrez, P. W. , Vanacker, G. , Philips, J. and Millán, J. del R. , A brain-actuated wheelchair: asynchronous and non-invasive brain-computer interfaces for continuous control of robots , in: Clinical Neurophysiology, number 119, pages 2159-2169, 2008.
Galán, F. , Nuttin, M. , Vanhooydonck, D. , Lew, E. , Ferrez, P. W. , Philips, J. and Millán, J. del R. , Continuous brain-actuated control of an intelligent wheelchair by human eeg , in: 4th International Brain-Computer Interface Workshop & Training Course, Graz University of Technology, Graz, Austria, 2008.
Galán, F. , Methods for Asynchronous and Non-Invasive EEG-Based Brain-Computer Interfaces. Towards Intelligent Brain-Actuated Wheelchairs , University of Barcelona, 2008.
Gammeter, S. , Ess, A. , Jaeggli, T. , Leibe, B. , Schindler, K. and van Gool, L. , Articulated multibody tracking under egomotion , in: European Conference on Computer Vision (ECCV'08), Springer, 2008.
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Autoregressive modelling of hilbert envelopes for wide-band audio coding , in: AES 124th Convention, Audio Engineering Society, Amsterdam, 2008.
Ganapathy, S. , Thomas, A. and Hermansky, H. , Front-end for far-field speech recognition based on frequency domain linear prediction , in: Interspeech 2008, Brisbane, Australia, 2008.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes , number Idiap-RR-75-2008, 2008.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION , number Idiap-RR-74-2008, 2008.
Ganapathy, S. , Thomas, S. and Hermansky, H. , Modulation Frequency Features For Phoneme Recognition In Noisy Speech , in: Journal of Acoustical Society of America - Express Letters, 2008.
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domain , in: INTERSPEECH 2008, Brisbane, Australia, 2008.
Ganapathy, S. , Motlicek, P. , Hermansky, H. and Garudadri, H. , Temporal masking for bit-rate reduction in audio codec based on frequency domain linear prediction , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4781-4784, Las Vegas, NV, 2008. [DOI]
Garg, N. and Hakkani-Tur, D. , Speaker role detection in meetings using lexical information and social network analysis , in: Technical Report TR-08-004, International Computer Science Institute, Berkeley, CA, 2008.
Garipelli, G. , Chavarriaga, R. and Millán, J. del R. , Fast recognition of anticipation related potentials , in: IEEE Transactions on Biomedical Engineering, 2008.
Garipelli, G. , Chavarriaga, R. and Millán, J. del R. , Recognition of anticipatory behavior from human eeg , in: 4th Intl. Brain-Computer Interface Workshop and Training Course, Graz University, Austria, 2008.
Garner, P. N. , A weighted finite state transducer tutorial , number Idiap-Com-03-2008, 2008.
Garner, P. N. , Silence models in weighted finite-state transducers , in: Interspeech, Brisbane, Australia, 2008.
Gatica-Perez, D. and Farrahi, K. , Daily routine classification from mobile phone data , in: Workshop on Machine Learning and Multimodal Interaction (MLMI08), Utrecht, The Netherlands, 2008.
Gatica-Perez, D. and Farrahi, K. , Discovering human routines from cell phone data with topic models , in: IEEE International Symposium on Wearable Computers (ISWC), Pittsburgh, Pennsylvania, 2008.
Gatica-Perez, D. and Farrahi, K. , What did you do today? discovering daily routines from large-scale mobile data , in: ACM International Conference on Multimedia (ACMMM), Vancouver, 2008.
Gillick, D. , Hakkani-Tur, D. and Levit, M. , Unsupervised learning of edit parameters for matching name variants , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Goldmann, L. , Adamek, T. , Vajda, P. , Karaman, M. , Mörzinger, R. , Galmar, E. , Sikora, T. , O'Connor, N. , Ha-Minh, T. , Ebrahimi, T. , Schallauer, P. and Huet, B. , Towards Fully Automatic Image Segmentation Evaluation , in: Advanced Concepts for Intelligent Vision Systems (ACIVS), Springer, Juan-les-Pins, 2008.
Gonzalez, G. , Fleuret, F. and Fua, P. , Automated delineation of dendritic networks in noisy image stacks , in: Proceedings of the European Conference on Computer Vision (ECCV), pages 214-227, 2008.
Gonzalez, G. , Fleuret, F. and Fua, P. , Automated delineation of dendritic networks in noisy image stacks , in: The 10th European Conference on Computer Vision, Marseille, France, 2008.
Grandjean, D. and Pun, T. , Multimodality in emotions and for their assessment , 2008.
Grandvalet, Y. , Rakotomamonjy, A. , Keshet, J. and Canu, S. , Support Vector Machines with a Reject Option , in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008.
Grangier, D. and Bengio, S. , A discriminative kernel-based model to rank images from text queries , in: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2008.
Grangier, D. , Machine Learning for Information Retrieval , École Polytechnique Fédérale de Lausanne, 2008.
Grossmann, E. , Gaspar, J. -A. and Orabona, F. , Calibration from statistical properties of the visual world , in: European Conf. on Computer Vision, 2008.
Gui, L. , Thiran, J. -Ph. and Paragios, N. , Cooperative object segmentation and behavior inference in image sequences , in: International Journal of Computer Vision, ISSN 0920-5691, 2008. [DOI]
Gurban, M. , Thiran, J. -Ph. , Drugman, T. and Dutoit, T. , Dynamic modality weighting for multi-stream HMMs in Audio-Visual Speech Recognition , in: 10th International Conference on Multimodal Interfaces, Chania, Greece, 2008.
Gurban, M. and Thiran, J. -Ph. , Using entropy as a stream reliability estimate for audio-visual speech recognition , in: 16th European Signal Processing Conference, Lausanne, Switzerland, 2008.
Hoffmann, U. , Vesin, J. M. , Ebrahimi, T. and Diserens, K. , An efficient p300-based brain-computer interface for disabled subjects , in: Journal of Neuroscience Methods, volume 167, number 1, pages 115-125, 2008. [DOI]
Hoffmann, U. , Yazdani, A. , Vesin, J. M. and Ebrahimi, T. , Bayesian feature selection applied in a p300 brain- computer interface , in: 16th European Signal Processing Conference, Lausanne, 2008.
Hoffmann, U. , Naruniec, J. , Yazdani, A. and Ebrahimi, T. , Face Detection Using Discrete Gabor Jets And Color Information , in: SIGMAP 2008 - International Conference on Signal Processing and Multimedia Applications, Porto, 2008.
Humm, A. , Hennebert, J. and Ingold, R. , Combined handwriting and speech modalities for user authentication , in: IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans, volume 38, 2008.
Humm, A. , Modelling combined handwriting and speech modalities for user authentication , University of Fribourg, Switzerland, 2008.
Humm, A. , Hennebert, J. and Ingold, R. , Spoken signature for user authentication , in: SPIE Journal of Electronic Imaging, volume 17, 2008.
Humm, A. , Hennebert, J. and Ingold, R. , Spoken signature for user authentication , in: SPIE Journal of Electronic Imaging, volume 17, 2008.
Hung, H. , Huang, Y. , Yeo, C. and Gatica-Perez, D. , Associating audio-visual activity cues in a dominance estimation framework , in: CVPR Workshop on Human Communicative Behavior, 2008.
Hung, H. , Huang, Y. , Friedland, G. and Gatica-Perez, D. , Estimating the dominant person in multi-party conversations using speaker diarization strategies , in: ICASSP 08, 2008.
Hung, H. , Huang, Y. , Friedland, G. and Gatica-Perez, D. , Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies , in: IEEE ICASSP, Las Vegas, NV, 2008.
Hung, H. and Gatica-Perez, D. , Identifying dominant people in meetings from audio-visual sensors , in: Proc. IEEE Int. Conf. on Automatic Face and Gesture Recognition, Special Session on Multimodal HCI for Smart Environments, 2008.
Hung, H. and Gatica-Perez, D. , Identifying dominant people in meetings from audio-visual sensors , in: Proc. IEEE Int. Conf. on Automatic Face and Gesture Recognition (FG), Special Session on Multi-Sensor HCI for Smart Environments, 2008.
Hung, H. , Jayagopi, D. , Ba, S. , Odobez, J. -M. and Gatica-Perez, D. , Investigating automatic dominance estimation in groups from visual attention and speaking activity , in: International Conference on Multimodal Interfaces (ICMI), 2008.
Hung, H. , Jayagopi, D. , Ba, S. , Odobez, J. -M. and Gatica-Perez, D. , Investigating automatic dominance estimation in groups from visual attention and speaking activity , in: Proc. ICMI, 2008.
Hung, H. and Friedland, G. , Towards audio-visual on-line diarization of participants in group meetings , in: European Conference on Computer Vision (ECCV) 2008, Marseille, France, 2008.
Indermühle, E. , Liwicki, M. and Bunke, H. , Recognition of handwritten historical documents: hmm -adaptation vs. writer specific training , in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 186-191, 2008.
Jayagopi, D. , Raducanu, B. and Gatica-Perez, D. , Characterizing conversational group dynamics using nonverbal behavior , in: Proc. IEEE Int. Conf. on Multimedia (ICME), 2008.
Jayagopi, D. , Hung, H. , Yeo, C. and Gatica-Perez, D. , Modeling dominance in group conversations from nonverbal activity cues , in: IEEE Trans. on Audio, Speech and Language Processing, Special Issue on Multimodal Processing for Speech-based Interactions, accepted for publication, 2008.
Jayagopi, D. , Predicting the dominant clique in meetings through fusion of nonverbal cues , in: Proc. ACM Vancouver, Canada, 2008.
Jayagopi, D. , Hung, H. , Yeo, C. and Gatica-Perez, D. , Predicting the dominant clique in meetings through fusion of nonverbal cues , in: ACM MM 2008, Vancouver, Canada, 2008.
Jayagopi, D. , Predicting two facets of social verticality in meetings from five-minute time slices and nonverbal cues , in: Proc. ICMI, 2008.
Jayagopi, D. , Ba, S. , Odobez, J. -M. and Gatica-Perez, D. , Predicting two facets of social verticality in meetings from five-minute time slices and nonverbal cues , in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), Special Session on Social Signal Processing, 2008.
Kamangar, K. , Hakkani-Tur, D. , Tur, G. and Levit, M. , An iterative unsupervised learning method for information distillation , in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
Keshet, J. and Bengio, S. , Automatic speech and speaker recognition: large margin and kernel methods , John Wiley & Sons, 2008.
Ketabdar, H. and Bourlard, H. , Enhanced phone posteriors for improving speech recognition systems , number Idiap-RR-39-2008, 2008.
Ketabdar, H. , Enhancing posterior based speech recognition systems , Ecole Polytechnique Fédérale de Lausanne, 2008.
Ketabdar, H. and Bourlard, H. , Hierarchical integration of phonetic and lexical knowledge in phone posterior estimation , in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
Ketabdar, H. and Bourlard, H. , In-context phone posteriors as complementary features for tandem asr , in: ICSLP'08, Brisbane, Australia,, 2008.
Kludas, J. , Bruno, E. and Marchand-Maillet, S. , Can feature information interaction help for information fusion in multimedia problems? , in: First International Workshop on Metadata Mining for Image Understanding, pages 23-33, 2008.
Kludas, J. , Bruno, E. and Marchand-Maillet, S. , Can feature information interaction help for information fusion in multimedia problems? , in: To appear in Multimedia Tools and Applications Journal special issue on "Metadata Mining for Image Understanding", 2008.
Kludas, J. , Marchand-Maillet, S. and Bruno, E. , Exploiting document feature interactions for efficient information fusion in high dimensional spaces , in: Proceedings of the First International Workshops on Image Processing Theory, Tools and Applications (IPTA'2008), 2008.
Kludas, J. , Bruno, E. and Marchand-Maillet, S. , Exploiting synergistic and redundant features for multimedia document classification , in: 32nd Annual Conference of the German Classification Society - Advances in Data Analysis, Data Handling and Business Intelligence (GfKl 2008), 2008.
Kludas, J. , Bruno, E. and Marchand-Maillet, S. , Exploiting synergistic and redundant features for multimedia document classification , in: 32nd Annual Conference of the German Classification Society - Advances in Data Analysis, Data Handling and Business Intelligence (GfKl 2008), 2008.
Knox, M. , Morgan, N. and Mirghafori, N. , Getting the last laugh: automatic laughter segmentation in meetings , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Knox, M. , Morgan, N. and Mirghafori, N. , Getting the last laugh: automatic laughter segmentation in meetings , in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 797-800, 2008.
Kokiopoulou, E. , Frossard, P. and Verscheure, O. , Fast keyword detection with sparse time-frequency models , in: IEEE Int. Conf. on Multimedia & Expo (ICME), 2008.
Kokiopoulou, E. , Pirillos, S. and Frossard, P. , Graph-based classification for multiple observations of transformed patterns , in: IEEE Int. Conf. Pattern Recognition (ICPR), 2008.
Kokiopoulou, E. and Frossard, P. , Minimum distance between pattern transformation manifolds: algorithm and applications , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Kokiopoulou, E. , Frossard, P. and Gkorou, D. , Optimal polynomial filtering for accelerating distributed consensus , in: IEEE Int. Symp. on Information Theory (ISIT), 2008.
Kokiopoulou, E. and Frossard, P. , Semantic coding by supervised dimensionality reduction , in: IEEE Transactions on Multimedia, volume 10, number 2, 2008.
Kosinov, S. and Pun, T. , Distance-based discriminant analysis method and its applications , in: Pattern Analysis and Applications, volume 11, number 3-4, pages 227-246, 2008.
Kosinov, S. , Bruno, E. and Marchand-Maillet, S. , Spatially-consistent partial matching for intra- and inter-image prototype selection , in: To appear in Signal Processing: Image Communication special issue on "Semantic Analysis for Interactive Multimedia Services", 2008.
Koval, O. , Voloshynovskiy, S. , Beekhof, F. and Pun, T. , Analysis of physical unclonable identification based on reference list decoding , in: Steganography, and Watermarking of Multimedia Contents X, 2008.
Koval, O. , Voloshynovskiy, S. and Pun, T. , Privacy-preserving multimodal person and object identification , in: Proceedings of the 10th ACM Workshop on Multimedia & Security, 2008.
Koval, O. , Voloshynovskiy, S. , Caire, F. and Bas, P. , Privacy-preserving multimodal person and object identification , in: MM&Sec 2008, 2008.
Koval, O. , Voloshynovskiy, S. , Beekhof, F. and Pun, T. , Security analysis of robust perceptual hashing , in: Steganography, and Watermarking of Multimedia Contents X, 2008.
Kryszczuk, K. and Drygajlo, A. , Credence estimation and error prediction in biometric identity verification , in: Signal Processing, volume 88, number 4, pages 916-925, 2008.
Kryszczuk, K. and Drygajlo, A. , Impact of feature correlations on separation between bivariate normal distributions , 2008.
Kryszczuk, K. and Drygajlo, A. , Impact of feature correlations on separation between bivariate normal distributions , in: 19th International Conference on Pattern Recognition, 2008.
Kryszczuk, K. and Drygajlo, A. , On quality of quality measures for classification , in: Biometrics and Identity Management, Lecture Notes in Computer Science 5372, pages 19-28, 2008.
Kryszczuk, K. and Drygajlo, A. , On quality of quality measures for classification , pages 19-28, Springer, 2008.
Kryszczuk, K. and Drygajlo, A. , What do quality measures predict in biometrics , pages -,-29, 2008.
Kryszczuk, K. and Drygajlo, A. , What do quality measures predict in biometrics , in: 16th European Signal Processing Conference, 2008.
Kumatani, K. , McDonough, J. , Klakow, D. , Garner, P. N. and Li, W. , Adaptive beamforming with a maximum negentropy criterion, , in: The Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2008.
Kumatani, K. , McDonough, J. , Rauch, B. , Klakow, D. , Garner, P. N. and Li, W. , Beamforming with a Maximum Negentropy Criterion , in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 5, pages 994-1008, 2008.
Kumatani, K. , McDonough, J. , Schacht, S. , Klakow, D. , Garner, P. N. and Li, W. , Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming , in: International Conferance on Acoustics Speech and Signal Processing, 2008.
Kumatani, K. , McDonough, J. , Schacht, S. , Klakow, D. , Garner, P. N. and Li, W. , Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition , number Idiap-RR-02-2008, 2008.
Kumatani, K. , McDonough, J. , Klakow, D. , Garner, P. N. and Li, W. , Maximum negentropy beamforming , number Idiap-RR-07-2008, 2008.
Lalanne, D. , Rigamonti, M. , Ingold, R. , Evéquoz, F. and Dumas, B. , An ego-centric and tangible approach to meeting indexing and browsing , Lecture Notes in Computer Science, volume Volume 4892, Springer Berlin / Heidelberg, ISBN 978-3-540-78154-7, 2008. [DOI]
Leibe, B. , Schindler, K. , Cornelis, N. and van Gool, L. , Coupled object detection and tracking from static cameras and moving vehicles , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Leibe, B. , Ettlin, A. and Schiele, B. , Learning semantic object parts for object categorization , in: Image and Vision Computing, volume 26, number 1, pages 15-26, 2008.
Leibe, B. , Leonardis, A. and Schiele, B. , Robust object detection with interleaved categorization and segmentation , in: International Journal of Computer Vision, volume 77, number 1-3, pages 259-289, 2008.
Li, W. , Kumatani, K. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , A neural network based regression approach for recogninizing simultaneous speech , in: Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
Li, W. , Kumatani, K. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , A neural network based regression approach for recognizing simultaneous speech , number Idiap-RR-10-2008, 2008.
Li, W. , Effective post-processing for single-channel frequency-domain speech enhancement , pages 149-152, 2008. [DOI]
Li, W. , Effective post-processing of single-channel frequency-domain speech enhancement , in: IEEE conference on multimedia and expo, 2008.
Li, W. , Doss, M. M. , Dines, J. and Bourlard, H. , Mlp-based log spectral energy mapping for robust overlapping speech recognition , in: European Signal Processing Conference, 2008.
Li, W. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , Neural network based regression for robust overlapping speech recognition using microphone arrays , in: Interspeech, 2008.
Liwicki, M. and Bunke, H. , Combining on-line and off-line blstm networks for handwritten text line recognition , in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 31-36, 2008.
Liwicki, M. and Bunke, H. , Recognition of whiteboard notes -- online, offline and combination , World Scientific, ISBN 978-9812814531, 2008.
Liwicki, M. , Schlapbach, A. and Bunke, H. , Writer-dependent recognition of handwritten whiteboard notes in smart meeting room environments , in: Proc. 8th IAPR Int. Workshop on Document Analysis Systems, pages 151-157, 2008.
Llonch, R. Sala , Kokiopoulou, E. , Tosic, I. and Frossard, P. , 3d face recognition using sparse spherical representations , in: IEEE Int. Conf. Pattern Recognition (ICPR), 2008.
Luo, J. , Caputo, B. , Zweig, A. , Back, J. -H. and Anemuller, J. , Object category detection using audio-visual cues , in: International Conference on Computer Vision Systems (ICVS08), 2008.
Mariéthoz, J. , Bengio, S. and Grandvalet, Y. , Kernel Based Text-Independnent Speaker Verification , number Idiap-RR-68-2008, 2008.
Matena, L. , Jaimes, A. and Popescu-Belis, A. , Graphical representation of meetings on mobile devices , in: MobileHCI 2008 Demonstrations (10th ACM International Conference on Human-Computer Interaction with Mobile Devices and Services), 2008.
Mesot, B. , Inference in switching linear dynamical systems applied to noise robust speech recognition of isolated digits , Ecole Polytechnique Fédérale de Lausanne, 2008.
Mesot, B. , Switching linear dynamical systems for noise robust speech recognition of isolated degits , STI School of Engineering, EPFL, 2008.
Meynet, J. and Thiran, J. -Ph. , Ensembles of SVMs using an Information Theoretic Criterion , in: Pattern Recognition Letters, 2008.
Meynet, J. , Arsan, T. , Cruz Mota, J. and Thiran, J. -Ph. , Fast multi-view face tracking with pose estimation , in: 16th European Signal Processing Conference, Lausanne, 2008.
Meynet, J. and Thiran, J. -Ph. , Information Theoretic Combination of Classifiers , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008. [DOI]
Millán, J. del R. , Brain-controlled robots , in: IEEE International Conference on Robotics and Automation (ICRA 2008), Pasadena, CA, USA,, 2008. [DOI]
Millán, J. del R. , Brain-Controlled Robots , in: IEEE Intelligent Systems, 2008.
Millán, J. del R. , Ferrez, P. W. , Galán, F. , Lew, E. and Chavarriaga, R. , Non-invasive brain-machine interaction , in: International Journal of Pattern Recognition and Artificial Intelligence, 2008.
Morrison, D. , Marchand-Maillet, S. and Bruno, E. , Semantic clustering of images using patterns of relevance feedback , in: Proceedings of the 6th International Workshop on Content-based Multimedia Indexing (CBMI'2008), 2008.
Motlicek, P. , Ganapathy, S. and Hermansky, H. , Entropy coding of Quantized Spectral Components in FDLP audio codec , number Idiap-RR-71-2008, 2008.
Motlicek, P. , Ganapathy, S. , Hermansky, H. , Garudadri, H. and Athineos, M. , Perceptually motivated Sub-band Decomposition for FDLP Audio Coding , in: Text, Speech and Dialogue, pages 435-442, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
Naturel, X. and Odobez, J. -M. , Detecting queues at vending machines: a statistical layered approach , in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008.
Negoescu, R. -A. and Gatica-Perez, D. , Analyzing flickr groups , in: Proceedings of the 2008 international conference on Content-based image and video retrieval (CIVR '08), Sheraton Fallsview Hotel, Niagara Falls, Canada, 2008.
Negoescu, R. -A. and Gatica-Perez, D. , Topickr: Flickr Groups and Users Reloaded , in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008.
Nijholt, A. , Tan, D. , Allison, B. , Millán, J. del R. , Moore, M. and Graimann, B. , Brain-computer interfaces for hci and games , in: Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts, 2008.
Noris, B. , Benmachiche, K. and Billard, A. , Calibration-free eye gaze direction detection with gaussian processes , in: International Conference on Computer Vision Theory and Applications (VISAPP 2008), Funchal, Portugal, 2008.
Orabona, F. , Keshet, J. and Caputo, B. , The Projectron: a Bounded Kernel-Based Perceptron , in: Int. Conf. on Machine Learning, 2008.
Ouaret, M. , Dufaux, F. and Ebrahimi, T. , Enabling Privacy For Distributed Video Coding by Transform Domain Scrambling , in: 2008 SPIE Visual Communications and Image Processing, San Diego, USA, 2008.
Paiement, J. -F. , Grandvalet, Y. , Bengio, S. and Eck, D. , A Distance Model for Rhythms , in: 25th International Conference on Machine Learning (ICML), 2008.
Paiement, J. -F. , Grandvalet, Y. and Bengio, S. , Predictive Models for Music , number Idiap-RR-51-2008, 2008.
Paiement, J. -F. , Bengio, S. and Eck, D. , Probabilistic Models for Melodic Prediction , number Idiap-RR-50-2008, 2008.
Paiement, J. -F. , Probabilistic models for music , École Polytechnique Fédérale de Lausanne, 2008.
Parthasarathi, S. H. K. and Hermansky, H. , A data-driven approach to speech/non-speech detection , number Idiap-RR-23-2008, 2008.
Parthasarathi, S. H. K. , Motlicek, P. and Hermansky, H. , Exploiting Contextual Information for Speech/Non-Speech Detection , in: Text, Speech and Dialogue, pages 451-459, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
Parthasarathi, S. H. K. , Motlicek, P. and Hermansky, H. , Exploiting temporal context for speech/non-speech detection , number Idiap-RR-21-2008, 2008.
Pellegrini, S. , Schindler, K. and D. Nardi, , A generalization of the icp algorithm for articulated bodies , in: British Machine Vision Conference (BMVC'08), 2008.
Perrin, X. , Chavarriaga, R. , Ray, C. , Siegwart, R. and Millán, J. del R. , A comparative psychophysical and eeg study of different feedback modalities for hri , in: Human-Robot Interaction (HRI08), 2008.
Perruchoud, L. , The Anterior Cingulate Cortex , number Idiap-Com-02-2008, 2008.
Pinto, J. P. and Hermansky, H. , Combining evidence from a generative and a discriminative model in phoneme recognition , in: Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Pinto, J. P. , Hermansky, H. , Yegnanarayana, B. and Magimai-Doss, M. , Exploiting contextual information for improved phoneme recognition , in: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2008), pages 4449-4452, Las Vegas, NV, 2008. [DOI]
Pinto, J. P. , Szoke, I. , Prasanna, S. R. Mahadeva and Hermansky, H. , Fast approximate spoken term detection from sequence of phonemes , in: The 31st Annual International ACM SIGIR Conference 20-24 July 2008, pages 28-33, Singapore,, 2008.
Pinto, J. P. , Sivaram, G. S. V. S. and Hermansky, H. , Reverse correlation for analyzing mlp posterior features in asr , in: 11th International Conference on Text, Speech and Dialogue (TSD), pages 469-476, Brno, Czech Republic, 2008. [DOI]
Popescu-Belis, A. , Dimensionality of dialogue act tagsets: an empirical analysis of large corpora , in: Language Resources and Evaluation, volume 42, number 1, pages 99-107, 2008. [DOI]
Popescu-Belis, A. , Bourlard, H. and Renals, S. , Machine learning for multimodal interaction iv , LNCS, volume 4892, Springer-Verlag, ISBN 978-3-540-78154-7, 2008.
Popescu-Belis, A. and Stiefelhagen, R. , Machine learning for multimodal interaction v , LNCS, volume 5237, Springer-Verlag, ISBN 978-3-540-85852-2, 2008.
Popescu-Belis, A. , Reference-based vs. task-based evaluation of human language technology , in: LREC 2008 ELRA Workshop on Evaluation: "Looking into the Future of Evaluation: When automatic metrics meet task-based and performance-based approaches", pages 12-16, ELRA, 2008.
Popescu-Belis, A. , Flynn, M. , Wellner, P. and Baudrion, P. , Task-based evaluation of meeting browsers: from bet task elicitation to user behavior analysis , in: LREC 2008 (6th International Conference on Language Resources and Evaluation), 2008.
Prodanov, P. , Drygajlo, A. , Richiardi, J. and Alexander, A. , Low-level grounding in a multimodal mobile service robot conversational system using graphical models , in: Intelligent Service Robotics, volume 1, pages 3-26, 2008. [DOI]
Pronobis, M. and Magimai-Doss, M. , Integrating audio and vision for robust automatic gender recognition , number Idiap-RR-73-2008, 2008.
Pronobis, A. , Martinez Monos, O. and Caputo, B. , SVM-based Discriminative Accumulation Scheme for Place Recognition , in: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA08), 2008.
Quack, T. , Bay, H. and van Gool, L. , Object recognition for the internet of things , in: Internet of Things 2008, 2008.
Quack, T. , Leibe, B. and van Gool, L. , World-scale mining of objects and events from community photo collections , in: Conference on Image and Video Retrieval (CIVR'08), ACM, 2008.
Rakotomamonjy, A. , Bach, F. , Canu, S. and Grandvalet, Y. , SimpleMKL , in: Journal of Machine Learning Research, volume 9, pages 2491-2521, 2008.
Rayner, M. , Tsourakis, N. , Georgescul, M. and Bouillon, P. , Building mobile spoken dialogue applications using regulus , in: Proceedings of the Sixth International Language Resources and Evaluation (LREC'08), 2008.
Richiardi, J. , Drygajlo, A. and Todesco, L. , Promoting diversity in gaussian mixture ensembles: an application to signature verification , pages 140-149, Springer, 2008.
Richiardi, J. , Drygajlo, A. and Todesco, L. , Promoting diversity in gaussian mixture ensembles: an application to signature verification , in: Biometrics and Identity Management, Lecture Notes in Computer Science 5372, pages 140-149, 2008.
Riedhammer, K. , Gillick, D. , Favre, B. and Hakkani-Tur, D. , Packing the meeting summarization knapsack , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Rigamonti, M. , A framework for structuring multimedia archives and for browsing efficiently through multimodal links , University of Fribourg, Switzerland, 2008.
Rigamonti, M. , A framework for structuring multimedia archives and for browsing efficiently through multimodal links , University of Fribourg, Switzerland, 2008.
Roth, D. , Koller-Meier, E. , Rowe, D. , Moeslund, T. B. and van Gool, L. , Event-based tracking evaluation metric , in: IEEE Workshop on Motion and Video Computing (WMVC), 2008.
Scaringella, N. , Timbre and Rhythmic TRAP-TANDEM features for music information retrieval , in: "Int. Conf. on Music Information Retrieval (ISMIR)", 2008.
Schindler, K. and van Gool, L. , Action snippets: how many frames does human action recognition require? , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), IEEE Press, 2008.
Schindler, K. and van Gool, L. , Combining densely sampled form and motion for human action recognition , in: DAGM Annual Pattern Recognition Symposium, Springer, 2008.
Schindler, K. and Suter, D. , Object detection by global contour shape , in: Pattern Recognition, 2008.
Schindler, K. , van Gool, L. and B. de Gelder, , Recognizing emotions expressed by body pose: a biologically inspired neural model , in: Neural Networks, 2008.
Schlapbach, A. , Liwicki, M. and Bunke, H. , A writer identification system for on-line whiteboard data , in: Pattern Recognition, volume 41, pages 2381-2397, 2008.
Schlapbach, A. , Wettstein, F. and Bunke, H. , Automatic estimation of the readability of handwritten text , in: Proc. 16th European Signal Processing Conference, 2008.
Schlapbach, A. , Bunke, H. and Wettstein, F. , Estimating the readability of handwritten text -- a support vector regression based approach , in: Proc. 19th Int. Conf. on Pattern Recognition, IEEE, 2008.
Schlapbach, A. and Bunke, H. , Off-line writer identification and verification using gaussian mixture models , in: Machine Learning in Document Analysis and Recognition, pages 409-428, Springer, 2008.
Schlapbach, A. , Writer identification and verification , volume 311, IOS Press, ISBN 978-1-58603-825-0, 2008.
Schouten, B. , Juul, N. , Drygajlo, A. and Tistarelli, M. , Biometrics and identity management , Springer, 2008.
Schouten, B. , Juul, N. , Drygajlo, A. and Tistarelli, M. , Biometrics and identity management , Springer, 2008.
Shahrokni, A. , Drummond, T. , Fleuret, F. and Fua, P. , Classification-based Probabilistic Modeling of Texture Transition for Fast Line Search Tracking and Delineation , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
Shriberg, E. , Higher level features in speaker recognition , in: in C. Muller (Ed.) Speaker Classification I. Springer-Verlag, New York, 2008.
De Simone, F. , Ticca, D. , Dufaux, F. , Ansorge, M. and Ebrahimi, T. , A comparative study of color image compression standards using perceptually driven quality metrics , in: SPIE Optics and Photonics, San Diego, CA USA, 2008.
De Simone, F. , Ansorge, M. and Ebrahimi, T. , A multi-channel objective model for the full-reference assessment of color pictures , in: 2nd K-space Jamboree Workshop, Paris, 2008.
Singla, A. and Hakkani-Tur, D. , Cross-lingual sentence extraction for information distillation , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Sivaram, G. S. V. S. and Hermansky, H. , Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition , in: Proc. 16th European Signal Processing Conference (EUSIPCO), Lausanne, 2008.
Sivaram, G. S. V. S. and Hermansky, H. , Introducing temporal asymmetries in feature extraction for automatic speech recognition , in: Interspeech 2008, Brisbane, Australia, 2008.
Smith, K. , Ba, S. , Gatica-Perez, D. and Odobez, J. -M. , Tracking the visual focus of attention for a varying number of wandering people , in: IEEE Trans. on Pattern Analysis and Machine Intelligence,, volume 30, number 7, pages 1212-1229, 2008.
Soleymani, M. , Chanel, G. , Kierkels, J. and Pun, T. , affective characterization of movie scenes based on multimedia content analysis and user's physiological emotional responses , in: IEEE International Symposium on Multimedia, 2008.
Soleymani, M. , Chanel, G. , Kierkels, J. and Pun, T. , affective ranking of movie scenes using physiological signals and content analysis , in: 2nd ACM Workshop on the Many Faces of Multimedia Semantics, ACM MM08, 2008.
Soleymani, M. , Kierkels, J. , Chanel, G. , Bruno, E. , Marchand-Maillet, S. and T. Pun, , Estimating emotions and tracking interest during movie watching based on multimedia content and physiological responses , in: Joint (IM)2-Interactive Multimodal Information Management and Affective Sciences NCCRs meeting, 2008.
Soleymani, M. , Chanel, G. , Kierkels, J. and Pun, T. , Valence-arousal representation of movie scenes based on multimedia content analysis and user's physiological emotional responses , in: MLMI 2008, 5th Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
Soleymani, M. , Chanel, G. , Kierkels, J. and Pun, T. , valence-arousal representation of movie scenes based on multimedia content analysis and user's physiological emotional responses , 5th Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
Sorci, M. , Antonini, G. , Cerretani, B. , Cruz Mota, J. , Rubin, T. , Bierlaire, M. and Thiran, J. -Ph. , Modelling human perception of static facial expressions , in: Face and Gesture Recognition 2008, Amsterdam, 2008.
Spindler, T. , Wartmann, C. , Hovestadt, L. , Roth, D. , van Gool, L. and Steffen, A. , Privacy in video surveilled spaces , in: Journal of Computer Security, volume 16, number 2, pages 199-222, 2008.
Stolcke, A. , Anguera, X. , Boakye, K. , Cetin, O. , Janin, A. , Magimai-Doss, M. , Wooters, C. and Zheng, J. , The SRI-ICSI spring 2007 meeting and lecture recognition system , in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
Stoyanchev, S. , Tur, G. and Hakkani-Tur, D. , Name-aware speech recognition for interactive question answering , in: IEEE ICASSP, Las Vegas, NV, 2008.
Szafranski, M. , Grandvalet, Y. and Rakotomamonjy, A. , Composite Kernel Learning , in: Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), pages 1040-1047, Omnipress, 2008.
Thomas, A. , Ganapathy, S. and Hermansky, H. , Hilbert envelope based features for far-field speech recognition , in: MLMI 2008, Utrecht, The Netherlands, 2008.
Thomas, A. , Ganapathy, S. and Hermansky, H. , Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech , in: Interspeech 2008, Brisbane, Australia, 2008.
Thomas, A. , Ganapathy, S. and Hermansky, H. , Recognition of reverberant speech using frequency domain linear prediction , in: IEEE Signal Processing Letters, 2008.
Thomas, A. , Ganapathy, S. and Hermansky, H. , Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain , in: 16th European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
Thomas, A. , Ferrari, V. , Leibe, B. , Tuytelaars, T. and van Gool, L. , Using recognition to guide a robot's attention , in: Robotics Science and Systems, 2008.
Tommasi, T. , Orabona, F. and Caputo, B. , CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach , number Idiap-RR-77-2008, 2008.
Tommasi, T. , Orabona, F. and Caputo, B. , Cue Integration for Medical Image Annotation , in: Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Springer-Verlag, 2008.
Tommasi, T. , Orabona, F. and Caputo, B. , Discriminative cue integration for medical image annotation , in: Pattern Recognition Letters, 2008.
Torii, A. , Havlena, M. , Pajdla, T. and B. Leibe, , Measuring camera translation by the dominant apical angle , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
Tous, R. , Carreras, A. , Delgado, J. , Cordara, G. , Gianluca, F. , Peig, E. , Dufaux, F. and Galinski, G. , An Architecture for TV Content Distributed Search and Retrieval Using the MPEG Query Format (MPQF) , in: International Workshop on Ambient Media Delivery and Interactive Television (AMDIT 2008), Quebec City, Canada, 2008.
Tsourakis, N. , Lisowska, A. , Bouillon, P. and Rayner, M. , From desktop to mobile: adapting a successful voice interaction platform for use in mobile devices , in: Third ACM MobileHCI Workshop on Speech in Mobile and Pervasive Environments (SiMPE), Amsterdam, the Netherlands., 2008.
Ullah, M. M. , Pronobis, A. , Caputo, B. , Luo, J. , Jensfelt, P. and Christensen, H. I. , Towards Robust Place Recognition for Robot Localization , in: IEEE International Conference on Robotics ad Automation, 2008.
Valente, F. and Hermansky, H. , Hierarchical and parallel processing of modulation spectrum for asr applications , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4165-4168, 2008. [DOI]
Valente, F. and Hermansky, H. , On the combination of auditory and modulation frequency channels for asr applications , in: Interspeech 2008, Brisbane, Australia, 2008.
Vergyri, D. , Mandal, A. , Wang, W. , Stolcke, A. , Zheng, J. , Graciarena, M. , Rybach, D. , Gollan, C. , Schlater, R. , Kirchoff, K. , Faria, A. and Morgan, N. , Development of the sri/nightingale arabic asr system , in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 1437-1440, 2008.
Vergyri, D. , Mandal, A. , Wang, W. , Stolcke, A. , Zheng, J. , Graciarena, M. , Rybach, D. , Gollan, C. , Schlater, R. , Kirchoff, K. , Faria, A. and Morgan, N. , Development of the sri/nightingale arabic asr system , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Vijayasenan, D. , Valente, F. and Bourlard, H. , Combination of agglomerative and sequential clustering for speaker diarization , in: International Conference on Acoustics, Speech and Signal Processing, 2008.
Vijayasenan, D. , Valente, F. and Bourlard, H. , Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization , in: Interspeech 2008, 2008.
Vinciarelli, A. , Pantic, M. , Bourlard, H. and Pentland, A. , Social signal processing: state-of-the-art and future perspectives of an emerging domain , in: Proceedings of the ACM International Conference on Multimedia, 2008.
Vinciarelli, A. , Pantic, M. , Bourlard, H. and Pentland, A. , Social signals, their function, and automatic analysis: a survey , in: Proceedings of International Conference on Multimodal Interfaces (to appear), 2008.
Vinyals, O. and Friedland, G. , A hardware-independent fast logarithm approximation with adjustable accuracy , in: 10th IEEE International Symposium on Multimedia, Berkeley, CA, USA, pages 61-65, 2008.
Vinyals, O. and Friedland, G. , Live speaker identification in meetings: "who is speaking now?" , in: Technical Report TR-08-001, International Computer Science Institute, Berkeley, CA, 2008.
Vinyals, O. and Friedland, G. , Modulation spectrogram features for speaker diarization , in: to appear in proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Vinyals, O. and Friedland, G. , Modulation spectrogram features for speaker diarization , in: Interspeech 2008, Brisbane, Australia, pages 630-633, 2008.
Vinyals, O. and Friedland, G. , Towards semantic analysis of conversations: a system for the live identification of speakers in meetings , in: to appear in Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, CA, 2008.
Voloshynovskiy, S. , Koval, O. , Villán, R. , Beekhof, F. and Pun, T. , Authentication of biometric identification documents via mobile devices , in: Journal of Electronic Imaging, 2008.
Voloshynovskiy, S. , Koval, O. and Pun, T. , Multimodal authentication based on random projections and distributed coding , in: Proceedings of the 10th ACM Workshop on Multimedia & Security, 2008.
Voloshynovskiy, S. , Koval, O. , Beekhof, F. and Pun, T. , Multimodal authentication based on random projections and distributed coding , in: MM&Sec 2008, 2008.
Weinshall, D. , Hermansky, H. , Zweig, A. , Luo, J. , Jimison, H. , Ohl, F. and Pavel, M. , Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree , in: Advances in Neural Information Processing Systems 21, 2008.
Weise, T. , Leibe, B. and van Gool, L. , Accurate and robust registration for in-hand modeling , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
Wooters, C. and Huijbregts, M. , The ICSI RT07s speaker diarization system , in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
Yao, J. and Odobez, J. -M. , Fast human detection from videos using covariance features , in: European Conference on Computer Vision, workshop on Visual Surveillance (ECCV-VS), Marseille, 2008.
Yao, J. and Odobez, J. -M. , Multi-camera 3d person tracking with particle filter in a surveillance environment , in: 16th European Signal processing Conference (EUSIPCO), 2008.
Zeng, G. and van Gool, L. , Multi-label image segmentation via point-wise repetition , in: International Conference on Computer Vision and Pattern Recognition (CVPR), 2008.
Zhao, S. and Morgan, N. , Multi-stream spectro-temporal features for robust speech recognition , in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
Zhao, S. Y. and Morgan, N. , Multi-stream spectro-temporal features for robust speech recognition , in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 898-901, 2008.
I. Bogdanova, , A. Bur, and Hügli, H. , The spherical approach to omnidirectional visual attention , in: XVI European Signal Processing Conference (EUSIPCO 2008), 2008.
I. Bogdanova, , A. Bur, and Hügli, H. , Visual attention on the sphere [in press] , in: IEEE Transactios on Image Processing, 2008.
Varga, T. and Bunke, H. , Perturbation models for generating synthetic training data in handwriting recognition , in: Machine Learning in Document Analysis and Recognition, pages 333-360, Springer, 2008.
Tommasi, T. , Orabona, F. and Caputo, B. , An SVM Confidence-Based Approach to Medical Image Annotation , in: Evaluating Systems for Multilingual and Multimodal Information Access -- 9th Workshop of the Cross-Language Evaluation Forum, 2008.
Popescu-Belis, A. , Bourlard, H. and Renals, S. , Machine learning for multimodal interaction iv (revised selected papers from mlmi 2007, brno, 28-30 june 2007) , LNCS 4892, Springer-Verlag, 2008.
Popescu-Belis, A. and Stiefelhagen, R. , Machine learning for multimodal interaction v (proceedings of mlmi 2008, utrecht, 8-10 september 2008) , LNCS 5237, Springer-Verlag, 2008.
Popescu-Belis, A. , Boertjes, E. , Kilgour, J. , Poller, P. , Castronovo, S. , Wilson, T. , Jaimes, A. and Carletta, J. , The amida automatic content linking device: just-in-time document retrieval in meetings , in: Machine Learning for Multimodal Interaction V (Proceedings of MLMI 2008, Utrecht, 8-10 September 2008), pages 273-284, Springer-Verlag, 2008.
Popescu-Belis, A. , Boertjes, E. , Kilgour, J. , Poller, P. , Castronovo, S. , Wilson, T. , Jaimes, A. and Carletta, J. , The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings , in: Machine Learning for Multimodal Interaction V, pages 272-283, Springer-Verlag, Utrecht, 2008. [DOI]
Popescu-Belis, A. , Baudrion, P. , Flynn, M. and Wellner, P. , Towards an objective test for meeting browsers: the bet4tqb pilot experiment , in: Machine Learning for Multimodal Interaction IV, pages 108-119, Springer-Verlag, 2008. [DOI]
Aloise, F. , Caporusso, N. , Mattia, D. , Babiloni, F. , Kauhanen, L. , Millán, J. del R. , Nuttin, M. , Marciani, M. G. and Cincotti, F. , Brain-machine interfaces through control of electroencephalographic signals and vibrotactile feedback , in: Proceedings of the 12th International Conference on Human-Computer Interaction, 2007.
Anguera, X. , Wooters, C. and Hernando, J. , Acoustic Beamforming for Speaker Diarization of Meetings , in: to appear in IEEE Transactions on Audio, Speech and Language Processing, 2007.
Anguera, X. , Wooters, C. , Pardo, J. M. and Hernando, J. , Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings , in: Proc. ICASSP, Honolulu, 2007.
Anguera, X. , Shinozaki, T. , Wooters, C. and Hernando, J. , Model Complexity Selection and Cross-validation EM Training for Robust Speaker Diarization , in: Proc. ICASSP, Honolulu, 2007.
Ansari-Asl, K. , Chanel, G. and Pun, T. , A channel selection method for eeg classification in emotion assessment based on synchronization likelihoo , in: Eusipco 2007, 15th Eur. Signal Proc. Conf., 2007.
Aradilla, G. , Vepa, J. and Bourlard, H. , An acoustic model based on kullback-leibler divergence for posterior features , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
Aradilla, G. and Ajmera, J. , Detection and recognition of number sequences within spoken utterances , in: 2nd Workshop on Speech in Mobile and Pervasive Environments, 2007.
Aradilla, G. and Bourlard, H. , Posterior-based features and distances in template matching for speech recognition , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 204-214, 2007. [DOI]
Ba, S. , Joint head tracking and pose estimation for visual focus of attention recognition , École Polytechnique Fédérale de Lausanne, 2007.
Ba, S. and Odobez, J. -M. , Probabilistic head pose tracking evaluation in single and multiple camera setups , in: Classification of Events, Activities and Relationship Evaluation and Workshop, 2007.
Bay, H. , Ess, A. , Tuytelaars, T. and van Gool, L. , Speeded-up robust features (surf) , in: Computer Vision and Image Understanding (CVIU), 2007.
Behera, A. , Lalanne, D. and Ingold, R. , Docmir: an automatic document-based indexing system for meeting retrieval , in: Multimedia Tools and Applications, volume 37, number 2, 2007.
Bengio, S. and Mariéthoz, J. , Biometric person authentication is a multiple classifier problem , in: 7th International Workshop on Multiple Classifier Systems, MCS, 2007.
Bertini, E. , Hertzog, P. and Lalanne, D. , Spiralview: a visual tool to improve monitoring and understanding of security data in corporate , in: IEEE Symposium on Visual Analytics Science and Technology 2007 (VAST'07), pages to appear, 2007.
Bertolami, R. and Bunke, H. , Multiple classifier methods for offline handwritten text line recognition , in: Multiple Classifier Systems, pages 72-81, Springer, 2007.
Bertolami, R. , Uchida, S. , Zimmermann, M. and Bunke, H. , Non-uniform slant correction for handwritten text line recognition , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 18-22, 2007.
Besson, P. , Popovici, V. , Vesin, J. M. , Thiran, J. -Ph. and Kunt, M. , Extraction of audio features specific to speech production for multimodal speaker detection , in: IEEE Transactions on Multimedia, 2007. [DOI]
Bogdanova, I. , Bresson, X. , Thiran, J. -Ph. and Vandergheynst, P. , Scale-space analysis and active contours for omnidirectional images , in: IEEE Transactions on Image Processing, volume 16, number 7, pages 1888-1901, 2007. [DOI]
Bologna, G. , Deville, B. , Pun, T. and Vinckenbosch, M. , Identifying major components of pictures by audio encoding of colors , in: IWINAC2007, 2nd. Int. Work-conf. on the Interplay between Natural and Artificial Computation, 2007.
Bologna, G. , Deville, B. , Pun, T. and Vinckenbosch, M. , Transforming 3d coloured pixels into musical instrument notes for vision substitution applications , in: Eurasip J. of Image and Video Processing, Special Issue: Image and Video Processing for Disability, accepted for publication, 2007.
Bouillon, P. , Flores, G. , Starlander, M. , Chatzichrisafis, N. , Santaholma, M. , Tsourakis, N. , Rayner, M. and Hockey, B. A. , A bidirectional grammar-based medical speech translator , in: Proceedings of workshop on Grammar-based approaches to spoken language processing, pages 41-48, ACL 2007, Prague, Czech Republic, 2007.
Bouillon, P. , Chatzichrisafis, N. , Halimi, S. , Hockey, B. A. , Isahara, H. , Kanzaki, K. , Nakao, Y. , Novellas Vall, B. , Rayner, M. , Santaholma, M. and Starlander, M. , Medslt: a multi-lingual grammar-based medical speech translator , in: Proceedings of First International Workshop on Intercultural Collaboration, IWIC2007, Kyoto, Japan, 2007.
Bouillon, P. , Rayner, M. , Novellas Vall, B. , Starlander, M. , Santaholma, M. , Nakao, Y. and Chatzichrisafis, N. , Une grammaire partagée multi-tâche pour le traitement de la parole : application aux langues romanes , in: TAL (Traitement Automatique des Langues), volume 47, number 3, 2007.
Bray, M. , Koller-Meier, E. and van Gool, L. , Smart particle filtering for high-dimensional tracking , in: Computer Vision and Image Understanding, 2007.
Bresson, X. , Esedoglu, S. , Vandergheynst, P. , Thiran, J. -Ph. and Osher, S. , Fast Global Minimization of the Active Contour/Snake Model , in: Journal of Mathematical Imaging and Vision, volume 28, number 2, pages 151-167, 2007. [DOI]
Broschart, M. , de Negueruela, C. , Millán, J. del R. and Menon, C. , Augmenting astronaut's capabilities through brain-machine interfaces , in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, 2007.
Bruno, E. , Kludas, J. and Marchand-Maillet, S. , Combining multimodal preferences for multimedia information retrieval , in: ACM SIGMM - International Workshop on Multimedia Information Retrieval, 2007.
Bruno, E. , Kludas, J. and Marchand-Maillet, S. , Combining multimodal preferences for multimedia information retrieval , in: Proc. of International Workshop on Multimedia Information Retrieval, 2007.
Bunke, H. and Neuhaus, M. , Graph matching -- exact and error-tolerant methods and the automatic learning of edit costs , in: Mining Graph Data, pages 17-34, Wiley, 2007.
Bunke, H. , Dickinson, P. , Humm, A. , Irniger, C. and Kraetzl, M. , Graph sequence visualisation and its application to computer network monitoring and abnormal event detection , in: Applied Graph Theory in Computer Vision and Pattern Recognition, pages 227-245, Springer, 2007.
Bunke, H. and Varga, T. , Off-line Roman cursive handwriting recognition , in: Digital Document Processing: Major Directions and Recent Advances, volume 20, pages 165-173, 2007.
Cetin, O. , Kantor, A. , King, S. , Bartels, C. , Magimai-Doss, M. , Frankel, J. and Livescu, K. , An Articulatory Feature-based Tandem Approach and Factored Observation Modeling , in: Proc. ICASSP, Honolulu, 2007.
Chanel, G. , Ansari-Asl, K. and Pun, T. , Valence-arousal evaluation using physiological signals in an emotion recall paradigm , in: 2007 IEEE SMC, Int. Conf. on Systems, Man and Cybernetics, Smart cooperative systems and cybernetics: advancing knowledge and security for humanity, 2007.
Chavarriaga, R. , Ferrez, P. W. and Millán, J. del R. , To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces , in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007.
Chavarriaga, R. , Ferrez, P. W. and del R. Millán, J. , To err is human: learning from error potentials in brain-computer interfaces , in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007.
Chen, L. , Barber, D. and Odobez, J. -M. , Dynamical dirichlet mixture model , number 02, 2007.
Chiappa, S. and Barber, D. , Bayesian factorial linear gaussian state-space models for biosignal decomposition , in: IEEE Signal Processing Letters, 2007.
Cincotti, F. , Mattia, D. , Aloise, F. , Bufalari, S. , Astolfi, L. , Fallani, F. De Vico , Tocci, A. , Bianchi, L. , Marciani, M. G. , Gao, S. , Millán, J. del R. and Babiloni, F. , High-resolution eeg techniques for brain-computer interface applications , in: Journal of Neuroscience Methods, volume 167, pages 31-42, ISSN 0165-0270, 2007.
Cincotti, F. , Kauhanen, L. and Aloise, F. , Vibrotactile feedback for brain-computer interface operation , in: Computational Intelligence and Neuroscience, volume 2007, pages Article ID, 2007.
Cuendet, S. , Shriberg, E. , Favre, B. , Fung, J. and Hakkani-Tur, D. , An analysis of sentence segmentation features for broadcast news, broadcast conversations, and meetings , in: SIGIR Workshop on Searching Conversational Spontaneous Speech, 2007.
Cuendet, S. , Hakkani-Tur, D. and Shriberg, E. , Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech , in: to appear in Proceedings of MLMI, Brno, Czech Republic, 2007.
Cuendet, S. , Hakkani-Tur, D. , Shriberg, E. , Fung, J. and Favre, B. , Cross-Genre Feature Comparisons for Spoken Sentence Segmentation , in: International Conference on Semantic Computing (ICSC), Irvine, CA, 2007.
Dessimoz, D. , Richiardi, J. , Champod, C. and Drygajlo, A. , Multimodal biometrics for identity documents (MBioID) , in: Forensic Science International, volume 167, pages 154-159, 2007. [DOI]
Dines, J. and Magimai-Doss, M. , A study of phoneme and grapheme based context-dependent asr systems , number 12, 2007.
Dines, J. and Vepa, J. , Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics , number 13, 2007.
Dornhege, G. , del R. Millán, J. , Hinterberger, T. , McFarland, D. and Müller, K. -R. , Towards brain-computer interfacing , The MIT Press, 2007.
Drugman, T. , Gurban, M. and Thiran, J. -Ph. , Relevant Feature Selection for Audio-Visual Speech Recognition , in: 9th International Workshop on Multimedia Signal Processing (MMSP), Chania, Crete, Greece, 2007.
Drygajlo, A. , Man-machine voice communication , pages 433-461, EPFL Press, 2007. [DOI]
Drygajlo, A. , Multimodal biometrics for identity documents and smart cards european challenge , in: Proc. 15th European Signal Processing Conf. (EUSIPCO), 2007.
Einsele, F. , Hennebert, J. and Ingold, R. , Towards identification of very low resolution, anti-aliased characters , in: IEEE International Symposium on Signal Processing and its Applications (ISSPA'07), Sharjah, United Arab Emirates, 2007.
Ess, A. , Leibe, B. and van Gool, L. , Depth and appearance for mobile scene analysis , in: International Conference on Computer Vision (ICCV'07), 2007.
Ess, A. , Neubeck, A. and van Gool, L. , Generalised linear pose estimation , in: BMVC, 2007.
Evéquoz, F. and Lalanne, D. , Indexing and visualizing digital memories through personal email archive , pages 21-24, 2007.
Evéquoz, F. and Lalanne, D. , Personal information management through interactive visualizations , pages 158-160, 2007.
Ferrez, P. W. , Error-related eeg potentials in brain-computer interfaces , École Polytechnique Fédérale de Lausanne, 2007.
Ferrez, P. W. and Millán, J. del R. , Error-related eeg potentials in brain-computer interfaces , in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
Frapolli, F. , Hirsbrunner, B. and Lalanne, D. , Dynamic rules: towards interactive games intelligence , in: Tangible Play: Research and Design for Tangible and Tabletop Games. Workshop at the 2007 Intelligent User Interfaces Conference (IUI'07), pages 29-32, 2007.
Galán, F. , Nuttin, M. , Lew, E. , Ferrez, P. W. , Vanacker, G. , Philips, J. , van Brussel, H. and Millán, J. del R. , An asynchronous and non-invasive brain-actuated wheelchair , in: Proceedings of the 13th International Symposium on Robotics Research, 2007.
Galán, F. , Ferrez, P. W. , Oliva, F. , Guàrdia, J. and del R. Millán, J. , Feature extraction for multi-class bci using canonical variates analysis , number 23, 2007.
Galán, F. , Palix, J. , Chavarriaga, R. , Ferrez, P. W. , Lew, E. , Hauert, C. -A. and Millán, J. del R. , Visuo-spatial attention frame recognition for brain-computer interfaces , in: Proceedings of the 1st International Conference on Cognitive Neurodynamics, 2007.
Gaudard, C. , Aradilla, G. and Bourlard, H. , Speech recognition based on template matching and phone posterior probabilities , number 02, 2007.
Georgescul, M. , Clark, A. and Armstrong, S. , Exploiting structural meeting-specific features for topic segmentation , in: Actes de la 14ème Conférence sur le Traitement Automatique des Langues Naturelles, Toulouse, France, 2007.
Gerber, M. , Kaufmann, T. and Pfister, B. , Perceptron-based class verification , in: Proceedings of NOLISP (ISCA Workshop on non linear speech processing), 2007.
Gerber, M. , Beutler, R. and Pfister, B. , Quasi text-independent speaker verification based on pattern matching , in: Proceedings of Interspeech, ISCA, 2007.
Germann, M. , Breitenstein, M. D. , Park, I. K. and Pfister, H. , Automatic pose estimation for range images on the gpu , in: Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007), pages 81-90, IEEE Computer Society, 2007.
Grangier, D. and Bengio, S. , Learning the inter-frame distance for discriminative template-based keyword detection , in: International Conference on Speech Communication and Technology (INTERSPEECH), 2007.
Graves, A. , Liwicki, M. and Bunke, H. , Unconstrained on-line handwriting recognition with recurrent neural networks , in: Advances in Neural Information Processing, 2007.
Gurban, M. , Valles, A. and Thiran, J. -Ph. , Low-Dimensional Motion Features for Audio-Visual Speech Recognition , in: 15th European Signal Processing Conference (EUSIPCO), Poznan, Poland, Poznan, Poland, 2007.
Guz, U. , Cuendet, S. , Hakkani-Tur, D. and Tur, G. , Co-training Using Prosodic and Lexical Information for Sentence Segmentation , in: to appear in Proceedings of Interspeech, Antwerp, 2007.
Hakkani-Tur, D. and Tur, G. , Statistical Sentence Extraction for Information Distillation , in: Proc. ICASSP, Honolulu, 2007.
Hennebert, J. , Loeffel, R. , Humm, A. and Ingold, R. , A new forgery scenario based on regaining dynamics of signature , in: Accepted for publication, International Conference on Biometrics (ICB 2007), Seoul Korea, 2007.
Hennebert, J. , Humm, A. and Ingold, R. , Modelling spoken signatures with gaussian mixture model adaptation , in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 07), 2007.
Hennebert, J. , Please repeat: my voice is my password. from the basics to real-life implementations of speaker verification technologies , in: Invited lecture at the Information Security Summit (IS2 2007), Prague, 2007.
Heusch, G. and Marcel, S. , A novel statistical generative model dedicated to face recognition , number Idiap-RR-39-2007, 2007.
Heusch, G. and Marcel, S. , Face authentication with salient local features and static bayesian network , in: IEEE / IAPR Intl. Conf. On Biometrics (ICB), 2007.
Hoffmann, U. , Vesin, J. M. and Ebrahimi, T. , Recent advances in brain-computer interfaces , in: IEEE International Workshop on Multimedia Signal Processing, Chania, Crete, Greece, 2007.
Huang, Y. , Vinyals, O. , Friedland, G. , Müller, C. , Mirghafori, N. and Wooters, C. , A Fast-Match approach for robust, faster than real-time Speaker Diarization , in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
Huang, Y. , Robust and rapid speaker diarization , in: Master Thesis, University of California, Berkeley, 2007.
Huang, Y. , Friedland, G. , Müller, C. and Mirghafori, N. , Speeding up speaker diarization by using prosodic features , in: Technical Report TR-07-004, International Computer Science Institute, Berkeley, California, 2007.
Huijbregts, M. , Wooters, C. and Ordelman, R. , Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections , in: to appear in Proceedings of Interspeech, Antwerp, 2007.
Huijbregts, M. and Wooters, C. , The Blame Game: Performance Analysis of Speaker Diarization System Components , in: to appear in Proc. Interspeech, Antwerp., 2007.
Humm, A. , Hennebert, J. and Ingold, R. , Database and evaluation protocols for user authentication using combined handwriting and speech modalities , 2007.
Humm, A. , Hennebert, J. and Ingold, R. , Hidden markov models for spoken signature verification , 2007.
Humm, A. , Hennebert, J. and Ingold, R. , Modelling combined handwriting and speech modalities , in: Accepted for publication, International Conference on Biometrics (ICB 2007), Seoul Korea, 2007.
Humm, A. , Hennebert, J. and Ingold, R. , Spoken handwriting verification using statistical models , in: Accepted for publication, International Conference on Document Analysis and Recognition (ICDAR 07), Curitiba Brazil, 2007.
Hung, H. , Jayagopi, D. , Yeo, C. , Friedland, G. , Ba, S. , Odobez, J. -M. , Ramchandran, K. , Mirghafori, N. and Gatica-Perez, D. , Using audio and video features to classify the most dominant person in a group meeting , 2007.
Hung, H. , Jayagopi, D. , Yeo, C. , Friedland, G. , Ba, S. , Odobez, J. -M. , Ramchandran, K. , Mirghafori, N. and Gatica-Perez, D. , Using audio and video features to classify the most dominant person in a group meeting multi-layer background subtraction based on color and texture , in: Proc. ACM Multi Media, Augsburg, Germany, 2007.
Hung, H. , Jayagopi, D. , Yeo, C. , Friedland, G. , Ba, S. , Odobez, J. -M. , Ramchandran, K. , Mirghafori, N. and Gatica-Perez, D. , Using audio and video features to classify the most dominant person in meetings , in: Proceedings of ACM Multimedia 2007, pp. 835-838, Augsburg, Germany, 2007.
Hwang, M. -Y. , Peng, G. , Wang, W. , Faria, A. , Heidel, A. and Ostendorf, M. , Building a Highly Accurate Mandarin Speech Recognizer , in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
Hérault, R. and Grandvalet, Y. , Sparse probabilistic classifiers , in: International Conference on Machine Learning (ICML), 2007.
Jaeggli, T. , Koller-Meier, E. and van Gool, L. , Learning generative models for monocular body pose estimation , in: ACCV, 2007.
Jaeggli, T. , Koller-Meier, E. and van Gool, L. , Multi-activity tracking in lle body pose space , in: 2nd Workshop on HUMAN MOTION Understanding, Modeling, Capture and Animation, ICCV, 2007.
Jaimes, A. , Gatica-Perez, D. , Sebe, N. and Huang, T. S. , Guest Editors' Introduction: Human-Centered Computing-Toward a Human Revolution , in: Computer, volume 40, number 5, pages 30-34, 2007.
Jaimes, A. , Gatica-Perez, D. , Sebe, N. and Huang, T. S. , Human-centered computing: toward a human revolution , in: IEEE Computer, volume 40, number 5, 2007. [DOI]
Kaufmann, T. and Pfister, B. , An HPSG parser supporting discontinuous licenser rules , in: International Conference on HPSG, 2007.
Kaufmann, T. and Pfister, B. , Applying licenser rules to a grammar with continuous constituents , in: The Proceedings of the 14th International Conference on Head-Driven Phrase Structure Grammar, 2007.
Keshet, J. , Theoretical foundations for large-margin kernel-based continuous speech recognition , number Idiap-RR-44-2007, 2007.
Kittler, J. , Poh, N. , Fatukasi, O. , Messer, K. , Kryszczuk, K. , Richiardi, J. and Drygajlo, A. , Quality dependent fusion of intramodal and multimodal biometric experts , in: Proc. SPIE Defense and Security Symposium, 2007.
Kludas, J. , Bruno, E. and Marchand-Maillet, S. , Information fusion in multimedia information retrieval , in: Workshop on Adaptive Multimedia Retrieval (AMR 2007), 2007.
Knox, M. and Mirghafori, N. , Automatic Laughter Detection Using Neural Networks , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Kokiopoulou, E. and Frossard, P. , Accelarating distributed consensus using extrapolation , in: IEEE Signal Processing Letters, volume 14, number 10, pages 665-668, 2007.
Kokiopoulou, E. and Frossard, P. , Accelerating Distributed Consensus Using Extrapolation , in: IEEE Signal Processing Letters, volume 14, number 10, 2007. [DOI]
Kokiopoulou, E. and Frossard, P. , Dimensionality Reduction with Adaptive Approximation , in: IEEE Int. Conf. on Multimedia & Expo (ICME), Beijing, China, 2007.
Kokiopoulou, E. and Frossard, P. , Image alignment with rotation manifolds built on sparse geometric expansions , in: IEEE International Workshop on Multimedia Signal Processing, Chania, Crete, Greece, 2007.
Kolar, J. , Liu, Y. and Shriberg, E. , Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Koval, O. , Voloshynovskiy, S. and Pun, T. , Analysis of multimodal binary detection systems based on dependent/independent modalities , in: Proceedings of the IEEE 2007 International Workshop on Multimedia Signal Processing, 2007.
Koval, O. , Voloshynovskiy, S. and Pun, T. , Error exponent analysis of person identification based on fusion of dependent/independent modalities , in: Proceedings of SPIE-IS&T Electronic Imaging 2007, Security, Steganography, and Watermarking of Multimedia Contents IX, 2007.
Kron, E. , Rayner, M. , Santaholma, M. and Bouillon, P. , A development environment for building grammar-based speech-enabled applications , in: Proceedings of workshop on Grammar-based approaches to spoken language processing, pages 49-52, ACL 2007, Prague, Czech Republic, 2007.
Kronegg, J. , Chanel, G. , Voloshynovskiy, S. and Pun, T. , Eeg-based synchronized brain-computer interfaces: a model for optimizing the number of mental tasks , in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, volume 15, number 1, pages 50-58, 2007.
Kryszczuk, K. and Drygajlo, A. , Improving classification with class-independent quality measures: q-stack in face verification , in: Proc. 2nd Int. Conference in Biometrics (ICB 2007), 2007.
Kryszczuk, K. and Drygajlo, A. , Q-stack: uni- and multimodal classifier stacking with quality measures , in: Proc. 7th Int. Workshop on Multiple Classifier Systems, Springer, 2007.
Kryszczuk, K. , Richiardi, J. and Drygajlo, A. , Reliability estimation for multimodal error prediction and fusion , in: Proc. 7th Int. Workshop on Pattern Recognition in Information Systems (PRIS 2007), 2007.
Kryszczuk, K. , Richiardi, J. , Prodanov, P. and Drygajlo, A. , Reliability-based decision fusion in multimodal biometric verification systems , in: EURASIP Journal of Advances in Signal Processing, 2007.
Kumatani, K. , Mayer, H. , Gehrig, T. , Stoimenov, E. , McDonough, J. and Wölfel, M. , Adaptive beamforming with a minimum mutual information criterion , pages 2527--2541, 2007. [DOI]
Kumatani, K. , Mayer, H. , Gehrig, T. , Stoimenov, E. , McDonough, J. and Wölfel, M. , Minimum mutual information beamforming for simultaneous active speakers , in: IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), pages 71-76, Kyoto, 2007. [DOI]
Lalanne, D. , Evéquoz, F. , Rigamonti, M. , Dumas, B. and Ingold, R. , An ego-centric and tangible approach to meeting indexing and browsing , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI'07), pages to appear, 2007.
Lalanne, D. , Evéquoz, F. , Chiquet, H. , Müller, M. , Radgohar, M. and Ingold, R. , Going through digital versus physical augmented gaming , in: Tangible Play: Research and Design for Tangible and Tabletop Games. Workshop at the 2007 Intelligent User Interfaces Conference (IUI'07), pages 41-44, 2007.
Lalanne, D. and van den Hoven, E. , Supporting human memory with interactive systems , pages 215-216, 2007.
Lalanne, D. , Bertini, E. , Hertzog, P. and Bados, P. , Visual analysis of corporate network intelligence: abstracting and reasoning on yesterdays for acting today , 2007.
Laptev, I. , Caputo, B. and Lindberg, T. , Local velocity-adapted motion events for spatio-temporal recognition , in: Computer Vision and Image Undertanding, volume 108, number 3, pages 207-229, ISSN 1077-3142, 2007.
Lathoud, G. and Odobez, J. -M. , Short-term spatio-temporal clustering applied to multiple moving speakers , in: IEEE Transactions on Audio, Speech and Language Processing, 2007.
Lei, H. and Mirghafori, N. , Word-Conditioned HMM Supervectors for Speaker Recognition , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Lei, H. and Mirghafori, N. , Word-conditioned phone N-grams for speaker recognition , in: Proc. ICASSP, Honolulu, 2007.
Leibe, B. , Schindler, K. and van Gool, L. , Coupled detection and trajectory estimation for multi-object tracking , in: International Conference on Computer Vision (ICCV'07), 2007.
Leibe, B. , Cornelis, N. , Cornelis, K. and van Gool, L. , Dynamic 3d scene analysis from a moving vehicle , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'07), 2007.
Levit, M. , Hakkani-Tur, D. , Tur, G. and Gillick, D. , Integrating several annotation layers for statistical information distillation , in: Workshop on Automatic Speech Recognition and Understanding, 2007.
Levit, M. , Hakkani-Tur, D. , Tur, G. and Gillick, D. , Integrating Several Annotation Layers for Statistical Information Distillation , in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
Li, W. and Bourlard, H. , Non-linear spectral stretching for in-car speech recognition , in: Interspeech, 2007.
Li, W. , Dines, J. and Magimai-Doss, M. , Robust overlapping speech recognition based on neural networks , number Idiap-RR-55-2007, 2007.
Lisowska, A. , Betrancourt, M. , Armstrong, S. and Rajman, M. , Minimizing modality bias when exploring input preference for multimodal systems in new domains: the archivus case study , in: CHI' 07, San José, California, 2007.
Lisowska, A. , Armstrong, S. , Melichar, M. , Ailomaa, M. and Rajman, M. , The wizard of oz meets multimodal language-enabled gui interfaces: new challenges , in: Proceedings of CHI' 07, San José, California, 2007.
Liu, Y. and Shriberg, E. , Comparing Evaluation Metrics for Sentence Boundary Detection , in: Proc. ICASSP, Honolulu, 2007.
Livescu, K. , Cetin, O. , Hasegawa-Johnson, M. , King, S. , Bartels, C. , Borges, N. , Kantor, A. , Lal, P. , Yung, L. , Bezman, A. , Dawson-Haggerty, S. , Woods, B. , Frankel, J. , Magimai-Doss, M. and Saenko, K. , Articulatory Feature-based Methods for Acoustic and Audio-visual speech Recognition: Summary from the 2006 JHU Summer Workshop , in: Proc. ICASSP, Honolulu, 2007.
Livescu, K. , Bezman, A. , Borges, N. , Yung, L. , Cetin, O. , Frankel, J. , King, S. , Magimai-Doss, M. , Chi, X. and Lavoie, L. , Manual Transcription of Conversational Speech at the Articulatory Feature Level , in: Proc. ICASSP, Honolulu, 2007.
Liwicki, M. , Graves, A. , Bunke, H. and Schmidhuber, J. , A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 367-371, 2007.
Liwicki, M. , Schlapbach, A. , Loretan, P. and Bunke, H. , Automatic detection of gender and handedness from on-line handwriting , in: Proc. 13th Conf. of the Graphonomics Society, pages 179-183, 2007.
Liwicki, M. and Bunke, H. , Combining on-line and off-line systems for handwriting recognition , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 372-376, 2007.
Liwicki, M. and Bunke, H. , Feature selection for on-line handwriting recognition of whiteboard notes , in: Proc. 13th Conf. of the Graphonomics Society, pages 101-105, 2007.
Liwicki, M. and Bunke, H. , Handwriting recognition of whiteboard notes -- studying the influence of training set size and type , in: Int. Journal of Pattern Recognition and Art. Intelligence, volume 21, number 1, pages 83-98, 2007.
Liwicki, M. , Indermühle, E. and Bunke, H. , On-line handwritten text line detection using dynamic programming , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 447-451, 2007.
Lovitt, A. , Correcting confusion matrices for phone recognizers , number 03, 2007.
Lovitt, A. , Pinto, J. P. and Hermansky, H. , On confusions in a phoneme recognizer , 2007.
Lovitt, A. , Truncation confusion patterns in onset consonants , in: Interspeech 2007, 2007.
Lüthy, F. , Varga, T. and Bunke, H. , Using hidden Markov models as a tool for handwritten text line segmentation , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 8-12, 2007.
Magimai-Doss, M. , Hakkani-Tur, D. , Cetin, O. , Shriberg, E. , Fung, J. and Mirghafori, N. , Entropy Based Classifier Combination for Sentence Segmentation, , in: Proc. ICASSP, Honolulu, 2007.
Marcel, S. , Abbet, P. and Guillemot, M. , Google portrait , number Idiap-Com-07-2007, 2007.
Marcel, S. , Joint bi-modal face and speaker authentication using explicit polynomial expansion , number 14, 2007.
Marcel, S. , Rodriguez, Y. and Heusch, G. , On the recent use of local binary patterns for face authentication , in: International Journal on Image and Video Processing Special Issue on Facial Image Processing, 2007.
Marcel, S. and del R. Millán, J. , Person authentication using brainwaves (eeg) and maximum a posteriori model adaptation , in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Special Issue on Biometrics, 2007.
Marchand-Maillet, S. , Bruno, E. , Nürnberger, A. and Detyniecki, M. , Adaptive multimedia retrieval: user, context and feedback , Springer, 2007.
Mariéthoz, J. and Bengio, S. , A kernel trick for sequences applied to text-independent speaker verification systems , in: Pattern Recognition, volume 40, number 8, ISSN 0031-3203, 2007.
McCowan, I. , Maganti, H. K. and Gatica-Perez, D. , Speech enhancement and recognition in meetings with an audio-visual sensor array , in: IEEE Trans. on Audio, Speech, and Language Processing, volume 15, number 8, pages 2257-2269, 2007.
Mesot, B. and Barber, D. , A bayesian switching linear dynamical system for scale-invariant robust speech extraction , 2007.
Mesot, B. and Barber, D. , A gaussian sum smoother for inference in switching linear dynamical systems , 2007.
Meynet, J. , Popovici, V. and Thiran, J. -Ph. , Face Detection with Boosted Gaussian Features , in: Pattern Recognition, volume 40, number 8, pages 2283-2291, 2007. [DOI]
Meynet, J. and Thiran, J. -Ph. , Information Theoretic Combination of Classifiers with Application to AdaBoost , in: 7th international Workshop on Multiple Classifier Systems (MCS), Prague, Prague, 2007.
Meynet, J. , Popovici, V. and Thiran, J. -Ph. , Mixtures of Boosted Classifiers for Frontal Face Detection , in: Signal, Image and Video Processing, volume 1, number 1, pages 29-38, 2007. [DOI]
Millán, J. del R. , Buttfield, A. , Vidaurre, C. , Krauledat, M. , Schlögl, A. , Shenoy, P. , Blankertz, B. , Rao, R. P. N. , Cabeza, R. , Pfurtscheller, G. and Müller, K. -R. , Adaptation in brain-computer interfaces , in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
Millán, J. del R. , Ferrez, P. W. , Galán, F. , Lew, E. and Chavarriaga, R. , Non-invasive brain-actuated interaction , in: Proceedings of the 2nd International Symposium on Brain, Vision and Artificial Intelligence, 2007. [DOI]
Millán, J. del R. , Ferrez, P. W. and Buttfield, A. , The idiap brain-computer interface: an asynchronous multi-class approach , in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
Monay, F. , Learning the structure of image collections with latent aspect models , in: ., 2007.
Monay, F. and Gatica-Perez, D. , Modeling semantic aspects for cross-media image indexing , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 29, pages 1802-1817, ISSN 0162-8828, 2007. [DOI]
Morrison, D. , Marchand-Maillet, S. and Bruno, E. , Automatic image annotation with relevance feedback and latent semantic analysis , in: Workshop on Adaptive Multimedia Retrieval (AMR 2007), 2007.
Morrison, D. , Marchand-Maillet, S. and Bruno, E. , Hierarchical long-term learning for automatic image , in: International Conference on Semantics And digital Media Technologies (SAMT 2007), 2007.
Morrison, D. , Marchand-Maillet, S. and Bruno, E. , Hierarchical long-term learning for automatic image annotation , in: Proceedings 2nd International Conference on Semantic and Digital Media Technologies, 2007.
Motlicek, P. , Hermansky, H. , Ganapathy, S. and Garudadri, H. , Frequency domain linear prediction for qmf sub-bands and applications to audio coding , in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 248-258, 2007.
Motlicek, P. , Hermansky, H. , Ganapathy, S. , Garudadri, H. and Srinivasamurthy, N. , Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes , in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), pages 350-357, 2007.
Motlicek, P. , Ganapathy, S. , Hermansky, H. and Garudadri, H. , Scalable wide-band audio codec based on frequency domain linear prediction , number 16, 2007.
Müller, C. and Burkhardt, F. , Combining Short-term Cepstral and Long-term Pitch Features for Automatic Recognition of Speaker Age , in: to appear in Proceedings of Interspeech, Antwerp., 2007.
Müller, P. , Zeng, G. , Wonka, P. and van Gool, L. , Image-based procedural modeling of facades , in: Proceedings of ACM SIGGRAPH 2007 / ACM Transactions on Graphics, ACM Press, 2007.
Neuhaus, M. and Bunke, H. , A quadratic programming approach to the graph edit distance problem , in: Graph-Based Representations in Pattern Recognition, pages 92-102, Springer, 2007.
Neuhaus, M. and Bunke, H. , Bridging the gap between graph edit distance and kernel machines , Machine Perception and Artificial Intelligence, volume 68, World Scientific, ISBN 978-981-270-817-5, 2007.
Noris, B. , Benmachiche, K. , Meynet, J. , Thiran, J. -Ph. and Billard, A. , Analysis of Head Mounted Wireless Camera Videos for Early Diagnosis of Autism , in: International Conference on Recognition Systems, 2007.
Odobez, J. -M. and Ba, S. , A cognitive and unsupervised map adaptation approach to the recognition of the focus of attention from head pose , in: International Conference on Multi-Media & Expo (ICME07), 2007.
Orabona, F. , Castellini, C. , Caputo, B. , Luo, J. and Sandini, G. , Indoor place recognition using online independent support vector machines , in: 18th British Machine Vision Conference (BMVC07), pages 1090-1099, Warwick, UK, 2007.
Orabona, F. , Castellini, C. , Caputo, B. , Luo, J. and Sandini, G. , On-line independent support vector machines for cognitive systems , number Idiap-RR-63-2007, 2007.
Ozden, K. E. , Schindler, K. and van Gool, L. , Simultaneous segmentation and 3d reconstruction of monocular image sequences , in: International Conference on Computer Vision (ICCV'07), 2007.
Pallotta, V. , Seretan, V. and Ailomaa, M. , User requirement analysis for meeting information retrieval based on query elicitation , in: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007), pages 1008-1015, Association for Computational Linguistics, 2007.
Pardo, J. M. , Anguera, X. and Wooters, C. , Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information , in: to appear in IEEE Transactions on Computers, 2007.
Paugam-Moisy, H. , Martinez, R. and Bengio, S. , A supervised learning approach based on stdp and polychronization in spiking neuron networks , in: European Symposium on Artificial Neural Networks, ESANN, 2007.
Perrin, X. , Chavarriaga, R. , Siegwart, R. and del R. Millán, J. , Bayesian controller for a novel semi-autonomous navigation concept , in: 3rd European Conference on Mobile Robots (ECMR 2007), 2007.
Philips, J. , Millán, J. del R. , Vanacker, G. , Lew, E. , Galán, F. , Ferrez, P. W. , van Brussel, H. and Nuttin, M. , Adaptive shared control of a brain-actuated simulated wheelchair , in: Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics, pages 408-414, 2007. [DOI]
Piccardi, L. , Noris, B. , Barbey, O. , Schiavone, G. , Keller, F. , Von Hofsten, C. and Billard, A. , Wearcam: a head mounted wireless camera for monitoring gaze attention and for the diagnosis of developmental disorders in young children , in: 16th IEEE International Symposium on Robot & Human Interactive Communication, RO-MAN, 2007.
Pinto, J. P. , Bourlard, H. , Graves, A. and Hermansky, H. , Comparing different word lattice rescoring approaches towards keyword spotting , number 32, 2007.
Pinto, J. P. , Lovitt, A. and Hermansky, H. , Exploiting phoneme similarities in hybrid hmm-ann keyword spotting , in: Proceedings of Interspeech, 2007.
Pinto, J. P. , R. M., P. , Yegnanarayana, B. and Hermansky, H. , Significance of contextual information in phoneme recognition , 2007.
Plauché, M. , Cetin, O. and Uhdaykumar, N. , How to build a spoken dialog system with limited (or no) resources , in: AI in ICT for Development Workshop of the Twentieth Intl. Joint Conf. on AI, Hyderabad, India, 2007.
Popescu-Belis, A. and Zufferey, S. , Contrasting the automatic identification of two discourse markers in multiparty dialogues , in: Proceedings of SIGDIAL 2007, pages 10, Antwerp, Belgium, 2007.
Popescu-Belis, A. , Evaluation of nlg: some analogies and differences with mt and reference resolution , in: MT Summit XI Workshop on Using Corpora for NLG and MT (UCNLG MT), pages 66-68, 2007.
Popescu-Belis, A. and Estrella, P. , Generating usable formats for metadata and annotations in a large meeting corpus , in: ACL 2007, pages 93-96, ACL 2007, Prague, Czech Republic, 2007.
Popescu-Belis, A. , Le rôle des métriques d'évaluation dans le processus de recherche en tal , in: TAL (Traitement Automatique des Langues), volume 47, number 2, 2007.
Prasanna, S. R. Mahadeva , Yegnanarayana, B. , Pinto, J. P. and Hermansky, H. , Analysis of confusion matrix to combine evidence for phoneme recognition , number 27, 2007.
Pronobis, A. and Caputo, B. , Confidence-based cue integration for visual place recognition , number 17, 2007.
Quack, T. , Ferrari, V. , Leibe, B. and van Gool, L. , Efficient mining of frequent and distinctive feature configurations , in: accepted for ICCV'07, 2007.
Quack, T. , Ferrari, V. , Leibe, B. and van Gool, L. , Efficient mining of frequent and distinctive feature configurations , in: International Conference on Computer Vision (ICCV'07), 2007.
Quelhas, P. , Odobez, J. -M. , Gatica-Perez, D. and Tuytelaars, T. , A thousand words in a scene , in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 29, number 9, pages 151575-1589, 2007. [DOI]
del R. Millán, J. , Tapping the mind or resonating minds? , in: European Visions for the Knowledge Age, Cheshire Henbury, 2007.
Rakotomamonjy, A. , Bach, F. , Canu, S. and Grandvalet, Y. , More efficiency in multiple kernel learning , in: International Conference on Machine Learning (ICML), 2007.
Renals, S. , Hain, T. and Bourlard, H. , Recognition and understanding of meetings the ami and amida projects , in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'07, pages 238-247, Kyoto, 2007. [DOI]
Richiardi, J. , Kryszczuk, K. and Drygajlo, A. , Quality measures in unimodal and multimodal biometric verification , in: Proc. 15th European Signal Processing Conf. (EUSIPCO), 2007.
Richiardi, J. and Drygajlo, A. , Reliability-based voting schemes using modality-independent features in multi-classifier biometric authentication , in: Proc. 7th Int. Workshop on Multiple Classifier Systems, Springer, 2007.
Riesen, K. , Neuhaus, M. and Bunke, H. , Bipartite graph matching for computing the edit distance of graphs , in: Graph-Based Representations in Pattern Recognition, pages 1-12, Springer, 2007.
Riesen, K. , Neuhaus, M. and Bunke, H. , Graph embedding in vector spaces by means of prototype selection , in: Graph-Based Representations in Pattern Recognition, pages 383-393, Springer, 2007.
Rigamonti, M. , Lalanne, D. and Ingold, R. , Faericworld: browsing multimedia events through static documents and links , in: In proc. of INTERACT 2007, pages to appear, Springer-Verlag, 2007.
Romsdorfer, H. and Pfister, B. , Text analysis and language identification for polyglot text-to-speech synthesis , in: Speech Communication (Elsevier), 2007.
Rytsar, R. and Pun, T. , Computational aspects of the eeg forward problem solution for real head model using finite element , in: 29th Annual Int. Conf. IEEE Engineering in Medicine and Biology Society, 2007.
Schindler, K. , Suter, D. and H. Wang, , A model-selection framework for multibody structure-and-motion of image sequences , in: International Journal of Computer Vision, volume 79, number 2, pages 159-177, 2007.
Schlapbach, A. and Bunke, H. , A writer identification and verification system using HMM based recognizers , in: Pattern Analysis and Applications, volume 10, number 1, pages 33-43, 2007.
Schlapbach, A. and Bunke, H. , Fusing asynchronous feature streams for on-line writer identification , in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 103-107, 2007.
Shriberg, E. , Higher level features in speaker recognition , in: Speaker Classification I, Lecture Notes in Computer Science, Springer, 2007.
Smith, K. , Bayesian methods for visual multi-object tracking with applications to human activity recognition , École Polytechnique Fédérale de Lausanne, 2007.
Sorci, M. , Antonini, G. and Thiran, J. -Ph. , Fisher's Discriminant and Relevant Component Analysis for static facial expression classification , in: 15th European Signal Processing Conference (EUSIPCO), Poznan, Poland, Poznan, Poland, 2007.
Starlander, M. , Using a wizard of oz as a baseline to determine which system architecture is the best for a spoken language translation system , in: Proceedings of Nodalida 2007, pages 161-164, Tartu, Estonia, 2007.
Stolcke, A. , Kajarekar, S. , Ferrer, L. and Shriberg, E. , Speaker recognition with session variability normalization based on mllr adaptation transforms , in: IEEE Transactions on Audio, Speech, and Language Processing, volume 15, pages 1987-1998, 2007.
Stolcke, A. , Kajarekar, S. , Ferrer, L. and Shriberg, E. , Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms , in: IEEE Transactions on Audio, Speech, and Language Processing, special issue on speaker and language recognition, 2007.
Stolcke, A. , Anguera, X. , Boakye, K. , Cetin, O. , Janin, A. , Magimai-Doss, M. , Wooters, C. and Zheng, J. , The sri-icsi spring 2007 meeting and lecture recognition system , in: Lecture Notes in Computer Science, 2007.
Stoll, L. , Frankel, J. and Mirghafori, N. , Speaker Recognition Via Nonlinear Discriminant Features , in: Proceedings of NOLISP, Paris, France,, 2007.
Szekely, E. , Bruno, E. and Marchand-Maillet, S. , Clustered multidimensional scaling for exploration in information retrieval , in: International Conference on the Theory of Information Retrieval, 2007.
Thomas, A. , Ferrari, V. , Leibe, B. , Tuytelaars, T. and van Gool, L. , Depth-from-recognition: inferring metadata by cognitive feedback , in: ICCV'07 Workshop on 3D Representations for Recognition, 2007.
Uldry, L. , Ferrez, P. W. and del R. Millán, J. , Feature selection methods on distributed linear inverse solutions for a non-invasive brain-machine interface , number 04, 2007.
Valente, F. , Bourlard, H. and Deepu, V. , Agglomerative information bottleneck for speaker diarization of meetings data , number 31, 2007.
Valente, F. and Hermansky, H. , Combination of acoustic classifiers based on dempster-shafer theory of evidence , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
Valente, F. , Vepa, J. , Plahl, C. , Gollan, C. , Hermansky, H. and Schlüter, R. , Hierarchical neural networks feature extraction for lvcsr system , in: Interspeech 2007, 2007.
Valente, F. , Vepa, J. and Hermansky, H. , Multi-stream features combination based on dempster-shafer rule for lvcsr system , in: Interspeech 2007, 2007.
Vanacker, G. , Millán, J. del R. , Lew, E. , Ferrez, P. W. , Galán, F. , Philips, J. , van Brussel, H. and Nuttin, M. , Context-based filtering for assisted brain-actuated wheelchair driving , in: Computational Intelligence and Neuroscience, volume 2007, pages 3, ISSN 1687-5265, 2007.
Villán, R. , Voloshynovskiy, S. , Koval, O. , Deguillaume, F. and Pun, T. , Tamper-proofing of Electronic and Printed Text Documents via Robust Hashing and Data-Hiding , in: Proceedings of SPIE-IS&T Electronic Imaging 2007, Security, Steganography, and Watermarking of Multimedia Contents IX, 2007.
Vinciarelli, A. and Favre, S. , Broadcast news story segmentation using social network analysis and hidden markov models , in: ACM International Conference on Multimedia, pages 261-264, 2007.
Vinciarelli, A. , Mapping nonverbal communication into social status: automatic recognition of journalists and non-journalists in radio news , number 33, 2007.
Vinciarelli, A. , Role recognition in broadcast news using social network analysis and duration distribution modeling , in: IEEE Transactions on Multimedia, 2007.
Vinciarelli, A. and Favre, S. , Role recognition in radio programs using social affiliation networks and mixtures of discrete distributions: an approach inspired by social cognition , number Idiap-RR-40-2007, 2007.
Vinciarelli, A. , Fernàndez, F. and Favre, S. , Semantic segmentation of radio programs using social network analysis and duration distribution modeling , in: IEEE International Conference on Multimedia and Expo (ICME), 2007.
Vinyals, O. , Friedland, G. and Mirghafori, N. , Revisiting a basic function on current CPUs: A fast logarithm implementation with adjustable accuracy , in: ICSI Technical Report number TR-07-002, 2007.
Weise, T. , Leibe, B. and van Gool, L. , Fast 3d scanning with automatic motion compensation , in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'07), 2007.
Wooters, C. and Huijbregts, M. , The ICSI RT07s Speaker Diarization System , in: to appear in Lecture Notes in Computer Science, 2007.
Yao, J. and Odobez, J. -M. , Multi-layer background subtraction based on color and texture , in: CVPR 2007 Workshop on Visual Surveillance (VS2007), pages 1-8, 2007. [DOI]
Zacharie, D. G. and Pinto, J. P. , Keyword spotting on word lattices , number 22, 2007.
Zheng, J. , Cetin, O. , Hwang, M. -Y. , Lei, X. , Stolcke, A. and Morgan, N. , Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition , in: Proc. ICASSP, Honolulu., 2007.
Peralta Menendez, R. Grave de , González Andino, S. L. , Ferrez, P. W. and Millán, J. del R. , Non-invasive estimates of local field potentials for brain-computer interfaces , in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
Fasel, B. and van Gool, L. , Interactive museum guide: accurate retrieval of object descriptions , in: Adaptive Multimedia Retrieval: User, Context, and Feedback, pages 179-191, Springer, 2007.
van Gool, L. , Zeng, G. , van den Borre, F. and Müller, P. , Towards mass-produced building models , in: Photogrammetric Image Analysis, pages 209-220, Institute of Photogrammetry and Cartography, Technische Universitaet Muenchen, 2007.
Alecu, T. I. , Voloshynovskiy, S. and Pun, T. , The gaussian transform of distributions: definition, computation and application , in: IEEE Trans. on Signal Processing, volume 54, number 8, pages 2976-2995, 2006.
Andreani, G. , Di Fabbrizio, G. , Gilbert, M. , Gillick, D. , Hakkani-Tur, D. and Lemon, O. , Lets DiSCoH: Collecting an Annotated Open Corpus with Dialog Acts and Reward Signals for Natural Language Helpdesks , in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
Ba, S. and Odobez, J. -M. , A study on visual focus of attention recognition from head pose in a meeting room , in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006.
Ba, S. and Odobez, J. -M. , Recognizing people's focus of attention from head poses: a study , number 42, 2006.
Barber, D. and Chiappa, S. , Unified inference for variational bayesian linear gaussian state-space models , in: NIPS, 2006.
BenZeghiba, M. F. and Bourlard, H. , User-customized password speaker verification using multiple reference and background models , in: Speech Communication, volume 8, pages 1200-1213, 2006.
Bertolami, R. , Halter, B. and Bunke, H. , Combination of multiple handwritten text line recognition systems with a recursive approach , in: Proc. 10th Int. Workshop Frontiers in Handwriting Recognition, pages 61-65, 2006.
Buttfield, A. and del R. Millán, J. , Online classifier adaptation in brain-computer interfaces , number 16, 2006.
Buttfield, A. , Ferrez, P. W. and del R. Millán, J. , Towards a robust bci: error potentials and online learning , in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, volume 14, number 2, pages 164-168, 2006.
Cattin, P. C. , Bay, H. , van Gool, L. and Székely, G. , Retina mosaicing using local features , in: Medical Image Computing and Computer-Assisted Intervention (MICCAI), pages 185-192, 2006.
Chanel, G. , Kronegg, J. , Grandjean, D. and Pun, T. , Emotion assessment: arousal evaluation using eeg's and peripheral physiological signals , in: Proc. Int. Workshop Multimedia Content Representation, Classification and Security (MRCS), pages 530-537, Lecture Notes in Computer Science, Springer, 2006.
Cheng, O. , Dines, J. and Magimai-Doss, M. , A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition , number 62, 2006.
Chiappa, S. , Analysis and classification of eeg signals using probabilistic models for brain computer interfaces , École Polytechnique Fédérale de Lausanne, 2006.
Chiquet, H. , Evéquoz, F. and Lalanne, D. , Elcano, a tangible multimedia browser (demo). , in: Symposium on User Interface Software and Technology (UIST 2006), pages 51-52, 2006.
Cuendet, S. , Hakkani-Tur, D. and Tur, G. , Model Adaptation for Sentence Segmentation from Speech , in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
Cuendet, S. , Model adaptation for sentence unit segmentation from speech , number 64, 2006.
Dimitrakakis, C. , Ensembles for sequence learning , École Polytechnique Fédérale de Lausanne, 2006.
Everingham, M. , Zisserman, A. , Williams, C. , van Gool, L. , Allan, M. , Bishop, C. , Chapelle, O. , Dalal, N. , Deselaers, T. , Dorko, G. , Duffner, S. , Eichhorn, J. , Farquhar, J. , Fritz, M. , Garcia, C. , Griffiths, T. , Jurie, F. , Keysers, D. , Koskela, M. , Laaksonen, J. , Larlus, D. , Leibe, B. , Meng, H. , Ney, H. , Schiele, B. , Schmid, C. , Seemann, E. , Shawe-Taylor, J. , Storkey, A. , Szedmak, S. , Triggs, B. , Ulusoy, I. , Viitaniemi, V. and Zhang, J. , The 2005 pascal visual object class challenge , in: Selected Proceedings of the 1st PASCAL Challenges Workshop, Lecture Notes in AI, Springer, 2006.
Hannani, A. , Toledano, D. , Petrovska, D. , Montero-Asenjo, A. and Hennebert, J. , Using data-driven and phonetic units for speaker verification , in: IEEE Speaker and Language Recognition Workshop (Odyssey 2006), Puerto Rico, 2006.
Hemptinne, C. , Master thesis: integration of the harmonic plus noise model (hnm) into the hidden markov model-based speech synthesis system (hts) , number 69, 2006.
Hillard, D. , Huang, Z. , Ji, H. , Grishman, R. , Hakkani-Tur, D. , Harper, M. , Ostendorf, M. and Wang, W. , Impact of Automatic Comma Prediction on POS/Name Tagging of Speech , in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
Janin, A. , Stolcke, A. , Anguera, X. , Boakye, K. , Cetin, O. , Frankel, J. and Zheng, J. , The ICSI-SRI Spring 2006 Meeting Evaluation System , in: In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006); Lecture Notes in Computer Science. Springer, 2006.
Janvier, B. , Bruno, E. , Marchand-Maillet, S. and Pun, T. , Handling temporal heterogeneous data for content-based management of large video collections , in: Multimedia Tools and Applications, volume 30, pages 273-288, 2006.
Just, A. , Two-handed gestures for human-computer interaction , École Polytechnique Fédérale de Lausanne, 2006.
Keller, M. and Bengio, S. , A multitask learning approach to document representation using unlabeled data , number 44, 2006.
Keller, M. , Machine learning approaches to text representation using unlabeled data , Ecole Polytechnique Fédérale de Lausanne, 2006.
Ketabdar, H. and Hermansky, H. , Identifying unexpected words using in-context and out-of-context phoneme posteriors , number 68, 2006.
Kosinov, S. , Marchand-Maillet, S. , Kozintsev, I. , Dulong, C. and Pun, T. , Dual diffusion model of spreading activation for content-based image retrieval , in: 8th ACM SIGMM - International Workshop on Multimedia Information Retrieval, 2006.
Koval, O. , Voloshynovskiy, S. , Holotyak, T. and Pun, T. , Information-theoretic analysis of steganalysis in real images , in: ACM Multimedia and Security Workshop 2006, 2006.
Lathoud, G. , Observations on multi-band asynchrony in distant speech recordings , number 74, 2006.
Lathoud, G. , Spatio-temporal analysis of spontaneous speech with microphone arrays , École Polytechnique Fédérale de Lausanne, 2006.
Lathoud, G. , Magimai-Doss, M. and Bourlard, H. , Unsupervised spectral subtraction for noise-robust asr on unknown transmission channels , number 09, 2006.
Leibe, B. , Mikolajczyk, K. and Schiele, B. , Efficient clustering and matching for object class recognition , in: British Machine Vision Conference (BMVC, 2006.
Leibe, B. , Cornelis, N. , Cornelis, K. and van Gool, L. , Integrating recognition and reconstruction for cognitive traffic scene analysis from a moving vehicle , in: DAGM Annual Pattern Recognition Symposium, pages 192-201, Springer, 2006.
Leibe, B. , Mikolajczyk, K. and Schiele, B. , Segmentation based multi-cue integration for object detection , in: British Machine Vision Conference (BMVC, 2006.
Liwicki, M. and Bunke, H. , HMM-based on-line recognition of handwritten whiteboard notes , in: Proceedings 10th International Workshop Frontiers in Handwriting Recognition, pages 595-599, 2006.
Luo, J. , Pronobis, A. , Caputo, B. and Jensfelt, P. , Incremental learning for place recognition in dynamic environments , number 52, 2006.
Luo, J. , Pronobis, A. and Caputo, B. , Svm-based transfer of visual knowledge across robotic platforms , number 65, 2006.
Maganti, H. K. , Motlicek, P. and Gatica-Perez, D. , Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms , number 57, 2006.
Marcel, S. , Rodriguez, Y. , Guillemot, M. and Popescu-Belis, A. , Annotation of face detection: description of xml format and files , number 06, 2006.
Marcel, S. , Keomany, J. and Rodriguez, Y. , Robust-to-illumination face localisation using active shape models and local binary patterns , number 47, 2006.
Mariéthoz, J. , Discrmininant models for text-independent speaker verification , number 70, 2006.
Melichar, M. , Cenek, P. , Ailomaa, M. , Lisowska, A. and Rajman, M. , From Vocal to Multimodal Dialogue Management , in: Eighth International Conference on Multimodal Interfaces (ICMI'06), Banff, Canada, 2006.
Mendels, F. , Thiran, J. -Ph. and Vandergheynst, P. , Matching pursuit-based shape representation and recognition using scale-space , in: International Journal of Imaging Systems and Technology, volume 6, number 15, pages 162-180, 2006. [DOI]
Mesot, B. and Barber, D. , A bayesian alternative to gain adaptation in autoregressive hidden markov models , number 55, 2006.
Mesot, B. and Barber, D. , Switching linear dynamical systems for noise robust speech recognition , number 08, 2006.
Moore, D. , The juicer lvcsr decoder - user manual for juicer version 0.5.0 , number 03, 2006.
Motlicek, P. , Hermansky, H. , Garudadri, H. and Srinivasamurthy, N. , Audio coding based on long temporal contexts , number 30, 2006.
Motlicek, P. , Ullal, V. and Hermansky, H. , Wide-band perceptual audio coding based on frequency-domain linear prediction , number 58, 2006.
Moënne-Loccoz, N. , Janvier, B. , Marchand-Maillet, S. and Bruno, E. , Handling temporal heterogeneous data for content-based management of large video collections , in: Multimedia Tools and Applications, volume 31, pages 309-325, 2006.
Müller, P. , Wonka, P. , Haegler, S. , Ulmer, A. and van Gool, L. , Procedural modeling of buildings , in: Proceedings of ACM SIGGRAPH 2006 / ACM Transactions on Graphics, pages 614-623, ACM Press, 2006.
Müller, M. , Evéquoz, F. and Lalanne, D. , Tjass, a smart board for augmenting card game playing and learning (demo) , in: Symposium on User Interface Software and Technology (UIST 2006), pages 67-68, 2006.
Poh, N. and Bengio, S. , Estimating the confidence interval of expected performance curve in biometric authentication using joint bootstrap , number 25, 2006.
Poh, N. , Multi-system biometric authentication: optimal fusion and user-specific information , École Polytechnique Fédérale de Lausanne, 2006.
Poh, N. and Bengio, S. , Using chimeric users to construct fusion classifiers in biometric authentication tasks: an investigation , in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006.
Pozdnoukhov, A. , Prior knowledge in kernel methods , École Polytechnique Fédérale de Lausanne, 2006.
Pun, T. , Alecu, T. I. , Chanel, G. , Kronegg, J. and Voloshynovskiy, S. , Brain-computer interaction research at the computer vision and multimedia laboratory, university of geneva , in: IEEE Trans. Neural Systems and Rehabilitation Engineering, Special Issue on Brain-Computer Interaction, volume 14, number 2, pages 210-213, 2006.
Pérez-Freire, L. , Pérez-González, F. and Voloshynovskiy, S. , An Accurate Analysis of Scalar Quantization-Based Data Hiding , in: IEEE Trans. on Information Forensics and Security, volume 1, number 1, pages 80-86, 2006.
Quelhas, P. and Odobez, J. -M. , Natural scene image modeling using color and texture visterms. , in: Conference on Image and Video Retrieval CIVR, 2006.
del R. Millán, J. , Renkens, F. , Mouriño, J. and Gerstner, W. , Non-invasive brain-actuated control of a mobile robot by human eeg , in: 2006 IMIA Yearbook of Medical Informatics, Schattauer Verlag, 2006.
Radgohar, M. , Evéquoz, F. and Lalanne, D. , Phong, augmenting virtual and real gaming experience (demo) , in: Symposium on User Interface Software and Technology (UIST 2006), pages 71-72, 2006.
Richiardi, J. and Drygajlo, A. , Applying biometrics to identity documents: estimating and coping with errors , 2006.
Richiardi, J. and Drygajlo, A. , Applying biometrics to identity documents: implementation issues , 2006.
Rienks, R. , Zhang, D. , Gatica-Perez, D. and Post, W. , Detection and application of influence rankings in small group meetings , in: ICMI '06: Proceedings of the 8th international conference on Multimodal interfaces, pages 257-264, ACM Press, Banff, Alberta, Canada, 2006. [DOI]
Rodriguez, Y. , Face detection and verification using local binary patterns , École Polytechnique Fédérale de Lausanne, 2006.
Schlapbach, A. and Bunke, H. , Off-line writer verification: a comparison of a hidden Markov model (HMM) and a Gaussian mixture model (GMM) based system , in: Proc. 10th Int. Workshop Frontiers in Handwriting Recognition, pages 275-280, 2006.
Smith, K. , Schreiber, S. , Beran, V. , Potúcek, I. , Rigoll, G. and Gatica-Perez, D. , Multi-person tracking in meetings: a comparative study , in: Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2006.
Smith, K. , Ba, S. , Odobez, J. -M. and Gatica-Perez, D. , Tracking attention for multiple people: wandering visual focus of attention estimation , number 40, 2006.
Spindler, T. , Wartmann, C. , Roth, D. , Steffen, A. , Hovestadt, L. and van Gool, L. , Privacy in video surveilled areas , in: International Conference on Privacy, Security and Trust (PST 2006), 2006.
Torre, E. L. , Caputo, B. and Tommasi, T. , Melanoma recognition using kernel classifiers , number 53, 2006.
Tur, G. , Guz, U. and Hakkani-Tur, D. , Model Adaptation for Dialog Act Tagging , in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
Ullal, V. and Motlicek, P. , Audio coding based on long temporal segments: experiments with quantization of excitation signal , number 46, 2006.
Vepa, J. and King, S. , Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis , in: IEEE Trans. on Audio, Speech and Language Processing, volume 14, number 5, pages 1763-1771, 2006.
Vila-Forcén, J. E. , Voloshynovskiy, S. , Koval, O. and Pun, T. , Costa problem under channel ambiguity , in: Proceedings of 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2006.
Vila-Forcén, J. E. , Voloshynovskiy, S. , Koval, O. and Pun, T. , Facial Image Compression Based on Structured Codebooks in Overcomplete Domain , in: EURASIP Journal on Applied Signal Processing, Frames and overcomplete representations in signal processing, communications, and information theory special issue, volume 2006, number Article ID 69042, pages 1-11, 2006.
Voloshynovskiy, S. , Koval, O. , Topak, E. , Forcen, J. E. V. and Pun, T. , On reversibility of random binning based data-hiding techniques: security perspectives , in: ACM Multimedia and Security Workshop 2006, 2006.
Voloshynovskiy, S. , Koval, O. , Mihcak, M. K. and Pun, T. , The edge process model and its application to information hiding capacity analysis , in: IEEE Trans. on Signal Processing, volume 54, number 5, pages 1813-1825, 2006.
Wey, P. , Fischer, B. , Bay, H. and Buhmann, J. M. , Dense stereo by triangular meshing and cross validation , in: DAGM-Symposium, pages 708-717, 2006.
Zhang, D. , Gatica-Perez, D. and Bengio, S. , Exploring contextual information in a layered framework for group action recognition , in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006.
Zhang, D. , Probabilistic graphical models for human interaction analysis , École Polytechnique Fédérale de Lausanne, 2006.
A. Peregoudov, , Vinciarelli, A. and Bourlard, H. , Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations , number 56, 2006.
Brodbeck, D. , Mazza, R. and Lalanne, D. , Interactive visualization - a survey , 0000.
Dumas, B. , Lalanne, D. and Oviatt, S. , Multimodal interfaces: a survey of principles, models and frameworks , 0000.
Gatica-Perez, D. , Modeling interest in face-to-face conversations from multimodal nonverbal behavior , in: In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.), Multimodal Signal Processing, Academic Press, in press, 0000.
Gatica-Perez, D. and Odobez, J. -M. , Visual attention, speaking activity, and group conversational analysis in multi-sensor environments , in: H. Nakashima, J. Augusto, H. Aghajan (Eds.), Handbook of Ambient Intelligence and Smart Environments, Springer, in press, 0000.
Goldmann, L. , Samour, A. , Ebrahimi, T. and Sikora, T. , Multimodal person search combining information fusion and relevance feedback , in: IEEE International Workshop on Multimedia Signal Processing (MMSP 2009), Rio de Janeiro, Brazil, 0000.
Lee, J. -S. , De Simone, F. and Ebrahimi, T. , Influence of audio-visual attention on perceived quality of standard definition multimedia content , in: First International Workshop on Quality of Multimedia Experience (QoMEX 2009), San Diego, CA, U.S.A., 0000.
Lee, J. -S. and Ebrahimi, T. , Two-level bimodal association for audio-visual speech recognition , in: International Conference on Advanced Concepts for Intelligent Vision Systems (ACIVSâ09), Bordeaux, France, 0000.
Li, N. , Mubin, O. , Kaplan, F. and Dilllenbourg, P. , A Tabletop Environment for Augmenting Meetings with Background Search , 0000.
Mugellini, E. , Lalanne, D. , Dumas, B. , Evéquoz, F. , Gerardi, S. , Le Calvé, A. , Boder, A. , Ingold, R. and Khaled, O. , Memodules as tangible shortcuts to multimedia information , 0000.
Noris, B. , Benmachiche, K. and Billard, A. , Calibration-free eye gaze direction detection with gaussian processes , in: International Conference on Computer Vision Theory and Applications (VISAPP 08), 0000.
De Simone, F. , Naccari, M. , Tagliasacchi, M. , Dufaux, F. , Tubaro, S. and Ebrahimi, T. , Subjective assessment of H.264/AVC video sequences transmitted over a noisy channel , in: First International Workshop on Quality of Multimedia Experience (QoMEX 2009), San Diego, CA, U.S.A., 0000.
Popescu-Belis, A. , Multimodal database annotation formats and standards, software architecture for multimodal interfaces , in: Multimodal Signal Processing: Methods and Techniques to Build Multimodal Interactive Systems, Academic Press, 0000.
Powered by Agaion