Guide:
  • If you want to have the list of publications issued from a specific Individual Project (IP), write in the search field (IM2.IP). IP can have the following value: DMA, AP, VP, MPR, MCA, HMI, ISD, BMI

  • If you want to find joint publications between IPs, write in the search field (joint), click on search and then click on Keywords

  • If you want to display all the publications for a specific author, use the shortcut called -Authors- located in the main menu
 

All publications in the database, sorted on year



2009

Ali, K., Fleuret, F., Hasler, D. and Fua, P., Joint learning of pose estimators and features for object detection, in: Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2009.
 
Aradilla, G., Bourlard, H. and Magimai-Doss, M., Posterior features applied to speech recognition tasks with user-defined vocabulary, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
 
Ba, S. and Odobez, J. -M., Recognizing human visual focus of attention from head pose in meetings, in: IEEE Trans. on System, Man and Cybernetics: part B, Man, volume 39, number 1, pages 16-34, 2009.
 
Ba, S., Hung, H. and Odobez, J. -M., Visual activity context for focus of attention estimation in dynamic meetings, in: IEEE Proc. Int. Conf. on Multimedia and Expo (ICME), 2009.
 
Baechler, M., Bloechle, J. -L., Humm, A., Ingold, R. and Hennebert, J., Labeled images verification using gaussian mixture models, in: Proceedings of 24th Annual ACM Symposium on Applied Computing (ACM SAC'09), pages 1331-1336, 2009.
 
Baker, J., Deng, L., Glass, J., Khudanpur, S., Lee, C. -H., Morgan, N. and O'Shgughnessy, D., Research developments and directions in speech recognition and understanding, in: IEEE Signal Processing Magazine, volume 26, number 4, pages 78-85, 2009.
 
Baker, J., Deng, L., Glass, J., Khudanpur, S., Lee, C. -H., Morgan, N. and O'Shgughnessy, D., Research developments and directions in speech recognition and understanding, in: IEEE Signal Processing Magazine, volume 26, number 3, pages 75-80, 2009.
 
Beekhof, F., Voloshynovskiy, S., Koval, O. and Holotyak, T., Multi-class classifiers based on binary classifiers: performance, efficiency, and minimum coding matrix distances, in: MLSP 2009, 2009.
 
Berclaz, J., Fleuret, F. and Fua, P., Multiple object tracking using flow linear programming, number 10-2009, 2009.
 
Bertini, E., Lalanne, D. and Rigamonti, M., Extended excentric labeling, in: International Journal of the Eurographics Association, volume 28, 2009.
 
Bertini, E. and Lalanne, D., Surveying the complementary roles of automatic data analysis and visualization in knowledge discovery, in: Proceedings of ACM SIGKDD Workshop on Visual Analytics and Knowledge Discovery, VAKD '09, 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (VAKD 2009), pages 12-20, 2009.
 
Bloechle, J. -L., Lalanne, D. and Ingold, R., Ocd: an optimized and canonical document format, in: Proceedings of 10th IEEE International Conference on Document Analysis and Recognition (ICDAR 2009), pages 236-240, 2009.
 
Bologna, G., Deville, B. and Pun, T., Blind navigation along a sinuous path by means of the see color interface, in: IWINAC2009, 3rd International Work-conference on the Interplay between Natural and Artificial Computation, Santiago de Compostela, Spain, June 22--27, 2009.
 
Bologna, G., Deville, B. and Pun, T., On the use of the auditory pathway to represent image scenes in real-time, in: Neurocomputing, volume 72, pages 839-849, 2009.
 
Bologna, G., Malandain, S., Deville, B. and Pun, T., The multi-touch see color interface, in: ICTA 2009, The 2nd International Conference on Information and Communication Technologies and Accessibility, Hammamet, Tunisia, May 7--9, 2009.
 
Bruno, E. and Marchand-Maillet, S., Multimodal preference aggregation for multimedia information retrieval, in: To appear in Journal of Multimedia, 2009.
 
Bruno, E. and Marchand-Maillet, S., multiview clustering: a late fusion approach using latent models, in: Proceedings of the 32nd ACM Special Interest Group on Information Retrieval Conference, SIGIR 09, 2009.
 
Caputo, B., Hayman, E., Fritz, M. and Ekluhnd, J. -O, Classifying Material in the Real World, in: Image and vision Computing, volume accepted for pub, 2009.
 
Chanel, G., Kierkels, J., Soleymani, M. and Pun, T., short-term emotion assessment in a recall paradigm, in: International Journal of Human-Computer Studies, volume 67, number 8, pages 607-627, 2009.
 
Dines, J., Yamagishi, J. and King, S., Measuring the gap between HMM-based ASR and TTS, in: Proceedings of Interspeech, Brighton, U.K., 2009.
 
Dines, J., Saheer, L. and Liang, H., Speech recognition with speech synthesis models by marginalising over decision tree leaves, in: Proceedings of Interspeech, Brighton, U.K., 2009.
 
Drygajlo, A., Li, W. and Zhu, K., Q-stack aging model for face verification, in: 17th European Signal Processing Conference, 2009.
 
Duffner, S., Odobez, J. -M. and Ricci, E., Dynamic Partitioned Sampling For Tracking With Discriminative Features, in: Proceedings of the British Maschine Vision Conference, London, 2009.
 
Dumas, B., Lalanne, D. and Ingold, R., Benchmarking fusion engines of multimodal interactive systems, in: Proceedings of International Conference on Multimodal Interfaces and Workshop on Machine Learning for Multi-modal Interaction (ICMI-MLMI 2009), 2009.
 
Favre, S., Dielmann, A. and Vinciarelli, A., Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models, in: ACM International Conference on Multimedia, To Appear, 2009.
 
Fleuret, F., Multi-layer boosting for pattern recognition, in: Pattern Recognition Letters (PRL), volume 30, pages 237-241, 2009.
 
Friedland, G., Vinyals, O., Huang, Y. and Muller, C., Fusion of short-term and long-term features for improved speaker diarization, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4077-4080, 2009.
 
Friedland, G., Hung, H. and Yeo, C., Multi-modal speaker diarization of real-world meetings using compressed-domain video features, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4069-4072, 2009.
 
Friedland, G., Vinyals, O., Huang, Y. and Muller, C., Prosodic and other long-term features for speaker diarization, in: IEEE Transactions on Audio, Speech and Language Processing, volume 17, number 5, pages 985-993, 2009.
 
Friedland, G. and van Leeuwen, D., Speaker diarization and identification, IEEE Press/Wiley, 2009.
 
Friedland, G., Yeo, C. and Hung, H., Visual Speaker Localization Aided by Acoustic Models, in: ACM Multimedia, 2009.
 
Friedland, G., Yeo, C. and Hung, H., Visual speaker localization aided by acoustic models (full paper), in: Proceedings of ACM Multimedia, Beijing, China, 2009.
 
Frinken, V. and Bunke, H., Evaluating retraining rules for semi-supervised learning in neural network based cursive word recognition, in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 31-35, 2009.
 
Frinken, V., Riesen, K. and Bunke, H., Improving graph classification by isomap, in: Graph-Based Representations in Pattern Recognition, pages 205-214, Springer, 2009.
 
Frinken, V. and Bunke, H., Self-training strategies for handwriting word recognition, in: Proc. Industrial Conf. Advances in Data Mining. Applications and Theoretical Aspects, pages 291-300, Springer, 2009.
 
Galbally, J., McCool, C., Fierrez, J., Marcel, S. and Ortega-Garcia, J., Hill-Climbing Attack to an Eigenface-Based Face Verification System, in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, pages 355-362, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
 
Garau, G., Ba, S., Bourlard, H. and Odobez, J. -M., Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009.
 
Garg, N., Favre, B., Riedhammer, K. and Hakkani-Tur, D., Clusterrank: a graph based method for meeting summarization, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 
Garg, N., Co-occurrence Models for Image Annotation and Retrieval, number Idiap-RR-22-2009, 2009.
 
Garg, N. and Gatica-Perez, D., Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr, number Idiap-RR-21-2009, 2009.
 
Garner, P. N., A MAP Approach to Noise Compensation of Speech, number Idiap-RR-08-2009, 2009.
 
Garner, P. N., Dines, J., Hain, T., El Hannani, A., Karafiat, M., Korchagin, D., Lincoln, M., Wan, V. and Zhang, L., Real-Time ASR from Meetings, in: Proceedings of Interspeech, Brighton, UK., 2009.
 
Garner, P. N., SNR Features for Automatic Speech Recognition, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
 
Gatica-Perez, D., Automatic nonverbal analysis of social interaction in small groups: a review, in: Image and Vision Computing, Special Issue on Human Naturalistic Behavior, in press, 2009.
 
Gelbart, D., Morgan, N. and Tsymbal, A., Hill-climbing feature selection for multi-stream asr, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 
Gillick, D., Riedhammer, K., Favre, B. and Hakkani-Tur, D., A global optimization framework for meeting summarization, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
 
Gonzalez, G., Fleuret, F. and Fua, P., Learning rotational features for filament detection, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2009.
 
Gonzalez, G., Aguet, F., Fleuret, F., Unser, M. and Fua, P., Steerable features for statistical 3d dendrite detection, in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2009.
 
Gottlieb, L. and Friedland, G., On the use of artificial conversation data for speaker recognition in cars, in: IEEE International Conference for Semantic Computing, Berkeley, USA, 2009.
 
Graves, A., Liwicki, M., Fernandez, S., Bertolami, R., Bunke, H. and Schmidhuber, J., A novel connectionist system for unconstrained handwriting recognition, in: IEEE Trans. PAMI, volume 31, number 5, pages 855-869, ISSN 0162-8828, 2009.
 
Gurban, M. and Thiran, J. -Ph., Information theoretic feature extraction for audio-visual speech recognition, in: IEEE Trans. on Signal Processing, volume in press, 2009.
 
Hakkani-Tur, D., Towards automatic argument diagramming of multiparty meetings, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
 
Heusch, G. and Marcel, S., Bayesian Networks to Combine Intensity and Color Information in Face Recognition, number Idiap-RR-27-2009, 2009.
 
Humm, A., Hennebert, J. and Ingold, R., Combined handwriting and speech modalities for user authentication, in: IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans, volume 39, 2009.
 
Humm, A., Ingold, R. and Hennebert, J., Spoken handwriting for user authentication using joint modelling systems, in: Proceedings of 6th International Symposium on Image and Signal Processing and Analysis (ISPA'09), 2009.
 
Hung, H. and Ba, S., Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features, number Idiap-RR-20-2009, 2009.
 
Imseng, D., Novel initialization methods for Speaker Diarization, number Idiap-RR-07-2009, 2009.
 
Imseng, D. and Friedland, G., Robust Speaker Diarization for Short Speech Recordings, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
 
Indermühle, E., Liwicki, M. and Bunke, H., Combining alignment results for historical handwritten document analysis, in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 1186-1190, 2009.
 
Ivanov, I., Dufaux, F., Ha, T. M. and Ebrahimi, T., Towards Generic Detection of Unusual Events in Video Surveillance, in: 6th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSSâ09), Genoa, Italy, 2009.
 
Jayagopi, D., Bogdan, R. and Gatica-Perez, D., Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour, in: Proceedings ICME 2009, 2009.
 
Jayagopi, D. and Gatica-Perez, D., Discovering group nonverbal conversational patterns with topics, in: accepted for publication in Proc. ICMI-MLMI, 2009.
 
Jayagopi, D., Modeling dominance in group conversations using nonverbal activity cues, in: IEEE Trans. on Audio, Speech, and Language Processing, Special Issue on Multimodal Processing for Speech-based Interactions, volume 17, pages 501-513, 2009.
 
Keshet, J., Grangier, D. and Bengio, S., Discriminative Keyword Spotting, in: Speech Communication, volume 51, number 4, pages 317-329, 2009.
 
Koval, O., Voloshynovskiy, S., Caire, F. and Bas, P., On security threats for robust perceptual hashin, in: Electronic Imaging 2009, 2009.
 
Kryszczuk, K. and Drygajlo, A., Improving biometric verification with class-independent quality information, pages 310-321, 2009.
 
Kryszczuk, K. and Drygajlo, A., Improving biometric verification with class-independent quality information, in: IET Signal Processing, Special Issue on Biometric Recognition, volume 3, number 4, pages 310-321, 2009.
 
Kumatani, K., McDonough, J., Rauch, B., Garner, P. N., Li, W. and Dines, J., Maximum kurtosis beamforming with the generalized sidelobe canceller, in: Proceedings of INTERSPEECH, September 2008, Brisbane, Australia, 2009.
 
Lalanne, D., Nigay, L., Palanque, P., Robinson, P., Vanderdonckt, J. and Ladry, J. -F., Fusion engines for multimodal interfaces: a survey, in: Proceedings of International Conference on Multimodal Interfaces and Workshop on Machine Learning for Multi-modal Interaction (ICMI-MLMI 2009), 2009.
 
Lalanne, D. and Kholas, J., Human machine interaction, 2009.
 
Le, Q. A. and Popescu-Belis, A., Automatic vs. human question answering over multimedia meeting recordings, in: Interspeech 2009 (10th Annual Conference of the International Speech Communication Association), 2009.
 
Lee, J. -S., De Simone, F. and Ebrahimi, T., Video coding based on audio-visual attention, in: IEEE International Conference on Multimedia and Expo (ICME'09), New York, USA, 2009.
 
Lefèvre, S. and Odobez, J. -M., Structure and appearance features for robust 3d facial actions tracking, in: International Conference on Multimedia and Expo (ICME), 2009.
 
Li, W., Dines, J., Magimai-Doss, M. and Bourlard, H., Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
 
Luo, J., Orabona, F. and Caputo, B., An online framework for learning novel concepts over multiple cues, in: Proceeding of The 9th Asian Conference on Computer Vision, Xi'an, China, 2009.
 
Magimai-Doss, M., Aradilla, G. and Bourlard, H., On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR, number Idiap-RR-24-2009, 2009.
 
Marchand-Maillet, S., Szekely, E. and Bruno, E., Optimizing strategies for the exploration of social-networks and associated data collections, in: Proceedings of the International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS'09) - Special session on "People, Pixels, Peers: Interactive Content in Social Networks", 2009.
 
McCool, C. and Marcel, S., Parts-Based Face Verification using Local Frequency Bands, in: in Proceedings of IEEE/IAPR International Conference on Biometrics, 2009.
 
Monay, F., Quelhas, P., Odobez, J. -M. and Gatica-Perez, D., Contextual classification of image patches with latent aspect models, in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009.
 
Morrison, D., Bruno, E. and Marchand-Maillet, S., capturing the semantics of user interaction: a review and case study, in: Emergent Web Intelligence, Springer, 2009.
 
Morrison, D., Marchand-Maillet, S. and Bruno, E., Modelling long-term relevance feedback, in: Proceedings of the ECIR Workshop on Information Retrieval over Social Networks, 2009.
 
Motlicek, P., Ganapathy, S. and Hermansky, H., Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, in: 10th Annual Conference of the International Speech Communication Association, pages 2591-2594, ISCA 2009, ISCA, Brighton, England, 2009.
 
Motlicek, P., Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, in: 10thAnnual Conference of the International Speech Communication Association, pages 1215-1218, ISCA, Brighton, England, 2009.
 
Motlicek, P., Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, in: 10thAnnual Conference of the International Speech Communication Association, ISCA, 2009.
 
Negoescu, R. -A., Gatica-Perez, D., Adams, B., Phung, D. and Venkatesh, S., Flickr Hypergroups, number Idiap-Internal-RR-73-2009, 2009.
 
Noceti, N., Caputo, B., Castellini, C., Baldassarre, L., Barla, A., Rosasco, L., Odone, F. and Sandini, G., Towards a theoretical framework for learning multi-modal patterns for embodied agents, in: International Conference on Image Analysis and Processing, 2009.
 
Orabona, F., Caputo, B., Fillbrandt, A. and Ohl, F., A theoretical framework for transfer of knowledge across modalities in artificial and cognitive systems, in: International Conference on Developmental Learning, 2009.
 
Orabona, F., Keshet, J. and Caputo, B., Bounded kernel-based perceptrons, in: Journal of Machine Learning Research, volume Accepted for pub, 2009.
 
Orabona, F., Castellini, C., Caputo, B., Fiorilla, A. E. and Sandini, G., Model adaptation with least-square SVM for adaptive hand prosthetics, in: IEEE International conference on Robotics and Automation, 2009.
 
Orabona, F., Castellini, C., Caputo, B., Luo, J. and Sandini, G., Towards Life-long Learning for Cognitive Systems: Online Independent Support Vector Machine, in: Pattern Recognition, volume Accepted for Pub, 2009.
 
Ortega-Garcia, J., Fierrez, J., Alonso-Fernandez, F., Galbally, J., M. R. Freire, , Gonzalez-Rodriguez, J., Garcia-Mateo, C., Alba-Castro, J. -L., E. Gonzalez-Agulla, , E. Otero-Muras, , S. Garcia-Salicetti, , L. Allano, , B. Ly-Van, , B. Dorizzi, , Kittler, J., Bourlai, T., Poh, N., Deravi, F., M. W. R. Ng, , M. Fairhurst, , Hennebert, J., Humm, A., M. Tistarelli, , L. Brodo, , Richiardi, J., Drygajlo, A., H. Ganster, , F. M. Sukno, , Pavani, S. -K., A. Frangi, , L. Akarun, and A. Savran, , The multi-scenario multi-environment biosecure multimodal database (bmdb), in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 2009.
 
Pantic, M. and Vinciarelli, A., Implicit Human Centered Tagging, in: IEEE Signal Processing Magazine, volume 26, 2009.
 
Parthasarathi, S. H. K., Magimai-Doss, M., Bourlard, H. and Gatica-Perez, D., Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, in: Proceedings of Interspeech 2009, 2009.
 
Parthasarathi, S. H. K., Magimai-Doss, M., Gatica-Perez, D. and Bourlard, H., Speaker Change Detection with Privacy-Preserving Audio Cues, in: Proceedings of ICMI-MLMI 2009, 2009.
 
Perrin, X., Chavarriaga, R., Pradalier, C., Millán, J. del R. and Siegwart, R., Dialog Management Technique for Brain-Computer Interfaces, 2009.
 
Perrin, X., Colas, F., Pradalier, C. and Siegwart, R., Learning human habits and reactions to external events with a dynamic Bayesian network, 2009.
 
Perrin, X., Colas, F., Pradalier, C. and Siegwart, R., Learning to identify users and predict their destination in a robotic guidance application, in: Field and Service Robotics (FSR), 2009.
 
Picart, B., Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity, number Idiap-RR-18-2009, 2009.
 
Pinto, J. P., Sivaram, G. S. V. S., Hermansky, H. and Magimai-Doss, M., Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009.
 
Popescu-Belis, A., Poller, P., Kilgour, J., Boertjes, E., Carletta, J., Castronovo, S., Fapso, M., Flynn, M., Nanchen, A., Wilson, T., Wit, J. de and Yazdani, M., A multimedia retrieval system using speech input, in: ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), 2009.
 
Popescu-Belis, A., Carletta, J., Kilgour, J. and Poller, P., Accessing a large multimodal corpus using an automatic content linking device, in: Multimodal Corpora, Springer-Verlag, 2009.
 
Popescu-Belis, A., Comparing meeting browsers using a task-based evaluation method, number Idiap-RR-11-2009, 2009.
 
Popescu-Belis, A. and Vinciarelli, A., Multimedia meeting processing and retrieval at the idiap research institute, in: Informer (Newsletter of the BCS Information Retrieval Specialist Group), volume 29, pages 14-16, 2009.
 
Pronobis, A. and Caputo, B., COLD: The COsy Localization Database, in: International Journal of Robotics Research, volume 28, number 5, pages 588-594, 2009.
 
Raducanu, B. and Gatica-Perez, D., You are fired! Nonverbal role analysis in competitive meetings, in: Proc. ICASSP, Taiwan, 2009.
 
Rajan, P., Parthasarathi, S. H. K. and Murthy, H., Robustness of Phase based Features for Speaker Recognition, in: Proceedings of Interspeech, 2009.
 
Ricci, E. and Odobez, J. -M., Real-time simultaneous head tracking and pose estimation, in: IEEE International Conference on Image Processing (ICIP), 2009.
 
Richiardi, J., Drygajlo, A. and Kryszczuk, K., Static models of derivative-coordinates phase spaces for multivariate time series classification: an application to signature verification, pages 140-149, 2009.
 
Richiardi, J., Kryszczuk, K. and Drygajlo, A., Static models of derivative-coordinates phase spaces for multivariate time series classification: an application to signature verification, in: Advances in Biometrics, Lecture Notes in Computer Science 5558, pages 1200-1208, 2009.
 
Roy, A. and Marcel, S., Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, number Idiap-RR-28-2009, 2009.
 
Salamin, H., Favre, S. and Vinciarelli, A., Automatic Role Recognition in Multiparty Recordings: Using Social Affiliation Networks for Feature Extraction, in: IEEE Transactions on Multimedia, To Appear, 2009.
 
Scaringella, N., On the design of audio features robust to the album-effect for music information retrieval., Ecole Polytechnique Fédérale de Lausanne, 2009.
 
De Simone, F., Dufaux, F., Ebrahimi, T., Delogu, C. and Baroncini, V., A subjective study of the influence of color information on visual quality assessment of high resolution pictures, in: Fourth International Workshop on Video Processing and Quality Metrics for Consumer Electronics (VPQM-09), Scottsdale, Arizona, USA, 2009.
 
Soleymani, M., Chanel, G., Kierkels, J. and Pun, T., affective characterization of movie scenes based on content analysis and physiological changes, in: To appear in International Journal of Semantic Computing, 2009.
 
Thomas, S., Ganapathy, S. and Hermansky, H., Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features, number Idiap-RR-04-2009, 2009.
 
Tommasi, T. and Caputo, B., The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories, in: BMVC, 2009.
 
Ullah, M. M., Orabona, F. and Caputo, B., You live, you learn, you forget: continuous learning of visual places with a forgetting mechanism, in: International Conference on Robotic and Systems, 2009.
 
Valente, F., A Novel Criterion for Classifiers Combination in Multistream Speech Recognition, in: IEEE Signal Processing Letters, volume 16, number 7, pages 561-564, ISSN 1070-9908, 2009. [DOI]
 
Valente, F., Magimai-Doss, M., Plahl, C. and Suman, R., Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009.
 
Vijayasenan, D., Valente, F. and Bourlard, H., An Information Theoretic Approach to Speaker Diarization of Meeting Data, in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 7, pages 1382-1393, 2009. [DOI]
 
Vijayasenan, D., Valente, F. and Bourlard, H., KL Realignment for Speaker Diarization with Multiple Feature Streams, in: 10th Annual Conference of the International Speech Communication Association, 2009.
 
Vijayasenan, D., Valente, F. and Bourlard, H., MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009.
 
Vijayasenan, D., Valente, F. and Bourlard, H., Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, in: Proceedings of International conference on acoustics speech and signal processing, 2009.
 
Vinciarelli, A., Capturing Order in Social Interactions, in: IEEE Signal Processing Magazine, 2009.
 
Vinciarelli, A., Suditu, N. and Pantic, M., Implicit Human Centered Tagging, in: Proceedings of IEEE Conference on Multimedia and Expo, pages 1428-1431, 2009.
 
Vinciarelli, A., Pantic, M. and Bourlard, H., Social Signal Processing: Survey of an Emerging Domain, in: Image and Vision Computing, 2009.
 
Voloshynovskiy, S., Koval, O., Beekhof, F. and Holotyak, T., Binary robust hashing based on probabilistic bit reliability, in: IEEE Workshop on Statistical Signal Processing 2009, 2009.
 
Voloshynovskiy, S., Koval, O., Beekhof, F. and Pun, T., Random projections based item authentication, in: Electronic Imaging 2009, 2009.
 
Wuthrich, M., Liwicki, M., Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G. and Stolz, M., Language model integration for the recognition of handwritten medieval documents, in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 211-215, 2009.
 
Wöllmer, M., Eyben, F., Keshet, J., Graves, A., Schuller, B. and Rigoll, G., Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech using Bidirectional LSTM Networks, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, 2009.
 
Xie, S., Favre, B., Hakkani-Tur, D. and Liu, Y., Leveraging sentence weights in a concept-based optimization framework for extractive meeting summarization, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 
Yao, J. and Odobez, J. -M., Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets, number Idiap-RR-19-2009, 2009.
 
Yao, J. and Odobez, J. -M., Multi-camera multi-person 3d space tracking with mcmc in surveillance scenarios, in: European Conference on Computer Vision, workshop on Multi Camera and Multi-modal Sensor Fusion Algorithms and Applications (ECCV-M2SFA2), Marseille, 2009.
 
Zhao, S. Y., Ravuri, R. and Morgan, N., Multi-stream to many-stream: using spectro-temporal features for asr, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 
Zhu, K., Drygajlo, A. and Li, W., Q-stack aging model for face verification, 2009.
 
Keshet, J. and Chazan, D., A Kernel Wrapper for Phoneme Sequence Recognition, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Keshet, J., Shalev-Shwartz, S., Singer, Y. and Chazan, D., A Large Margin Algorithm for Forced Alignment, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Keshet, J., A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Grangier, D., Keshet, J. and Bengio, S., Discriminative Keyword Spotting, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Deville, B., Bologna, G., Vinckenbosch, M. and Pun, T., See color: seeing colours with an orchestra, in: Human Machine Interaction: Research Results of the MMI Program, pages 251-279, Springer, 2009.
 

2008

Anemuller, J., Back, J. -H., Caputo, B., Luo, J., Ohl, F., Orabona, F., Vogels, R., Weinshall, D. and Zweig, A., Biologically Motivated Audio-Visual Cue Integration for Object, in: Proceedings of the first Internatinal Conference on Cognitive Systems, 2008.
 
Anemuller, J., Back, J. -H., Caputo, B., Havlena, M., Luo, J., Kayser, H., Leibe, B., Motlicek, P., Pajdla, T., Pavel, M., Torii, A., van Gool, L., Zweig, A. and Hermansky, H., The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, in: Proceedings of the International Conference on Multimodal Interfaces, 2008.
 
Aradilla, G., Acoustic models for posterior features in speech recognition, Ecole Polytechnique Fédérale de Lausanne, 2008.
 
Aradilla, G., Bourlard, H. and Magimai-Doss, M., Posterior features applied to speech recognition tasks with limited training data, number Idiap-RR-15-2008, 2008.
 
Aradilla, G., Bourlard, H. and Magimai-Doss, M., Using kl-based acoustic models in a large vocabulary recognition task, number Idiap-RR-14-2008, 2008.
 
Ba, S. and Odobez, J. -M., Multi-party focus of attention recognition in meetings from head pose and multimodal contextual cues, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008.
 
Ba, S. and Odobez, J. -M., Multi-person visual focus of attention from head pose and meeting contextual cues, number Idiap-RR-47-2008, 2008.
 
Ba, S. and Odobez, J. -M., Multi-person visual focus of attention from head pose and meeting contextual cues, number 47, 2008.
 
Ba, S. and Odobez, J. -M., Recognizing visual focus of attention from head pose in natural meetings, in: accepted for publication in IEEE Trans. on System, Man and Cybernetics: Part B, Man,, 2008.
 
Ba, S. and Odobez, J. -M., Visual focus of attention estimation from head pose posterior probability distributions, in: IEEE Proc. Int. Conf. on Multimedia and Expo (ICME), 2008.
 
Beekhof, F., Voloshynovskiy, S., Koval, O. and Villán, R., Secure surface identification codes, in: Steganography, and Watermarking of Multimedia Contents X, 2008. [DOI]
 
Berclaz, J., Fleuret, F. and Fua, P., Multi-camera tracking and atypical motion detection with behavioral maps, in: The 10th European Conference on Computer Vision (ECCV 2008), Marseille, France, 2008.
 
Berclaz, J., Fleuret, F. and Fua, P., Multi-camera tracking and atypical motion detection with behavioral maps, in: Proceedings of the European Conference on Computer Vision (ECCV), pages 112-125, 2008.
 
Berclaz, J., Fleuret, F. and Fua, P., Principled Detection-by-classification from Multiple Views, in: proceedings of the International Conference on Computer Vision Theory and Applications, pages 375-382, 2008.
 
Bertolami, R. and Bunke, H., Ensemble methods to improve the performance of an english handwritten text line recognizer, in: Arabic and Chinese Handwriting Recognition, pages 265-277, Springer, 2008.
 
Bertolami, R. and Bunke, H., Hidden Markov model based ensemble methods for offline handwritten text line recognition, in: Pattern Recognition, volume 41, number 11, pages 3452-3460, 2008.
 
Bertolami, R. and Bunke, H., Including language model information in the combination of handwritten text line recognizers, in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 25-30, 2008.
 
Bertolami, R. and Bunke, H., Integration of n-gram language models in multiple classifier systems for offline handwritten text line recognition, in: Int. Journal of Pattern Recognition and Art. Intelligence, volume 22, number 7, pages 1301-1321, 2008.
 
Bertolami, R., Gutmann, C., Spitz, L. and Bunke, H., Shape code based lexicon reduction for offline handwriting recognition, in: Proc. 8th IAPR Int. Workshop on Document Analysis Systems, pages 158-163, 2008.
 
Besson, P., Popovici, V., Vesin, J. M., Thiran, J. -Ph. and Kunt, M., Extraction of audio features specific to speech production for multimodal speaker detection, in: IEEE Transactions on Multimedia, volume 10, number 1, pages 63-73, 2008. [DOI]
 
Boakye, K., Trueba-Hornero, B., Vinyals, O. and Friedland, G., Overlapped speech detection for improved speaker diarization in multiparty meetings, in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
 
Boakye, K., Vinyals, O. and Friedland, G., Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech, in: Interspeech, 2008.
 
Boakye, K., Vinyals, O. and Friedland, G., Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech, in: Interspeech 2008, Brisbane, Australia, pages 32-35, 2008.
 
Bologna, G., Deville, B., Vinckenbosch, M. and Pun, T., a perceptual interface for vision substitution in a color matching experiment, in: Proceeding on IEEE IJCNN, IEEE World congress on computational intelligence, 2008.
 
Bologna, G., Deville, B., Vinckenbosch, M. and Pun, T., Pairing colored socks and following a red serpentine with sounds of musical instruments, in: ICAD 08, International Conference on Auditory Displays, Paris, France, June 24--27, 2008.
 
Bourlard, H., Chavarriaga, R., Galán, F. and Millán, J. del R., Characterizing the eeg correlates of exploratory behavior, in: IEEE Transactions on Neural Systems & Rehabilitation Engineering, 2008.
 
Bourlard, H. and Renals, S., Recognition and understanding of meetings overview of the european ami and amida projects, in: LangTech 2008, Rome, 2008.
 
Breitenstein, M. D., Kuettel, D., Weise, T., van Gool, L. and Pfister, H., Real-time face pose estimation from single range images, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), IEEE Press, 2008.
 
Bruno, E., Moënne-Loccoz, N. and Marchand-Maillet, S., Design of multimodal dissimilarity spaces for retrieval of multimedia documents, in: To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
 
Bunke, H., Dickinson, P., Neuhaus, M. and Stettler, M., Matching of hypergraphs -- algorithms, applications, and experiments, in: Applied Pattern Recognition, pages 131-154, Springer, 2008.
 
Camastra, F. and Vinciarelli, A., Machine learning for audio, image and video analysis, Advanced Information and Knowledge Processing, volume XVI, Springer Verlag, ISBN 978-1-84800-006-3, 2008.
 
Caputo, B., Class specific object recognition using kernel Gibbs distributions, in: ELectronic Letters on Computer vision and Image Analysis, volume 7, number 2, pages 96-109, 2008.
 
Carincotte, C., Naturel, X., Hick, M., Odobez, J. -M., Yao, J., Bastide, A. and Corbucci, B., Understanding Metro Station Usage using Closed Circuit Television Cameras Analysis, in: 11th International IEEE Conference on Intelligent Transportation Systems (ITSC), Bejing, 2008.
 
Carreras, A., Cordara, G., Delgado, J., Dufaux, F., Francini, G., Ha, T. M., Rodriguez, E. and Tous, R., A search and retrieval framework for the management of copyrighted audiovisual content, in: 50th International Symposium ELMAR 2008, Zadar, Croatia, 2008.
 
Chanel, G., Rebetez, C., Betrancourt, M. and Pun, T., boredom, engagement and anxiety as indicators for adaptation to difficulty in games, in: ACM Mindtrek conference, 2008.
 
Chavarriaga, R., Galán, F. and Millán, J. del R., Asynchronous detection and classification of oscillatory brain activity, in: 16 European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
 
Cornelis, N., Leibe, B., Cornelis, K. and van Gool, L., 3d urban scene modeling integrating recognition and reconstruction, in: International Journal of Computer Vision, volume 78, number 2-3, pages 121-141, 2008.
 
van den Berg, M., Koller-Meier, E. and van Gool, L., Fast body posture estimation using volumetric features, in: IEEE Visual Motion Computing (MOTION), 2008.
 
Deville, B., Bologna, G., Vinckenbosch, M. and Pun, T., Guiding the focus of attention of blind people with visual saliency, in: Workshop on Computer Vision Applications for the Visually Impaired (CVAVI 08), Satellite Workshop of theEuropean Conference on Computer Vision (ECCV 2008), Marseille, France, October 18, 2008.
 
Deville, B., Bologna, G., Vinckenbosch, M. and Pun, T., guiding the focus of attention of blind people with visual saliency, in: Workshop on Computer Vision Applications for the Visually Impaired (CVAVI 08), 2008.
 
Dollé, L., Khamassi, M., Girard, B., Guillot, A. and Chavarriaga, R., Analyzing interactions between navigation strategies using a computational model of action selection, in: Spatial Cognition 2008 (SC '08), pages 71-86, Freiburg, Germany, 2008.
 
Dufaux, F. and Ebrahimi, T., H.264/AVC Video Scrambling for Privacy Protection, in: IEEE International Conference on Image Processing (ICIP2008), San Diego, 2008.
 
Dumas, B., Lalanne, D. and Ingold, R., Démonstration : hephaistk, une bo\^\ite à outils pour le prototypage d'interfaces multimodales, 2008.
 
Dumas, B., Lalanne, D. and Ingold, R., Demonstration : hephaistk, une bo\^\ite à outils pour le prototypage d'interfaces multimodales, in: Proceedings of 20e Conférence sur l'Interaction Homme-Machine (IHM 08), pages 215-216, 2008.
 
Dumas, B., Lalanne, D. and Ingold, R., Prototyping multimodal interfaces with smuiml modeling language, in: Proceedings of CHI 2008 Workshop on UIDLs for Next Generation User Interfaces (CHI 2008 workshop), pages 63-66, 2008.
 
Dumas, B., Lalanne, D. and Ingold, R., Prototyping multimodal interfaces with smuiml modeling language, pages 63-66, 2008.
 
Dumas, B., Lalanne, D., Guinard, D., Koenig, R. and Ingold, R., Strengths and weaknesses of software architectures for the rapid creation of tangible and multimodal interfaces, in: Proceedings of 2nd international conference on Tangible and Embedded Interaction (TEI 2008), pages 47-54, 2008.
 
Dumas, B., Lalanne, D., Guinard, D., Koenig, R. and Ingold, R., Strengths and weaknesses of software architectures for the rapid creation of tangible and multimodal interfaces, pages 47-54, 2008.
 
Dutoit, T., Couvreur, L. and Bourlard, H., How does a dictation machine recognize speech ?, in: Applied Signal Processing--A MATLAB approach, pages 104-148, Springer MA, 2008.
 
Ess, A., Leibe, B., Schindler, K. and van Gool, L., A mobile vision system for robust multi-person tracking, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
 
Estrella, P., Popescu-Belis, A. and King, M., Improving contextual quality models for mt evaluation based on evaluators' feedback., in: LREC 2008 (6th International Conference on Language Resources and Evaluation), 2008.
 
Faria, A. and Morgan, N., Corrected tandem features for acoustic model training, in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
 
Faria, A. and Morgan, N., Corrected Tandem Features for Acoustic Model Training, in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
 
Faria, A. and Morgan, N., When a mismatch can be good: large vocabulary speech recognition trained with idealized tandem features, in: Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, 2008.
 
Favre, B., Grishman, R., Hillard, D., Ji, H., Hakkani-Tur, D. and Ostendorf, M., Punctuating speech for information extraction, in: IEEE ICASSP, Las Vegas, NV, 2008.
 
Favre, S., Salamin, H., Vinciarelli, A., Hakkani-Tur, D. and Garg, N., Role recognition for meeting participants: an approach based on lexical information and social network analysis, in: ACM International Conference on Multimedia, Vancouver, Canada, 2008.
 
Favre, S., Salamin, H. and Vinciarelli, A., Role recognition in multiparty recordings using social affiliation networks and discrete distributions, in: The Tenth International Conference on Multimodal Interfaces (ICMI 2008), Chania, Greece, 2008.
 
Ferrez, P. W. and Millán, J. del R., Eeg-based brain-computer interaction: improved accuracy by automatic single-trial error detection, in: Advances in Neural Information Processing Systems 20, pages 441-448, Cambridge, MA, 2008.
 
Ferrez, P. W. and Millán, J. del R., Error-related eeg potentials generated during simulated brain-computer interaction, in: IEEE Transactions on Biomedical Engineering, volume 55, number 3, pages 923-929, 2008. [DOI]
 
Ferrez, P. W. and Millán, J. del R., Error-Related EEG Potentials Generated During Simulated Brain-Computer Interaction, in: IEEE Trans. on Biomedical Engineering, volume 55, number 3, pages 923-929, 2008.
 
Ferrez, P. W. and Millán, J. del R., Simultaneous real-time detection of motor imagery and error-related potentials for improved bci accuracy, in: Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course, 2008.
 
Fleuret, F., Berclaz, J., Lengagne, R. and Fua, P., Multi-Camera People Tracking with a Probabilistic Occupancy Map, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 30, number 2, pages 267-282, 2008.
 
Fleuret, F. and Geman, D., Stationary features and cat detection, in: Journal of Machine Learning Research, 2008.
 
Fleuret, F. and Geman, D., Stationary features and cat detection, in: Journal of Machine Learning Research (JMLR), volume 9, pages 2549-2578, 2008.
 
Friedland, G. and Vinyals, O., Live speaker identification in conversations, in: ACM Multimedia 2008, Vancouver, Canada, pages 1017-1018, 2008.
 
Galán, F., Nuttin, M., Lew, E., Ferrez, P. W., Vanacker, G., Philips, J. and Millán, J. del R., A brain-actuated wheelchair: asynchronous and non-invasive brain-computer interfaces for continuous control of robots, in: Clinical Neurophysiology, number 119, pages 2159-2169, 2008.
 
Galán, F., Nuttin, M., Vanhooydonck, D., Lew, E., Ferrez, P. W., Philips, J. and Millán, J. del R., Continuous brain-actuated control of an intelligent wheelchair by human eeg, in: 4th International Brain-Computer Interface Workshop & Training Course, Graz University of Technology, Graz, Austria, 2008.
 
Galán, F., Methods for Asynchronous and Non-Invasive EEG-Based Brain-Computer Interfaces. Towards Intelligent Brain-Actuated Wheelchairs, University of Barcelona, 2008.
 
Gammeter, S., Ess, A., Jaeggli, T., Leibe, B., Schindler, K. and van Gool, L., Articulated multibody tracking under egomotion, in: European Conference on Computer Vision (ECCV'08), Springer, 2008.
 
Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Autoregressive modelling of hilbert envelopes for wide-band audio coding, in: AES 124th Convention, Audio Engineering Society, Amsterdam, 2008.
 
Ganapathy, S., Thomas, A. and Hermansky, H., Front-end for far-field speech recognition based on frequency domain linear prediction, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, number Idiap-RR-75-2008, 2008.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, number Idiap-RR-74-2008, 2008.
 
Ganapathy, S., Thomas, S. and Hermansky, H., Modulation Frequency Features For Phoneme Recognition In Noisy Speech, in: Journal of Acoustical Society of America - Express Letters, 2008.
 
Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domain, in: INTERSPEECH 2008, Brisbane, Australia, 2008.
 
Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Temporal masking for bit-rate reduction in audio codec based on frequency domain linear prediction, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4781-4784, Las Vegas, NV, 2008. [DOI]
 
Garg, N. and Hakkani-Tur, D., Speaker role detection in meetings using lexical information and social network analysis, in: Technical Report TR-08-004, International Computer Science Institute, Berkeley, CA, 2008.
 
Garipelli, G., Chavarriaga, R. and Millán, J. del R., Fast recognition of anticipation related potentials, in: IEEE Transactions on Biomedical Engineering, 2008.
 
Garipelli, G., Chavarriaga, R. and Millán, J. del R., Recognition of anticipatory behavior from human eeg, in: 4th Intl. Brain-Computer Interface Workshop and Training Course, Graz University, Austria, 2008.
 
Garner, P. N., A weighted finite state transducer tutorial, number Idiap-Com-03-2008, 2008.
 
Garner, P. N., Silence models in weighted finite-state transducers, in: Interspeech, Brisbane, Australia, 2008.
 
Gatica-Perez, D. and Farrahi, K., Daily routine classification from mobile phone data, in: Workshop on Machine Learning and Multimodal Interaction (MLMI08), Utrecht, The Netherlands, 2008.
 
Gatica-Perez, D. and Farrahi, K., Discovering human routines from cell phone data with topic models, in: IEEE International Symposium on Wearable Computers (ISWC), Pittsburgh, Pennsylvania, 2008.
 
Gatica-Perez, D. and Farrahi, K., What did you do today? discovering daily routines from large-scale mobile data, in: ACM International Conference on Multimedia (ACMMM), Vancouver, 2008.
 
Gillick, D., Hakkani-Tur, D. and Levit, M., Unsupervised learning of edit parameters for matching name variants, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Goldmann, L., Adamek, T., Vajda, P., Karaman, M., Mörzinger, R., Galmar, E., Sikora, T., O'Connor, N., Ha-Minh, T., Ebrahimi, T., Schallauer, P. and Huet, B., Towards Fully Automatic Image Segmentation Evaluation, in: Advanced Concepts for Intelligent Vision Systems (ACIVS), Springer, Juan-les-Pins, 2008.
 
Gonzalez, G., Fleuret, F. and Fua, P., Automated delineation of dendritic networks in noisy image stacks, in: Proceedings of the European Conference on Computer Vision (ECCV), pages 214-227, 2008.
 
Gonzalez, G., Fleuret, F. and Fua, P., Automated delineation of dendritic networks in noisy image stacks, in: The 10th European Conference on Computer Vision, Marseille, France, 2008.
 
Grandjean, D. and Pun, T., Multimodality in emotions and for their assessment, 2008.
 
Grandvalet, Y., Rakotomamonjy, A., Keshet, J. and Canu, S., Support Vector Machines with a Reject Option, in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008.
 
Grangier, D. and Bengio, S., A discriminative kernel-based model to rank images from text queries, in: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2008.
 
Grangier, D., Machine Learning for Information Retrieval, École Polytechnique Fédérale de Lausanne, 2008.
 
Grossmann, E., Gaspar, J. -A. and Orabona, F., Calibration from statistical properties of the visual world, in: European Conf. on Computer Vision, 2008.
 
Gui, L., Thiran, J. -Ph. and Paragios, N., Cooperative object segmentation and behavior inference in image sequences, in: International Journal of Computer Vision, ISSN 0920-5691, 2008. [DOI]
 
Gurban, M., Thiran, J. -Ph., Drugman, T. and Dutoit, T., Dynamic modality weighting for multi-stream HMMs in Audio-Visual Speech Recognition, in: 10th International Conference on Multimodal Interfaces, Chania, Greece, 2008.
 
Gurban, M. and Thiran, J. -Ph., Using entropy as a stream reliability estimate for audio-visual speech recognition, in: 16th European Signal Processing Conference, Lausanne, Switzerland, 2008.
 
Hoffmann, U., Vesin, J. M., Ebrahimi, T. and Diserens, K., An efficient p300-based brain-computer interface for disabled subjects, in: Journal of Neuroscience Methods, volume 167, number 1, pages 115-125, 2008. [DOI]
 
Hoffmann, U., Yazdani, A., Vesin, J. M. and Ebrahimi, T., Bayesian feature selection applied in a p300 brain- computer interface, in: 16th European Signal Processing Conference, Lausanne, 2008.
 
Hoffmann, U., Naruniec, J., Yazdani, A. and Ebrahimi, T., Face Detection Using Discrete Gabor Jets And Color Information, in: SIGMAP 2008 - International Conference on Signal Processing and Multimedia Applications, Porto, 2008.
 
Humm, A., Hennebert, J. and Ingold, R., Combined handwriting and speech modalities for user authentication, in: IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans, volume 38, 2008.
 
Humm, A., Modelling combined handwriting and speech modalities for user authentication, University of Fribourg, Switzerland, 2008.
 
Humm, A., Hennebert, J. and Ingold, R., Spoken signature for user authentication, in: SPIE Journal of Electronic Imaging, volume 17, 2008.
 
Humm, A., Hennebert, J. and Ingold, R., Spoken signature for user authentication, in: SPIE Journal of Electronic Imaging, volume 17, 2008.
 
Hung, H., Huang, Y., Yeo, C. and Gatica-Perez, D., Associating audio-visual activity cues in a dominance estimation framework, in: CVPR Workshop on Human Communicative Behavior, 2008.
 
Hung, H., Huang, Y., Friedland, G. and Gatica-Perez, D., Estimating the dominant person in multi-party conversations using speaker diarization strategies, in: ICASSP 08, 2008.
 
Hung, H., Huang, Y., Friedland, G. and Gatica-Perez, D., Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies, in: IEEE ICASSP, Las Vegas, NV, 2008.
 
Hung, H. and Gatica-Perez, D., Identifying dominant people in meetings from audio-visual sensors, in: Proc. IEEE Int. Conf. on Automatic Face and Gesture Recognition, Special Session on Multimodal HCI for Smart Environments, 2008.
 
Hung, H. and Gatica-Perez, D., Identifying dominant people in meetings from audio-visual sensors, in: Proc. IEEE Int. Conf. on Automatic Face and Gesture Recognition (FG), Special Session on Multi-Sensor HCI for Smart Environments, 2008.
 
Hung, H., Jayagopi, D., Ba, S., Odobez, J. -M. and Gatica-Perez, D., Investigating automatic dominance estimation in groups from visual attention and speaking activity, in: International Conference on Multimodal Interfaces (ICMI), 2008.
 
Hung, H., Jayagopi, D., Ba, S., Odobez, J. -M. and Gatica-Perez, D., Investigating automatic dominance estimation in groups from visual attention and speaking activity, in: Proc. ICMI, 2008.
 
Hung, H. and Friedland, G., Towards audio-visual on-line diarization of participants in group meetings, in: European Conference on Computer Vision (ECCV) 2008, Marseille, France, 2008.
 
Indermühle, E., Liwicki, M. and Bunke, H., Recognition of handwritten historical documents: hmm -adaptation vs. writer specific training, in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 186-191, 2008.
 
Jayagopi, D., Raducanu, B. and Gatica-Perez, D., Characterizing conversational group dynamics using nonverbal behavior, in: Proc. IEEE Int. Conf. on Multimedia (ICME), 2008.
 
Jayagopi, D., Hung, H., Yeo, C. and Gatica-Perez, D., Modeling dominance in group conversations from nonverbal activity cues, in: IEEE Trans. on Audio, Speech and Language Processing, Special Issue on Multimodal Processing for Speech-based Interactions, accepted for publication, 2008.
 
Jayagopi, D., Predicting the dominant clique in meetings through fusion of nonverbal cues, in: Proc. ACM Vancouver, Canada, 2008.
 
Jayagopi, D., Hung, H., Yeo, C. and Gatica-Perez, D., Predicting the dominant clique in meetings through fusion of nonverbal cues, in: ACM MM 2008, Vancouver, Canada, 2008.
 
Jayagopi, D., Predicting two facets of social verticality in meetings from five-minute time slices and nonverbal cues, in: Proc. ICMI, 2008.
 
Jayagopi, D., Ba, S., Odobez, J. -M. and Gatica-Perez, D., Predicting two facets of social verticality in meetings from five-minute time slices and nonverbal cues, in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), Special Session on Social Signal Processing, 2008.
 
Kamangar, K., Hakkani-Tur, D., Tur, G. and Levit, M., An iterative unsupervised learning method for information distillation, in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
 
Keshet, J. and Bengio, S., Automatic speech and speaker recognition: large margin and kernel methods, John Wiley & Sons, 2008.
 
Ketabdar, H. and Bourlard, H., Enhanced phone posteriors for improving speech recognition systems, number Idiap-RR-39-2008, 2008.
 
Ketabdar, H., Enhancing posterior based speech recognition systems, Ecole Polytechnique Fédérale de Lausanne, 2008.
 
Ketabdar, H. and Bourlard, H., Hierarchical integration of phonetic and lexical knowledge in phone posterior estimation, in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
 
Ketabdar, H. and Bourlard, H., In-context phone posteriors as complementary features for tandem asr, in: ICSLP'08, Brisbane, Australia,, 2008.
 
Kludas, J., Bruno, E. and Marchand-Maillet, S., Can feature information interaction help for information fusion in multimedia problems?, in: First International Workshop on Metadata Mining for Image Understanding, pages 23-33, 2008.
 
Kludas, J., Bruno, E. and Marchand-Maillet, S., Can feature information interaction help for information fusion in multimedia problems?, in: To appear in Multimedia Tools and Applications Journal special issue on "Metadata Mining for Image Understanding", 2008.
 
Kludas, J., Marchand-Maillet, S. and Bruno, E., Exploiting document feature interactions for efficient information fusion in high dimensional spaces, in: Proceedings of the First International Workshops on Image Processing Theory, Tools and Applications (IPTA'2008), 2008.
 
Kludas, J., Bruno, E. and Marchand-Maillet, S., Exploiting synergistic and redundant features for multimedia document classification, in: 32nd Annual Conference of the German Classification Society - Advances in Data Analysis, Data Handling and Business Intelligence (GfKl 2008), 2008.
 
Kludas, J., Bruno, E. and Marchand-Maillet, S., Exploiting synergistic and redundant features for multimedia document classification, in: 32nd Annual Conference of the German Classification Society - Advances in Data Analysis, Data Handling and Business Intelligence (GfKl 2008), 2008.
 
Knox, M., Morgan, N. and Mirghafori, N., Getting the last laugh: automatic laughter segmentation in meetings, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Knox, M., Morgan, N. and Mirghafori, N., Getting the last laugh: automatic laughter segmentation in meetings, in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 797-800, 2008.
 
Kokiopoulou, E., Frossard, P. and Verscheure, O., Fast keyword detection with sparse time-frequency models, in: IEEE Int. Conf. on Multimedia & Expo (ICME), 2008.
 
Kokiopoulou, E., Pirillos, S. and Frossard, P., Graph-based classification for multiple observations of transformed patterns, in: IEEE Int. Conf. Pattern Recognition (ICPR), 2008.
 
Kokiopoulou, E. and Frossard, P., Minimum distance between pattern transformation manifolds: algorithm and applications, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
 
Kokiopoulou, E., Frossard, P. and Gkorou, D., Optimal polynomial filtering for accelerating distributed consensus, in: IEEE Int. Symp. on Information Theory (ISIT), 2008.
 
Kokiopoulou, E. and Frossard, P., Semantic coding by supervised dimensionality reduction, in: IEEE Transactions on Multimedia, volume 10, number 2, 2008.
 
Kosinov, S. and Pun, T., Distance-based discriminant analysis method and its applications, in: Pattern Analysis and Applications, volume 11, number 3-4, pages 227-246, 2008.
 
Kosinov, S., Bruno, E. and Marchand-Maillet, S., Spatially-consistent partial matching for intra- and inter-image prototype selection, in: To appear in Signal Processing: Image Communication special issue on "Semantic Analysis for Interactive Multimedia Services", 2008.
 
Koval, O., Voloshynovskiy, S., Beekhof, F. and Pun, T., Analysis of physical unclonable identification based on reference list decoding, in: Steganography, and Watermarking of Multimedia Contents X, 2008.
 
Koval, O., Voloshynovskiy, S. and Pun, T., Privacy-preserving multimodal person and object identification, in: Proceedings of the 10th ACM Workshop on Multimedia & Security, 2008.
 
Koval, O., Voloshynovskiy, S., Caire, F. and Bas, P., Privacy-preserving multimodal person and object identification, in: MM&Sec 2008, 2008.
 
Koval, O., Voloshynovskiy, S., Beekhof, F. and Pun, T., Security analysis of robust perceptual hashing, in: Steganography, and Watermarking of Multimedia Contents X, 2008.
 
Kryszczuk, K. and Drygajlo, A., Credence estimation and error prediction in biometric identity verification, in: Signal Processing, volume 88, number 4, pages 916-925, 2008.
 
Kryszczuk, K. and Drygajlo, A., Impact of feature correlations on separation between bivariate normal distributions, 2008.
 
Kryszczuk, K. and Drygajlo, A., Impact of feature correlations on separation between bivariate normal distributions, in: 19th International Conference on Pattern Recognition, 2008.
 
Kryszczuk, K. and Drygajlo, A., On quality of quality measures for classification, in: Biometrics and Identity Management, Lecture Notes in Computer Science 5372, pages 19-28, 2008.
 
Kryszczuk, K. and Drygajlo, A., On quality of quality measures for classification, pages 19-28, Springer, 2008.
 
Kryszczuk, K. and Drygajlo, A., What do quality measures predict in biometrics, pages -,-29, 2008.
 
Kryszczuk, K. and Drygajlo, A., What do quality measures predict in biometrics, in: 16th European Signal Processing Conference, 2008.
 
Kumatani, K., McDonough, J., Klakow, D., Garner, P. N. and Li, W., Adaptive beamforming with a maximum negentropy criterion,, in: The Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2008.
 
Kumatani, K., McDonough, J., Rauch, B., Klakow, D., Garner, P. N. and Li, W., Beamforming with a Maximum Negentropy Criterion, in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 5, pages 994-1008, 2008.
 
Kumatani, K., McDonough, J., Schacht, S., Klakow, D., Garner, P. N. and Li, W., Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming, in: International Conferance on Acoustics Speech and Signal Processing, 2008.
 
Kumatani, K., McDonough, J., Schacht, S., Klakow, D., Garner, P. N. and Li, W., Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition, number Idiap-RR-02-2008, 2008.
 
Kumatani, K., McDonough, J., Klakow, D., Garner, P. N. and Li, W., Maximum negentropy beamforming, number Idiap-RR-07-2008, 2008.
 
Lalanne, D., Rigamonti, M., Ingold, R., Evéquoz, F. and Dumas, B., An ego-centric and tangible approach to meeting indexing and browsing, Lecture Notes in Computer Science, volume Volume 4892, Springer Berlin / Heidelberg, ISBN 978-3-540-78154-7, 2008. [DOI]
 
Leibe, B., Schindler, K., Cornelis, N. and van Gool, L., Coupled object detection and tracking from static cameras and moving vehicles, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
 
Leibe, B., Ettlin, A. and Schiele, B., Learning semantic object parts for object categorization, in: Image and Vision Computing, volume 26, number 1, pages 15-26, 2008.
 
Leibe, B., Leonardis, A. and Schiele, B., Robust object detection with interleaved categorization and segmentation, in: International Journal of Computer Vision, volume 77, number 1-3, pages 259-289, 2008.
 
Li, W., Kumatani, K., Dines, J., Magimai-Doss, M. and Bourlard, H., A neural network based regression approach for recogninizing simultaneous speech, in: Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
 
Li, W., Kumatani, K., Dines, J., Magimai-Doss, M. and Bourlard, H., A neural network based regression approach for recognizing simultaneous speech, number Idiap-RR-10-2008, 2008.
 
Li, W., Effective post-processing for single-channel frequency-domain speech enhancement, pages 149-152, 2008. [DOI]
 
Li, W., Effective post-processing of single-channel frequency-domain speech enhancement, in: IEEE conference on multimedia and expo, 2008.
 
Li, W., Doss, M. M., Dines, J. and Bourlard, H., Mlp-based log spectral energy mapping for robust overlapping speech recognition, in: European Signal Processing Conference, 2008.
 
Li, W., Dines, J., Magimai-Doss, M. and Bourlard, H., Neural network based regression for robust overlapping speech recognition using microphone arrays, in: Interspeech, 2008.
 
Liwicki, M. and Bunke, H., Combining on-line and off-line blstm networks for handwritten text line recognition, in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 31-36, 2008.
 
Liwicki, M. and Bunke, H., Recognition of whiteboard notes -- online, offline and combination, World Scientific, ISBN 978-9812814531, 2008.
 
Liwicki, M., Schlapbach, A. and Bunke, H., Writer-dependent recognition of handwritten whiteboard notes in smart meeting room environments, in: Proc. 8th IAPR Int. Workshop on Document Analysis Systems, pages 151-157, 2008.
 
Llonch, R. Sala, Kokiopoulou, E., Tosic, I. and Frossard, P., 3d face recognition using sparse spherical representations, in: IEEE Int. Conf. Pattern Recognition (ICPR), 2008.
 
Luo, J., Caputo, B., Zweig, A., Back, J. -H. and Anemuller, J., Object category detection using audio-visual cues, in: International Conference on Computer Vision Systems (ICVS08), 2008.
 
Mariéthoz, J., Bengio, S. and Grandvalet, Y., Kernel Based Text-Independnent Speaker Verification, number Idiap-RR-68-2008, 2008.
 
Matena, L., Jaimes, A. and Popescu-Belis, A., Graphical representation of meetings on mobile devices, in: MobileHCI 2008 Demonstrations (10th ACM International Conference on Human-Computer Interaction with Mobile Devices and Services), 2008.
 
Mesot, B., Inference in switching linear dynamical systems applied to noise robust speech recognition of isolated digits, Ecole Polytechnique Fédérale de Lausanne, 2008.
 
Mesot, B., Switching linear dynamical systems for noise robust speech recognition of isolated degits, STI School of Engineering, EPFL, 2008.
 
Meynet, J. and Thiran, J. -Ph., Ensembles of SVMs using an Information Theoretic Criterion, in: Pattern Recognition Letters, 2008.
 
Meynet, J., Arsan, T., Cruz Mota, J. and Thiran, J. -Ph., Fast multi-view face tracking with pose estimation, in: 16th European Signal Processing Conference, Lausanne, 2008.
 
Meynet, J. and Thiran, J. -Ph., Information Theoretic Combination of Classifiers, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008. [DOI]
 
Millán, J. del R., Brain-controlled robots, in: IEEE International Conference on Robotics and Automation (ICRA 2008), Pasadena, CA, USA,, 2008. [DOI]
 
Millán, J. del R., Brain-Controlled Robots, in: IEEE Intelligent Systems, 2008.
 
Millán, J. del R., Ferrez, P. W., Galán, F., Lew, E. and Chavarriaga, R., Non-invasive brain-machine interaction, in: International Journal of Pattern Recognition and Artificial Intelligence, 2008.
 
Morrison, D., Marchand-Maillet, S. and Bruno, E., Semantic clustering of images using patterns of relevance feedback, in: Proceedings of the 6th International Workshop on Content-based Multimedia Indexing (CBMI'2008), 2008.
 
Motlicek, P., Ganapathy, S. and Hermansky, H., Entropy coding of Quantized Spectral Components in FDLP audio codec, number Idiap-RR-71-2008, 2008.
 
Motlicek, P., Ganapathy, S., Hermansky, H., Garudadri, H. and Athineos, M., Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, in: Text, Speech and Dialogue, pages 435-442, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
 
Naturel, X. and Odobez, J. -M., Detecting queues at vending machines: a statistical layered approach, in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008.
 
Negoescu, R. -A. and Gatica-Perez, D., Analyzing flickr groups, in: Proceedings of the 2008 international conference on Content-based image and video retrieval (CIVR '08), Sheraton Fallsview Hotel, Niagara Falls, Canada, 2008.
 
Negoescu, R. -A. and Gatica-Perez, D., Topickr: Flickr Groups and Users Reloaded, in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008.
 
Nijholt, A., Tan, D., Allison, B., Millán, J. del R., Moore, M. and Graimann, B., Brain-computer interfaces for hci and games, in: Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts, 2008.
 
Noris, B., Benmachiche, K. and Billard, A., Calibration-free eye gaze direction detection with gaussian processes, in: International Conference on Computer Vision Theory and Applications (VISAPP 2008), Funchal, Portugal, 2008.
 
Orabona, F., Keshet, J. and Caputo, B., The Projectron: a Bounded Kernel-Based Perceptron, in: Int. Conf. on Machine Learning, 2008.
 
Ouaret, M., Dufaux, F. and Ebrahimi, T., Enabling Privacy For Distributed Video Coding by Transform Domain Scrambling, in: 2008 SPIE Visual Communications and Image Processing, San Diego, USA, 2008.
 
Paiement, J. -F., Grandvalet, Y., Bengio, S. and Eck, D., A Distance Model for Rhythms, in: 25th International Conference on Machine Learning (ICML), 2008.
 
Paiement, J. -F., Grandvalet, Y. and Bengio, S., Predictive Models for Music, number Idiap-RR-51-2008, 2008.
 
Paiement, J. -F., Bengio, S. and Eck, D., Probabilistic Models for Melodic Prediction, number Idiap-RR-50-2008, 2008.
 
Paiement, J. -F., Probabilistic models for music, École Polytechnique Fédérale de Lausanne, 2008.
 
Parthasarathi, S. H. K. and Hermansky, H., A data-driven approach to speech/non-speech detection, number Idiap-RR-23-2008, 2008.
 
Parthasarathi, S. H. K., Motlicek, P. and Hermansky, H., Exploiting Contextual Information for Speech/Non-Speech Detection, in: Text, Speech and Dialogue, pages 451-459, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
 
Parthasarathi, S. H. K., Motlicek, P. and Hermansky, H., Exploiting temporal context for speech/non-speech detection, number Idiap-RR-21-2008, 2008.
 
Pellegrini, S., Schindler, K. and D. Nardi, , A generalization of the icp algorithm for articulated bodies, in: British Machine Vision Conference (BMVC'08), 2008.
 
Perrin, X., Chavarriaga, R., Ray, C., Siegwart, R. and Millán, J. del R., A comparative psychophysical and eeg study of different feedback modalities for hri, in: Human-Robot Interaction (HRI08), 2008.
 
Perruchoud, L., The Anterior Cingulate Cortex, number Idiap-Com-02-2008, 2008.
 
Pinto, J. P. and Hermansky, H., Combining evidence from a generative and a discriminative model in phoneme recognition, in: Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Pinto, J. P., Hermansky, H., Yegnanarayana, B. and Magimai-Doss, M., Exploiting contextual information for improved phoneme recognition, in: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2008), pages 4449-4452, Las Vegas, NV, 2008. [DOI]
 
Pinto, J. P., Szoke, I., Prasanna, S. R. Mahadeva and Hermansky, H., Fast approximate spoken term detection from sequence of phonemes, in: The 31st Annual International ACM SIGIR Conference 20-24 July 2008, pages 28-33, Singapore,, 2008.
 
Pinto, J. P., Sivaram, G. S. V. S. and Hermansky, H., Reverse correlation for analyzing mlp posterior features in asr, in: 11th International Conference on Text, Speech and Dialogue (TSD), pages 469-476, Brno, Czech Republic, 2008. [DOI]
 
Popescu-Belis, A., Dimensionality of dialogue act tagsets: an empirical analysis of large corpora, in: Language Resources and Evaluation, volume 42, number 1, pages 99-107, 2008. [DOI]
 
Popescu-Belis, A., Bourlard, H. and Renals, S., Machine learning for multimodal interaction iv, LNCS, volume 4892, Springer-Verlag, ISBN 978-3-540-78154-7, 2008.
 
Popescu-Belis, A. and Stiefelhagen, R., Machine learning for multimodal interaction v, LNCS, volume 5237, Springer-Verlag, ISBN 978-3-540-85852-2, 2008.
 
Popescu-Belis, A., Reference-based vs. task-based evaluation of human language technology, in: LREC 2008 ELRA Workshop on Evaluation: "Looking into the Future of Evaluation: When automatic metrics meet task-based and performance-based approaches", pages 12-16, ELRA, 2008.
 
Popescu-Belis, A., Flynn, M., Wellner, P. and Baudrion, P., Task-based evaluation of meeting browsers: from bet task elicitation to user behavior analysis, in: LREC 2008 (6th International Conference on Language Resources and Evaluation), 2008.
 
Prodanov, P., Drygajlo, A., Richiardi, J. and Alexander, A., Low-level grounding in a multimodal mobile service robot conversational system using graphical models, in: Intelligent Service Robotics, volume 1, pages 3-26, 2008. [DOI]
 
Pronobis, M. and Magimai-Doss, M., Integrating audio and vision for robust automatic gender recognition, number Idiap-RR-73-2008, 2008.
 
Pronobis, A., Martinez Monos, O. and Caputo, B., SVM-based Discriminative Accumulation Scheme for Place Recognition, in: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA08), 2008.
 
Quack, T., Bay, H. and van Gool, L., Object recognition for the internet of things, in: Internet of Things 2008, 2008.
 
Quack, T., Leibe, B. and van Gool, L., World-scale mining of objects and events from community photo collections, in: Conference on Image and Video Retrieval (CIVR'08), ACM, 2008.
 
Rakotomamonjy, A., Bach, F., Canu, S. and Grandvalet, Y., SimpleMKL, in: Journal of Machine Learning Research, volume 9, pages 2491-2521, 2008.
 
Rayner, M., Tsourakis, N., Georgescul, M. and Bouillon, P., Building mobile spoken dialogue applications using regulus, in: Proceedings of the Sixth International Language Resources and Evaluation (LREC'08), 2008.
 
Richiardi, J., Drygajlo, A. and Todesco, L., Promoting diversity in gaussian mixture ensembles: an application to signature verification, pages 140-149, Springer, 2008.
 
Richiardi, J., Drygajlo, A. and Todesco, L., Promoting diversity in gaussian mixture ensembles: an application to signature verification, in: Biometrics and Identity Management, Lecture Notes in Computer Science 5372, pages 140-149, 2008.
 
Riedhammer, K., Gillick, D., Favre, B. and Hakkani-Tur, D., Packing the meeting summarization knapsack, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Rigamonti, M., A framework for structuring multimedia archives and for browsing efficiently through multimodal links, University of Fribourg, Switzerland, 2008.
 
Rigamonti, M., A framework for structuring multimedia archives and for browsing efficiently through multimodal links, University of Fribourg, Switzerland, 2008.
 
Roth, D., Koller-Meier, E., Rowe, D., Moeslund, T. B. and van Gool, L., Event-based tracking evaluation metric, in: IEEE Workshop on Motion and Video Computing (WMVC), 2008.
 
Scaringella, N., Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, in: "Int. Conf. on Music Information Retrieval (ISMIR)", 2008.
 
Schindler, K. and van Gool, L., Action snippets: how many frames does human action recognition require?, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), IEEE Press, 2008.
 
Schindler, K. and van Gool, L., Combining densely sampled form and motion for human action recognition, in: DAGM Annual Pattern Recognition Symposium, Springer, 2008.
 
Schindler, K. and Suter, D., Object detection by global contour shape, in: Pattern Recognition, 2008.
 
Schindler, K., van Gool, L. and B. de Gelder, , Recognizing emotions expressed by body pose: a biologically inspired neural model, in: Neural Networks, 2008.
 
Schlapbach, A., Liwicki, M. and Bunke, H., A writer identification system for on-line whiteboard data, in: Pattern Recognition, volume 41, pages 2381-2397, 2008.
 
Schlapbach, A., Wettstein, F. and Bunke, H., Automatic estimation of the readability of handwritten text, in: Proc. 16th European Signal Processing Conference, 2008.
 
Schlapbach, A., Bunke, H. and Wettstein, F., Estimating the readability of handwritten text -- a support vector regression based approach, in: Proc. 19th Int. Conf. on Pattern Recognition, IEEE, 2008.
 
Schlapbach, A. and Bunke, H., Off-line writer identification and verification using gaussian mixture models, in: Machine Learning in Document Analysis and Recognition, pages 409-428, Springer, 2008.
 
Schlapbach, A., Writer identification and verification, volume 311, IOS Press, ISBN 978-1-58603-825-0, 2008.
 
Schouten, B., Juul, N., Drygajlo, A. and Tistarelli, M., Biometrics and identity management, Springer, 2008.
 
Schouten, B., Juul, N., Drygajlo, A. and Tistarelli, M., Biometrics and identity management, Springer, 2008.
 
Shahrokni, A., Drummond, T., Fleuret, F. and Fua, P., Classification-based Probabilistic Modeling of Texture Transition for Fast Line Search Tracking and Delineation, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
 
Shriberg, E., Higher level features in speaker recognition, in: in C. Muller (Ed.) Speaker Classification I. Springer-Verlag, New York, 2008.
 
De Simone, F., Ticca, D., Dufaux, F., Ansorge, M. and Ebrahimi, T., A comparative study of color image compression standards using perceptually driven quality metrics, in: SPIE Optics and Photonics, San Diego, CA USA, 2008.
 
De Simone, F., Ansorge, M. and Ebrahimi, T., A multi-channel objective model for the full-reference assessment of color pictures, in: 2nd K-space Jamboree Workshop, Paris, 2008.
 
Singla, A. and Hakkani-Tur, D., Cross-lingual sentence extraction for information distillation, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Sivaram, G. S. V. S. and Hermansky, H., Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition, in: Proc. 16th European Signal Processing Conference (EUSIPCO), Lausanne, 2008.
 
Sivaram, G. S. V. S. and Hermansky, H., Introducing temporal asymmetries in feature extraction for automatic speech recognition, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Smith, K., Ba, S., Gatica-Perez, D. and Odobez, J. -M., Tracking the visual focus of attention for a varying number of wandering people, in: IEEE Trans. on Pattern Analysis and Machine Intelligence,, volume 30, number 7, pages 1212-1229, 2008.
 
Soleymani, M., Chanel, G., Kierkels, J. and Pun, T., affective characterization of movie scenes based on multimedia content analysis and user's physiological emotional responses, in: IEEE International Symposium on Multimedia, 2008.
 
Soleymani, M., Chanel, G., Kierkels, J. and Pun, T., affective ranking of movie scenes using physiological signals and content analysis, in: 2nd ACM Workshop on the Many Faces of Multimedia Semantics, ACM MM08, 2008.
 
Soleymani, M., Kierkels, J., Chanel, G., Bruno, E., Marchand-Maillet, S. and T. Pun, , Estimating emotions and tracking interest during movie watching based on multimedia content and physiological responses, in: Joint (IM)2-Interactive Multimodal Information Management and Affective Sciences NCCRs meeting, 2008.
 
Soleymani, M., Chanel, G., Kierkels, J. and Pun, T., Valence-arousal representation of movie scenes based on multimedia content analysis and user's physiological emotional responses, in: MLMI 2008, 5th Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
 
Soleymani, M., Chanel, G., Kierkels, J. and Pun, T., valence-arousal representation of movie scenes based on multimedia content analysis and user's physiological emotional responses, 5th Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
 
Sorci, M., Antonini, G., Cerretani, B., Cruz Mota, J., Rubin, T., Bierlaire, M. and Thiran, J. -Ph., Modelling human perception of static facial expressions, in: Face and Gesture Recognition 2008, Amsterdam, 2008.
 
Spindler, T., Wartmann, C., Hovestadt, L., Roth, D., van Gool, L. and Steffen, A., Privacy in video surveilled spaces, in: Journal of Computer Security, volume 16, number 2, pages 199-222, 2008.
 
Stolcke, A., Anguera, X., Boakye, K., Cetin, O., Janin, A., Magimai-Doss, M., Wooters, C. and Zheng, J., The SRI-ICSI spring 2007 meeting and lecture recognition system, in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
 
Stoyanchev, S., Tur, G. and Hakkani-Tur, D., Name-aware speech recognition for interactive question answering, in: IEEE ICASSP, Las Vegas, NV, 2008.
 
Szafranski, M., Grandvalet, Y. and Rakotomamonjy, A., Composite Kernel Learning, in: Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), pages 1040-1047, Omnipress, 2008.
 
Thomas, A., Ganapathy, S. and Hermansky, H., Hilbert envelope based features for far-field speech recognition, in: MLMI 2008, Utrecht, The Netherlands, 2008.
 
Thomas, A., Ganapathy, S. and Hermansky, H., Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Thomas, A., Ganapathy, S. and Hermansky, H., Recognition of reverberant speech using frequency domain linear prediction, in: IEEE Signal Processing Letters, 2008.
 
Thomas, A., Ganapathy, S. and Hermansky, H., Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain, in: 16th European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
 
Thomas, A., Ferrari, V., Leibe, B., Tuytelaars, T. and van Gool, L., Using recognition to guide a robot's attention, in: Robotics Science and Systems, 2008.
 
Tommasi, T., Orabona, F. and Caputo, B., CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach, number Idiap-RR-77-2008, 2008.
 
Tommasi, T., Orabona, F. and Caputo, B., Cue Integration for Medical Image Annotation, in: Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Springer-Verlag, 2008.
 
Tommasi, T., Orabona, F. and Caputo, B., Discriminative cue integration for medical image annotation, in: Pattern Recognition Letters, 2008.
 
Torii, A., Havlena, M., Pajdla, T. and B. Leibe, , Measuring camera translation by the dominant apical angle, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
 
Tous, R., Carreras, A., Delgado, J., Cordara, G., Gianluca, F., Peig, E., Dufaux, F. and Galinski, G., An Architecture for TV Content Distributed Search and Retrieval Using the MPEG Query Format (MPQF), in: International Workshop on Ambient Media Delivery and Interactive Television (AMDIT 2008), Quebec City, Canada, 2008.
 
Tsourakis, N., Lisowska, A., Bouillon, P. and Rayner, M., From desktop to mobile: adapting a successful voice interaction platform for use in mobile devices, in: Third ACM MobileHCI Workshop on Speech in Mobile and Pervasive Environments (SiMPE), Amsterdam, the Netherlands., 2008.
 
Ullah, M. M., Pronobis, A., Caputo, B., Luo, J., Jensfelt, P. and Christensen, H. I., Towards Robust Place Recognition for Robot Localization, in: IEEE International Conference on Robotics ad Automation, 2008.
 
Valente, F. and Hermansky, H., Hierarchical and parallel processing of modulation spectrum for asr applications, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4165-4168, 2008. [DOI]
 
Valente, F. and Hermansky, H., On the combination of auditory and modulation frequency channels for asr applications, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Vergyri, D., Mandal, A., Wang, W., Stolcke, A., Zheng, J., Graciarena, M., Rybach, D., Gollan, C., Schlater, R., Kirchoff, K., Faria, A. and Morgan, N., Development of the sri/nightingale arabic asr system, in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 1437-1440, 2008.
 
Vergyri, D., Mandal, A., Wang, W., Stolcke, A., Zheng, J., Graciarena, M., Rybach, D., Gollan, C., Schlater, R., Kirchoff, K., Faria, A. and Morgan, N., Development of the sri/nightingale arabic asr system, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Vijayasenan, D., Valente, F. and Bourlard, H., Combination of agglomerative and sequential clustering for speaker diarization, in: International Conference on Acoustics, Speech and Signal Processing, 2008.
 
Vijayasenan, D., Valente, F. and Bourlard, H., Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, in: Interspeech 2008, 2008.
 
Vinciarelli, A., Pantic, M., Bourlard, H. and Pentland, A., Social signal processing: state-of-the-art and future perspectives of an emerging domain, in: Proceedings of the ACM International Conference on Multimedia, 2008.
 
Vinciarelli, A., Pantic, M., Bourlard, H. and Pentland, A., Social signals, their function, and automatic analysis: a survey, in: Proceedings of International Conference on Multimodal Interfaces (to appear), 2008.
 
Vinyals, O. and Friedland, G., A hardware-independent fast logarithm approximation with adjustable accuracy, in: 10th IEEE International Symposium on Multimedia, Berkeley, CA, USA, pages 61-65, 2008.
 
Vinyals, O. and Friedland, G., Live speaker identification in meetings: "who is speaking now?", in: Technical Report TR-08-001, International Computer Science Institute, Berkeley, CA, 2008.
 
Vinyals, O. and Friedland, G., Modulation spectrogram features for speaker diarization, in: to appear in proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Vinyals, O. and Friedland, G., Modulation spectrogram features for speaker diarization, in: Interspeech 2008, Brisbane, Australia, pages 630-633, 2008.
 
Vinyals, O. and Friedland, G., Towards semantic analysis of conversations: a system for the live identification of speakers in meetings, in: to appear in Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, CA, 2008.
 
Voloshynovskiy, S., Koval, O., Villán, R., Beekhof, F. and Pun, T., Authentication of biometric identification documents via mobile devices, in: Journal of Electronic Imaging, 2008.
 
Voloshynovskiy, S., Koval, O. and Pun, T., Multimodal authentication based on random projections and distributed coding, in: Proceedings of the 10th ACM Workshop on Multimedia & Security, 2008.
 
Voloshynovskiy, S., Koval, O., Beekhof, F. and Pun, T., Multimodal authentication based on random projections and distributed coding, in: MM&Sec 2008, 2008.
 
Weinshall, D., Hermansky, H., Zweig, A., Luo, J., Jimison, H., Ohl, F. and Pavel, M., Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree, in: Advances in Neural Information Processing Systems 21, 2008.
 
Weise, T., Leibe, B. and van Gool, L., Accurate and robust registration for in-hand modeling, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
 
Wooters, C. and Huijbregts, M., The ICSI RT07s speaker diarization system, in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
 
Yao, J. and Odobez, J. -M., Fast human detection from videos using covariance features, in: European Conference on Computer Vision, workshop on Visual Surveillance (ECCV-VS), Marseille, 2008.
 
Yao, J. and Odobez, J. -M., Multi-camera 3d person tracking with particle filter in a surveillance environment, in: 16th European Signal processing Conference (EUSIPCO), 2008.
 
Zeng, G. and van Gool, L., Multi-label image segmentation via point-wise repetition, in: International Conference on Computer Vision and Pattern Recognition (CVPR), 2008.
 
Zhao, S. and Morgan, N., Multi-stream spectro-temporal features for robust speech recognition, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Zhao, S. Y. and Morgan, N., Multi-stream spectro-temporal features for robust speech recognition, in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 898-901, 2008.
 
I. Bogdanova, , A. Bur, and Hügli, H., The spherical approach to omnidirectional visual attention, in: XVI European Signal Processing Conference (EUSIPCO 2008), 2008.
 
I. Bogdanova, , A. Bur, and Hügli, H., Visual attention on the sphere [in press], in: IEEE Transactios on Image Processing, 2008.
 
Varga, T. and Bunke, H., Perturbation models for generating synthetic training data in handwriting recognition, in: Machine Learning in Document Analysis and Recognition, pages 333-360, Springer, 2008.
 
Tommasi, T., Orabona, F. and Caputo, B., An SVM Confidence-Based Approach to Medical Image Annotation, in: Evaluating Systems for Multilingual and Multimodal Information Access -- 9th Workshop of the Cross-Language Evaluation Forum, 2008.
 
Popescu-Belis, A., Bourlard, H. and Renals, S., Machine learning for multimodal interaction iv (revised selected papers from mlmi 2007, brno, 28-30 june 2007), LNCS 4892, Springer-Verlag, 2008.
 
Popescu-Belis, A. and Stiefelhagen, R., Machine learning for multimodal interaction v (proceedings of mlmi 2008, utrecht, 8-10 september 2008), LNCS 5237, Springer-Verlag, 2008.
 
Popescu-Belis, A., Boertjes, E., Kilgour, J., Poller, P., Castronovo, S., Wilson, T., Jaimes, A. and Carletta, J., The amida automatic content linking device: just-in-time document retrieval in meetings, in: Machine Learning for Multimodal Interaction V (Proceedings of MLMI 2008, Utrecht, 8-10 September 2008), pages 273-284, Springer-Verlag, 2008.
 
Popescu-Belis, A., Boertjes, E., Kilgour, J., Poller, P., Castronovo, S., Wilson, T., Jaimes, A. and Carletta, J., The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings, in: Machine Learning for Multimodal Interaction V, pages 272-283, Springer-Verlag, Utrecht, 2008. [DOI]
 
Popescu-Belis, A., Baudrion, P., Flynn, M. and Wellner, P., Towards an objective test for meeting browsers: the bet4tqb pilot experiment, in: Machine Learning for Multimodal Interaction IV, pages 108-119, Springer-Verlag, 2008. [DOI]
 

2007

Aloise, F., Caporusso, N., Mattia, D., Babiloni, F., Kauhanen, L., Millán, J. del R., Nuttin, M., Marciani, M. G. and Cincotti, F., Brain-machine interfaces through control of electroencephalographic signals and vibrotactile feedback, in: Proceedings of the 12th International Conference on Human-Computer Interaction, 2007.
 
Anguera, X., Wooters, C. and Hernando, J., Acoustic Beamforming for Speaker Diarization of Meetings, in: to appear in IEEE Transactions on Audio, Speech and Language Processing, 2007.
 
Anguera, X., Wooters, C., Pardo, J. M. and Hernando, J., Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings, in: Proc. ICASSP, Honolulu, 2007.
 
Anguera, X., Shinozaki, T., Wooters, C. and Hernando, J., Model Complexity Selection and Cross-validation EM Training for Robust Speaker Diarization, in: Proc. ICASSP, Honolulu, 2007.
 
Ansari-Asl, K., Chanel, G. and Pun, T., A channel selection method for eeg classification in emotion assessment based on synchronization likelihoo, in: Eusipco 2007, 15th Eur. Signal Proc. Conf., 2007.
 
Aradilla, G., Vepa, J. and Bourlard, H., An acoustic model based on kullback-leibler divergence for posterior features, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
 
Aradilla, G. and Ajmera, J., Detection and recognition of number sequences within spoken utterances, in: 2nd Workshop on Speech in Mobile and Pervasive Environments, 2007.
 
Aradilla, G. and Bourlard, H., Posterior-based features and distances in template matching for speech recognition, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 204-214, 2007. [DOI]
 
Ba, S., Joint head tracking and pose estimation for visual focus of attention recognition, École Polytechnique Fédérale de Lausanne, 2007.
 
Ba, S. and Odobez, J. -M., Probabilistic head pose tracking evaluation in single and multiple camera setups, in: Classification of Events, Activities and Relationship Evaluation and Workshop, 2007.
 
Bay, H., Ess, A., Tuytelaars, T. and van Gool, L., Speeded-up robust features (surf), in: Computer Vision and Image Understanding (CVIU), 2007.
 
Behera, A., Lalanne, D. and Ingold, R., Docmir: an automatic document-based indexing system for meeting retrieval, in: Multimedia Tools and Applications, volume 37, number 2, 2007.
 
Bengio, S. and Mariéthoz, J., Biometric person authentication is a multiple classifier problem, in: 7th International Workshop on Multiple Classifier Systems, MCS, 2007.
 
Bertini, E., Hertzog, P. and Lalanne, D., Spiralview: a visual tool to improve monitoring and understanding of security data in corporate, in: IEEE Symposium on Visual Analytics Science and Technology 2007 (VAST'07), pages to appear, 2007.
 
Bertolami, R. and Bunke, H., Multiple classifier methods for offline handwritten text line recognition, in: Multiple Classifier Systems, pages 72-81, Springer, 2007.
 
Bertolami, R., Uchida, S., Zimmermann, M. and Bunke, H., Non-uniform slant correction for handwritten text line recognition, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 18-22, 2007.
 
Besson, P., Popovici, V., Vesin, J. M., Thiran, J. -Ph. and Kunt, M., Extraction of audio features specific to speech production for multimodal speaker detection, in: IEEE Transactions on Multimedia, 2007. [DOI]
 
Bogdanova, I., Bresson, X., Thiran, J. -Ph. and Vandergheynst, P., Scale-space analysis and active contours for omnidirectional images, in: IEEE Transactions on Image Processing, volume 16, number 7, pages 1888-1901, 2007. [DOI]
 
Bologna, G., Deville, B., Pun, T. and Vinckenbosch, M., Identifying major components of pictures by audio encoding of colors, in: IWINAC2007, 2nd. Int. Work-conf. on the Interplay between Natural and Artificial Computation, 2007.
 
Bologna, G., Deville, B., Pun, T. and Vinckenbosch, M., Transforming 3d coloured pixels into musical instrument notes for vision substitution applications, in: Eurasip J. of Image and Video Processing, Special Issue: Image and Video Processing for Disability, accepted for publication, 2007.
 
Bouillon, P., Flores, G., Starlander, M., Chatzichrisafis, N., Santaholma, M., Tsourakis, N., Rayner, M. and Hockey, B. A., A bidirectional grammar-based medical speech translator, in: Proceedings of workshop on Grammar-based approaches to spoken language processing, pages 41-48, ACL 2007, Prague, Czech Republic, 2007.
 
Bouillon, P., Chatzichrisafis, N., Halimi, S., Hockey, B. A., Isahara, H., Kanzaki, K., Nakao, Y., Novellas Vall, B., Rayner, M., Santaholma, M. and Starlander, M., Medslt: a multi-lingual grammar-based medical speech translator, in: Proceedings of First International Workshop on Intercultural Collaboration, IWIC2007, Kyoto, Japan, 2007.
 
Bouillon, P., Rayner, M., Novellas Vall, B., Starlander, M., Santaholma, M., Nakao, Y. and Chatzichrisafis, N., Une grammaire partagée multi-tâche pour le traitement de la parole : application aux langues romanes, in: TAL (Traitement Automatique des Langues), volume 47, number 3, 2007.
 
Bray, M., Koller-Meier, E. and van Gool, L., Smart particle filtering for high-dimensional tracking, in: Computer Vision and Image Understanding, 2007.
 
Bresson, X., Esedoglu, S., Vandergheynst, P., Thiran, J. -Ph. and Osher, S., Fast Global Minimization of the Active Contour/Snake Model, in: Journal of Mathematical Imaging and Vision, volume 28, number 2, pages 151-167, 2007. [DOI]
 
Broschart, M., de Negueruela, C., Millán, J. del R. and Menon, C., Augmenting astronaut's capabilities through brain-machine interfaces, in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, 2007.
 
Bruno, E., Kludas, J. and Marchand-Maillet, S., Combining multimodal preferences for multimedia information retrieval, in: ACM SIGMM - International Workshop on Multimedia Information Retrieval, 2007.
 
Bruno, E., Kludas, J. and Marchand-Maillet, S., Combining multimodal preferences for multimedia information retrieval, in: Proc. of International Workshop on Multimedia Information Retrieval, 2007.
 
Bunke, H. and Neuhaus, M., Graph matching -- exact and error-tolerant methods and the automatic learning of edit costs, in: Mining Graph Data, pages 17-34, Wiley, 2007.
 
Bunke, H., Dickinson, P., Humm, A., Irniger, C. and Kraetzl, M., Graph sequence visualisation and its application to computer network monitoring and abnormal event detection, in: Applied Graph Theory in Computer Vision and Pattern Recognition, pages 227-245, Springer, 2007.
 
Bunke, H. and Varga, T., Off-line Roman cursive handwriting recognition, in: Digital Document Processing: Major Directions and Recent Advances, volume 20, pages 165-173, 2007.
 
Cetin, O., Kantor, A., King, S., Bartels, C., Magimai-Doss, M., Frankel, J. and Livescu, K., An Articulatory Feature-based Tandem Approach and Factored Observation Modeling, in: Proc. ICASSP, Honolulu, 2007.
 
Chanel, G., Ansari-Asl, K. and Pun, T., Valence-arousal evaluation using physiological signals in an emotion recall paradigm, in: 2007 IEEE SMC, Int. Conf. on Systems, Man and Cybernetics, Smart cooperative systems and cybernetics: advancing knowledge and security for humanity, 2007.
 
Chavarriaga, R., Ferrez, P. W. and Millán, J. del R., To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007.
 
Chavarriaga, R., Ferrez, P. W. and del R. Millán, J., To err is human: learning from error potentials in brain-computer interfaces, in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007.
 
Chen, L., Barber, D. and Odobez, J. -M., Dynamical dirichlet mixture model, number 02, 2007.
 
Chiappa, S. and Barber, D., Bayesian factorial linear gaussian state-space models for biosignal decomposition, in: IEEE Signal Processing Letters, 2007.
 
Cincotti, F., Mattia, D., Aloise, F., Bufalari, S., Astolfi, L., Fallani, F. De Vico, Tocci, A., Bianchi, L., Marciani, M. G., Gao, S., Millán, J. del R. and Babiloni, F., High-resolution eeg techniques for brain-computer interface applications, in: Journal of Neuroscience Methods, volume 167, pages 31-42, ISSN 0165-0270, 2007.
 
Cincotti, F., Kauhanen, L. and Aloise, F., Vibrotactile feedback for brain-computer interface operation, in: Computational Intelligence and Neuroscience, volume 2007, pages Article ID, 2007.
 
Cuendet, S., Shriberg, E., Favre, B., Fung, J. and Hakkani-Tur, D., An analysis of sentence segmentation features for broadcast news, broadcast conversations, and meetings, in: SIGIR Workshop on Searching Conversational Spontaneous Speech, 2007.
 
Cuendet, S., Hakkani-Tur, D. and Shriberg, E., Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech, in: to appear in Proceedings of MLMI, Brno, Czech Republic, 2007.
 
Cuendet, S., Hakkani-Tur, D., Shriberg, E., Fung, J. and Favre, B., Cross-Genre Feature Comparisons for Spoken Sentence Segmentation, in: International Conference on Semantic Computing (ICSC), Irvine, CA, 2007.
 
Dessimoz, D., Richiardi, J., Champod, C. and Drygajlo, A., Multimodal biometrics for identity documents (MBioID), in: Forensic Science International, volume 167, pages 154-159, 2007. [DOI]
 
Dines, J. and Magimai-Doss, M., A study of phoneme and grapheme based context-dependent asr systems, number 12, 2007.
 
Dines, J. and Vepa, J., Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics, number 13, 2007.
 
Dornhege, G., del R. Millán, J., Hinterberger, T., McFarland, D. and Müller, K. -R., Towards brain-computer interfacing, The MIT Press, 2007.
 
Drugman, T., Gurban, M. and Thiran, J. -Ph., Relevant Feature Selection for Audio-Visual Speech Recognition, in: 9th International Workshop on Multimedia Signal Processing (MMSP), Chania, Crete, Greece, 2007.
 
Drygajlo, A., Man-machine voice communication, pages 433-461, EPFL Press, 2007. [DOI]
 
Drygajlo, A., Multimodal biometrics for identity documents and smart cards european challenge, in: Proc. 15th European Signal Processing Conf. (EUSIPCO), 2007.
 
Einsele, F., Hennebert, J. and Ingold, R., Towards identification of very low resolution, anti-aliased characters, in: IEEE International Symposium on Signal Processing and its Applications (ISSPA'07), Sharjah, United Arab Emirates, 2007.
 
Ess, A., Leibe, B. and van Gool, L., Depth and appearance for mobile scene analysis, in: International Conference on Computer Vision (ICCV'07), 2007.
 
Ess, A., Neubeck, A. and van Gool, L., Generalised linear pose estimation, in: BMVC, 2007.
 
Evéquoz, F. and Lalanne, D., Indexing and visualizing digital memories through personal email archive, pages 21-24, 2007.
 
Evéquoz, F. and Lalanne, D., Personal information management through interactive visualizations, pages 158-160, 2007.
 
Ferrez, P. W., Error-related eeg potentials in brain-computer interfaces, École Polytechnique Fédérale de Lausanne, 2007.
 
Ferrez, P. W. and Millán, J. del R., Error-related eeg potentials in brain-computer interfaces, in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
 
Frapolli, F., Hirsbrunner, B. and Lalanne, D., Dynamic rules: towards interactive games intelligence, in: Tangible Play: Research and Design for Tangible and Tabletop Games. Workshop at the 2007 Intelligent User Interfaces Conference (IUI'07), pages 29-32, 2007.
 
Galán, F., Nuttin, M., Lew, E., Ferrez, P. W., Vanacker, G., Philips, J., van Brussel, H. and Millán, J. del R., An asynchronous and non-invasive brain-actuated wheelchair, in: Proceedings of the 13th International Symposium on Robotics Research, 2007.
 
Galán, F., Ferrez, P. W., Oliva, F., Guàrdia, J. and del R. Millán, J., Feature extraction for multi-class bci using canonical variates analysis, number 23, 2007.
 
Galán, F., Palix, J., Chavarriaga, R., Ferrez, P. W., Lew, E., Hauert, C. -A. and Millán, J. del R., Visuo-spatial attention frame recognition for brain-computer interfaces, in: Proceedings of the 1st International Conference on Cognitive Neurodynamics, 2007.
 
Gaudard, C., Aradilla, G. and Bourlard, H., Speech recognition based on template matching and phone posterior probabilities, number 02, 2007.
 
Georgescul, M., Clark, A. and Armstrong, S., Exploiting structural meeting-specific features for topic segmentation, in: Actes de la 14ème Conférence sur le Traitement Automatique des Langues Naturelles, Toulouse, France, 2007.
 
Gerber, M., Kaufmann, T. and Pfister, B., Perceptron-based class verification, in: Proceedings of NOLISP (ISCA Workshop on non linear speech processing), 2007.
 
Gerber, M., Beutler, R. and Pfister, B., Quasi text-independent speaker verification based on pattern matching, in: Proceedings of Interspeech, ISCA, 2007.
 
Germann, M., Breitenstein, M. D., Park, I. K. and Pfister, H., Automatic pose estimation for range images on the gpu, in: Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007), pages 81-90, IEEE Computer Society, 2007.
 
Grangier, D. and Bengio, S., Learning the inter-frame distance for discriminative template-based keyword detection, in: International Conference on Speech Communication and Technology (INTERSPEECH), 2007.
 
Graves, A., Liwicki, M. and Bunke, H., Unconstrained on-line handwriting recognition with recurrent neural networks, in: Advances in Neural Information Processing, 2007.
 
Gurban, M., Valles, A. and Thiran, J. -Ph., Low-Dimensional Motion Features for Audio-Visual Speech Recognition, in: 15th European Signal Processing Conference (EUSIPCO), Poznan, Poland, Poznan, Poland, 2007.
 
Guz, U., Cuendet, S., Hakkani-Tur, D. and Tur, G., Co-training Using Prosodic and Lexical Information for Sentence Segmentation, in: to appear in Proceedings of Interspeech, Antwerp, 2007.
 
Hakkani-Tur, D. and Tur, G., Statistical Sentence Extraction for Information Distillation, in: Proc. ICASSP, Honolulu, 2007.
 
Hennebert, J., Loeffel, R., Humm, A. and Ingold, R., A new forgery scenario based on regaining dynamics of signature, in: Accepted for publication, International Conference on Biometrics (ICB 2007), Seoul Korea, 2007.
 
Hennebert, J., Humm, A. and Ingold, R., Modelling spoken signatures with gaussian mixture model adaptation, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 07), 2007.
 
Hennebert, J., Please repeat: my voice is my password. from the basics to real-life implementations of speaker verification technologies, in: Invited lecture at the Information Security Summit (IS2 2007), Prague, 2007.
 
Heusch, G. and Marcel, S., A novel statistical generative model dedicated to face recognition, number Idiap-RR-39-2007, 2007.
 
Heusch, G. and Marcel, S., Face authentication with salient local features and static bayesian network, in: IEEE / IAPR Intl. Conf. On Biometrics (ICB), 2007.
 
Hoffmann, U., Vesin, J. M. and Ebrahimi, T., Recent advances in brain-computer interfaces, in: IEEE International Workshop on Multimedia Signal Processing, Chania, Crete, Greece, 2007.
 
Huang, Y., Vinyals, O., Friedland, G., Müller, C., Mirghafori, N. and Wooters, C., A Fast-Match approach for robust, faster than real-time Speaker Diarization, in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
 
Huang, Y., Robust and rapid speaker diarization, in: Master Thesis, University of California, Berkeley, 2007.
 
Huang, Y., Friedland, G., Müller, C. and Mirghafori, N., Speeding up speaker diarization by using prosodic features, in: Technical Report TR-07-004, International Computer Science Institute, Berkeley, California, 2007.
 
Huijbregts, M., Wooters, C. and Ordelman, R., Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections, in: to appear in Proceedings of Interspeech, Antwerp, 2007.
 
Huijbregts, M. and Wooters, C., The Blame Game: Performance Analysis of Speaker Diarization System Components, in: to appear in Proc. Interspeech, Antwerp., 2007.
 
Humm, A., Hennebert, J. and Ingold, R., Database and evaluation protocols for user authentication using combined handwriting and speech modalities, 2007.
 
Humm, A., Hennebert, J. and Ingold, R., Hidden markov models for spoken signature verification, 2007.
 
Humm, A., Hennebert, J. and Ingold, R., Modelling combined handwriting and speech modalities, in: Accepted for publication, International Conference on Biometrics (ICB 2007), Seoul Korea, 2007.
 
Humm, A., Hennebert, J. and Ingold, R., Spoken handwriting verification using statistical models, in: Accepted for publication, International Conference on Document Analysis and Recognition (ICDAR 07), Curitiba Brazil, 2007.
 
Hung, H., Jayagopi, D., Yeo, C., Friedland, G., Ba, S., Odobez, J. -M., Ramchandran, K., Mirghafori, N. and Gatica-Perez, D., Using audio and video features to classify the most dominant person in a group meeting, 2007.
 
Hung, H., Jayagopi, D., Yeo, C., Friedland, G., Ba, S., Odobez, J. -M., Ramchandran, K., Mirghafori, N. and Gatica-Perez, D., Using audio and video features to classify the most dominant person in a group meeting multi-layer background subtraction based on color and texture, in: Proc. ACM Multi Media, Augsburg, Germany, 2007.
 
Hung, H., Jayagopi, D., Yeo, C., Friedland, G., Ba, S., Odobez, J. -M., Ramchandran, K., Mirghafori, N. and Gatica-Perez, D., Using audio and video features to classify the most dominant person in meetings, in: Proceedings of ACM Multimedia 2007, pp. 835-838, Augsburg, Germany, 2007.
 
Hwang, M. -Y., Peng, G., Wang, W., Faria, A., Heidel, A. and Ostendorf, M., Building a Highly Accurate Mandarin Speech Recognizer, in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
 
Hérault, R. and Grandvalet, Y., Sparse probabilistic classifiers, in: International Conference on Machine Learning (ICML), 2007.
 
Jaeggli, T., Koller-Meier, E. and van Gool, L., Learning generative models for monocular body pose estimation, in: ACCV, 2007.
 
Jaeggli, T., Koller-Meier, E. and van Gool, L., Multi-activity tracking in lle body pose space, in: 2nd Workshop on HUMAN MOTION Understanding, Modeling, Capture and Animation, ICCV, 2007.
 
Jaimes, A., Gatica-Perez, D., Sebe, N. and Huang, T. S., Guest Editors' Introduction: Human-Centered Computing-Toward a Human Revolution, in: Computer, volume 40, number 5, pages 30-34, 2007.
 
Jaimes, A., Gatica-Perez, D., Sebe, N. and Huang, T. S., Human-centered computing: toward a human revolution, in: IEEE Computer, volume 40, number 5, 2007. [DOI]
 
Kaufmann, T. and Pfister, B., An HPSG parser supporting discontinuous licenser rules, in: International Conference on HPSG, 2007.
 
Kaufmann, T. and Pfister, B., Applying licenser rules to a grammar with continuous constituents, in: The Proceedings of the 14th International Conference on Head-Driven Phrase Structure Grammar, 2007.
 
Keshet, J., Theoretical foundations for large-margin kernel-based continuous speech recognition, number Idiap-RR-44-2007, 2007.
 
Kittler, J., Poh, N., Fatukasi, O., Messer, K., Kryszczuk, K., Richiardi, J. and Drygajlo, A., Quality dependent fusion of intramodal and multimodal biometric experts, in: Proc. SPIE Defense and Security Symposium, 2007.
 
Kludas, J., Bruno, E. and Marchand-Maillet, S., Information fusion in multimedia information retrieval, in: Workshop on Adaptive Multimedia Retrieval (AMR 2007), 2007.
 
Knox, M. and Mirghafori, N., Automatic Laughter Detection Using Neural Networks, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 
Kokiopoulou, E. and Frossard, P., Accelarating distributed consensus using extrapolation, in: IEEE Signal Processing Letters, volume 14, number 10, pages 665-668, 2007.
 
Kokiopoulou, E. and Frossard, P., Accelerating Distributed Consensus Using Extrapolation, in: IEEE Signal Processing Letters, volume 14, number 10, 2007. [DOI]
 
Kokiopoulou, E. and Frossard, P., Dimensionality Reduction with Adaptive Approximation, in: IEEE Int. Conf. on Multimedia & Expo (ICME), Beijing, China, 2007.
 
Kokiopoulou, E. and Frossard, P., Image alignment with rotation manifolds built on sparse geometric expansions, in: IEEE International Workshop on Multimedia Signal Processing, Chania, Crete, Greece, 2007.
 
Kolar, J., Liu, Y. and Shriberg, E., Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 
Koval, O., Voloshynovskiy, S. and Pun, T., Analysis of multimodal binary detection systems based on dependent/independent modalities, in: Proceedings of the IEEE 2007 International Workshop on Multimedia Signal Processing, 2007.
 
Koval, O., Voloshynovskiy, S. and Pun, T., Error exponent analysis of person identification based on fusion of dependent/independent modalities, in: Proceedings of SPIE-IS&T Electronic Imaging 2007, Security, Steganography, and Watermarking of Multimedia Contents IX, 2007.
 
Kron, E., Rayner, M., Santaholma, M. and Bouillon, P., A development environment for building grammar-based speech-enabled applications, in: Proceedings of workshop on Grammar-based approaches to spoken language processing, pages 49-52, ACL 2007, Prague, Czech Republic, 2007.
 
Kronegg, J., Chanel, G., Voloshynovskiy, S. and Pun, T., Eeg-based synchronized brain-computer interfaces: a model for optimizing the number of mental tasks, in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, volume 15, number 1, pages 50-58, 2007.
 
Kryszczuk, K. and Drygajlo, A., Improving classification with class-independent quality measures: q-stack in face verification, in: Proc. 2nd Int. Conference in Biometrics (ICB 2007), 2007.
 
Kryszczuk, K. and Drygajlo, A., Q-stack: uni- and multimodal classifier stacking with quality measures, in: Proc. 7th Int. Workshop on Multiple Classifier Systems, Springer, 2007.
 
Kryszczuk, K., Richiardi, J. and Drygajlo, A., Reliability estimation for multimodal error prediction and fusion, in: Proc. 7th Int. Workshop on Pattern Recognition in Information Systems (PRIS 2007), 2007.
 
Kryszczuk, K., Richiardi, J., Prodanov, P. and Drygajlo, A., Reliability-based decision fusion in multimodal biometric verification systems, in: EURASIP Journal of Advances in Signal Processing, 2007.
 
Kumatani, K., Mayer, H., Gehrig, T., Stoimenov, E., McDonough, J. and Wölfel, M., Adaptive beamforming with a minimum mutual information criterion, pages 2527--2541, 2007. [DOI]
 
Kumatani, K., Mayer, H., Gehrig, T., Stoimenov, E., McDonough, J. and Wölfel, M., Minimum mutual information beamforming for simultaneous active speakers, in: IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), pages 71-76, Kyoto, 2007. [DOI]
 
Lalanne, D., Evéquoz, F., Rigamonti, M., Dumas, B. and Ingold, R., An ego-centric and tangible approach to meeting indexing and browsing, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI'07), pages to appear, 2007.
 
Lalanne, D., Evéquoz, F., Chiquet, H., Müller, M., Radgohar, M. and Ingold, R., Going through digital versus physical augmented gaming, in: Tangible Play: Research and Design for Tangible and Tabletop Games. Workshop at the 2007 Intelligent User Interfaces Conference (IUI'07), pages 41-44, 2007.
 
Lalanne, D. and van den Hoven, E., Supporting human memory with interactive systems, pages 215-216, 2007.
 
Lalanne, D., Bertini, E., Hertzog, P. and Bados, P., Visual analysis of corporate network intelligence: abstracting and reasoning on yesterdays for acting today, 2007.
 
Laptev, I., Caputo, B. and Lindberg, T., Local velocity-adapted motion events for spatio-temporal recognition, in: Computer Vision and Image Undertanding, volume 108, number 3, pages 207-229, ISSN 1077-3142, 2007.
 
Lathoud, G. and Odobez, J. -M., Short-term spatio-temporal clustering applied to multiple moving speakers, in: IEEE Transactions on Audio, Speech and Language Processing, 2007.
 
Lei, H. and Mirghafori, N., Word-Conditioned HMM Supervectors for Speaker Recognition, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 
Lei, H. and Mirghafori, N., Word-conditioned phone N-grams for speaker recognition, in: Proc. ICASSP, Honolulu, 2007.
 
Leibe, B., Schindler, K. and van Gool, L., Coupled detection and trajectory estimation for multi-object tracking, in: International Conference on Computer Vision (ICCV'07), 2007.
 
Leibe, B., Cornelis, N., Cornelis, K. and van Gool, L., Dynamic 3d scene analysis from a moving vehicle, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'07), 2007.
 
Levit, M., Hakkani-Tur, D., Tur, G. and Gillick, D., Integrating several annotation layers for statistical information distillation, in: Workshop on Automatic Speech Recognition and Understanding, 2007.
 
Levit, M., Hakkani-Tur, D., Tur, G. and Gillick, D., Integrating Several Annotation Layers for Statistical Information Distillation, in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
 
Li, W. and Bourlard, H., Non-linear spectral stretching for in-car speech recognition, in: Interspeech, 2007.
 
Li, W., Dines, J. and Magimai-Doss, M., Robust overlapping speech recognition based on neural networks, number Idiap-RR-55-2007, 2007.
 
Lisowska, A., Betrancourt, M., Armstrong, S. and Rajman, M., Minimizing modality bias when exploring input preference for multimodal systems in new domains: the archivus case study, in: CHI' 07, San José, California, 2007.
 
Lisowska, A., Armstrong, S., Melichar, M., Ailomaa, M. and Rajman, M., The wizard of oz meets multimodal language-enabled gui interfaces: new challenges, in: Proceedings of CHI' 07, San José, California, 2007.
 
Liu, Y. and Shriberg, E., Comparing Evaluation Metrics for Sentence Boundary Detection, in: Proc. ICASSP, Honolulu, 2007.
 
Livescu, K., Cetin, O., Hasegawa-Johnson, M., King, S., Bartels, C., Borges, N., Kantor, A., Lal, P., Yung, L., Bezman, A., Dawson-Haggerty, S., Woods, B., Frankel, J., Magimai-Doss, M. and Saenko, K., Articulatory Feature-based Methods for Acoustic and Audio-visual speech Recognition: Summary from the 2006 JHU Summer Workshop, in: Proc. ICASSP, Honolulu, 2007.
 
Livescu, K., Bezman, A., Borges, N., Yung, L., Cetin, O., Frankel, J., King, S., Magimai-Doss, M., Chi, X. and Lavoie, L., Manual Transcription of Conversational Speech at the Articulatory Feature Level, in: Proc. ICASSP, Honolulu, 2007.
 
Liwicki, M., Graves, A., Bunke, H. and Schmidhuber, J., A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 367-371, 2007.
 
Liwicki, M., Schlapbach, A., Loretan, P. and Bunke, H., Automatic detection of gender and handedness from on-line handwriting, in: Proc. 13th Conf. of the Graphonomics Society, pages 179-183, 2007.
 
Liwicki, M. and Bunke, H., Combining on-line and off-line systems for handwriting recognition, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 372-376, 2007.
 
Liwicki, M. and Bunke, H., Feature selection for on-line handwriting recognition of whiteboard notes, in: Proc. 13th Conf. of the Graphonomics Society, pages 101-105, 2007.
 
Liwicki, M. and Bunke, H., Handwriting recognition of whiteboard notes -- studying the influence of training set size and type, in: Int. Journal of Pattern Recognition and Art. Intelligence, volume 21, number 1, pages 83-98, 2007.
 
Liwicki, M., Indermühle, E. and Bunke, H., On-line handwritten text line detection using dynamic programming, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 447-451, 2007.
 
Lovitt, A., Correcting confusion matrices for phone recognizers, number 03, 2007.
 
Lovitt, A., Pinto, J. P. and Hermansky, H., On confusions in a phoneme recognizer, 2007.
 
Lovitt, A., Truncation confusion patterns in onset consonants, in: Interspeech 2007, 2007.
 
Lüthy, F., Varga, T. and Bunke, H., Using hidden Markov models as a tool for handwritten text line segmentation, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 8-12, 2007.
 
Magimai-Doss, M., Hakkani-Tur, D., Cetin, O., Shriberg, E., Fung, J. and Mirghafori, N., Entropy Based Classifier Combination for Sentence Segmentation,, in: Proc. ICASSP, Honolulu, 2007.
 
Marcel, S., Abbet, P. and Guillemot, M., Google portrait, number Idiap-Com-07-2007, 2007.
 
Marcel, S., Joint bi-modal face and speaker authentication using explicit polynomial expansion, number 14, 2007.
 
Marcel, S., Rodriguez, Y. and Heusch, G., On the recent use of local binary patterns for face authentication, in: International Journal on Image and Video Processing Special Issue on Facial Image Processing, 2007.
 
Marcel, S. and del R. Millán, J., Person authentication using brainwaves (eeg) and maximum a posteriori model adaptation, in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Special Issue on Biometrics, 2007.
 
Marchand-Maillet, S., Bruno, E., Nürnberger, A. and Detyniecki, M., Adaptive multimedia retrieval: user, context and feedback, Springer, 2007.
 
Mariéthoz, J. and Bengio, S., A kernel trick for sequences applied to text-independent speaker verification systems, in: Pattern Recognition, volume 40, number 8, ISSN 0031-3203, 2007.
 
McCowan, I., Maganti, H. K. and Gatica-Perez, D., Speech enhancement and recognition in meetings with an audio-visual sensor array, in: IEEE Trans. on Audio, Speech, and Language Processing, volume 15, number 8, pages 2257-2269, 2007.
 
Mesot, B. and Barber, D., A bayesian switching linear dynamical system for scale-invariant robust speech extraction, 2007.
 
Mesot, B. and Barber, D., A gaussian sum smoother for inference in switching linear dynamical systems, 2007.
 
Meynet, J., Popovici, V. and Thiran, J. -Ph., Face Detection with Boosted Gaussian Features, in: Pattern Recognition, volume 40, number 8, pages 2283-2291, 2007. [DOI]
 
Meynet, J. and Thiran, J. -Ph., Information Theoretic Combination of Classifiers with Application to AdaBoost, in: 7th international Workshop on Multiple Classifier Systems (MCS), Prague, Prague, 2007.
 
Meynet, J., Popovici, V. and Thiran, J. -Ph., Mixtures of Boosted Classifiers for Frontal Face Detection, in: Signal, Image and Video Processing, volume 1, number 1, pages 29-38, 2007. [DOI]
 
Millán, J. del R., Buttfield, A., Vidaurre, C., Krauledat, M., Schlögl, A., Shenoy, P., Blankertz, B., Rao, R. P. N., Cabeza, R., Pfurtscheller, G. and Müller, K. -R., Adaptation in brain-computer interfaces, in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
 
Millán, J. del R., Ferrez, P. W., Galán, F., Lew, E. and Chavarriaga, R., Non-invasive brain-actuated interaction, in: Proceedings of the 2nd International Symposium on Brain, Vision and Artificial Intelligence, 2007. [DOI]
 
Millán, J. del R., Ferrez, P. W. and Buttfield, A., The idiap brain-computer interface: an asynchronous multi-class approach, in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
 
Monay, F., Learning the structure of image collections with latent aspect models, in: ., 2007.
 
Monay, F. and Gatica-Perez, D., Modeling semantic aspects for cross-media image indexing, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 29, pages 1802-1817, ISSN 0162-8828, 2007. [DOI]
 
Morrison, D., Marchand-Maillet, S. and Bruno, E., Automatic image annotation with relevance feedback and latent semantic analysis, in: Workshop on Adaptive Multimedia Retrieval (AMR 2007), 2007.
 
Morrison, D., Marchand-Maillet, S. and Bruno, E., Hierarchical long-term learning for automatic image, in: International Conference on Semantics And digital Media Technologies (SAMT 2007), 2007.
 
Morrison, D., Marchand-Maillet, S. and Bruno, E., Hierarchical long-term learning for automatic image annotation, in: Proceedings 2nd International Conference on Semantic and Digital Media Technologies, 2007.
 
Motlicek, P., Hermansky, H., Ganapathy, S. and Garudadri, H., Frequency domain linear prediction for qmf sub-bands and applications to audio coding, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 248-258, 2007.
 
Motlicek, P., Hermansky, H., Ganapathy, S., Garudadri, H. and Srinivasamurthy, N., Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes, in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), pages 350-357, 2007.
 
Motlicek, P., Ganapathy, S., Hermansky, H. and Garudadri, H., Scalable wide-band audio codec based on frequency domain linear prediction, number 16, 2007.
 
Müller, C. and Burkhardt, F., Combining Short-term Cepstral and Long-term Pitch Features for Automatic Recognition of Speaker Age, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 
Müller, P., Zeng, G., Wonka, P. and van Gool, L., Image-based procedural modeling of facades, in: Proceedings of ACM SIGGRAPH 2007 / ACM Transactions on Graphics, ACM Press, 2007.
 
Neuhaus, M. and Bunke, H., A quadratic programming approach to the graph edit distance problem, in: Graph-Based Representations in Pattern Recognition, pages 92-102, Springer, 2007.
 
Neuhaus, M. and Bunke, H., Bridging the gap between graph edit distance and kernel machines, Machine Perception and Artificial Intelligence, volume 68, World Scientific, ISBN 978-981-270-817-5, 2007.
 
Noris, B., Benmachiche, K., Meynet, J., Thiran, J. -Ph. and Billard, A., Analysis of Head Mounted Wireless Camera Videos for Early Diagnosis of Autism, in: International Conference on Recognition Systems, 2007.
 
Odobez, J. -M. and Ba, S., A cognitive and unsupervised map adaptation approach to the recognition of the focus of attention from head pose, in: International Conference on Multi-Media & Expo (ICME07), 2007.
 
Orabona, F., Castellini, C., Caputo, B., Luo, J. and Sandini, G., Indoor place recognition using online independent support vector machines, in: 18th British Machine Vision Conference (BMVC07), pages 1090-1099, Warwick, UK, 2007.
 
Orabona, F., Castellini, C., Caputo, B., Luo, J. and Sandini, G., On-line independent support vector machines for cognitive systems, number Idiap-RR-63-2007, 2007.
 
Ozden, K. E., Schindler, K. and van Gool, L., Simultaneous segmentation and 3d reconstruction of monocular image sequences, in: International Conference on Computer Vision (ICCV'07), 2007.
 
Pallotta, V., Seretan, V. and Ailomaa, M., User requirement analysis for meeting information retrieval based on query elicitation, in: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007), pages 1008-1015, Association for Computational Linguistics, 2007.
 
Pardo, J. M., Anguera, X. and Wooters, C., Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information, in: to appear in IEEE Transactions on Computers, 2007.
 
Paugam-Moisy, H., Martinez, R. and Bengio, S., A supervised learning approach based on stdp and polychronization in spiking neuron networks, in: European Symposium on Artificial Neural Networks, ESANN, 2007.
 
Perrin, X., Chavarriaga, R., Siegwart, R. and del R. Millán, J., Bayesian controller for a novel semi-autonomous navigation concept, in: 3rd European Conference on Mobile Robots (ECMR 2007), 2007.
 
Philips, J., Millán, J. del R., Vanacker, G., Lew, E., Galán, F., Ferrez, P. W., van Brussel, H. and Nuttin, M., Adaptive shared control of a brain-actuated simulated wheelchair, in: Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics, pages 408-414, 2007. [DOI]
 
Piccardi, L., Noris, B., Barbey, O., Schiavone, G., Keller, F., Von Hofsten, C. and Billard, A., Wearcam: a head mounted wireless camera for monitoring gaze attention and for the diagnosis of developmental disorders in young children, in: 16th IEEE International Symposium on Robot & Human Interactive Communication, RO-MAN, 2007.
 
Pinto, J. P., Bourlard, H., Graves, A. and Hermansky, H., Comparing different word lattice rescoring approaches towards keyword spotting, number 32, 2007.
 
Pinto, J. P., Lovitt, A. and Hermansky, H., Exploiting phoneme similarities in hybrid hmm-ann keyword spotting, in: Proceedings of Interspeech, 2007.
 
Pinto, J. P., R. M., P., Yegnanarayana, B. and Hermansky, H., Significance of contextual information in phoneme recognition, 2007.
 
Plauché, M., Cetin, O. and Uhdaykumar, N., How to build a spoken dialog system with limited (or no) resources, in: AI in ICT for Development Workshop of the Twentieth Intl. Joint Conf. on AI, Hyderabad, India, 2007.
 
Popescu-Belis, A. and Zufferey, S., Contrasting the automatic identification of two discourse markers in multiparty dialogues, in: Proceedings of SIGDIAL 2007, pages 10, Antwerp, Belgium, 2007.
 
Popescu-Belis, A., Evaluation of nlg: some analogies and differences with mt and reference resolution, in: MT Summit XI Workshop on Using Corpora for NLG and MT (UCNLG MT), pages 66-68, 2007.
 
Popescu-Belis, A. and Estrella, P., Generating usable formats for metadata and annotations in a large meeting corpus, in: ACL 2007, pages 93-96, ACL 2007, Prague, Czech Republic, 2007.
 
Popescu-Belis, A., Le rôle des métriques d'évaluation dans le processus de recherche en tal, in: TAL (Traitement Automatique des Langues), volume 47, number 2, 2007.
 
Prasanna, S. R. Mahadeva, Yegnanarayana, B., Pinto, J. P. and Hermansky, H., Analysis of confusion matrix to combine evidence for phoneme recognition, number 27, 2007.
 
Pronobis, A. and Caputo, B., Confidence-based cue integration for visual place recognition, number 17, 2007.
 
Quack, T., Ferrari, V., Leibe, B. and van Gool, L., Efficient mining of frequent and distinctive feature configurations, in: accepted for ICCV'07, 2007.
 
Quack, T., Ferrari, V., Leibe, B. and van Gool, L., Efficient mining of frequent and distinctive feature configurations, in: International Conference on Computer Vision (ICCV'07), 2007.
 
Quelhas, P., Odobez, J. -M., Gatica-Perez, D. and Tuytelaars, T., A thousand words in a scene, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 29, number 9, pages 151575-1589, 2007. [DOI]
 
del R. Millán, J., Tapping the mind or resonating minds?, in: European Visions for the Knowledge Age, Cheshire Henbury, 2007.
 
Rakotomamonjy, A., Bach, F., Canu, S. and Grandvalet, Y., More efficiency in multiple kernel learning, in: International Conference on Machine Learning (ICML), 2007.
 
Renals, S., Hain, T. and Bourlard, H., Recognition and understanding of meetings the ami and amida projects, in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'07, pages 238-247, Kyoto, 2007. [DOI]
 
Richiardi, J., Kryszczuk, K. and Drygajlo, A., Quality measures in unimodal and multimodal biometric verification, in: Proc. 15th European Signal Processing Conf. (EUSIPCO), 2007.
 
Richiardi, J. and Drygajlo, A., Reliability-based voting schemes using modality-independent features in multi-classifier biometric authentication, in: Proc. 7th Int. Workshop on Multiple Classifier Systems, Springer, 2007.
 
Riesen, K., Neuhaus, M. and Bunke, H., Bipartite graph matching for computing the edit distance of graphs, in: Graph-Based Representations in Pattern Recognition, pages 1-12, Springer, 2007.
 
Riesen, K., Neuhaus, M. and Bunke, H., Graph embedding in vector spaces by means of prototype selection, in: Graph-Based Representations in Pattern Recognition, pages 383-393, Springer, 2007.
 
Rigamonti, M., Lalanne, D. and Ingold, R., Faericworld: browsing multimedia events through static documents and links, in: In proc. of INTERACT 2007, pages to appear, Springer-Verlag, 2007.
 
Romsdorfer, H. and Pfister, B., Text analysis and language identification for polyglot text-to-speech synthesis, in: Speech Communication (Elsevier), 2007.
 
Rytsar, R. and Pun, T., Computational aspects of the eeg forward problem solution for real head model using finite element, in: 29th Annual Int. Conf. IEEE Engineering in Medicine and Biology Society, 2007.
 
Schindler, K., Suter, D. and H. Wang, , A model-selection framework for multibody structure-and-motion of image sequences, in: International Journal of Computer Vision, volume 79, number 2, pages 159-177, 2007.
 
Schlapbach, A. and Bunke, H., A writer identification and verification system using HMM based recognizers, in: Pattern Analysis and Applications, volume 10, number 1, pages 33-43, 2007.
 
Schlapbach, A. and Bunke, H., Fusing asynchronous feature streams for on-line writer identification, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 103-107, 2007.
 
Shriberg, E., Higher level features in speaker recognition, in: Speaker Classification I, Lecture Notes in Computer Science, Springer, 2007.
 
Smith, K., Bayesian methods for visual multi-object tracking with applications to human activity recognition, École Polytechnique Fédérale de Lausanne, 2007.
 
Sorci, M., Antonini, G. and Thiran, J. -Ph., Fisher's Discriminant and Relevant Component Analysis for static facial expression classification, in: 15th European Signal Processing Conference (EUSIPCO), Poznan, Poland, Poznan, Poland, 2007.
 
Starlander, M., Using a wizard of oz as a baseline to determine which system architecture is the best for a spoken language translation system, in: Proceedings of Nodalida 2007, pages 161-164, Tartu, Estonia, 2007.
 
Stolcke, A., Kajarekar, S., Ferrer, L. and Shriberg, E., Speaker recognition with session variability normalization based on mllr adaptation transforms, in: IEEE Transactions on Audio, Speech, and Language Processing, volume 15, pages 1987-1998, 2007.
 
Stolcke, A., Kajarekar, S., Ferrer, L. and Shriberg, E., Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms, in: IEEE Transactions on Audio, Speech, and Language Processing, special issue on speaker and language recognition, 2007.
 
Stolcke, A., Anguera, X., Boakye, K., Cetin, O., Janin, A., Magimai-Doss, M., Wooters, C. and Zheng, J., The sri-icsi spring 2007 meeting and lecture recognition system, in: Lecture Notes in Computer Science, 2007.
 
Stoll, L., Frankel, J. and Mirghafori, N., Speaker Recognition Via Nonlinear Discriminant Features, in: Proceedings of NOLISP, Paris, France,, 2007.
 
Szekely, E., Bruno, E. and Marchand-Maillet, S., Clustered multidimensional scaling for exploration in information retrieval, in: International Conference on the Theory of Information Retrieval, 2007.
 
Thomas, A., Ferrari, V., Leibe, B., Tuytelaars, T. and van Gool, L., Depth-from-recognition: inferring metadata by cognitive feedback, in: ICCV'07 Workshop on 3D Representations for Recognition, 2007.
 
Uldry, L., Ferrez, P. W. and del R. Millán, J., Feature selection methods on distributed linear inverse solutions for a non-invasive brain-machine interface, number 04, 2007.
 
Valente, F., Bourlard, H. and Deepu, V., Agglomerative information bottleneck for speaker diarization of meetings data, number 31, 2007.
 
Valente, F. and Hermansky, H., Combination of acoustic classifiers based on dempster-shafer theory of evidence, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
 
Valente, F., Vepa, J., Plahl, C., Gollan, C., Hermansky, H. and Schlüter, R., Hierarchical neural networks feature extraction for lvcsr system, in: Interspeech 2007, 2007.
 
Valente, F., Vepa, J. and Hermansky, H., Multi-stream features combination based on dempster-shafer rule for lvcsr system, in: Interspeech 2007, 2007.
 
Vanacker, G., Millán, J. del R., Lew, E., Ferrez, P. W., Galán, F., Philips, J., van Brussel, H. and Nuttin, M., Context-based filtering for assisted brain-actuated wheelchair driving, in: Computational Intelligence and Neuroscience, volume 2007, pages 3, ISSN 1687-5265, 2007.
 
Villán, R., Voloshynovskiy, S., Koval, O., Deguillaume, F. and Pun, T., Tamper-proofing of Electronic and Printed Text Documents via Robust Hashing and Data-Hiding, in: Proceedings of SPIE-IS&T Electronic Imaging 2007, Security, Steganography, and Watermarking of Multimedia Contents IX, 2007.
 
Vinciarelli, A. and Favre, S., Broadcast news story segmentation using social network analysis and hidden markov models, in: ACM International Conference on Multimedia, pages 261-264, 2007.
 
Vinciarelli, A., Mapping nonverbal communication into social status: automatic recognition of journalists and non-journalists in radio news, number 33, 2007.
 
Vinciarelli, A., Role recognition in broadcast news using social network analysis and duration distribution modeling, in: IEEE Transactions on Multimedia, 2007.
 
Vinciarelli, A. and Favre, S., Role recognition in radio programs using social affiliation networks and mixtures of discrete distributions: an approach inspired by social cognition, number Idiap-RR-40-2007, 2007.
 
Vinciarelli, A., Fernàndez, F. and Favre, S., Semantic segmentation of radio programs using social network analysis and duration distribution modeling, in: IEEE International Conference on Multimedia and Expo (ICME), 2007.
 
Vinyals, O., Friedland, G. and Mirghafori, N., Revisiting a basic function on current CPUs: A fast logarithm implementation with adjustable accuracy, in: ICSI Technical Report number TR-07-002, 2007.
 
Weise, T., Leibe, B. and van Gool, L., Fast 3d scanning with automatic motion compensation, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'07), 2007.
 
Wooters, C. and Huijbregts, M., The ICSI RT07s Speaker Diarization System, in: to appear in Lecture Notes in Computer Science, 2007.
 
Yao, J. and Odobez, J. -M., Multi-layer background subtraction based on color and texture, in: CVPR 2007 Workshop on Visual Surveillance (VS2007), pages 1-8, 2007. [DOI]
 
Zacharie, D. G. and Pinto, J. P., Keyword spotting on word lattices, number 22, 2007.
 
Zheng, J., Cetin, O., Hwang, M. -Y., Lei, X., Stolcke, A. and Morgan, N., Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition, in: Proc. ICASSP, Honolulu., 2007.
 
Peralta Menendez, R. Grave de, González Andino, S. L., Ferrez, P. W. and Millán, J. del R., Non-invasive estimates of local field potentials for brain-computer interfaces, in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
 
Fasel, B. and van Gool, L., Interactive museum guide: accurate retrieval of object descriptions, in: Adaptive Multimedia Retrieval: User, Context, and Feedback, pages 179-191, Springer, 2007.
 
van Gool, L., Zeng, G., van den Borre, F. and Müller, P., Towards mass-produced building models, in: Photogrammetric Image Analysis, pages 209-220, Institute of Photogrammetry and Cartography, Technische Universitaet Muenchen, 2007.
 

2006

Alecu, T. I., Voloshynovskiy, S. and Pun, T., The gaussian transform of distributions: definition, computation and application, in: IEEE Trans. on Signal Processing, volume 54, number 8, pages 2976-2995, 2006.
 
Andreani, G., Di Fabbrizio, G., Gilbert, M., Gillick, D., Hakkani-Tur, D. and Lemon, O., Lets DiSCoH: Collecting an Annotated Open Corpus with Dialog Acts and Reward Signals for Natural Language Helpdesks, in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
 
Ba, S. and Odobez, J. -M., A study on visual focus of attention recognition from head pose in a meeting room, in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006.
 
Ba, S. and Odobez, J. -M., Recognizing people's focus of attention from head poses: a study, number 42, 2006.
 
Barber, D. and Chiappa, S., Unified inference for variational bayesian linear gaussian state-space models, in: NIPS, 2006.
 
BenZeghiba, M. F. and Bourlard, H., User-customized password speaker verification using multiple reference and background models, in: Speech Communication, volume 8, pages 1200-1213, 2006.
 
Bertolami, R., Halter, B. and Bunke, H., Combination of multiple handwritten text line recognition systems with a recursive approach, in: Proc. 10th Int. Workshop Frontiers in Handwriting Recognition, pages 61-65, 2006.
 
Buttfield, A. and del R. Millán, J., Online classifier adaptation in brain-computer interfaces, number 16, 2006.
 
Buttfield, A., Ferrez, P. W. and del R. Millán, J., Towards a robust bci: error potentials and online learning, in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, volume 14, number 2, pages 164-168, 2006.
 
Cattin, P. C., Bay, H., van Gool, L. and Székely, G., Retina mosaicing using local features, in: Medical Image Computing and Computer-Assisted Intervention (MICCAI), pages 185-192, 2006.
 
Chanel, G., Kronegg, J., Grandjean, D. and Pun, T., Emotion assessment: arousal evaluation using eeg's and peripheral physiological signals, in: Proc. Int. Workshop Multimedia Content Representation, Classification and Security (MRCS), pages 530-537, Lecture Notes in Computer Science, Springer, 2006.
 
Cheng, O., Dines, J. and Magimai-Doss, M., A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition, number 62, 2006.
 
Chiappa, S., Analysis and classification of eeg signals using probabilistic models for brain computer interfaces, École Polytechnique Fédérale de Lausanne, 2006.
 
Chiquet, H., Evéquoz, F. and Lalanne, D., Elcano, a tangible multimedia browser (demo)., in: Symposium on User Interface Software and Technology (UIST 2006), pages 51-52, 2006.
 
Cuendet, S., Hakkani-Tur, D. and Tur, G., Model Adaptation for Sentence Segmentation from Speech, in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
 
Cuendet, S., Model adaptation for sentence unit segmentation from speech, number 64, 2006.
 
Dimitrakakis, C., Ensembles for sequence learning, École Polytechnique Fédérale de Lausanne, 2006.
 
Everingham, M., Zisserman, A., Williams, C., van Gool, L., Allan, M., Bishop, C., Chapelle, O., Dalal, N., Deselaers, T., Dorko, G., Duffner, S., Eichhorn, J., Farquhar, J., Fritz, M., Garcia, C., Griffiths, T., Jurie, F., Keysers, D., Koskela, M., Laaksonen, J., Larlus, D., Leibe, B., Meng, H., Ney, H., Schiele, B., Schmid, C., Seemann, E., Shawe-Taylor, J., Storkey, A., Szedmak, S., Triggs, B., Ulusoy, I., Viitaniemi, V. and Zhang, J., The 2005 pascal visual object class challenge, in: Selected Proceedings of the 1st PASCAL Challenges Workshop, Lecture Notes in AI, Springer, 2006.
 
Hannani, A., Toledano, D., Petrovska, D., Montero-Asenjo, A. and Hennebert, J., Using data-driven and phonetic units for speaker verification, in: IEEE Speaker and Language Recognition Workshop (Odyssey 2006), Puerto Rico, 2006.
 
Hemptinne, C., Master thesis: integration of the harmonic plus noise model (hnm) into the hidden markov model-based speech synthesis system (hts), number 69, 2006.
 
Hillard, D., Huang, Z., Ji, H., Grishman, R., Hakkani-Tur, D., Harper, M., Ostendorf, M. and Wang, W., Impact of Automatic Comma Prediction on POS/Name Tagging of Speech, in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
 
Janin, A., Stolcke, A., Anguera, X., Boakye, K., Cetin, O., Frankel, J. and Zheng, J., The ICSI-SRI Spring 2006 Meeting Evaluation System, in: In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006); Lecture Notes in Computer Science. Springer, 2006.
 
Janvier, B., Bruno, E., Marchand-Maillet, S. and Pun, T., Handling temporal heterogeneous data for content-based management of large video collections, in: Multimedia Tools and Applications, volume 30, pages 273-288, 2006.
 
Just, A., Two-handed gestures for human-computer interaction, École Polytechnique Fédérale de Lausanne, 2006.
 
Keller, M. and Bengio, S., A multitask learning approach to document representation using unlabeled data, number 44, 2006.
 
Keller, M., Machine learning approaches to text representation using unlabeled data, Ecole Polytechnique Fédérale de Lausanne, 2006.
 
Ketabdar, H. and Hermansky, H., Identifying unexpected words using in-context and out-of-context phoneme posteriors, number 68, 2006.
 
Kosinov, S., Marchand-Maillet, S., Kozintsev, I., Dulong, C. and Pun, T., Dual diffusion model of spreading activation for content-based image retrieval, in: 8th ACM SIGMM - International Workshop on Multimedia Information Retrieval, 2006.
 
Koval, O., Voloshynovskiy, S., Holotyak, T. and Pun, T., Information-theoretic analysis of steganalysis in real images, in: ACM Multimedia and Security Workshop 2006, 2006.
 
Lathoud, G., Observations on multi-band asynchrony in distant speech recordings, number 74, 2006.
 
Lathoud, G., Spatio-temporal analysis of spontaneous speech with microphone arrays, École Polytechnique Fédérale de Lausanne, 2006.
 
Lathoud, G., Magimai-Doss, M. and Bourlard, H., Unsupervised spectral subtraction for noise-robust asr on unknown transmission channels, number 09, 2006.
 
Leibe, B., Mikolajczyk, K. and Schiele, B., Efficient clustering and matching for object class recognition, in: British Machine Vision Conference (BMVC, 2006.
 
Leibe, B., Cornelis, N., Cornelis, K. and van Gool, L., Integrating recognition and reconstruction for cognitive traffic scene analysis from a moving vehicle, in: DAGM Annual Pattern Recognition Symposium, pages 192-201, Springer, 2006.
 
Leibe, B., Mikolajczyk, K. and Schiele, B., Segmentation based multi-cue integration for object detection, in: British Machine Vision Conference (BMVC, 2006.
 
Liwicki, M. and Bunke, H., HMM-based on-line recognition of handwritten whiteboard notes, in: Proceedings 10th International Workshop Frontiers in Handwriting Recognition, pages 595-599, 2006.
 
Luo, J., Pronobis, A., Caputo, B. and Jensfelt, P., Incremental learning for place recognition in dynamic environments, number 52, 2006.
 
Luo, J., Pronobis, A. and Caputo, B., Svm-based transfer of visual knowledge across robotic platforms, number 65, 2006.
 
Maganti, H. K., Motlicek, P. and Gatica-Perez, D., Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms, number 57, 2006.
 
Marcel, S., Rodriguez, Y., Guillemot, M. and Popescu-Belis, A., Annotation of face detection: description of xml format and files, number 06, 2006.
 
Marcel, S., Keomany, J. and Rodriguez, Y., Robust-to-illumination face localisation using active shape models and local binary patterns, number 47, 2006.
 
Mariéthoz, J., Discrmininant models for text-independent speaker verification, number 70, 2006.
 
Melichar, M., Cenek, P., Ailomaa, M., Lisowska, A. and Rajman, M., From Vocal to Multimodal Dialogue Management, in: Eighth International Conference on Multimodal Interfaces (ICMI'06), Banff, Canada, 2006.
 
Mendels, F., Thiran, J. -Ph. and Vandergheynst, P., Matching pursuit-based shape representation and recognition using scale-space, in: International Journal of Imaging Systems and Technology, volume 6, number 15, pages 162-180, 2006. [DOI]
 
Mesot, B. and Barber, D., A bayesian alternative to gain adaptation in autoregressive hidden markov models, number 55, 2006.
 
Mesot, B. and Barber, D., Switching linear dynamical systems for noise robust speech recognition, number 08, 2006.
 
Moore, D., The juicer lvcsr decoder - user manual for juicer version 0.5.0, number 03, 2006.
 
Motlicek, P., Hermansky, H., Garudadri, H. and Srinivasamurthy, N., Audio coding based on long temporal contexts, number 30, 2006.
 
Motlicek, P., Ullal, V. and Hermansky, H., Wide-band perceptual audio coding based on frequency-domain linear prediction, number 58, 2006.
 
Moënne-Loccoz, N., Janvier, B., Marchand-Maillet, S. and Bruno, E., Handling temporal heterogeneous data for content-based management of large video collections, in: Multimedia Tools and Applications, volume 31, pages 309-325, 2006.
 
Müller, P., Wonka, P., Haegler, S., Ulmer, A. and van Gool, L., Procedural modeling of buildings, in: Proceedings of ACM SIGGRAPH 2006 / ACM Transactions on Graphics, pages 614-623, ACM Press, 2006.
 
Müller, M., Evéquoz, F. and Lalanne, D., Tjass, a smart board for augmenting card game playing and learning (demo), in: Symposium on User Interface Software and Technology (UIST 2006), pages 67-68, 2006.
 
Poh, N. and Bengio, S., Estimating the confidence interval of expected performance curve in biometric authentication using joint bootstrap, number 25, 2006.
 
Poh, N., Multi-system biometric authentication: optimal fusion and user-specific information, École Polytechnique Fédérale de Lausanne, 2006.
 
Poh, N. and Bengio, S., Using chimeric users to construct fusion classifiers in biometric authentication tasks: an investigation, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006.
 
Pozdnoukhov, A., Prior knowledge in kernel methods, École Polytechnique Fédérale de Lausanne, 2006.
 
Pun, T., Alecu, T. I., Chanel, G., Kronegg, J. and Voloshynovskiy, S., Brain-computer interaction research at the computer vision and multimedia laboratory, university of geneva, in: IEEE Trans. Neural Systems and Rehabilitation Engineering, Special Issue on Brain-Computer Interaction, volume 14, number 2, pages 210-213, 2006.
 
Pérez-Freire, L., Pérez-González, F. and Voloshynovskiy, S., An Accurate Analysis of Scalar Quantization-Based Data Hiding, in: IEEE Trans. on Information Forensics and Security, volume 1, number 1, pages 80-86, 2006.
 
Quelhas, P. and Odobez, J. -M., Natural scene image modeling using color and texture visterms., in: Conference on Image and Video Retrieval CIVR, 2006.
 
del R. Millán, J., Renkens, F., Mouriño, J. and Gerstner, W., Non-invasive brain-actuated control of a mobile robot by human eeg, in: 2006 IMIA Yearbook of Medical Informatics, Schattauer Verlag, 2006.
 
Radgohar, M., Evéquoz, F. and Lalanne, D., Phong, augmenting virtual and real gaming experience (demo), in: Symposium on User Interface Software and Technology (UIST 2006), pages 71-72, 2006.
 
Richiardi, J. and Drygajlo, A., Applying biometrics to identity documents: estimating and coping with errors, 2006.
 
Richiardi, J. and Drygajlo, A., Applying biometrics to identity documents: implementation issues, 2006.
 
Rienks, R., Zhang, D., Gatica-Perez, D. and Post, W., Detection and application of influence rankings in small group meetings, in: ICMI '06: Proceedings of the 8th international conference on Multimodal interfaces, pages 257-264, ACM Press, Banff, Alberta, Canada, 2006. [DOI]
 
Rodriguez, Y., Face detection and verification using local binary patterns, École Polytechnique Fédérale de Lausanne, 2006.
 
Schlapbach, A. and Bunke, H., Off-line writer verification: a comparison of a hidden Markov model (HMM) and a Gaussian mixture model (GMM) based system, in: Proc. 10th Int. Workshop Frontiers in Handwriting Recognition, pages 275-280, 2006.
 
Smith, K., Schreiber, S., Beran, V., Potúcek, I., Rigoll, G. and Gatica-Perez, D., Multi-person tracking in meetings: a comparative study, in: Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2006.
 
Smith, K., Ba, S., Odobez, J. -M. and Gatica-Perez, D., Tracking attention for multiple people: wandering visual focus of attention estimation, number 40, 2006.
 
Spindler, T., Wartmann, C., Roth, D., Steffen, A., Hovestadt, L. and van Gool, L., Privacy in video surveilled areas, in: International Conference on Privacy, Security and Trust (PST 2006), 2006.
 
Torre, E. L., Caputo, B. and Tommasi, T., Melanoma recognition using kernel classifiers, number 53, 2006.
 
Tur, G., Guz, U. and Hakkani-Tur, D., Model Adaptation for Dialog Act Tagging, in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
 
Ullal, V. and Motlicek, P., Audio coding based on long temporal segments: experiments with quantization of excitation signal, number 46, 2006.
 
Vepa, J. and King, S., Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis, in: IEEE Trans. on Audio, Speech and Language Processing, volume 14, number 5, pages 1763-1771, 2006.
 
Vila-Forcén, J. E., Voloshynovskiy, S., Koval, O. and Pun, T., Costa problem under channel ambiguity, in: Proceedings of 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2006.
 
Vila-Forcén, J. E., Voloshynovskiy, S., Koval, O. and Pun, T., Facial Image Compression Based on Structured Codebooks in Overcomplete Domain, in: EURASIP Journal on Applied Signal Processing, Frames and overcomplete representations in signal processing, communications, and information theory special issue, volume 2006, number Article ID 69042, pages 1-11, 2006.
 
Voloshynovskiy, S., Koval, O., Topak, E., Forcen, J. E. V. and Pun, T., On reversibility of random binning based data-hiding techniques: security perspectives, in: ACM Multimedia and Security Workshop 2006, 2006.
 
Voloshynovskiy, S., Koval, O., Mihcak, M. K. and Pun, T., The edge process model and its application to information hiding capacity analysis, in: IEEE Trans. on Signal Processing, volume 54, number 5, pages 1813-1825, 2006.
 
Wey, P., Fischer, B., Bay, H. and Buhmann, J. M., Dense stereo by triangular meshing and cross validation, in: DAGM-Symposium, pages 708-717, 2006.
 
Zhang, D., Gatica-Perez, D. and Bengio, S., Exploring contextual information in a layered framework for group action recognition, in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006.
 
Zhang, D., Probabilistic graphical models for human interaction analysis, École Polytechnique Fédérale de Lausanne, 2006.
 
A. Peregoudov, , Vinciarelli, A. and Bourlard, H., Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations, number 56, 2006.
 

Unknown year

Brodbeck, D., Mazza, R. and Lalanne, D., Interactive visualization - a survey, 0000.
 
Dumas, B., Lalanne, D. and Oviatt, S., Multimodal interfaces: a survey of principles, models and frameworks, 0000.
 
Gatica-Perez, D., Modeling interest in face-to-face conversations from multimodal nonverbal behavior, in: In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.), Multimodal Signal Processing, Academic Press, in press, 0000.
 
Gatica-Perez, D. and Odobez, J. -M., Visual attention, speaking activity, and group conversational analysis in multi-sensor environments, in: H. Nakashima, J. Augusto, H. Aghajan (Eds.), Handbook of Ambient Intelligence and Smart Environments, Springer, in press, 0000.
 
Goldmann, L., Samour, A., Ebrahimi, T. and Sikora, T., Multimodal person search combining information fusion and relevance feedback, in: IEEE International Workshop on Multimedia Signal Processing (MMSP 2009), Rio de Janeiro, Brazil, 0000.
 
Lee, J. -S., De Simone, F. and Ebrahimi, T., Influence of audio-visual attention on perceived quality of standard definition multimedia content, in: First International Workshop on Quality of Multimedia Experience (QoMEX 2009), San Diego, CA, U.S.A., 0000.
 
Lee, J. -S. and Ebrahimi, T., Two-level bimodal association for audio-visual speech recognition, in: International Conference on Advanced Concepts for Intelligent Vision Systems (ACIVSâ09), Bordeaux, France, 0000.
 
Mugellini, E., Lalanne, D., Dumas, B., Evéquoz, F., Gerardi, S., Le Calvé, A., Boder, A., Ingold, R. and Khaled, O., Memodules as tangible shortcuts to multimedia information, 0000.
 
Noris, B., Benmachiche, K. and Billard, A., Calibration-free eye gaze direction detection with gaussian processes, in: International Conference on Computer Vision Theory and Applications (VISAPP 08), 0000.
 
De Simone, F., Naccari, M., Tagliasacchi, M., Dufaux, F., Tubaro, S. and Ebrahimi, T., Subjective assessment of H.264/AVC video sequences transmitted over a noisy channel, in: First International Workshop on Quality of Multimedia Experience (QoMEX 2009), San Diego, CA, U.S.A., 0000.
 
Popescu-Belis, A., Multimodal database annotation formats and standards, software architecture for multimodal interfaces, in: Multimodal Signal Processing: Methods and Techniques to Build Multimodal Interactive Systems, Academic Press, 0000.
 
Powered by Agaion