Guide:
  • If you want to have the list of publications issued from a specific Individual Project (IP), write in the search field (IM2.IP). IP can have the following value: DMA, AP, VP, MPR, MCA, HMI, ISD, BMI

  • If you want to find joint publications between IPs, write in the search field (joint), click on search and then click on Keywords

  • If you want to display all the publications for a specific author, use the shortcut called -Authors- located in the main menu
 

All publications in the database, sorted on type



Publications of type: Article

Dumas, B., Lalanne, D. and Ingold, R., Démonstration : hephaistk, une bo\^\ite à outils pour le prototypage d'interfaces multimodales, 2008.
 
Dumas, B., Lalanne, D. and Ingold, R., Prototyping multimodal interfaces with smuiml modeling language, pages 63-66, 2008.
 
Dumas, B., Lalanne, D., Guinard, D., Koenig, R. and Ingold, R., Strengths and weaknesses of software architectures for the rapid creation of tangible and multimodal interfaces, pages 47-54, 2008.
 
Evéquoz, F. and Lalanne, D., Indexing and visualizing digital memories through personal email archive, pages 21-24, 2007.
 
Evéquoz, F. and Lalanne, D., Personal information management through interactive visualizations, pages 158-160, 2007.
 
Humm, A., Hennebert, J. and Ingold, R., Hidden markov models for spoken signature verification, 2007.
 
Lalanne, D. and van den Hoven, E., Supporting human memory with interactive systems, pages 215-216, 2007.
 
Lalanne, D., Bertini, E., Hertzog, P. and Bados, P., Visual analysis of corporate network intelligence: abstracting and reasoning on yesterdays for acting today, 2007.
 

accepted for IEEE ICASSP, Las Vegas, NV

Faria, A. and Morgan, N., Corrected Tandem Features for Acoustic Model Training, in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
 
Kamangar, K., Hakkani-Tur, D., Tur, G. and Levit, M., An iterative unsupervised learning method for information distillation, in: accepted for IEEE ICASSP, Las Vegas, NV, 2008.
 

accepted for publication in IEEE Trans. on System, Man and Cybernetics: Part B, Man,

Ba, S. and Odobez, J. -M., Recognizing visual focus of attention from head pose in natural meetings, in: accepted for publication in IEEE Trans. on System, Man and Cybernetics: Part B, Man,, 2008.
 

AI in ICT for Development Workshop of the Twentieth Intl. Joint Conf. on AI, Hyderabad, India

Plauché, M., Cetin, O. and Uhdaykumar, N., How to build a spoken dialog system with limited (or no) resources, in: AI in ICT for Development Workshop of the Twentieth Intl. Joint Conf. on AI, Hyderabad, India, 2007.
 

Clinical Neurophysiology

Galán, F., Nuttin, M., Lew, E., Ferrez, P. W., Vanacker, G., Philips, J. and Millán, J. del R., A brain-actuated wheelchair: asynchronous and non-invasive brain-computer interfaces for continuous control of robots, in: Clinical Neurophysiology, number 119, pages 2159-2169, 2008.
 

Computational Intelligence and Neuroscience

Cincotti, F., Kauhanen, L. and Aloise, F., Vibrotactile feedback for brain-computer interface operation, in: Computational Intelligence and Neuroscience, volume 2007, pages Article ID, 2007.
 
Vanacker, G., Millán, J. del R., Lew, E., Ferrez, P. W., Galán, F., Philips, J., van Brussel, H. and Nuttin, M., Context-based filtering for assisted brain-actuated wheelchair driving, in: Computational Intelligence and Neuroscience, volume 2007, pages 3, ISSN 1687-5265, 2007.
 

Computer

Jaimes, A., Gatica-Perez, D., Sebe, N. and Huang, T. S., Guest Editors' Introduction: Human-Centered Computing-Toward a Human Revolution, in: Computer, volume 40, number 5, pages 30-34, 2007.
 

Computer Vision and Image Understanding

Bray, M., Koller-Meier, E. and van Gool, L., Smart particle filtering for high-dimensional tracking, in: Computer Vision and Image Understanding, 2007.
 

Computer Vision and Image Understanding (CVIU)

Bay, H., Ess, A., Tuytelaars, T. and van Gool, L., Speeded-up robust features (surf), in: Computer Vision and Image Understanding (CVIU), 2007.
 

Computer Vision and Image Undertanding

Laptev, I., Caputo, B. and Lindberg, T., Local velocity-adapted motion events for spatio-temporal recognition, in: Computer Vision and Image Undertanding, volume 108, number 3, pages 207-229, ISSN 1077-3142, 2007.
 

Digital Document Processing: Major Directions and Recent Advances

Bunke, H. and Varga, T., Off-line Roman cursive handwriting recognition, in: Digital Document Processing: Major Directions and Recent Advances, volume 20, pages 165-173, 2007.
 

ELectronic Letters on Computer vision and Image Analysis

Caputo, B., Class specific object recognition using kernel Gibbs distributions, in: ELectronic Letters on Computer vision and Image Analysis, volume 7, number 2, pages 96-109, 2008.
 

Eurasip J. of Image and Video Processing, Special Issue: Image and Video Processing for Disability, accepted for publication

Bologna, G., Deville, B., Pun, T. and Vinckenbosch, M., Transforming 3d coloured pixels into musical instrument notes for vision substitution applications, in: Eurasip J. of Image and Video Processing, Special Issue: Image and Video Processing for Disability, accepted for publication, 2007.
 

EURASIP Journal of Advances in Signal Processing

Kryszczuk, K., Richiardi, J., Prodanov, P. and Drygajlo, A., Reliability-based decision fusion in multimodal biometric verification systems, in: EURASIP Journal of Advances in Signal Processing, 2007.
 

EURASIP Journal on Applied Signal Processing, Frames and overcomplete representations in signal processing, communications, and information theory special issue

Vila-Forcén, J. E., Voloshynovskiy, S., Koval, O. and Pun, T., Facial Image Compression Based on Structured Codebooks in Overcomplete Domain, in: EURASIP Journal on Applied Signal Processing, Frames and overcomplete representations in signal processing, communications, and information theory special issue, volume 2006, number Article ID 69042, pages 1-11, 2006.
 

EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision

Monay, F., Quelhas, P., Odobez, J. -M. and Gatica-Perez, D., Contextual classification of image patches with latent aspect models, in: EURASIP Journal on Image and Video Processing, Special Issue on Patches in Vision, 2009.
 

Forensic Science International

Dessimoz, D., Richiardi, J., Champod, C. and Drygajlo, A., Multimodal biometrics for identity documents (MBioID), in: Forensic Science International, volume 167, pages 154-159, 2007. [DOI]
 

ICSI Technical Report number TR-07-002

Vinyals, O., Friedland, G. and Mirghafori, N., Revisiting a basic function on current CPUs: A fast logarithm implementation with adjustable accuracy, in: ICSI Technical Report number TR-07-002, 2007.
 

IEEE Computer

Jaimes, A., Gatica-Perez, D., Sebe, N. and Huang, T. S., Human-centered computing: toward a human revolution, in: IEEE Computer, volume 40, number 5, 2007. [DOI]
 

IEEE ICASSP, Las Vegas, NV

Favre, B., Grishman, R., Hillard, D., Ji, H., Hakkani-Tur, D. and Ostendorf, M., Punctuating speech for information extraction, in: IEEE ICASSP, Las Vegas, NV, 2008.
 
Hung, H., Huang, Y., Friedland, G. and Gatica-Perez, D., Estimating the Dominant Person in Multi-Party Conversations Using Speaker Diarization Strategies, in: IEEE ICASSP, Las Vegas, NV, 2008.
 
Stoyanchev, S., Tur, G. and Hakkani-Tur, D., Name-aware speech recognition for interactive question answering, in: IEEE ICASSP, Las Vegas, NV, 2008.
 

IEEE Int. Conf. on Multimedia & Expo (ICME)

Kokiopoulou, E., Frossard, P. and Verscheure, O., Fast keyword detection with sparse time-frequency models, in: IEEE Int. Conf. on Multimedia & Expo (ICME), 2008.
 

IEEE Int. Conf. Pattern Recognition (ICPR)

Kokiopoulou, E., Pirillos, S. and Frossard, P., Graph-based classification for multiple observations of transformed patterns, in: IEEE Int. Conf. Pattern Recognition (ICPR), 2008.
 
Llonch, R. Sala, Kokiopoulou, E., Tosic, I. and Frossard, P., 3d face recognition using sparse spherical representations, in: IEEE Int. Conf. Pattern Recognition (ICPR), 2008.
 

IEEE Int. Symp. on Information Theory (ISIT)

Kokiopoulou, E., Frossard, P. and Gkorou, D., Optimal polynomial filtering for accelerating distributed consensus, in: IEEE Int. Symp. on Information Theory (ISIT), 2008.
 

IEEE Intelligent Systems

Millán, J. del R., Brain-Controlled Robots, in: IEEE Intelligent Systems, 2008.
 

IEEE Signal Processing Letters

Valente, F., A Novel Criterion for Classifiers Combination in Multistream Speech Recognition, in: IEEE Signal Processing Letters, volume 16, number 7, pages 561-564, ISSN 1070-9908, 2009. [DOI]
 
Thomas, A., Ganapathy, S. and Hermansky, H., Recognition of reverberant speech using frequency domain linear prediction, in: IEEE Signal Processing Letters, 2008.
 
Chiappa, S. and Barber, D., Bayesian factorial linear gaussian state-space models for biosignal decomposition, in: IEEE Signal Processing Letters, 2007.
 
Kokiopoulou, E. and Frossard, P., Accelarating distributed consensus using extrapolation, in: IEEE Signal Processing Letters, volume 14, number 10, pages 665-668, 2007.
 
Kokiopoulou, E. and Frossard, P., Accelerating Distributed Consensus Using Extrapolation, in: IEEE Signal Processing Letters, volume 14, number 10, 2007. [DOI]
 

IEEE Signal Processing Magazine

Baker, J., Deng, L., Glass, J., Khudanpur, S., Lee, C. -H., Morgan, N. and O'Shgughnessy, D., Research developments and directions in speech recognition and understanding, in: IEEE Signal Processing Magazine, volume 26, number 4, pages 78-85, 2009.
 
Baker, J., Deng, L., Glass, J., Khudanpur, S., Lee, C. -H., Morgan, N. and O'Shgughnessy, D., Research developments and directions in speech recognition and understanding, in: IEEE Signal Processing Magazine, volume 26, number 3, pages 75-80, 2009.
 
Pantic, M. and Vinciarelli, A., Implicit Human Centered Tagging, in: IEEE Signal Processing Magazine, volume 26, 2009.
 
Vinciarelli, A., Capturing Order in Social Interactions, in: IEEE Signal Processing Magazine, 2009.
 

IEEE Trans. Neural Systems and Rehabilitation Engineering, Special Issue on Brain-Computer Interaction

Pun, T., Alecu, T. I., Chanel, G., Kronegg, J. and Voloshynovskiy, S., Brain-computer interaction research at the computer vision and multimedia laboratory, university of geneva, in: IEEE Trans. Neural Systems and Rehabilitation Engineering, Special Issue on Brain-Computer Interaction, volume 14, number 2, pages 210-213, 2006.
 

IEEE Trans. on Audio, Speech and Language Processing

Vepa, J. and King, S., Subjective evaluation of join cost and smoothing methods for unit selection speech synthesis, in: IEEE Trans. on Audio, Speech and Language Processing, volume 14, number 5, pages 1763-1771, 2006.
 

IEEE Trans. on Audio, Speech and Language Processing, Special Issue on Multimodal Processing for Speech-based Interactions, accepted for publication

Jayagopi, D., Hung, H., Yeo, C. and Gatica-Perez, D., Modeling dominance in group conversations from nonverbal activity cues, in: IEEE Trans. on Audio, Speech and Language Processing, Special Issue on Multimodal Processing for Speech-based Interactions, accepted for publication, 2008.
 

IEEE Trans. on Audio, Speech, and Language Processing

McCowan, I., Maganti, H. K. and Gatica-Perez, D., Speech enhancement and recognition in meetings with an audio-visual sensor array, in: IEEE Trans. on Audio, Speech, and Language Processing, volume 15, number 8, pages 2257-2269, 2007.
 

IEEE Trans. on Audio, Speech, and Language Processing, Special Issue on Multimodal Processing for Speech-based Interactions

Jayagopi, D., Modeling dominance in group conversations using nonverbal activity cues, in: IEEE Trans. on Audio, Speech, and Language Processing, Special Issue on Multimodal Processing for Speech-based Interactions, volume 17, pages 501-513, 2009.
 

IEEE Trans. on Biomedical Engineering

Ferrez, P. W. and Millán, J. del R., Error-Related EEG Potentials Generated During Simulated Brain-Computer Interaction, in: IEEE Trans. on Biomedical Engineering, volume 55, number 3, pages 923-929, 2008.
 

IEEE Trans. on Information Forensics and Security

Pérez-Freire, L., Pérez-González, F. and Voloshynovskiy, S., An Accurate Analysis of Scalar Quantization-Based Data Hiding, in: IEEE Trans. on Information Forensics and Security, volume 1, number 1, pages 80-86, 2006.
 

IEEE Trans. on Neural Systems and Rehabilitation Engineering

Kronegg, J., Chanel, G., Voloshynovskiy, S. and Pun, T., Eeg-based synchronized brain-computer interfaces: a model for optimizing the number of mental tasks, in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, volume 15, number 1, pages 50-58, 2007.
 
Buttfield, A., Ferrez, P. W. and del R. Millán, J., Towards a robust bci: error potentials and online learning, in: IEEE Trans. on Neural Systems and Rehabilitation Engineering, volume 14, number 2, pages 164-168, 2006.
 

IEEE Trans. on Pattern Analysis and Machine Intelligence

Ortega-Garcia, J., Fierrez, J., Alonso-Fernandez, F., Galbally, J., M. R. Freire, , Gonzalez-Rodriguez, J., Garcia-Mateo, C., Alba-Castro, J. -L., E. Gonzalez-Agulla, , E. Otero-Muras, , S. Garcia-Salicetti, , L. Allano, , B. Ly-Van, , B. Dorizzi, , Kittler, J., Bourlai, T., Poh, N., Deravi, F., M. W. R. Ng, , M. Fairhurst, , Hennebert, J., Humm, A., M. Tistarelli, , L. Brodo, , Richiardi, J., Drygajlo, A., H. Ganster, , F. M. Sukno, , Pavani, S. -K., A. Frangi, , L. Akarun, and A. Savran, , The multi-scenario multi-environment biosecure multimodal database (bmdb), in: IEEE Trans. on Pattern Analysis and Machine Intelligence, 2009.
 

IEEE Trans. on Pattern Analysis and Machine Intelligence,

Smith, K., Ba, S., Gatica-Perez, D. and Odobez, J. -M., Tracking the visual focus of attention for a varying number of wandering people, in: IEEE Trans. on Pattern Analysis and Machine Intelligence,, volume 30, number 7, pages 1212-1229, 2008.
 

IEEE Trans. on Signal Processing

Gurban, M. and Thiran, J. -Ph., Information theoretic feature extraction for audio-visual speech recognition, in: IEEE Trans. on Signal Processing, volume in press, 2009.
 
Alecu, T. I., Voloshynovskiy, S. and Pun, T., The gaussian transform of distributions: definition, computation and application, in: IEEE Trans. on Signal Processing, volume 54, number 8, pages 2976-2995, 2006.
 
Voloshynovskiy, S., Koval, O., Mihcak, M. K. and Pun, T., The edge process model and its application to information hiding capacity analysis, in: IEEE Trans. on Signal Processing, volume 54, number 5, pages 1813-1825, 2006.
 

IEEE Trans. on System, Man and Cybernetics: part B, Man

Ba, S. and Odobez, J. -M., Recognizing human visual focus of attention from head pose in meetings, in: IEEE Trans. on System, Man and Cybernetics: part B, Man, volume 39, number 1, pages 16-34, 2009.
 

IEEE Trans. PAMI

Graves, A., Liwicki, M., Fernandez, S., Bertolami, R., Bunke, H. and Schmidhuber, J., A novel connectionist system for unconstrained handwriting recognition, in: IEEE Trans. PAMI, volume 31, number 5, pages 855-869, ISSN 0162-8828, 2009.
 

IEEE Transactions on Audio Speech and Language Processing

Vijayasenan, D., Valente, F. and Bourlard, H., An Information Theoretic Approach to Speaker Diarization of Meeting Data, in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 7, pages 1382-1393, 2009. [DOI]
 
Kumatani, K., McDonough, J., Rauch, B., Klakow, D., Garner, P. N. and Li, W., Beamforming with a Maximum Negentropy Criterion, in: IEEE Transactions on Audio Speech and Language Processing, volume 17, number 5, pages 994-1008, 2008.
 

IEEE Transactions on Audio, Speech and Language Processing

Friedland, G., Vinyals, O., Huang, Y. and Muller, C., Prosodic and other long-term features for speaker diarization, in: IEEE Transactions on Audio, Speech and Language Processing, volume 17, number 5, pages 985-993, 2009.
 
Lathoud, G. and Odobez, J. -M., Short-term spatio-temporal clustering applied to multiple moving speakers, in: IEEE Transactions on Audio, Speech and Language Processing, 2007.
 

IEEE Transactions on Audio, Speech, and Language Processing

Stolcke, A., Kajarekar, S., Ferrer, L. and Shriberg, E., Speaker recognition with session variability normalization based on mllr adaptation transforms, in: IEEE Transactions on Audio, Speech, and Language Processing, volume 15, pages 1987-1998, 2007.
 

IEEE Transactions on Audio, Speech, and Language Processing, special issue on speaker and language recognition

Stolcke, A., Kajarekar, S., Ferrer, L. and Shriberg, E., Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms, in: IEEE Transactions on Audio, Speech, and Language Processing, special issue on speaker and language recognition, 2007.
 

IEEE Transactions on Biomedical Engineering

Ferrez, P. W. and Millán, J. del R., Error-related eeg potentials generated during simulated brain-computer interaction, in: IEEE Transactions on Biomedical Engineering, volume 55, number 3, pages 923-929, 2008. [DOI]
 
Garipelli, G., Chavarriaga, R. and Millán, J. del R., Fast recognition of anticipation related potentials, in: IEEE Transactions on Biomedical Engineering, 2008.
 

IEEE Transactions on Image Processing

Bogdanova, I., Bresson, X., Thiran, J. -Ph. and Vandergheynst, P., Scale-space analysis and active contours for omnidirectional images, in: IEEE Transactions on Image Processing, volume 16, number 7, pages 1888-1901, 2007. [DOI]
 

IEEE Transactions on Multimedia

Besson, P., Popovici, V., Vesin, J. M., Thiran, J. -Ph. and Kunt, M., Extraction of audio features specific to speech production for multimodal speaker detection, in: IEEE Transactions on Multimedia, volume 10, number 1, pages 63-73, 2008. [DOI]
 
Kokiopoulou, E. and Frossard, P., Semantic coding by supervised dimensionality reduction, in: IEEE Transactions on Multimedia, volume 10, number 2, 2008.
 
Besson, P., Popovici, V., Vesin, J. M., Thiran, J. -Ph. and Kunt, M., Extraction of audio features specific to speech production for multimodal speaker detection, in: IEEE Transactions on Multimedia, 2007. [DOI]
 
Vinciarelli, A., Role recognition in broadcast news using social network analysis and duration distribution modeling, in: IEEE Transactions on Multimedia, 2007.
 

IEEE Transactions on Multimedia, To Appear

Salamin, H., Favre, S. and Vinciarelli, A., Automatic Role Recognition in Multiparty Recordings: Using Social Affiliation Networks for Feature Extraction, in: IEEE Transactions on Multimedia, To Appear, 2009.
 

IEEE Transactions on Neural Systems & Rehabilitation Engineering

Bourlard, H., Chavarriaga, R., Galán, F. and Millán, J. del R., Characterizing the eeg correlates of exploratory behavior, in: IEEE Transactions on Neural Systems & Rehabilitation Engineering, 2008.
 

IEEE Transactions on Pattern Analysis and Machine Intelligence

Fleuret, F., Berclaz, J., Lengagne, R. and Fua, P., Multi-Camera People Tracking with a Probabilistic Occupancy Map, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 30, number 2, pages 267-282, 2008.
 
Kokiopoulou, E. and Frossard, P., Minimum distance between pattern transformation manifolds: algorithm and applications, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
 
Leibe, B., Schindler, K., Cornelis, N. and van Gool, L., Coupled object detection and tracking from static cameras and moving vehicles, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
 
Meynet, J. and Thiran, J. -Ph., Information Theoretic Combination of Classifiers, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008. [DOI]
 
Shahrokni, A., Drummond, T., Fleuret, F. and Fua, P., Classification-based Probabilistic Modeling of Texture Transition for Fast Line Search Tracking and Delineation, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
 
Monay, F. and Gatica-Perez, D., Modeling semantic aspects for cross-media image indexing, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 29, pages 1802-1817, ISSN 0162-8828, 2007. [DOI]
 
Quelhas, P., Odobez, J. -M., Gatica-Perez, D. and Tuytelaars, T., A thousand words in a scene, in: IEEE Transactions on Pattern Analysis and Machine Intelligence, volume 29, number 9, pages 151575-1589, 2007. [DOI]
 

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

Grangier, D. and Bengio, S., A discriminative kernel-based model to rank images from text queries, in: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2008.
 

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Special Issue on Biometrics

Marcel, S. and del R. Millán, J., Person authentication using brainwaves (eeg) and maximum a posteriori model adaptation, in: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE Special Issue on Biometrics, 2007.
 

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans

Humm, A., Hennebert, J. and Ingold, R., Combined handwriting and speech modalities for user authentication, in: IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans, volume 39, 2009.
 
Humm, A., Hennebert, J. and Ingold, R., Combined handwriting and speech modalities for user authentication, in: IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans, volume 38, 2008.
 

IEEE Transactios on Image Processing

I. Bogdanova, , A. Bur, and Hügli, H., Visual attention on the sphere [in press], in: IEEE Transactios on Image Processing, 2008.
 

IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto

Huang, Y., Vinyals, O., Friedland, G., Müller, C., Mirghafori, N. and Wooters, C., A Fast-Match approach for robust, faster than real-time Speaker Diarization, in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
 
Hwang, M. -Y., Peng, G., Wang, W., Faria, A., Heidel, A. and Ostendorf, M., Building a Highly Accurate Mandarin Speech Recognizer, in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
 
Levit, M., Hakkani-Tur, D., Tur, G. and Gillick, D., Integrating Several Annotation Layers for Statistical Information Distillation, in: IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 07), Kyoto, 2007.
 

IET Signal Processing, Special Issue on Biometric Recognition

Kryszczuk, K. and Drygajlo, A., Improving biometric verification with class-independent quality information, in: IET Signal Processing, Special Issue on Biometric Recognition, volume 3, number 4, pages 310-321, 2009.
 

Image and vision Computing

Caputo, B., Hayman, E., Fritz, M. and Ekluhnd, J. -O, Classifying Material in the Real World, in: Image and vision Computing, volume accepted for pub, 2009.
 

Image and Vision Computing

Vinciarelli, A., Pantic, M. and Bourlard, H., Social Signal Processing: Survey of an Emerging Domain, in: Image and Vision Computing, 2009.
 
Leibe, B., Ettlin, A. and Schiele, B., Learning semantic object parts for object categorization, in: Image and Vision Computing, volume 26, number 1, pages 15-26, 2008.
 

in C. Muller (Ed.) Speaker Classification I. Springer-Verlag, New York

Shriberg, E., Higher level features in speaker recognition, in: in C. Muller (Ed.) Speaker Classification I. Springer-Verlag, New York, 2008.
 

In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006); Lecture Notes in Computer Science. Springer

Janin, A., Stolcke, A., Anguera, X., Boakye, K., Cetin, O., Frankel, J. and Zheng, J., The ICSI-SRI Spring 2006 Meeting Evaluation System, in: In S. Renals and S. Bengio, editors, Machine Learning for Multimodal Interaction: Third International Workshop (MLMI 2006); Lecture Notes in Computer Science. Springer, 2006.
 

Informer (Newsletter of the BCS Information Retrieval Specialist Group)

Popescu-Belis, A. and Vinciarelli, A., Multimedia meeting processing and retrieval at the idiap research institute, in: Informer (Newsletter of the BCS Information Retrieval Specialist Group), volume 29, pages 14-16, 2009.
 

Int. Journal of Pattern Recognition and Art. Intelligence

Bertolami, R. and Bunke, H., Integration of n-gram language models in multiple classifier systems for offline handwritten text line recognition, in: Int. Journal of Pattern Recognition and Art. Intelligence, volume 22, number 7, pages 1301-1321, 2008.
 
Liwicki, M. and Bunke, H., Handwriting recognition of whiteboard notes -- studying the influence of training set size and type, in: Int. Journal of Pattern Recognition and Art. Intelligence, volume 21, number 1, pages 83-98, 2007.
 

Intelligent Service Robotics

Prodanov, P., Drygajlo, A., Richiardi, J. and Alexander, A., Low-level grounding in a multimodal mobile service robot conversational system using graphical models, in: Intelligent Service Robotics, volume 1, pages 3-26, 2008. [DOI]
 

International Conference on Semantic Computing (ICSC), Irvine, CA

Cuendet, S., Hakkani-Tur, D., Shriberg, E., Fung, J. and Favre, B., Cross-Genre Feature Comparisons for Spoken Sentence Segmentation, in: International Conference on Semantic Computing (ICSC), Irvine, CA, 2007.
 

International Journal of Computer Vision

Cornelis, N., Leibe, B., Cornelis, K. and van Gool, L., 3d urban scene modeling integrating recognition and reconstruction, in: International Journal of Computer Vision, volume 78, number 2-3, pages 121-141, 2008.
 
Gui, L., Thiran, J. -Ph. and Paragios, N., Cooperative object segmentation and behavior inference in image sequences, in: International Journal of Computer Vision, ISSN 0920-5691, 2008. [DOI]
 
Leibe, B., Leonardis, A. and Schiele, B., Robust object detection with interleaved categorization and segmentation, in: International Journal of Computer Vision, volume 77, number 1-3, pages 259-289, 2008.
 
Schindler, K., Suter, D. and H. Wang, , A model-selection framework for multibody structure-and-motion of image sequences, in: International Journal of Computer Vision, volume 79, number 2, pages 159-177, 2007.
 

International Journal of Human-Computer Studies

Chanel, G., Kierkels, J., Soleymani, M. and Pun, T., short-term emotion assessment in a recall paradigm, in: International Journal of Human-Computer Studies, volume 67, number 8, pages 607-627, 2009.
 

International Journal of Imaging Systems and Technology

Mendels, F., Thiran, J. -Ph. and Vandergheynst, P., Matching pursuit-based shape representation and recognition using scale-space, in: International Journal of Imaging Systems and Technology, volume 6, number 15, pages 162-180, 2006. [DOI]
 

International Journal of Pattern Recognition and Artificial Intelligence

Millán, J. del R., Ferrez, P. W., Galán, F., Lew, E. and Chavarriaga, R., Non-invasive brain-machine interaction, in: International Journal of Pattern Recognition and Artificial Intelligence, 2008.
 

International Journal of Robotics Research

Pronobis, A. and Caputo, B., COLD: The COsy Localization Database, in: International Journal of Robotics Research, volume 28, number 5, pages 588-594, 2009.
 

International Journal of the Eurographics Association

Bertini, E., Lalanne, D. and Rigamonti, M., Extended excentric labeling, in: International Journal of the Eurographics Association, volume 28, 2009.
 

International Journal on Image and Video Processing Special Issue on Facial Image Processing

Marcel, S., Rodriguez, Y. and Heusch, G., On the recent use of local binary patterns for face authentication, in: International Journal on Image and Video Processing Special Issue on Facial Image Processing, 2007.
 

Journal of Acoustical Society of America - Express Letters

Ganapathy, S., Thomas, S. and Hermansky, H., Modulation Frequency Features For Phoneme Recognition In Noisy Speech, in: Journal of Acoustical Society of America - Express Letters, 2008.
 

Journal of Computer Security

Spindler, T., Wartmann, C., Hovestadt, L., Roth, D., van Gool, L. and Steffen, A., Privacy in video surveilled spaces, in: Journal of Computer Security, volume 16, number 2, pages 199-222, 2008.
 

Journal of Electronic Imaging

Voloshynovskiy, S., Koval, O., Villán, R., Beekhof, F. and Pun, T., Authentication of biometric identification documents via mobile devices, in: Journal of Electronic Imaging, 2008.
 

Journal of Machine Learning Research

Orabona, F., Keshet, J. and Caputo, B., Bounded kernel-based perceptrons, in: Journal of Machine Learning Research, volume Accepted for pub, 2009.
 
Fleuret, F. and Geman, D., Stationary features and cat detection, in: Journal of Machine Learning Research, 2008.
 
Rakotomamonjy, A., Bach, F., Canu, S. and Grandvalet, Y., SimpleMKL, in: Journal of Machine Learning Research, volume 9, pages 2491-2521, 2008.
 

Journal of Machine Learning Research (JMLR)

Fleuret, F. and Geman, D., Stationary features and cat detection, in: Journal of Machine Learning Research (JMLR), volume 9, pages 2549-2578, 2008.
 

Journal of Mathematical Imaging and Vision

Bresson, X., Esedoglu, S., Vandergheynst, P., Thiran, J. -Ph. and Osher, S., Fast Global Minimization of the Active Contour/Snake Model, in: Journal of Mathematical Imaging and Vision, volume 28, number 2, pages 151-167, 2007. [DOI]
 

Journal of Neuroscience Methods

Hoffmann, U., Vesin, J. M., Ebrahimi, T. and Diserens, K., An efficient p300-based brain-computer interface for disabled subjects, in: Journal of Neuroscience Methods, volume 167, number 1, pages 115-125, 2008. [DOI]
 
Cincotti, F., Mattia, D., Aloise, F., Bufalari, S., Astolfi, L., Fallani, F. De Vico, Tocci, A., Bianchi, L., Marciani, M. G., Gao, S., Millán, J. del R. and Babiloni, F., High-resolution eeg techniques for brain-computer interface applications, in: Journal of Neuroscience Methods, volume 167, pages 31-42, ISSN 0165-0270, 2007.
 

Language Resources and Evaluation

Popescu-Belis, A., Dimensionality of dialogue act tagsets: an empirical analysis of large corpora, in: Language Resources and Evaluation, volume 42, number 1, pages 99-107, 2008. [DOI]
 

Lecture Notes in Computer Science

Stolcke, A., Anguera, X., Boakye, K., Cetin, O., Janin, A., Magimai-Doss, M., Wooters, C. and Zheng, J., The sri-icsi spring 2007 meeting and lecture recognition system, in: Lecture Notes in Computer Science, 2007.
 

Master Thesis, University of California, Berkeley

Huang, Y., Robust and rapid speaker diarization, in: Master Thesis, University of California, Berkeley, 2007.
 

Multimedia Tools and Applications

Behera, A., Lalanne, D. and Ingold, R., Docmir: an automatic document-based indexing system for meeting retrieval, in: Multimedia Tools and Applications, volume 37, number 2, 2007.
 
Janvier, B., Bruno, E., Marchand-Maillet, S. and Pun, T., Handling temporal heterogeneous data for content-based management of large video collections, in: Multimedia Tools and Applications, volume 30, pages 273-288, 2006.
 
Moënne-Loccoz, N., Janvier, B., Marchand-Maillet, S. and Bruno, E., Handling temporal heterogeneous data for content-based management of large video collections, in: Multimedia Tools and Applications, volume 31, pages 309-325, 2006.
 

Neural Networks

Schindler, K., van Gool, L. and B. de Gelder, , Recognizing emotions expressed by body pose: a biologically inspired neural model, in: Neural Networks, 2008.
 

Neurocomputing

Bologna, G., Deville, B. and Pun, T., On the use of the auditory pathway to represent image scenes in real-time, in: Neurocomputing, volume 72, pages 839-849, 2009.
 

Pattern Analysis and Applications

Kosinov, S. and Pun, T., Distance-based discriminant analysis method and its applications, in: Pattern Analysis and Applications, volume 11, number 3-4, pages 227-246, 2008.
 
Schlapbach, A. and Bunke, H., A writer identification and verification system using HMM based recognizers, in: Pattern Analysis and Applications, volume 10, number 1, pages 33-43, 2007.
 

Pattern Recognition

Orabona, F., Castellini, C., Caputo, B., Luo, J. and Sandini, G., Towards Life-long Learning for Cognitive Systems: Online Independent Support Vector Machine, in: Pattern Recognition, volume Accepted for Pub, 2009.
 
Bertolami, R. and Bunke, H., Hidden Markov model based ensemble methods for offline handwritten text line recognition, in: Pattern Recognition, volume 41, number 11, pages 3452-3460, 2008.
 
Schindler, K. and Suter, D., Object detection by global contour shape, in: Pattern Recognition, 2008.
 
Schlapbach, A., Liwicki, M. and Bunke, H., A writer identification system for on-line whiteboard data, in: Pattern Recognition, volume 41, pages 2381-2397, 2008.
 
Mariéthoz, J. and Bengio, S., A kernel trick for sequences applied to text-independent speaker verification systems, in: Pattern Recognition, volume 40, number 8, ISSN 0031-3203, 2007.
 
Meynet, J., Popovici, V. and Thiran, J. -Ph., Face Detection with Boosted Gaussian Features, in: Pattern Recognition, volume 40, number 8, pages 2283-2291, 2007. [DOI]
 

Pattern Recognition Letters

Meynet, J. and Thiran, J. -Ph., Ensembles of SVMs using an Information Theoretic Criterion, in: Pattern Recognition Letters, 2008.
 
Tommasi, T., Orabona, F. and Caputo, B., Discriminative cue integration for medical image annotation, in: Pattern Recognition Letters, 2008.
 

Pattern Recognition Letters (PRL)

Fleuret, F., Multi-layer boosting for pattern recognition, in: Pattern Recognition Letters (PRL), volume 30, pages 237-241, 2009.
 

Proc. ICASSP, Honolulu

Anguera, X., Wooters, C., Pardo, J. M. and Hernando, J., Automatic Weighting for the Combination of TDOA and Acoustic Features in Speaker Diarization for Meetings, in: Proc. ICASSP, Honolulu, 2007.
 
Anguera, X., Shinozaki, T., Wooters, C. and Hernando, J., Model Complexity Selection and Cross-validation EM Training for Robust Speaker Diarization, in: Proc. ICASSP, Honolulu, 2007.
 
Cetin, O., Kantor, A., King, S., Bartels, C., Magimai-Doss, M., Frankel, J. and Livescu, K., An Articulatory Feature-based Tandem Approach and Factored Observation Modeling, in: Proc. ICASSP, Honolulu, 2007.
 
Hakkani-Tur, D. and Tur, G., Statistical Sentence Extraction for Information Distillation, in: Proc. ICASSP, Honolulu, 2007.
 
Lei, H. and Mirghafori, N., Word-conditioned phone N-grams for speaker recognition, in: Proc. ICASSP, Honolulu, 2007.
 
Liu, Y. and Shriberg, E., Comparing Evaluation Metrics for Sentence Boundary Detection, in: Proc. ICASSP, Honolulu, 2007.
 
Livescu, K., Cetin, O., Hasegawa-Johnson, M., King, S., Bartels, C., Borges, N., Kantor, A., Lal, P., Yung, L., Bezman, A., Dawson-Haggerty, S., Woods, B., Frankel, J., Magimai-Doss, M. and Saenko, K., Articulatory Feature-based Methods for Acoustic and Audio-visual speech Recognition: Summary from the 2006 JHU Summer Workshop, in: Proc. ICASSP, Honolulu, 2007.
 
Livescu, K., Bezman, A., Borges, N., Yung, L., Cetin, O., Frankel, J., King, S., Magimai-Doss, M., Chi, X. and Lavoie, L., Manual Transcription of Conversational Speech at the Articulatory Feature Level, in: Proc. ICASSP, Honolulu, 2007.
 
Magimai-Doss, M., Hakkani-Tur, D., Cetin, O., Shriberg, E., Fung, J. and Mirghafori, N., Entropy Based Classifier Combination for Sentence Segmentation,, in: Proc. ICASSP, Honolulu, 2007.
 

Proc. ICASSP, Honolulu.

Zheng, J., Cetin, O., Hwang, M. -Y., Lei, X., Stolcke, A. and Morgan, N., Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition, in: Proc. ICASSP, Honolulu., 2007.
 

Proc. IEEE/ACL Workshop on Spoken Language Technology

Andreani, G., Di Fabbrizio, G., Gilbert, M., Gillick, D., Hakkani-Tur, D. and Lemon, O., Lets DiSCoH: Collecting an Annotated Open Corpus with Dialog Acts and Reward Signals for Natural Language Helpdesks, in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
 
Tur, G., Guz, U. and Hakkani-Tur, D., Model Adaptation for Dialog Act Tagging, in: Proc. IEEE/ACL Workshop on Spoken Language Technology, 2006.
 

Proc. IEEE/ACL Workshop on Spoken Language Technology,

Cuendet, S., Hakkani-Tur, D. and Tur, G., Model Adaptation for Sentence Segmentation from Speech, in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
 
Hillard, D., Huang, Z., Ji, H., Grishman, R., Hakkani-Tur, D., Harper, M., Ostendorf, M. and Wang, W., Impact of Automatic Comma Prediction on POS/Name Tagging of Speech, in: Proc. IEEE/ACL Workshop on Spoken Language Technology,, 2006.
 

Proceedings of ACM Multimedia 2007, pp. 835-838, Augsburg, Germany

Hung, H., Jayagopi, D., Yeo, C., Friedland, G., Ba, S., Odobez, J. -M., Ramchandran, K., Mirghafori, N. and Gatica-Perez, D., Using audio and video features to classify the most dominant person in meetings, in: Proceedings of ACM Multimedia 2007, pp. 835-838, Augsburg, Germany, 2007.
 

Proceedings of NOLISP, Paris, France,

Stoll, L., Frankel, J. and Mirghafori, N., Speaker Recognition Via Nonlinear Discriminant Features, in: Proceedings of NOLISP, Paris, France,, 2007.
 

Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil

Faria, A. and Morgan, N., When a mismatch can be good: large vocabulary speech recognition trained with idealized tandem features, in: Proceedings of the ACM Symposium on Applied Computing, Fortaleza, Brazil, 2008.
 

Signal Processing

Kryszczuk, K. and Drygajlo, A., Credence estimation and error prediction in biometric identity verification, in: Signal Processing, volume 88, number 4, pages 916-925, 2008.
 

Signal, Image and Video Processing

Meynet, J., Popovici, V. and Thiran, J. -Ph., Mixtures of Boosted Classifiers for Frontal Face Detection, in: Signal, Image and Video Processing, volume 1, number 1, pages 29-38, 2007. [DOI]
 

Speech Communication

Keshet, J., Grangier, D. and Bengio, S., Discriminative Keyword Spotting, in: Speech Communication, volume 51, number 4, pages 317-329, 2009.
 
BenZeghiba, M. F. and Bourlard, H., User-customized password speaker verification using multiple reference and background models, in: Speech Communication, volume 8, pages 1200-1213, 2006.
 

Speech Communication (Elsevier)

Romsdorfer, H. and Pfister, B., Text analysis and language identification for polyglot text-to-speech synthesis, in: Speech Communication (Elsevier), 2007.
 

SPIE Journal of Electronic Imaging

Humm, A., Hennebert, J. and Ingold, R., Spoken signature for user authentication, in: SPIE Journal of Electronic Imaging, volume 17, 2008.
 
Humm, A., Hennebert, J. and Ingold, R., Spoken signature for user authentication, in: SPIE Journal of Electronic Imaging, volume 17, 2008.
 

TAL (Traitement Automatique des Langues)

Bouillon, P., Rayner, M., Novellas Vall, B., Starlander, M., Santaholma, M., Nakao, Y. and Chatzichrisafis, N., Une grammaire partagée multi-tâche pour le traitement de la parole : application aux langues romanes, in: TAL (Traitement Automatique des Langues), volume 47, number 3, 2007.
 
Popescu-Belis, A., Le rôle des métriques d'évaluation dans le processus de recherche en tal, in: TAL (Traitement Automatique des Langues), volume 47, number 2, 2007.
 

Technical Report TR-07-004, International Computer Science Institute, Berkeley, California

Huang, Y., Friedland, G., Müller, C. and Mirghafori, N., Speeding up speaker diarization by using prosodic features, in: Technical Report TR-07-004, International Computer Science Institute, Berkeley, California, 2007.
 

Technical Report TR-08-001, International Computer Science Institute, Berkeley, CA

Vinyals, O. and Friedland, G., Live speaker identification in meetings: "who is speaking now?", in: Technical Report TR-08-001, International Computer Science Institute, Berkeley, CA, 2008.
 

Technical Report TR-08-004, International Computer Science Institute, Berkeley, CA

Garg, N. and Hakkani-Tur, D., Speaker role detection in meetings using lexical information and social network analysis, in: Technical Report TR-08-004, International Computer Science Institute, Berkeley, CA, 2008.
 

to appear in IEEE Transactions on Audio, Speech and Language Processing

Anguera, X., Wooters, C. and Hernando, J., Acoustic Beamforming for Speaker Diarization of Meetings, in: to appear in IEEE Transactions on Audio, Speech and Language Processing, 2007.
 

to appear in IEEE Transactions on Computers

Pardo, J. M., Anguera, X. and Wooters, C., Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information, in: to appear in IEEE Transactions on Computers, 2007.
 

To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence

Bruno, E., Moënne-Loccoz, N. and Marchand-Maillet, S., Design of multimodal dissimilarity spaces for retrieval of multimedia documents, in: To appear in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
 

To appear in International Journal of Semantic Computing

Soleymani, M., Chanel, G., Kierkels, J. and Pun, T., affective characterization of movie scenes based on content analysis and physiological changes, in: To appear in International Journal of Semantic Computing, 2009.
 

To appear in Journal of Multimedia

Bruno, E. and Marchand-Maillet, S., Multimodal preference aggregation for multimedia information retrieval, in: To appear in Journal of Multimedia, 2009.
 

to appear in Lecture Notes in Computer Science

Wooters, C. and Huijbregts, M., The ICSI RT07s Speaker Diarization System, in: to appear in Lecture Notes in Computer Science, 2007.
 

To appear in Multimedia Tools and Applications Journal special issue on "Metadata Mining for Image Understanding"

Kludas, J., Bruno, E. and Marchand-Maillet, S., Can feature information interaction help for information fusion in multimedia problems?, in: To appear in Multimedia Tools and Applications Journal special issue on "Metadata Mining for Image Understanding", 2008.
 

to appear in Proc. Interspeech, Antwerp.

Huijbregts, M. and Wooters, C., The Blame Game: Performance Analysis of Speaker Diarization System Components, in: to appear in Proc. Interspeech, Antwerp., 2007.
 

to appear in Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, CA

Vinyals, O. and Friedland, G., Towards semantic analysis of conversations: a system for the live identification of speakers in meetings, in: to appear in Proceedings of IEEE International Conference on Semantic Computing, Santa Clara, CA, 2008.
 

to appear in Proceedings of Interspeech 2008, Brisbane, Australia

Gillick, D., Hakkani-Tur, D. and Levit, M., Unsupervised learning of edit parameters for matching name variants, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Knox, M., Morgan, N. and Mirghafori, N., Getting the last laugh: automatic laughter segmentation in meetings, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Riedhammer, K., Gillick, D., Favre, B. and Hakkani-Tur, D., Packing the meeting summarization knapsack, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Singla, A. and Hakkani-Tur, D., Cross-lingual sentence extraction for information distillation, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Vergyri, D., Mandal, A., Wang, W., Stolcke, A., Zheng, J., Graciarena, M., Rybach, D., Gollan, C., Schlater, R., Kirchoff, K., Faria, A. and Morgan, N., Development of the sri/nightingale arabic asr system, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 

to appear in proceedings of Interspeech 2008, Brisbane, Australia

Vinyals, O. and Friedland, G., Modulation spectrogram features for speaker diarization, in: to appear in proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 

to appear in Proceedings of Interspeech 2008, Brisbane, Australia

Zhao, S. and Morgan, N., Multi-stream spectro-temporal features for robust speech recognition, in: to appear in Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 

to appear in Proceedings of Interspeech, Antwerp

Guz, U., Cuendet, S., Hakkani-Tur, D. and Tur, G., Co-training Using Prosodic and Lexical Information for Sentence Segmentation, in: to appear in Proceedings of Interspeech, Antwerp, 2007.
 
Huijbregts, M., Wooters, C. and Ordelman, R., Filtering the Unknown: Speech Activity Detection in Heterogeneous Video Collections, in: to appear in Proceedings of Interspeech, Antwerp, 2007.
 

to appear in Proceedings of Interspeech, Antwerp.

Knox, M. and Mirghafori, N., Automatic Laughter Detection Using Neural Networks, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 
Kolar, J., Liu, Y. and Shriberg, E., Speaker Adaptation of Language Models for Automatic Dialog Act Segmentation of Meetings, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 
Lei, H. and Mirghafori, N., Word-Conditioned HMM Supervectors for Speaker Recognition, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 
Müller, C. and Burkhardt, F., Combining Short-term Cepstral and Long-term Pitch Features for Automatic Recognition of Speaker Age, in: to appear in Proceedings of Interspeech, Antwerp., 2007.
 

to appear in Proceedings of MLMI, Brno, Czech Republic

Cuendet, S., Hakkani-Tur, D. and Shriberg, E., Automatic Labeling Inconsistencies Detection And Correction For Sentence Unit Segmentation In Conversational Speech, in: to appear in Proceedings of MLMI, Brno, Czech Republic, 2007.
 

To appear in Signal Processing: Image Communication special issue on "Semantic Analysis for Interactive Multimedia Services"

Kosinov, S., Bruno, E. and Marchand-Maillet, S., Spatially-consistent partial matching for intra- and inter-image prototype selection, in: To appear in Signal Processing: Image Communication special issue on "Semantic Analysis for Interactive Multimedia Services", 2008.
 

Publications of type: Book

2009

Lalanne, D. and Kholas, J., Human machine interaction, 2009.
 

2008

Camastra, F. and Vinciarelli, A., Machine learning for audio, image and video analysis, Advanced Information and Knowledge Processing, volume XVI, Springer Verlag, ISBN 978-1-84800-006-3, 2008.
 
Keshet, J. and Bengio, S., Automatic speech and speaker recognition: large margin and kernel methods, John Wiley & Sons, 2008.
 
Lalanne, D., Rigamonti, M., Ingold, R., Evéquoz, F. and Dumas, B., An ego-centric and tangible approach to meeting indexing and browsing, Lecture Notes in Computer Science, volume Volume 4892, Springer Berlin / Heidelberg, ISBN 978-3-540-78154-7, 2008. [DOI]
 
Liwicki, M. and Bunke, H., Recognition of whiteboard notes -- online, offline and combination, World Scientific, ISBN 978-9812814531, 2008.
 
Popescu-Belis, A., Bourlard, H. and Renals, S., Machine learning for multimodal interaction iv, LNCS, volume 4892, Springer-Verlag, ISBN 978-3-540-78154-7, 2008.
 
Popescu-Belis, A. and Stiefelhagen, R., Machine learning for multimodal interaction v, LNCS, volume 5237, Springer-Verlag, ISBN 978-3-540-85852-2, 2008.
 
Schlapbach, A., Writer identification and verification, volume 311, IOS Press, ISBN 978-1-58603-825-0, 2008.
 
Schouten, B., Juul, N., Drygajlo, A. and Tistarelli, M., Biometrics and identity management, Springer, 2008.
 
Popescu-Belis, A., Bourlard, H. and Renals, S., Machine learning for multimodal interaction iv (revised selected papers from mlmi 2007, brno, 28-30 june 2007), LNCS 4892, Springer-Verlag, 2008.
 
Popescu-Belis, A. and Stiefelhagen, R., Machine learning for multimodal interaction v (proceedings of mlmi 2008, utrecht, 8-10 september 2008), LNCS 5237, Springer-Verlag, 2008.
 

2007

Dornhege, G., del R. Millán, J., Hinterberger, T., McFarland, D. and Müller, K. -R., Towards brain-computer interfacing, The MIT Press, 2007.
 
Marchand-Maillet, S., Bruno, E., Nürnberger, A. and Detyniecki, M., Adaptive multimedia retrieval: user, context and feedback, Springer, 2007.
 
Neuhaus, M. and Bunke, H., Bridging the gap between graph edit distance and kernel machines, Machine Perception and Artificial Intelligence, volume 68, World Scientific, ISBN 978-981-270-817-5, 2007.
 

Publications of type: Inbook

2009

Friedland, G. and van Leeuwen, D., Speaker diarization and identification, IEEE Press/Wiley, 2009.
 

2007

Drygajlo, A., Man-machine voice communication, pages 433-461, EPFL Press, 2007. [DOI]
 

Unknown year

Brodbeck, D., Mazza, R. and Lalanne, D., Interactive visualization - a survey, 0000.
 
Dumas, B., Lalanne, D. and Oviatt, S., Multimodal interfaces: a survey of principles, models and frameworks, 0000.
 
Mugellini, E., Lalanne, D., Dumas, B., Evéquoz, F., Gerardi, S., Le Calvé, A., Boder, A., Ingold, R. and Khaled, O., Memodules as tangible shortcuts to multimedia information, 0000.
 

Publications of type: Incollection

2009

Morrison, D., Bruno, E. and Marchand-Maillet, S., capturing the semantics of user interaction: a review and case study, in: Emergent Web Intelligence, Springer, 2009.
 
Popescu-Belis, A., Carletta, J., Kilgour, J. and Poller, P., Accessing a large multimodal corpus using an automatic content linking device, in: Multimodal Corpora, Springer-Verlag, 2009.
 
Keshet, J. and Chazan, D., A Kernel Wrapper for Phoneme Sequence Recognition, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Keshet, J., Shalev-Shwartz, S., Singer, Y. and Chazan, D., A Large Margin Algorithm for Forced Alignment, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Keshet, J., A Proposal for a Kernel-based Algorithm for Large Vocabulary Continuous Speech Recognition, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Grangier, D., Keshet, J. and Bengio, S., Discriminative Keyword Spotting, in: Automatic Speech and Speaker Recognition: Large Margin and Kernel Methods, John Wiley and Sons, 2009.
 
Deville, B., Bologna, G., Vinckenbosch, M. and Pun, T., See color: seeing colours with an orchestra, in: Human Machine Interaction: Research Results of the MMI Program, pages 251-279, Springer, 2009.
 

2008

Bertolami, R. and Bunke, H., Ensemble methods to improve the performance of an english handwritten text line recognizer, in: Arabic and Chinese Handwriting Recognition, pages 265-277, Springer, 2008.
 
Bunke, H., Dickinson, P., Neuhaus, M. and Stettler, M., Matching of hypergraphs -- algorithms, applications, and experiments, in: Applied Pattern Recognition, pages 131-154, Springer, 2008.
 
Dutoit, T., Couvreur, L. and Bourlard, H., How does a dictation machine recognize speech ?, in: Applied Signal Processing--A MATLAB approach, pages 104-148, Springer MA, 2008.
 
Schlapbach, A. and Bunke, H., Off-line writer identification and verification using gaussian mixture models, in: Machine Learning in Document Analysis and Recognition, pages 409-428, Springer, 2008.
 
Stolcke, A., Anguera, X., Boakye, K., Cetin, O., Janin, A., Magimai-Doss, M., Wooters, C. and Zheng, J., The SRI-ICSI spring 2007 meeting and lecture recognition system, in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
 
Wooters, C. and Huijbregts, M., The ICSI RT07s speaker diarization system, in: Multimodal Technologies for Perception of Humans, Lecture Notes in Computer Science, 2008.
 
Varga, T. and Bunke, H., Perturbation models for generating synthetic training data in handwriting recognition, in: Machine Learning in Document Analysis and Recognition, pages 333-360, Springer, 2008.
 
Popescu-Belis, A., Boertjes, E., Kilgour, J., Poller, P., Castronovo, S., Wilson, T., Jaimes, A. and Carletta, J., The amida automatic content linking device: just-in-time document retrieval in meetings, in: Machine Learning for Multimodal Interaction V (Proceedings of MLMI 2008, Utrecht, 8-10 September 2008), pages 273-284, Springer-Verlag, 2008.
 
Popescu-Belis, A., Baudrion, P., Flynn, M. and Wellner, P., Towards an objective test for meeting browsers: the bet4tqb pilot experiment, in: Machine Learning for Multimodal Interaction IV, pages 108-119, Springer-Verlag, 2008. [DOI]
 

2007

Bunke, H. and Neuhaus, M., Graph matching -- exact and error-tolerant methods and the automatic learning of edit costs, in: Mining Graph Data, pages 17-34, Wiley, 2007.
 
Ferrez, P. W. and Millán, J. del R., Error-related eeg potentials in brain-computer interfaces, in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
 
Millán, J. del R., Buttfield, A., Vidaurre, C., Krauledat, M., Schlögl, A., Shenoy, P., Blankertz, B., Rao, R. P. N., Cabeza, R., Pfurtscheller, G. and Müller, K. -R., Adaptation in brain-computer interfaces, in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
 
Millán, J. del R., Ferrez, P. W. and Buttfield, A., The idiap brain-computer interface: an asynchronous multi-class approach, in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
 
del R. Millán, J., Tapping the mind or resonating minds?, in: European Visions for the Knowledge Age, Cheshire Henbury, 2007.
 
Shriberg, E., Higher level features in speaker recognition, in: Speaker Classification I, Lecture Notes in Computer Science, Springer, 2007.
 
Peralta Menendez, R. Grave de, González Andino, S. L., Ferrez, P. W. and Millán, J. del R., Non-invasive estimates of local field potentials for brain-computer interfaces, in: Towards Brain-Computer Interfacing, The MIT Press, 2007.
 
Fasel, B. and van Gool, L., Interactive museum guide: accurate retrieval of object descriptions, in: Adaptive Multimedia Retrieval: User, Context, and Feedback, pages 179-191, Springer, 2007.
 

2006

Everingham, M., Zisserman, A., Williams, C., van Gool, L., Allan, M., Bishop, C., Chapelle, O., Dalal, N., Deselaers, T., Dorko, G., Duffner, S., Eichhorn, J., Farquhar, J., Fritz, M., Garcia, C., Griffiths, T., Jurie, F., Keysers, D., Koskela, M., Laaksonen, J., Larlus, D., Leibe, B., Meng, H., Ney, H., Schiele, B., Schmid, C., Seemann, E., Shawe-Taylor, J., Storkey, A., Szedmak, S., Triggs, B., Ulusoy, I., Viitaniemi, V. and Zhang, J., The 2005 pascal visual object class challenge, in: Selected Proceedings of the 1st PASCAL Challenges Workshop, Lecture Notes in AI, Springer, 2006.
 
del R. Millán, J., Renkens, F., Mouriño, J. and Gerstner, W., Non-invasive brain-actuated control of a mobile robot by human eeg, in: 2006 IMIA Yearbook of Medical Informatics, Schattauer Verlag, 2006.
 

Unknown year

Popescu-Belis, A., Multimodal database annotation formats and standards, software architecture for multimodal interfaces, in: Multimodal Signal Processing: Methods and Techniques to Build Multimodal Interactive Systems, Academic Press, 0000.
 

Publications of type: Inproceedings

2009

Ali, K., Fleuret, F., Hasler, D. and Fua, P., Joint learning of pose estimators and features for object detection, in: Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2009.
 
Aradilla, G., Bourlard, H. and Magimai-Doss, M., Posterior features applied to speech recognition tasks with user-defined vocabulary, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
 
Ba, S., Hung, H. and Odobez, J. -M., Visual activity context for focus of attention estimation in dynamic meetings, in: IEEE Proc. Int. Conf. on Multimedia and Expo (ICME), 2009.
 
Baechler, M., Bloechle, J. -L., Humm, A., Ingold, R. and Hennebert, J., Labeled images verification using gaussian mixture models, in: Proceedings of 24th Annual ACM Symposium on Applied Computing (ACM SAC'09), pages 1331-1336, 2009.
 
Beekhof, F., Voloshynovskiy, S., Koval, O. and Holotyak, T., Multi-class classifiers based on binary classifiers: performance, efficiency, and minimum coding matrix distances, in: MLSP 2009, 2009.
 
Bertini, E. and Lalanne, D., Surveying the complementary roles of automatic data analysis and visualization in knowledge discovery, in: Proceedings of ACM SIGKDD Workshop on Visual Analytics and Knowledge Discovery, VAKD '09, 15th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (VAKD 2009), pages 12-20, 2009.
 
Bloechle, J. -L., Lalanne, D. and Ingold, R., Ocd: an optimized and canonical document format, in: Proceedings of 10th IEEE International Conference on Document Analysis and Recognition (ICDAR 2009), pages 236-240, 2009.
 
Bologna, G., Deville, B. and Pun, T., Blind navigation along a sinuous path by means of the see color interface, in: IWINAC2009, 3rd International Work-conference on the Interplay between Natural and Artificial Computation, Santiago de Compostela, Spain, June 22--27, 2009.
 
Bologna, G., Malandain, S., Deville, B. and Pun, T., The multi-touch see color interface, in: ICTA 2009, The 2nd International Conference on Information and Communication Technologies and Accessibility, Hammamet, Tunisia, May 7--9, 2009.
 
Bruno, E. and Marchand-Maillet, S., multiview clustering: a late fusion approach using latent models, in: Proceedings of the 32nd ACM Special Interest Group on Information Retrieval Conference, SIGIR 09, 2009.
 
Dines, J., Yamagishi, J. and King, S., Measuring the gap between HMM-based ASR and TTS, in: Proceedings of Interspeech, Brighton, U.K., 2009.
 
Dines, J., Saheer, L. and Liang, H., Speech recognition with speech synthesis models by marginalising over decision tree leaves, in: Proceedings of Interspeech, Brighton, U.K., 2009.
 
Drygajlo, A., Li, W. and Zhu, K., Q-stack aging model for face verification, in: 17th European Signal Processing Conference, 2009.
 
Duffner, S., Odobez, J. -M. and Ricci, E., Dynamic Partitioned Sampling For Tracking With Discriminative Features, in: Proceedings of the British Maschine Vision Conference, London, 2009.
 
Dumas, B., Lalanne, D. and Ingold, R., Benchmarking fusion engines of multimodal interactive systems, in: Proceedings of International Conference on Multimodal Interfaces and Workshop on Machine Learning for Multi-modal Interaction (ICMI-MLMI 2009), 2009.
 
Favre, S., Dielmann, A. and Vinciarelli, A., Automatic Role Recognition in Multiparty Recordings Using Social Networks and Probabilistic Sequential Models, in: ACM International Conference on Multimedia, To Appear, 2009.
 
Friedland, G., Vinyals, O., Huang, Y. and Muller, C., Fusion of short-term and long-term features for improved speaker diarization, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4077-4080, 2009.
 
Friedland, G., Hung, H. and Yeo, C., Multi-modal speaker diarization of real-world meetings using compressed-domain video features, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, pages 4069-4072, 2009.
 
Friedland, G., Yeo, C. and Hung, H., Visual Speaker Localization Aided by Acoustic Models, in: ACM Multimedia, 2009.
 
Friedland, G., Yeo, C. and Hung, H., Visual speaker localization aided by acoustic models (full paper), in: Proceedings of ACM Multimedia, Beijing, China, 2009.
 
Frinken, V. and Bunke, H., Evaluating retraining rules for semi-supervised learning in neural network based cursive word recognition, in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 31-35, 2009.
 
Frinken, V., Riesen, K. and Bunke, H., Improving graph classification by isomap, in: Graph-Based Representations in Pattern Recognition, pages 205-214, Springer, 2009.
 
Frinken, V. and Bunke, H., Self-training strategies for handwriting word recognition, in: Proc. Industrial Conf. Advances in Data Mining. Applications and Theoretical Aspects, pages 291-300, Springer, 2009.
 
Galbally, J., McCool, C., Fierrez, J., Marcel, S. and Ortega-Garcia, J., Hill-Climbing Attack to an Eigenface-Based Face Verification System, in: Proceedings of the First IEEE International Conference on Biometrics, Identity and Security (BIdS), 2009.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, pages 355-362, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, in: 12th International Conference on Text, Speech and Dialogue, TSD 2009, Springer - Verlag, Berlin Heidelberg 2009, Pilsen, Czech Republic, 2009.
 
Garau, G., Ba, S., Bourlard, H. and Odobez, J. -M., Investigating the use of Visual Focus of Attention for Audio-Visual Speaker Diarisation, in: Proceedings of the ACM International Conference on Multimedia, Beijing, China, 2009.
 
Garg, N., Favre, B., Riedhammer, K. and Hakkani-Tur, D., Clusterrank: a graph based method for meeting summarization, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 
Garner, P. N., Dines, J., Hain, T., El Hannani, A., Karafiat, M., Korchagin, D., Lincoln, M., Wan, V. and Zhang, L., Real-Time ASR from Meetings, in: Proceedings of Interspeech, Brighton, UK., 2009.
 
Garner, P. N., SNR Features for Automatic Speech Recognition, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
 
Gatica-Perez, D., Automatic nonverbal analysis of social interaction in small groups: a review, in: Image and Vision Computing, Special Issue on Human Naturalistic Behavior, in press, 2009.
 
Gelbart, D., Morgan, N. and Tsymbal, A., Hill-climbing feature selection for multi-stream asr, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 
Gillick, D., Riedhammer, K., Favre, B. and Hakkani-Tur, D., A global optimization framework for meeting summarization, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
 
Gonzalez, G., Fleuret, F. and Fua, P., Learning rotational features for filament detection, in: Proceedings of the IEEE international conference on Computer Vision and Pattern Recognition (CVPR), 2009.
 
Gonzalez, G., Aguet, F., Fleuret, F., Unser, M. and Fua, P., Steerable features for statistical 3d dendrite detection, in: Proceedings of the International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), 2009.
 
Gottlieb, L. and Friedland, G., On the use of artificial conversation data for speaker recognition in cars, in: IEEE International Conference for Semantic Computing, Berkeley, USA, 2009.
 
Hakkani-Tur, D., Towards automatic argument diagramming of multiparty meetings, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Taipei, Taiwan, 2009.
 
Humm, A., Ingold, R. and Hennebert, J., Spoken handwriting for user authentication using joint modelling systems, in: Proceedings of 6th International Symposium on Image and Signal Processing and Analysis (ISPA'09), 2009.
 
Imseng, D. and Friedland, G., Robust Speaker Diarization for Short Speech Recordings, in: Proceedings of the IEEE workshop on Automatic Speech Recognition and Understanding, Merano, Italy, 2009.
 
Indermühle, E., Liwicki, M. and Bunke, H., Combining alignment results for historical handwritten document analysis, in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 1186-1190, 2009.
 
Ivanov, I., Dufaux, F., Ha, T. M. and Ebrahimi, T., Towards Generic Detection of Unusual Events in Video Surveillance, in: 6th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSSâ09), Genoa, Italy, 2009.
 
Jayagopi, D., Bogdan, R. and Gatica-Perez, D., Characterising Conversationsal Group Dynamics Using Nonverbal Behaviour, in: Proceedings ICME 2009, 2009.
 
Jayagopi, D. and Gatica-Perez, D., Discovering group nonverbal conversational patterns with topics, in: accepted for publication in Proc. ICMI-MLMI, 2009.
 
Koval, O., Voloshynovskiy, S., Caire, F. and Bas, P., On security threats for robust perceptual hashin, in: Electronic Imaging 2009, 2009.
 
Kumatani, K., McDonough, J., Rauch, B., Garner, P. N., Li, W. and Dines, J., Maximum kurtosis beamforming with the generalized sidelobe canceller, in: Proceedings of INTERSPEECH, September 2008, Brisbane, Australia, 2009.
 
Lalanne, D., Nigay, L., Palanque, P., Robinson, P., Vanderdonckt, J. and Ladry, J. -F., Fusion engines for multimodal interfaces: a survey, in: Proceedings of International Conference on Multimodal Interfaces and Workshop on Machine Learning for Multi-modal Interaction (ICMI-MLMI 2009), 2009.
 
Le, Q. A. and Popescu-Belis, A., Automatic vs. human question answering over multimedia meeting recordings, in: Interspeech 2009 (10th Annual Conference of the International Speech Communication Association), 2009.
 
Lee, J. -S., De Simone, F. and Ebrahimi, T., Video coding based on audio-visual attention, in: IEEE International Conference on Multimedia and Expo (ICME'09), New York, USA, 2009.
 
Lefèvre, S. and Odobez, J. -M., Structure and appearance features for robust 3d facial actions tracking, in: International Conference on Multimedia and Expo (ICME), 2009.
 
Li, W., Dines, J., Magimai-Doss, M. and Bourlard, H., Non-linear mapping for multi-channel speech separation and robust overlapping speech recognition, in: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2009.
 
Luo, J., Orabona, F. and Caputo, B., An online framework for learning novel concepts over multiple cues, in: Proceeding of The 9th Asian Conference on Computer Vision, Xi'an, China, 2009.
 
Marchand-Maillet, S., Szekely, E. and Bruno, E., Optimizing strategies for the exploration of social-networks and associated data collections, in: Proceedings of the International Workshop on Image Analysis for Multimedia Interactive Services (WIAMIS'09) - Special session on "People, Pixels, Peers: Interactive Content in Social Networks", 2009.
 
McCool, C. and Marcel, S., Parts-Based Face Verification using Local Frequency Bands, in: in Proceedings of IEEE/IAPR International Conference on Biometrics, 2009.
 
Morrison, D., Marchand-Maillet, S. and Bruno, E., Modelling long-term relevance feedback, in: Proceedings of the ECIR Workshop on Information Retrieval over Social Networks, 2009.
 
Motlicek, P., Ganapathy, S. and Hermansky, H., Arithmetic Coding of Sub-Band Residuals in FDLP Speech/Audio Codec, in: 10th Annual Conference of the International Speech Communication Association, pages 2591-2594, ISCA 2009, ISCA, Brighton, England, 2009.
 
Motlicek, P., Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, in: 10thAnnual Conference of the International Speech Communication Association, pages 1215-1218, ISCA, Brighton, England, 2009.
 
Motlicek, P., Automatic Out-of-Language Detection Based on Confidence Measures Derived fromLVCSR Word and Phone Lattices, in: 10thAnnual Conference of the International Speech Communication Association, ISCA, 2009.
 
Noceti, N., Caputo, B., Castellini, C., Baldassarre, L., Barla, A., Rosasco, L., Odone, F. and Sandini, G., Towards a theoretical framework for learning multi-modal patterns for embodied agents, in: International Conference on Image Analysis and Processing, 2009.
 
Orabona, F., Caputo, B., Fillbrandt, A. and Ohl, F., A theoretical framework for transfer of knowledge across modalities in artificial and cognitive systems, in: International Conference on Developmental Learning, 2009.
 
Orabona, F., Castellini, C., Caputo, B., Fiorilla, A. E. and Sandini, G., Model adaptation with least-square SVM for adaptive hand prosthetics, in: IEEE International conference on Robotics and Automation, 2009.
 
Parthasarathi, S. H. K., Magimai-Doss, M., Bourlard, H. and Gatica-Perez, D., Investigating Privacy-Sensitive Features for Speech Detection in Multiparty Conversations, in: Proceedings of Interspeech 2009, 2009.
 
Parthasarathi, S. H. K., Magimai-Doss, M., Gatica-Perez, D. and Bourlard, H., Speaker Change Detection with Privacy-Preserving Audio Cues, in: Proceedings of ICMI-MLMI 2009, 2009.
 
Perrin, X., Colas, F., Pradalier, C. and Siegwart, R., Learning to identify users and predict their destination in a robotic guidance application, in: Field and Service Robotics (FSR), 2009.
 
Pinto, J. P., Sivaram, G. S. V. S., Hermansky, H. and Magimai-Doss, M., Volterra Series for Analyzing MLP based Phoneme Posterior Probability Estimator, in: Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2009.
 
Popescu-Belis, A., Poller, P., Kilgour, J., Boertjes, E., Carletta, J., Castronovo, S., Fapso, M., Flynn, M., Nanchen, A., Wilson, T., Wit, J. de and Yazdani, M., A multimedia retrieval system using speech input, in: ICMI-MLMI 2009 (11th International Conference on Multimodal Interfaces and 6th Workshop on Machine Learning for Multimodal Interaction), 2009.
 
Raducanu, B. and Gatica-Perez, D., You are fired! Nonverbal role analysis in competitive meetings, in: Proc. ICASSP, Taiwan, 2009.
 
Rajan, P., Parthasarathi, S. H. K. and Murthy, H., Robustness of Phase based Features for Speaker Recognition, in: Proceedings of Interspeech, 2009.
 
Ricci, E. and Odobez, J. -M., Real-time simultaneous head tracking and pose estimation, in: IEEE International Conference on Image Processing (ICIP), 2009.
 
Richiardi, J., Kryszczuk, K. and Drygajlo, A., Static models of derivative-coordinates phase spaces for multivariate time series classification: an application to signature verification, in: Advances in Biometrics, Lecture Notes in Computer Science 5558, pages 1200-1208, 2009.
 
De Simone, F., Dufaux, F., Ebrahimi, T., Delogu, C. and Baroncini, V., A subjective study of the influence of color information on visual quality assessment of high resolution pictures, in: Fourth International Workshop on Video Processing and Quality Metrics for Consumer Electronics (VPQM-09), Scottsdale, Arizona, USA, 2009.
 
Tommasi, T. and Caputo, B., The more you know, the less you learn: from knowledge transfer to one-shot learning of object categories, in: BMVC, 2009.
 
Ullah, M. M., Orabona, F. and Caputo, B., You live, you learn, you forget: continuous learning of visual places with a forgetting mechanism, in: International Conference on Robotic and Systems, 2009.
 
Valente, F., Magimai-Doss, M., Plahl, C. and Suman, R., Hierarchical Processing of the Modulation Spectrum for GALE Mandarin LVCSR system, in: Proceedings of the 10thAnnual Conference of the International Speech Communication Association (Interspeech), Brighton, 2009.
 
Vijayasenan, D., Valente, F. and Bourlard, H., KL Realignment for Speaker Diarization with Multiple Feature Streams, in: 10th Annual Conference of the International Speech Communication Association, 2009.
 
Vijayasenan, D., Valente, F. and Bourlard, H., MUTUAL INFORMATION BASED CHANNEL SELECTION FOR SPEAKER DIARIZATION OF MEETINGS DATA, in: Proceedings of International Conference on Acoustics, Speech and Signal Processing, 2009.
 
Vijayasenan, D., Valente, F. and Bourlard, H., Mutual Information based Channel Selection for Speaker Diarization of Meetings Data, in: Proceedings of International conference on acoustics speech and signal processing, 2009.
 
Vinciarelli, A., Suditu, N. and Pantic, M., Implicit Human Centered Tagging, in: Proceedings of IEEE Conference on Multimedia and Expo, pages 1428-1431, 2009.
 
Voloshynovskiy, S., Koval, O., Beekhof, F. and Holotyak, T., Binary robust hashing based on probabilistic bit reliability, in: IEEE Workshop on Statistical Signal Processing 2009, 2009.
 
Voloshynovskiy, S., Koval, O., Beekhof, F. and Pun, T., Random projections based item authentication, in: Electronic Imaging 2009, 2009.
 
Wuthrich, M., Liwicki, M., Fischer, A., Indermühle, E., Bunke, H., Viehhauser, G. and Stolz, M., Language model integration for the recognition of handwritten medieval documents, in: Proc. 10th Int. Conf. on Document Analysis and Recognition, pages 211-215, 2009.
 
Wöllmer, M., Eyben, F., Keshet, J., Graves, A., Schuller, B. and Rigoll, G., Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech using Bidirectional LSTM Networks, in: IEEE International Conference on Acoustic, Speech, and Signal Processing, 2009.
 
Xie, S., Favre, B., Hakkani-Tur, D. and Liu, Y., Leveraging sentence weights in a concept-based optimization framework for extractive meeting summarization, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 
Yao, J. and Odobez, J. -M., Multi-camera multi-person 3d space tracking with mcmc in surveillance scenarios, in: European Conference on Computer Vision, workshop on Multi Camera and Multi-modal Sensor Fusion Algorithms and Applications (ECCV-M2SFA2), Marseille, 2009.
 
Zhao, S. Y., Ravuri, R. and Morgan, N., Multi-stream to many-stream: using spectro-temporal features for asr, in: 10th International Conference of the International Speech Communication Association, Brighton, UK, 2009.
 

2008

Anemuller, J., Back, J. -H., Caputo, B., Luo, J., Ohl, F., Orabona, F., Vogels, R., Weinshall, D. and Zweig, A., Biologically Motivated Audio-Visual Cue Integration for Object, in: Proceedings of the first Internatinal Conference on Cognitive Systems, 2008.
 
Anemuller, J., Back, J. -H., Caputo, B., Havlena, M., Luo, J., Kayser, H., Leibe, B., Motlicek, P., Pajdla, T., Pavel, M., Torii, A., van Gool, L., Zweig, A. and Hermansky, H., The DIRAC AWEAR Audio-Visual Platform for Detection of Unexpected and Incongruent Events, in: Proceedings of the International Conference on Multimodal Interfaces, 2008.
 
Ba, S. and Odobez, J. -M., Multi-party focus of attention recognition in meetings from head pose and multimodal contextual cues, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2008.
 
Ba, S. and Odobez, J. -M., Visual focus of attention estimation from head pose posterior probability distributions, in: IEEE Proc. Int. Conf. on Multimedia and Expo (ICME), 2008.
 
Beekhof, F., Voloshynovskiy, S., Koval, O. and Villán, R., Secure surface identification codes, in: Steganography, and Watermarking of Multimedia Contents X, 2008. [DOI]
 
Berclaz, J., Fleuret, F. and Fua, P., Multi-camera tracking and atypical motion detection with behavioral maps, in: The 10th European Conference on Computer Vision (ECCV 2008), Marseille, France, 2008.
 
Berclaz, J., Fleuret, F. and Fua, P., Multi-camera tracking and atypical motion detection with behavioral maps, in: Proceedings of the European Conference on Computer Vision (ECCV), pages 112-125, 2008.
 
Berclaz, J., Fleuret, F. and Fua, P., Principled Detection-by-classification from Multiple Views, in: proceedings of the International Conference on Computer Vision Theory and Applications, pages 375-382, 2008.
 
Bertolami, R. and Bunke, H., Including language model information in the combination of handwritten text line recognizers, in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 25-30, 2008.
 
Bertolami, R., Gutmann, C., Spitz, L. and Bunke, H., Shape code based lexicon reduction for offline handwriting recognition, in: Proc. 8th IAPR Int. Workshop on Document Analysis Systems, pages 158-163, 2008.
 
Boakye, K., Trueba-Hornero, B., Vinyals, O. and Friedland, G., Overlapped speech detection for improved speaker diarization in multiparty meetings, in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
 
Boakye, K., Vinyals, O. and Friedland, G., Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech, in: Interspeech, 2008.
 
Boakye, K., Vinyals, O. and Friedland, G., Two's a crowd: improving speaker diarization by automatically identifying and excluding overlapped speech, in: Interspeech 2008, Brisbane, Australia, pages 32-35, 2008.
 
Bologna, G., Deville, B., Vinckenbosch, M. and Pun, T., a perceptual interface for vision substitution in a color matching experiment, in: Proceeding on IEEE IJCNN, IEEE World congress on computational intelligence, 2008.
 
Bologna, G., Deville, B., Vinckenbosch, M. and Pun, T., Pairing colored socks and following a red serpentine with sounds of musical instruments, in: ICAD 08, International Conference on Auditory Displays, Paris, France, June 24--27, 2008.
 
Bourlard, H. and Renals, S., Recognition and understanding of meetings overview of the european ami and amida projects, in: LangTech 2008, Rome, 2008.
 
Breitenstein, M. D., Kuettel, D., Weise, T., van Gool, L. and Pfister, H., Real-time face pose estimation from single range images, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), IEEE Press, 2008.
 
Carincotte, C., Naturel, X., Hick, M., Odobez, J. -M., Yao, J., Bastide, A. and Corbucci, B., Understanding Metro Station Usage using Closed Circuit Television Cameras Analysis, in: 11th International IEEE Conference on Intelligent Transportation Systems (ITSC), Bejing, 2008.
 
Carreras, A., Cordara, G., Delgado, J., Dufaux, F., Francini, G., Ha, T. M., Rodriguez, E. and Tous, R., A search and retrieval framework for the management of copyrighted audiovisual content, in: 50th International Symposium ELMAR 2008, Zadar, Croatia, 2008.
 
Chanel, G., Rebetez, C., Betrancourt, M. and Pun, T., boredom, engagement and anxiety as indicators for adaptation to difficulty in games, in: ACM Mindtrek conference, 2008.
 
Chavarriaga, R., Galán, F. and Millán, J. del R., Asynchronous detection and classification of oscillatory brain activity, in: 16 European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
 
van den Berg, M., Koller-Meier, E. and van Gool, L., Fast body posture estimation using volumetric features, in: IEEE Visual Motion Computing (MOTION), 2008.
 
Deville, B., Bologna, G., Vinckenbosch, M. and Pun, T., Guiding the focus of attention of blind people with visual saliency, in: Workshop on Computer Vision Applications for the Visually Impaired (CVAVI 08), Satellite Workshop of theEuropean Conference on Computer Vision (ECCV 2008), Marseille, France, October 18, 2008.
 
Deville, B., Bologna, G., Vinckenbosch, M. and Pun, T., guiding the focus of attention of blind people with visual saliency, in: Workshop on Computer Vision Applications for the Visually Impaired (CVAVI 08), 2008.
 
Dollé, L., Khamassi, M., Girard, B., Guillot, A. and Chavarriaga, R., Analyzing interactions between navigation strategies using a computational model of action selection, in: Spatial Cognition 2008 (SC '08), pages 71-86, Freiburg, Germany, 2008.
 
Dufaux, F. and Ebrahimi, T., H.264/AVC Video Scrambling for Privacy Protection, in: IEEE International Conference on Image Processing (ICIP2008), San Diego, 2008.
 
Dumas, B., Lalanne, D. and Ingold, R., Demonstration : hephaistk, une bo\^\ite à outils pour le prototypage d'interfaces multimodales, in: Proceedings of 20e Conférence sur l'Interaction Homme-Machine (IHM 08), pages 215-216, 2008.
 
Dumas, B., Lalanne, D. and Ingold, R., Prototyping multimodal interfaces with smuiml modeling language, in: Proceedings of CHI 2008 Workshop on UIDLs for Next Generation User Interfaces (CHI 2008 workshop), pages 63-66, 2008.
 
Dumas, B., Lalanne, D., Guinard, D., Koenig, R. and Ingold, R., Strengths and weaknesses of software architectures for the rapid creation of tangible and multimodal interfaces, in: Proceedings of 2nd international conference on Tangible and Embedded Interaction (TEI 2008), pages 47-54, 2008.
 
Ess, A., Leibe, B., Schindler, K. and van Gool, L., A mobile vision system for robust multi-person tracking, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
 
Estrella, P., Popescu-Belis, A. and King, M., Improving contextual quality models for mt evaluation based on evaluators' feedback., in: LREC 2008 (6th International Conference on Language Resources and Evaluation), 2008.
 
Faria, A. and Morgan, N., Corrected tandem features for acoustic model training, in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
 
Favre, S., Salamin, H., Vinciarelli, A., Hakkani-Tur, D. and Garg, N., Role recognition for meeting participants: an approach based on lexical information and social network analysis, in: ACM International Conference on Multimedia, Vancouver, Canada, 2008.
 
Favre, S., Salamin, H. and Vinciarelli, A., Role recognition in multiparty recordings using social affiliation networks and discrete distributions, in: The Tenth International Conference on Multimodal Interfaces (ICMI 2008), Chania, Greece, 2008.
 
Ferrez, P. W. and Millán, J. del R., Eeg-based brain-computer interaction: improved accuracy by automatic single-trial error detection, in: Advances in Neural Information Processing Systems 20, pages 441-448, Cambridge, MA, 2008.
 
Ferrez, P. W. and Millán, J. del R., Simultaneous real-time detection of motor imagery and error-related potentials for improved bci accuracy, in: Proceedings of the 4th International Brain-Computer Interface Workshop and Training Course, 2008.
 
Friedland, G. and Vinyals, O., Live speaker identification in conversations, in: ACM Multimedia 2008, Vancouver, Canada, pages 1017-1018, 2008.
 
Galán, F., Nuttin, M., Vanhooydonck, D., Lew, E., Ferrez, P. W., Philips, J. and Millán, J. del R., Continuous brain-actuated control of an intelligent wheelchair by human eeg, in: 4th International Brain-Computer Interface Workshop & Training Course, Graz University of Technology, Graz, Austria, 2008.
 
Gammeter, S., Ess, A., Jaeggli, T., Leibe, B., Schindler, K. and van Gool, L., Articulated multibody tracking under egomotion, in: European Conference on Computer Vision (ECCV'08), Springer, 2008.
 
Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Autoregressive modelling of hilbert envelopes for wide-band audio coding, in: AES 124th Convention, Audio Engineering Society, Amsterdam, 2008.
 
Ganapathy, S., Thomas, A. and Hermansky, H., Front-end for far-field speech recognition based on frequency domain linear prediction, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Spectral noise shaping: improvements in speech/audio codec based on linear prediction in spectral domain, in: INTERSPEECH 2008, Brisbane, Australia, 2008.
 
Ganapathy, S., Motlicek, P., Hermansky, H. and Garudadri, H., Temporal masking for bit-rate reduction in audio codec based on frequency domain linear prediction, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4781-4784, Las Vegas, NV, 2008. [DOI]
 
Garipelli, G., Chavarriaga, R. and Millán, J. del R., Recognition of anticipatory behavior from human eeg, in: 4th Intl. Brain-Computer Interface Workshop and Training Course, Graz University, Austria, 2008.
 
Garner, P. N., Silence models in weighted finite-state transducers, in: Interspeech, Brisbane, Australia, 2008.
 
Gatica-Perez, D. and Farrahi, K., Daily routine classification from mobile phone data, in: Workshop on Machine Learning and Multimodal Interaction (MLMI08), Utrecht, The Netherlands, 2008.
 
Gatica-Perez, D. and Farrahi, K., Discovering human routines from cell phone data with topic models, in: IEEE International Symposium on Wearable Computers (ISWC), Pittsburgh, Pennsylvania, 2008.
 
Gatica-Perez, D. and Farrahi, K., What did you do today? discovering daily routines from large-scale mobile data, in: ACM International Conference on Multimedia (ACMMM), Vancouver, 2008.
 
Goldmann, L., Adamek, T., Vajda, P., Karaman, M., Mörzinger, R., Galmar, E., Sikora, T., O'Connor, N., Ha-Minh, T., Ebrahimi, T., Schallauer, P. and Huet, B., Towards Fully Automatic Image Segmentation Evaluation, in: Advanced Concepts for Intelligent Vision Systems (ACIVS), Springer, Juan-les-Pins, 2008.
 
Gonzalez, G., Fleuret, F. and Fua, P., Automated delineation of dendritic networks in noisy image stacks, in: Proceedings of the European Conference on Computer Vision (ECCV), pages 214-227, 2008.
 
Gonzalez, G., Fleuret, F. and Fua, P., Automated delineation of dendritic networks in noisy image stacks, in: The 10th European Conference on Computer Vision, Marseille, France, 2008.
 
Grandvalet, Y., Rakotomamonjy, A., Keshet, J. and Canu, S., Support Vector Machines with a Reject Option, in: Proceedings of the 22nd Annual Conference on Neural Information Processing Systems, 2008.
 
Grossmann, E., Gaspar, J. -A. and Orabona, F., Calibration from statistical properties of the visual world, in: European Conf. on Computer Vision, 2008.
 
Gurban, M., Thiran, J. -Ph., Drugman, T. and Dutoit, T., Dynamic modality weighting for multi-stream HMMs in Audio-Visual Speech Recognition, in: 10th International Conference on Multimodal Interfaces, Chania, Greece, 2008.
 
Gurban, M. and Thiran, J. -Ph., Using entropy as a stream reliability estimate for audio-visual speech recognition, in: 16th European Signal Processing Conference, Lausanne, Switzerland, 2008.
 
Hoffmann, U., Yazdani, A., Vesin, J. M. and Ebrahimi, T., Bayesian feature selection applied in a p300 brain- computer interface, in: 16th European Signal Processing Conference, Lausanne, 2008.
 
Hoffmann, U., Naruniec, J., Yazdani, A. and Ebrahimi, T., Face Detection Using Discrete Gabor Jets And Color Information, in: SIGMAP 2008 - International Conference on Signal Processing and Multimedia Applications, Porto, 2008.
 
Hung, H., Huang, Y., Yeo, C. and Gatica-Perez, D., Associating audio-visual activity cues in a dominance estimation framework, in: CVPR Workshop on Human Communicative Behavior, 2008.
 
Hung, H., Huang, Y., Friedland, G. and Gatica-Perez, D., Estimating the dominant person in multi-party conversations using speaker diarization strategies, in: ICASSP 08, 2008.
 
Hung, H. and Gatica-Perez, D., Identifying dominant people in meetings from audio-visual sensors, in: Proc. IEEE Int. Conf. on Automatic Face and Gesture Recognition, Special Session on Multimodal HCI for Smart Environments, 2008.
 
Hung, H. and Gatica-Perez, D., Identifying dominant people in meetings from audio-visual sensors, in: Proc. IEEE Int. Conf. on Automatic Face and Gesture Recognition (FG), Special Session on Multi-Sensor HCI for Smart Environments, 2008.
 
Hung, H., Jayagopi, D., Ba, S., Odobez, J. -M. and Gatica-Perez, D., Investigating automatic dominance estimation in groups from visual attention and speaking activity, in: International Conference on Multimodal Interfaces (ICMI), 2008.
 
Hung, H., Jayagopi, D., Ba, S., Odobez, J. -M. and Gatica-Perez, D., Investigating automatic dominance estimation in groups from visual attention and speaking activity, in: Proc. ICMI, 2008.
 
Hung, H. and Friedland, G., Towards audio-visual on-line diarization of participants in group meetings, in: European Conference on Computer Vision (ECCV) 2008, Marseille, France, 2008.
 
Indermühle, E., Liwicki, M. and Bunke, H., Recognition of handwritten historical documents: hmm -adaptation vs. writer specific training, in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 186-191, 2008.
 
Jayagopi, D., Raducanu, B. and Gatica-Perez, D., Characterizing conversational group dynamics using nonverbal behavior, in: Proc. IEEE Int. Conf. on Multimedia (ICME), 2008.
 
Jayagopi, D., Predicting the dominant clique in meetings through fusion of nonverbal cues, in: Proc. ACM Vancouver, Canada, 2008.
 
Jayagopi, D., Hung, H., Yeo, C. and Gatica-Perez, D., Predicting the dominant clique in meetings through fusion of nonverbal cues, in: ACM MM 2008, Vancouver, Canada, 2008.
 
Jayagopi, D., Predicting two facets of social verticality in meetings from five-minute time slices and nonverbal cues, in: Proc. ICMI, 2008.
 
Jayagopi, D., Ba, S., Odobez, J. -M. and Gatica-Perez, D., Predicting two facets of social verticality in meetings from five-minute time slices and nonverbal cues, in: Proc. Int. Conf. on Multimodal Interfaces (ICMI), Special Session on Social Signal Processing, 2008.
 
Ketabdar, H. and Bourlard, H., Hierarchical integration of phonetic and lexical knowledge in phone posterior estimation, in: International Conference on Acoustics, Speech, and Signal Processing, 2008.
 
Kludas, J., Bruno, E. and Marchand-Maillet, S., Can feature information interaction help for information fusion in multimedia problems?, in: First International Workshop on Metadata Mining for Image Understanding, pages 23-33, 2008.
 
Kludas, J., Marchand-Maillet, S. and Bruno, E., Exploiting document feature interactions for efficient information fusion in high dimensional spaces, in: Proceedings of the First International Workshops on Image Processing Theory, Tools and Applications (IPTA'2008), 2008.
 
Kludas, J., Bruno, E. and Marchand-Maillet, S., Exploiting synergistic and redundant features for multimedia document classification, in: 32nd Annual Conference of the German Classification Society - Advances in Data Analysis, Data Handling and Business Intelligence (GfKl 2008), 2008.
 
Kludas, J., Bruno, E. and Marchand-Maillet, S., Exploiting synergistic and redundant features for multimedia document classification, in: 32nd Annual Conference of the German Classification Society - Advances in Data Analysis, Data Handling and Business Intelligence (GfKl 2008), 2008.
 
Knox, M., Morgan, N. and Mirghafori, N., Getting the last laugh: automatic laughter segmentation in meetings, in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 797-800, 2008.
 
Koval, O., Voloshynovskiy, S., Beekhof, F. and Pun, T., Analysis of physical unclonable identification based on reference list decoding, in: Steganography, and Watermarking of Multimedia Contents X, 2008.
 
Koval, O., Voloshynovskiy, S. and Pun, T., Privacy-preserving multimodal person and object identification, in: Proceedings of the 10th ACM Workshop on Multimedia & Security, 2008.
 
Koval, O., Voloshynovskiy, S., Caire, F. and Bas, P., Privacy-preserving multimodal person and object identification, in: MM&Sec 2008, 2008.
 
Koval, O., Voloshynovskiy, S., Beekhof, F. and Pun, T., Security analysis of robust perceptual hashing, in: Steganography, and Watermarking of Multimedia Contents X, 2008.
 
Kryszczuk, K. and Drygajlo, A., Impact of feature correlations on separation between bivariate normal distributions, in: 19th International Conference on Pattern Recognition, 2008.
 
Kryszczuk, K. and Drygajlo, A., On quality of quality measures for classification, in: Biometrics and Identity Management, Lecture Notes in Computer Science 5372, pages 19-28, 2008.
 
Kryszczuk, K. and Drygajlo, A., What do quality measures predict in biometrics, in: 16th European Signal Processing Conference, 2008.
 
Kumatani, K., McDonough, J., Klakow, D., Garner, P. N. and Li, W., Adaptive beamforming with a maximum negentropy criterion,, in: The Joint Workshop on Hands-free Speech Communication and Microphone Arrays, 2008.
 
Kumatani, K., McDonough, J., Schacht, S., Klakow, D., Garner, P. N. and Li, W., Filter bank design based on minimization of individual aliasing terms for minimum mutual information subband adaptive beamforming, in: International Conferance on Acoustics Speech and Signal Processing, 2008.
 
Li, W., Kumatani, K., Dines, J., Magimai-Doss, M. and Bourlard, H., A neural network based regression approach for recogninizing simultaneous speech, in: Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
 
Li, W., Effective post-processing for single-channel frequency-domain speech enhancement, pages 149-152, 2008. [DOI]
 
Li, W., Effective post-processing of single-channel frequency-domain speech enhancement, in: IEEE conference on multimedia and expo, 2008.
 
Li, W., Doss, M. M., Dines, J. and Bourlard, H., Mlp-based log spectral energy mapping for robust overlapping speech recognition, in: European Signal Processing Conference, 2008.
 
Li, W., Dines, J., Magimai-Doss, M. and Bourlard, H., Neural network based regression for robust overlapping speech recognition using microphone arrays, in: Interspeech, 2008.
 
Liwicki, M. and Bunke, H., Combining on-line and off-line blstm networks for handwritten text line recognition, in: Proc. 11th Int. Conf. on Frontiers in Handwriting Recognition, pages 31-36, 2008.
 
Liwicki, M., Schlapbach, A. and Bunke, H., Writer-dependent recognition of handwritten whiteboard notes in smart meeting room environments, in: Proc. 8th IAPR Int. Workshop on Document Analysis Systems, pages 151-157, 2008.
 
Luo, J., Caputo, B., Zweig, A., Back, J. -H. and Anemuller, J., Object category detection using audio-visual cues, in: International Conference on Computer Vision Systems (ICVS08), 2008.
 
Matena, L., Jaimes, A. and Popescu-Belis, A., Graphical representation of meetings on mobile devices, in: MobileHCI 2008 Demonstrations (10th ACM International Conference on Human-Computer Interaction with Mobile Devices and Services), 2008.
 
Meynet, J., Arsan, T., Cruz Mota, J. and Thiran, J. -Ph., Fast multi-view face tracking with pose estimation, in: 16th European Signal Processing Conference, Lausanne, 2008.
 
Morrison, D., Marchand-Maillet, S. and Bruno, E., Semantic clustering of images using patterns of relevance feedback, in: Proceedings of the 6th International Workshop on Content-based Multimedia Indexing (CBMI'2008), 2008.
 
Motlicek, P., Ganapathy, S., Hermansky, H., Garudadri, H. and Athineos, M., Perceptually motivated Sub-band Decomposition for FDLP Audio Coding, in: Text, Speech and Dialogue, pages 435-442, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
 
Naturel, X. and Odobez, J. -M., Detecting queues at vending machines: a statistical layered approach, in: Proc. Int. Conf. on Pattern Recognition (ICPR), Tampa, 2008.
 
Negoescu, R. -A. and Gatica-Perez, D., Analyzing flickr groups, in: Proceedings of the 2008 international conference on Content-based image and video retrieval (CIVR '08), Sheraton Fallsview Hotel, Niagara Falls, Canada, 2008.
 
Negoescu, R. -A. and Gatica-Perez, D., Topickr: Flickr Groups and Users Reloaded, in: MM '08: Proc. of the 16th ACM Intl. Conf. on Multimedia, ACM, 2008.
 
Nijholt, A., Tan, D., Allison, B., Millán, J. del R., Moore, M. and Graimann, B., Brain-computer interfaces for hci and games, in: Proceedings of the 26th Annual CHI Conference on Human Factors in Computing Systems, Extended Abstracts, 2008.
 
Noris, B., Benmachiche, K. and Billard, A., Calibration-free eye gaze direction detection with gaussian processes, in: International Conference on Computer Vision Theory and Applications (VISAPP 2008), Funchal, Portugal, 2008.
 
Orabona, F., Keshet, J. and Caputo, B., The Projectron: a Bounded Kernel-Based Perceptron, in: Int. Conf. on Machine Learning, 2008.
 
Ouaret, M., Dufaux, F. and Ebrahimi, T., Enabling Privacy For Distributed Video Coding by Transform Domain Scrambling, in: 2008 SPIE Visual Communications and Image Processing, San Diego, USA, 2008.
 
Paiement, J. -F., Grandvalet, Y., Bengio, S. and Eck, D., A Distance Model for Rhythms, in: 25th International Conference on Machine Learning (ICML), 2008.
 
Parthasarathi, S. H. K., Motlicek, P. and Hermansky, H., Exploiting Contextual Information for Speech/Non-Speech Detection, in: Text, Speech and Dialogue, pages 451-459, Springer-Verlag Berlin, Heidelberg, Brno, Czech Republic, 2008.
 
Pellegrini, S., Schindler, K. and D. Nardi, , A generalization of the icp algorithm for articulated bodies, in: British Machine Vision Conference (BMVC'08), 2008.
 
Perrin, X., Chavarriaga, R., Ray, C., Siegwart, R. and Millán, J. del R., A comparative psychophysical and eeg study of different feedback modalities for hri, in: Human-Robot Interaction (HRI08), 2008.
 
Pinto, J. P. and Hermansky, H., Combining evidence from a generative and a discriminative model in phoneme recognition, in: Proceedings of Interspeech 2008, Brisbane, Australia, 2008.
 
Pinto, J. P., Hermansky, H., Yegnanarayana, B. and Magimai-Doss, M., Exploiting contextual information for improved phoneme recognition, in: IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2008), pages 4449-4452, Las Vegas, NV, 2008. [DOI]
 
Pinto, J. P., Szoke, I., Prasanna, S. R. Mahadeva and Hermansky, H., Fast approximate spoken term detection from sequence of phonemes, in: The 31st Annual International ACM SIGIR Conference 20-24 July 2008, pages 28-33, Singapore,, 2008.
 
Pinto, J. P., Sivaram, G. S. V. S. and Hermansky, H., Reverse correlation for analyzing mlp posterior features in asr, in: 11th International Conference on Text, Speech and Dialogue (TSD), pages 469-476, Brno, Czech Republic, 2008. [DOI]
 
Popescu-Belis, A., Reference-based vs. task-based evaluation of human language technology, in: LREC 2008 ELRA Workshop on Evaluation: "Looking into the Future of Evaluation: When automatic metrics meet task-based and performance-based approaches", pages 12-16, ELRA, 2008.
 
Popescu-Belis, A., Flynn, M., Wellner, P. and Baudrion, P., Task-based evaluation of meeting browsers: from bet task elicitation to user behavior analysis, in: LREC 2008 (6th International Conference on Language Resources and Evaluation), 2008.
 
Pronobis, A., Martinez Monos, O. and Caputo, B., SVM-based Discriminative Accumulation Scheme for Place Recognition, in: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA08), 2008.
 
Quack, T., Bay, H. and van Gool, L., Object recognition for the internet of things, in: Internet of Things 2008, 2008.
 
Quack, T., Leibe, B. and van Gool, L., World-scale mining of objects and events from community photo collections, in: Conference on Image and Video Retrieval (CIVR'08), ACM, 2008.
 
Rayner, M., Tsourakis, N., Georgescul, M. and Bouillon, P., Building mobile spoken dialogue applications using regulus, in: Proceedings of the Sixth International Language Resources and Evaluation (LREC'08), 2008.
 
Richiardi, J., Drygajlo, A. and Todesco, L., Promoting diversity in gaussian mixture ensembles: an application to signature verification, in: Biometrics and Identity Management, Lecture Notes in Computer Science 5372, pages 140-149, 2008.
 
Roth, D., Koller-Meier, E., Rowe, D., Moeslund, T. B. and van Gool, L., Event-based tracking evaluation metric, in: IEEE Workshop on Motion and Video Computing (WMVC), 2008.
 
Scaringella, N., Timbre and Rhythmic TRAP-TANDEM features for music information retrieval, in: "Int. Conf. on Music Information Retrieval (ISMIR)", 2008.
 
Schindler, K. and van Gool, L., Action snippets: how many frames does human action recognition require?, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), IEEE Press, 2008.
 
Schindler, K. and van Gool, L., Combining densely sampled form and motion for human action recognition, in: DAGM Annual Pattern Recognition Symposium, Springer, 2008.
 
Schlapbach, A., Wettstein, F. and Bunke, H., Automatic estimation of the readability of handwritten text, in: Proc. 16th European Signal Processing Conference, 2008.
 
Schlapbach, A., Bunke, H. and Wettstein, F., Estimating the readability of handwritten text -- a support vector regression based approach, in: Proc. 19th Int. Conf. on Pattern Recognition, IEEE, 2008.
 
De Simone, F., Ticca, D., Dufaux, F., Ansorge, M. and Ebrahimi, T., A comparative study of color image compression standards using perceptually driven quality metrics, in: SPIE Optics and Photonics, San Diego, CA USA, 2008.
 
De Simone, F., Ansorge, M. and Ebrahimi, T., A multi-channel objective model for the full-reference assessment of color pictures, in: 2nd K-space Jamboree Workshop, Paris, 2008.
 
Sivaram, G. S. V. S. and Hermansky, H., Emulating temporal receptive fields of auditory mid-brain neurons for automatic speech recognition, in: Proc. 16th European Signal Processing Conference (EUSIPCO), Lausanne, 2008.
 
Sivaram, G. S. V. S. and Hermansky, H., Introducing temporal asymmetries in feature extraction for automatic speech recognition, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Soleymani, M., Chanel, G., Kierkels, J. and Pun, T., affective characterization of movie scenes based on multimedia content analysis and user's physiological emotional responses, in: IEEE International Symposium on Multimedia, 2008.
 
Soleymani, M., Chanel, G., Kierkels, J. and Pun, T., affective ranking of movie scenes using physiological signals and content analysis, in: 2nd ACM Workshop on the Many Faces of Multimedia Semantics, ACM MM08, 2008.
 
Soleymani, M., Kierkels, J., Chanel, G., Bruno, E., Marchand-Maillet, S. and T. Pun, , Estimating emotions and tracking interest during movie watching based on multimedia content and physiological responses, in: Joint (IM)2-Interactive Multimodal Information Management and Affective Sciences NCCRs meeting, 2008.
 
Soleymani, M., Chanel, G., Kierkels, J. and Pun, T., Valence-arousal representation of movie scenes based on multimedia content analysis and user's physiological emotional responses, in: MLMI 2008, 5th Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
 
Sorci, M., Antonini, G., Cerretani, B., Cruz Mota, J., Rubin, T., Bierlaire, M. and Thiran, J. -Ph., Modelling human perception of static facial expressions, in: Face and Gesture Recognition 2008, Amsterdam, 2008.
 
Szafranski, M., Grandvalet, Y. and Rakotomamonjy, A., Composite Kernel Learning, in: Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008), pages 1040-1047, Omnipress, 2008.
 
Thomas, A., Ganapathy, S. and Hermansky, H., Hilbert envelope based features for far-field speech recognition, in: MLMI 2008, Utrecht, The Netherlands, 2008.
 
Thomas, A., Ganapathy, S. and Hermansky, H., Hilbert envelope based spectro-temporal features for phoneme recognition in telephone speech, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Thomas, A., Ganapathy, S. and Hermansky, H., Spectro-temporal features for automatic speech recognition using linear prediction in spectral domain, in: 16th European Signal Processing Conference (EUSIPCO 2008), Lausanne, 2008.
 
Thomas, A., Ferrari, V., Leibe, B., Tuytelaars, T. and van Gool, L., Using recognition to guide a robot's attention, in: Robotics Science and Systems, 2008.
 
Tommasi, T., Orabona, F. and Caputo, B., Cue Integration for Medical Image Annotation, in: Advances in Multilingual and Multimodal Information Retrieval: 8th Workshop of the Cross-Language Evaluation Forum, CLEF 2007, Budapest, Hungary, September 19-21, 2007, Revised Selected Papers, Springer-Verlag, 2008.
 
Torii, A., Havlena, M., Pajdla, T. and B. Leibe, , Measuring camera translation by the dominant apical angle, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
 
Tous, R., Carreras, A., Delgado, J., Cordara, G., Gianluca, F., Peig, E., Dufaux, F. and Galinski, G., An Architecture for TV Content Distributed Search and Retrieval Using the MPEG Query Format (MPQF), in: International Workshop on Ambient Media Delivery and Interactive Television (AMDIT 2008), Quebec City, Canada, 2008.
 
Tsourakis, N., Lisowska, A., Bouillon, P. and Rayner, M., From desktop to mobile: adapting a successful voice interaction platform for use in mobile devices, in: Third ACM MobileHCI Workshop on Speech in Mobile and Pervasive Environments (SiMPE), Amsterdam, the Netherlands., 2008.
 
Ullah, M. M., Pronobis, A., Caputo, B., Luo, J., Jensfelt, P. and Christensen, H. I., Towards Robust Place Recognition for Robot Localization, in: IEEE International Conference on Robotics ad Automation, 2008.
 
Valente, F. and Hermansky, H., Hierarchical and parallel processing of modulation spectrum for asr applications, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pages 4165-4168, 2008. [DOI]
 
Valente, F. and Hermansky, H., On the combination of auditory and modulation frequency channels for asr applications, in: Interspeech 2008, Brisbane, Australia, 2008.
 
Vergyri, D., Mandal, A., Wang, W., Stolcke, A., Zheng, J., Graciarena, M., Rybach, D., Gollan, C., Schlater, R., Kirchoff, K., Faria, A. and Morgan, N., Development of the sri/nightingale arabic asr system, in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 1437-1440, 2008.
 
Vijayasenan, D., Valente, F. and Bourlard, H., Combination of agglomerative and sequential clustering for speaker diarization, in: International Conference on Acoustics, Speech and Signal Processing, 2008.
 
Vijayasenan, D., Valente, F. and Bourlard, H., Integration of TDOA Features in Information Bottleneck Framework for Fast Speaker Diarization, in: Interspeech 2008, 2008.
 
Vinciarelli, A., Pantic, M., Bourlard, H. and Pentland, A., Social signal processing: state-of-the-art and future perspectives of an emerging domain, in: Proceedings of the ACM International Conference on Multimedia, 2008.
 
Vinciarelli, A., Pantic, M., Bourlard, H. and Pentland, A., Social signals, their function, and automatic analysis: a survey, in: Proceedings of International Conference on Multimodal Interfaces (to appear), 2008.
 
Vinyals, O. and Friedland, G., A hardware-independent fast logarithm approximation with adjustable accuracy, in: 10th IEEE International Symposium on Multimedia, Berkeley, CA, USA, pages 61-65, 2008.
 
Vinyals, O. and Friedland, G., Modulation spectrogram features for speaker diarization, in: Interspeech 2008, Brisbane, Australia, pages 630-633, 2008.
 
Voloshynovskiy, S., Koval, O. and Pun, T., Multimodal authentication based on random projections and distributed coding, in: Proceedings of the 10th ACM Workshop on Multimedia & Security, 2008.
 
Voloshynovskiy, S., Koval, O., Beekhof, F. and Pun, T., Multimodal authentication based on random projections and distributed coding, in: MM&Sec 2008, 2008.
 
Weinshall, D., Hermansky, H., Zweig, A., Luo, J., Jimison, H., Ohl, F. and Pavel, M., Beyond Novelty Detection: Incongruent Events, when General and Specific Classifiers Disagree, in: Advances in Neural Information Processing Systems 21, 2008.
 
Weise, T., Leibe, B. and van Gool, L., Accurate and robust registration for in-hand modeling, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'08), 2008.
 
Yao, J. and Odobez, J. -M., Fast human detection from videos using covariance features, in: European Conference on Computer Vision, workshop on Visual Surveillance (ECCV-VS), Marseille, 2008.
 
Yao, J. and Odobez, J. -M., Multi-camera 3d person tracking with particle filter in a surveillance environment, in: 16th European Signal processing Conference (EUSIPCO), 2008.
 
Zeng, G. and van Gool, L., Multi-label image segmentation via point-wise repetition, in: International Conference on Computer Vision and Pattern Recognition (CVPR), 2008.
 
Zhao, S. Y. and Morgan, N., Multi-stream spectro-temporal features for robust speech recognition, in: 9th International Conference of the ISCA (Interspeech 2008), Brisbane, Australia, pages 898-901, 2008.
 
I. Bogdanova, , A. Bur, and Hügli, H., The spherical approach to omnidirectional visual attention, in: XVI European Signal Processing Conference (EUSIPCO 2008), 2008.
 
Tommasi, T., Orabona, F. and Caputo, B., An SVM Confidence-Based Approach to Medical Image Annotation, in: Evaluating Systems for Multilingual and Multimodal Information Access -- 9th Workshop of the Cross-Language Evaluation Forum, 2008.
 
Popescu-Belis, A., Boertjes, E., Kilgour, J., Poller, P., Castronovo, S., Wilson, T., Jaimes, A. and Carletta, J., The AMIDA Automatic Content Linking Device: Just-in-Time Document Retrieval in Meetings, in: Machine Learning for Multimodal Interaction V, pages 272-283, Springer-Verlag, Utrecht, 2008. [DOI]
 

2007

Aloise, F., Caporusso, N., Mattia, D., Babiloni, F., Kauhanen, L., Millán, J. del R., Nuttin, M., Marciani, M. G. and Cincotti, F., Brain-machine interfaces through control of electroencephalographic signals and vibrotactile feedback, in: Proceedings of the 12th International Conference on Human-Computer Interaction, 2007.
 
Ansari-Asl, K., Chanel, G. and Pun, T., A channel selection method for eeg classification in emotion assessment based on synchronization likelihoo, in: Eusipco 2007, 15th Eur. Signal Proc. Conf., 2007.
 
Aradilla, G., Vepa, J. and Bourlard, H., An acoustic model based on kullback-leibler divergence for posterior features, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
 
Aradilla, G. and Ajmera, J., Detection and recognition of number sequences within spoken utterances, in: 2nd Workshop on Speech in Mobile and Pervasive Environments, 2007.
 
Aradilla, G. and Bourlard, H., Posterior-based features and distances in template matching for speech recognition, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 204-214, 2007. [DOI]
 
Ba, S. and Odobez, J. -M., Probabilistic head pose tracking evaluation in single and multiple camera setups, in: Classification of Events, Activities and Relationship Evaluation and Workshop, 2007.
 
Bengio, S. and Mariéthoz, J., Biometric person authentication is a multiple classifier problem, in: 7th International Workshop on Multiple Classifier Systems, MCS, 2007.
 
Bertini, E., Hertzog, P. and Lalanne, D., Spiralview: a visual tool to improve monitoring and understanding of security data in corporate, in: IEEE Symposium on Visual Analytics Science and Technology 2007 (VAST'07), pages to appear, 2007.
 
Bertolami, R. and Bunke, H., Multiple classifier methods for offline handwritten text line recognition, in: Multiple Classifier Systems, pages 72-81, Springer, 2007.
 
Bertolami, R., Uchida, S., Zimmermann, M. and Bunke, H., Non-uniform slant correction for handwritten text line recognition, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 18-22, 2007.
 
Bologna, G., Deville, B., Pun, T. and Vinckenbosch, M., Identifying major components of pictures by audio encoding of colors, in: IWINAC2007, 2nd. Int. Work-conf. on the Interplay between Natural and Artificial Computation, 2007.
 
Bouillon, P., Flores, G., Starlander, M., Chatzichrisafis, N., Santaholma, M., Tsourakis, N., Rayner, M. and Hockey, B. A., A bidirectional grammar-based medical speech translator, in: Proceedings of workshop on Grammar-based approaches to spoken language processing, pages 41-48, ACL 2007, Prague, Czech Republic, 2007.
 
Bouillon, P., Chatzichrisafis, N., Halimi, S., Hockey, B. A., Isahara, H., Kanzaki, K., Nakao, Y., Novellas Vall, B., Rayner, M., Santaholma, M. and Starlander, M., Medslt: a multi-lingual grammar-based medical speech translator, in: Proceedings of First International Workshop on Intercultural Collaboration, IWIC2007, Kyoto, Japan, 2007.
 
Broschart, M., de Negueruela, C., Millán, J. del R. and Menon, C., Augmenting astronaut's capabilities through brain-machine interfaces, in: Proceedings of the 20th International Joint Conference on Artificial Intelligence, Workshop on Artificial Intelligence for Space Applications, 2007.
 
Bruno, E., Kludas, J. and Marchand-Maillet, S., Combining multimodal preferences for multimedia information retrieval, in: ACM SIGMM - International Workshop on Multimedia Information Retrieval, 2007.
 
Bruno, E., Kludas, J. and Marchand-Maillet, S., Combining multimodal preferences for multimedia information retrieval, in: Proc. of International Workshop on Multimedia Information Retrieval, 2007.
 
Bunke, H., Dickinson, P., Humm, A., Irniger, C. and Kraetzl, M., Graph sequence visualisation and its application to computer network monitoring and abnormal event detection, in: Applied Graph Theory in Computer Vision and Pattern Recognition, pages 227-245, Springer, 2007.
 
Chanel, G., Ansari-Asl, K. and Pun, T., Valence-arousal evaluation using physiological signals in an emotion recall paradigm, in: 2007 IEEE SMC, Int. Conf. on Systems, Man and Cybernetics, Smart cooperative systems and cybernetics: advancing knowledge and security for humanity, 2007.
 
Chavarriaga, R., Ferrez, P. W. and Millán, J. del R., To Err Is Human: Learning from Error Potentials in Brain-Computer Interfaces, in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007.
 
Chavarriaga, R., Ferrez, P. W. and del R. Millán, J., To err is human: learning from error potentials in brain-computer interfaces, in: 1st International Conference on Cognitive Neurodynamics (ICCN 2007), 2007.
 
Cuendet, S., Shriberg, E., Favre, B., Fung, J. and Hakkani-Tur, D., An analysis of sentence segmentation features for broadcast news, broadcast conversations, and meetings, in: SIGIR Workshop on Searching Conversational Spontaneous Speech, 2007.
 
Drugman, T., Gurban, M. and Thiran, J. -Ph., Relevant Feature Selection for Audio-Visual Speech Recognition, in: 9th International Workshop on Multimedia Signal Processing (MMSP), Chania, Crete, Greece, 2007.
 
Drygajlo, A., Multimodal biometrics for identity documents and smart cards european challenge, in: Proc. 15th European Signal Processing Conf. (EUSIPCO), 2007.
 
Einsele, F., Hennebert, J. and Ingold, R., Towards identification of very low resolution, anti-aliased characters, in: IEEE International Symposium on Signal Processing and its Applications (ISSPA'07), Sharjah, United Arab Emirates, 2007.
 
Ess, A., Leibe, B. and van Gool, L., Depth and appearance for mobile scene analysis, in: International Conference on Computer Vision (ICCV'07), 2007.
 
Ess, A., Neubeck, A. and van Gool, L., Generalised linear pose estimation, in: BMVC, 2007.
 
Frapolli, F., Hirsbrunner, B. and Lalanne, D., Dynamic rules: towards interactive games intelligence, in: Tangible Play: Research and Design for Tangible and Tabletop Games. Workshop at the 2007 Intelligent User Interfaces Conference (IUI'07), pages 29-32, 2007.
 
Galán, F., Nuttin, M., Lew, E., Ferrez, P. W., Vanacker, G., Philips, J., van Brussel, H. and Millán, J. del R., An asynchronous and non-invasive brain-actuated wheelchair, in: Proceedings of the 13th International Symposium on Robotics Research, 2007.
 
Galán, F., Palix, J., Chavarriaga, R., Ferrez, P. W., Lew, E., Hauert, C. -A. and Millán, J. del R., Visuo-spatial attention frame recognition for brain-computer interfaces, in: Proceedings of the 1st International Conference on Cognitive Neurodynamics, 2007.
 
Georgescul, M., Clark, A. and Armstrong, S., Exploiting structural meeting-specific features for topic segmentation, in: Actes de la 14ème Conférence sur le Traitement Automatique des Langues Naturelles, Toulouse, France, 2007.
 
Gerber, M., Kaufmann, T. and Pfister, B., Perceptron-based class verification, in: Proceedings of NOLISP (ISCA Workshop on non linear speech processing), 2007.
 
Gerber, M., Beutler, R. and Pfister, B., Quasi text-independent speaker verification based on pattern matching, in: Proceedings of Interspeech, ISCA, 2007.
 
Germann, M., Breitenstein, M. D., Park, I. K. and Pfister, H., Automatic pose estimation for range images on the gpu, in: Sixth International Conference on 3-D Digital Imaging and Modeling (3DIM 2007), pages 81-90, IEEE Computer Society, 2007.
 
Grangier, D. and Bengio, S., Learning the inter-frame distance for discriminative template-based keyword detection, in: International Conference on Speech Communication and Technology (INTERSPEECH), 2007.
 
Graves, A., Liwicki, M. and Bunke, H., Unconstrained on-line handwriting recognition with recurrent neural networks, in: Advances in Neural Information Processing, 2007.
 
Gurban, M., Valles, A. and Thiran, J. -Ph., Low-Dimensional Motion Features for Audio-Visual Speech Recognition, in: 15th European Signal Processing Conference (EUSIPCO), Poznan, Poland, Poznan, Poland, 2007.
 
Hennebert, J., Loeffel, R., Humm, A. and Ingold, R., A new forgery scenario based on regaining dynamics of signature, in: Accepted for publication, International Conference on Biometrics (ICB 2007), Seoul Korea, 2007.
 
Hennebert, J., Humm, A. and Ingold, R., Modelling spoken signatures with gaussian mixture model adaptation, in: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 07), 2007.
 
Hennebert, J., Please repeat: my voice is my password. from the basics to real-life implementations of speaker verification technologies, in: Invited lecture at the Information Security Summit (IS2 2007), Prague, 2007.
 
Heusch, G. and Marcel, S., Face authentication with salient local features and static bayesian network, in: IEEE / IAPR Intl. Conf. On Biometrics (ICB), 2007.
 
Hoffmann, U., Vesin, J. M. and Ebrahimi, T., Recent advances in brain-computer interfaces, in: IEEE International Workshop on Multimedia Signal Processing, Chania, Crete, Greece, 2007.
 
Humm, A., Hennebert, J. and Ingold, R., Modelling combined handwriting and speech modalities, in: Accepted for publication, International Conference on Biometrics (ICB 2007), Seoul Korea, 2007.
 
Humm, A., Hennebert, J. and Ingold, R., Spoken handwriting verification using statistical models, in: Accepted for publication, International Conference on Document Analysis and Recognition (ICDAR 07), Curitiba Brazil, 2007.
 
Hung, H., Jayagopi, D., Yeo, C., Friedland, G., Ba, S., Odobez, J. -M., Ramchandran, K., Mirghafori, N. and Gatica-Perez, D., Using audio and video features to classify the most dominant person in a group meeting, 2007.
 
Hung, H., Jayagopi, D., Yeo, C., Friedland, G., Ba, S., Odobez, J. -M., Ramchandran, K., Mirghafori, N. and Gatica-Perez, D., Using audio and video features to classify the most dominant person in a group meeting multi-layer background subtraction based on color and texture, in: Proc. ACM Multi Media, Augsburg, Germany, 2007.
 
Hérault, R. and Grandvalet, Y., Sparse probabilistic classifiers, in: International Conference on Machine Learning (ICML), 2007.
 
Jaeggli, T., Koller-Meier, E. and van Gool, L., Learning generative models for monocular body pose estimation, in: ACCV, 2007.
 
Jaeggli, T., Koller-Meier, E. and van Gool, L., Multi-activity tracking in lle body pose space, in: 2nd Workshop on HUMAN MOTION Understanding, Modeling, Capture and Animation, ICCV, 2007.
 
Kaufmann, T. and Pfister, B., An HPSG parser supporting discontinuous licenser rules, in: International Conference on HPSG, 2007.
 
Kaufmann, T. and Pfister, B., Applying licenser rules to a grammar with continuous constituents, in: The Proceedings of the 14th International Conference on Head-Driven Phrase Structure Grammar, 2007.
 
Kittler, J., Poh, N., Fatukasi, O., Messer, K., Kryszczuk, K., Richiardi, J. and Drygajlo, A., Quality dependent fusion of intramodal and multimodal biometric experts, in: Proc. SPIE Defense and Security Symposium, 2007.
 
Kludas, J., Bruno, E. and Marchand-Maillet, S., Information fusion in multimedia information retrieval, in: Workshop on Adaptive Multimedia Retrieval (AMR 2007), 2007.
 
Kokiopoulou, E. and Frossard, P., Dimensionality Reduction with Adaptive Approximation, in: IEEE Int. Conf. on Multimedia & Expo (ICME), Beijing, China, 2007.
 
Kokiopoulou, E. and Frossard, P., Image alignment with rotation manifolds built on sparse geometric expansions, in: IEEE International Workshop on Multimedia Signal Processing, Chania, Crete, Greece, 2007.
 
Koval, O., Voloshynovskiy, S. and Pun, T., Analysis of multimodal binary detection systems based on dependent/independent modalities, in: Proceedings of the IEEE 2007 International Workshop on Multimedia Signal Processing, 2007.
 
Koval, O., Voloshynovskiy, S. and Pun, T., Error exponent analysis of person identification based on fusion of dependent/independent modalities, in: Proceedings of SPIE-IS&T Electronic Imaging 2007, Security, Steganography, and Watermarking of Multimedia Contents IX, 2007.
 
Kron, E., Rayner, M., Santaholma, M. and Bouillon, P., A development environment for building grammar-based speech-enabled applications, in: Proceedings of workshop on Grammar-based approaches to spoken language processing, pages 49-52, ACL 2007, Prague, Czech Republic, 2007.
 
Kryszczuk, K. and Drygajlo, A., Improving classification with class-independent quality measures: q-stack in face verification, in: Proc. 2nd Int. Conference in Biometrics (ICB 2007), 2007.
 
Kryszczuk, K. and Drygajlo, A., Q-stack: uni- and multimodal classifier stacking with quality measures, in: Proc. 7th Int. Workshop on Multiple Classifier Systems, Springer, 2007.
 
Kryszczuk, K., Richiardi, J. and Drygajlo, A., Reliability estimation for multimodal error prediction and fusion, in: Proc. 7th Int. Workshop on Pattern Recognition in Information Systems (PRIS 2007), 2007.
 
Kumatani, K., Mayer, H., Gehrig, T., Stoimenov, E., McDonough, J. and Wölfel, M., Adaptive beamforming with a minimum mutual information criterion, pages 2527--2541, 2007. [DOI]
 
Kumatani, K., Mayer, H., Gehrig, T., Stoimenov, E., McDonough, J. and Wölfel, M., Minimum mutual information beamforming for simultaneous active speakers, in: IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), pages 71-76, Kyoto, 2007. [DOI]
 
Lalanne, D., Evéquoz, F., Rigamonti, M., Dumas, B. and Ingold, R., An ego-centric and tangible approach to meeting indexing and browsing, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI'07), pages to appear, 2007.
 
Lalanne, D., Evéquoz, F., Chiquet, H., Müller, M., Radgohar, M. and Ingold, R., Going through digital versus physical augmented gaming, in: Tangible Play: Research and Design for Tangible and Tabletop Games. Workshop at the 2007 Intelligent User Interfaces Conference (IUI'07), pages 41-44, 2007.
 
Leibe, B., Schindler, K. and van Gool, L., Coupled detection and trajectory estimation for multi-object tracking, in: International Conference on Computer Vision (ICCV'07), 2007.
 
Leibe, B., Cornelis, N., Cornelis, K. and van Gool, L., Dynamic 3d scene analysis from a moving vehicle, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'07), 2007.
 
Levit, M., Hakkani-Tur, D., Tur, G. and Gillick, D., Integrating several annotation layers for statistical information distillation, in: Workshop on Automatic Speech Recognition and Understanding, 2007.
 
Li, W. and Bourlard, H., Non-linear spectral stretching for in-car speech recognition, in: Interspeech, 2007.
 
Lisowska, A., Betrancourt, M., Armstrong, S. and Rajman, M., Minimizing modality bias when exploring input preference for multimodal systems in new domains: the archivus case study, in: CHI' 07, San José, California, 2007.
 
Lisowska, A., Armstrong, S., Melichar, M., Ailomaa, M. and Rajman, M., The wizard of oz meets multimodal language-enabled gui interfaces: new challenges, in: Proceedings of CHI' 07, San José, California, 2007.
 
Liwicki, M., Graves, A., Bunke, H. and Schmidhuber, J., A novel approach to on-line handwriting recognition based on bidirectional long short-term memory networks, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 367-371, 2007.
 
Liwicki, M., Schlapbach, A., Loretan, P. and Bunke, H., Automatic detection of gender and handedness from on-line handwriting, in: Proc. 13th Conf. of the Graphonomics Society, pages 179-183, 2007.
 
Liwicki, M. and Bunke, H., Combining on-line and off-line systems for handwriting recognition, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 372-376, 2007.
 
Liwicki, M. and Bunke, H., Feature selection for on-line handwriting recognition of whiteboard notes, in: Proc. 13th Conf. of the Graphonomics Society, pages 101-105, 2007.
 
Liwicki, M., Indermühle, E. and Bunke, H., On-line handwritten text line detection using dynamic programming, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 447-451, 2007.
 
Lovitt, A., Pinto, J. P. and Hermansky, H., On confusions in a phoneme recognizer, 2007.
 
Lovitt, A., Truncation confusion patterns in onset consonants, in: Interspeech 2007, 2007.
 
Lüthy, F., Varga, T. and Bunke, H., Using hidden Markov models as a tool for handwritten text line segmentation, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 8-12, 2007.
 
Meynet, J. and Thiran, J. -Ph., Information Theoretic Combination of Classifiers with Application to AdaBoost, in: 7th international Workshop on Multiple Classifier Systems (MCS), Prague, Prague, 2007.
 
Millán, J. del R., Ferrez, P. W., Galán, F., Lew, E. and Chavarriaga, R., Non-invasive brain-actuated interaction, in: Proceedings of the 2nd International Symposium on Brain, Vision and Artificial Intelligence, 2007. [DOI]
 
Monay, F., Learning the structure of image collections with latent aspect models, in: ., 2007.
 
Morrison, D., Marchand-Maillet, S. and Bruno, E., Automatic image annotation with relevance feedback and latent semantic analysis, in: Workshop on Adaptive Multimedia Retrieval (AMR 2007), 2007.
 
Morrison, D., Marchand-Maillet, S. and Bruno, E., Hierarchical long-term learning for automatic image, in: International Conference on Semantics And digital Media Technologies (SAMT 2007), 2007.
 
Morrison, D., Marchand-Maillet, S. and Bruno, E., Hierarchical long-term learning for automatic image annotation, in: Proceedings 2nd International Conference on Semantic and Digital Media Technologies, 2007.
 
Motlicek, P., Hermansky, H., Ganapathy, S. and Garudadri, H., Frequency domain linear prediction for qmf sub-bands and applications to audio coding, in: 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI), pages 248-258, 2007.
 
Motlicek, P., Hermansky, H., Ganapathy, S., Garudadri, H. and Srinivasamurthy, N., Non-uniform speech/audio coding exploiting predictability of temporal evolution of spectral envelopes, in: Tenth International Conference on TEXT, SPEECH and DIALOGUE (TSD), pages 350-357, 2007.
 
Müller, P., Zeng, G., Wonka, P. and van Gool, L., Image-based procedural modeling of facades, in: Proceedings of ACM SIGGRAPH 2007 / ACM Transactions on Graphics, ACM Press, 2007.
 
Neuhaus, M. and Bunke, H., A quadratic programming approach to the graph edit distance problem, in: Graph-Based Representations in Pattern Recognition, pages 92-102, Springer, 2007.
 
Noris, B., Benmachiche, K., Meynet, J., Thiran, J. -Ph. and Billard, A., Analysis of Head Mounted Wireless Camera Videos for Early Diagnosis of Autism, in: International Conference on Recognition Systems, 2007.
 
Odobez, J. -M. and Ba, S., A cognitive and unsupervised map adaptation approach to the recognition of the focus of attention from head pose, in: International Conference on Multi-Media & Expo (ICME07), 2007.
 
Orabona, F., Castellini, C., Caputo, B., Luo, J. and Sandini, G., Indoor place recognition using online independent support vector machines, in: 18th British Machine Vision Conference (BMVC07), pages 1090-1099, Warwick, UK, 2007.
 
Ozden, K. E., Schindler, K. and van Gool, L., Simultaneous segmentation and 3d reconstruction of monocular image sequences, in: International Conference on Computer Vision (ICCV'07), 2007.
 
Pallotta, V., Seretan, V. and Ailomaa, M., User requirement analysis for meeting information retrieval based on query elicitation, in: Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics (ACL 2007), pages 1008-1015, Association for Computational Linguistics, 2007.
 
Paugam-Moisy, H., Martinez, R. and Bengio, S., A supervised learning approach based on stdp and polychronization in spiking neuron networks, in: European Symposium on Artificial Neural Networks, ESANN, 2007.
 
Perrin, X., Chavarriaga, R., Siegwart, R. and del R. Millán, J., Bayesian controller for a novel semi-autonomous navigation concept, in: 3rd European Conference on Mobile Robots (ECMR 2007), 2007.
 
Philips, J., Millán, J. del R., Vanacker, G., Lew, E., Galán, F., Ferrez, P. W., van Brussel, H. and Nuttin, M., Adaptive shared control of a brain-actuated simulated wheelchair, in: Proceedings of the 10th IEEE International Conference on Rehabilitation Robotics, pages 408-414, 2007. [DOI]
 
Piccardi, L., Noris, B., Barbey, O., Schiavone, G., Keller, F., Von Hofsten, C. and Billard, A., Wearcam: a head mounted wireless camera for monitoring gaze attention and for the diagnosis of developmental disorders in young children, in: 16th IEEE International Symposium on Robot & Human Interactive Communication, RO-MAN, 2007.
 
Pinto, J. P., Lovitt, A. and Hermansky, H., Exploiting phoneme similarities in hybrid hmm-ann keyword spotting, in: Proceedings of Interspeech, 2007.
 
Pinto, J. P., R. M., P., Yegnanarayana, B. and Hermansky, H., Significance of contextual information in phoneme recognition, 2007.
 
Popescu-Belis, A. and Zufferey, S., Contrasting the automatic identification of two discourse markers in multiparty dialogues, in: Proceedings of SIGDIAL 2007, pages 10, Antwerp, Belgium, 2007.
 
Popescu-Belis, A., Evaluation of nlg: some analogies and differences with mt and reference resolution, in: MT Summit XI Workshop on Using Corpora for NLG and MT (UCNLG MT), pages 66-68, 2007.
 
Popescu-Belis, A. and Estrella, P., Generating usable formats for metadata and annotations in a large meeting corpus, in: ACL 2007, pages 93-96, ACL 2007, Prague, Czech Republic, 2007.
 
Quack, T., Ferrari, V., Leibe, B. and van Gool, L., Efficient mining of frequent and distinctive feature configurations, in: accepted for ICCV'07, 2007.
 
Quack, T., Ferrari, V., Leibe, B. and van Gool, L., Efficient mining of frequent and distinctive feature configurations, in: International Conference on Computer Vision (ICCV'07), 2007.
 
Rakotomamonjy, A., Bach, F., Canu, S. and Grandvalet, Y., More efficiency in multiple kernel learning, in: International Conference on Machine Learning (ICML), 2007.
 
Renals, S., Hain, T. and Bourlard, H., Recognition and understanding of meetings the ami and amida projects, in: Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU'07, pages 238-247, Kyoto, 2007. [DOI]
 
Richiardi, J., Kryszczuk, K. and Drygajlo, A., Quality measures in unimodal and multimodal biometric verification, in: Proc. 15th European Signal Processing Conf. (EUSIPCO), 2007.
 
Richiardi, J. and Drygajlo, A., Reliability-based voting schemes using modality-independent features in multi-classifier biometric authentication, in: Proc. 7th Int. Workshop on Multiple Classifier Systems, Springer, 2007.
 
Riesen, K., Neuhaus, M. and Bunke, H., Bipartite graph matching for computing the edit distance of graphs, in: Graph-Based Representations in Pattern Recognition, pages 1-12, Springer, 2007.
 
Riesen, K., Neuhaus, M. and Bunke, H., Graph embedding in vector spaces by means of prototype selection, in: Graph-Based Representations in Pattern Recognition, pages 383-393, Springer, 2007.
 
Rigamonti, M., Lalanne, D. and Ingold, R., Faericworld: browsing multimedia events through static documents and links, in: In proc. of INTERACT 2007, pages to appear, Springer-Verlag, 2007.
 
Rytsar, R. and Pun, T., Computational aspects of the eeg forward problem solution for real head model using finite element, in: 29th Annual Int. Conf. IEEE Engineering in Medicine and Biology Society, 2007.
 
Schlapbach, A. and Bunke, H., Fusing asynchronous feature streams for on-line writer identification, in: Proc. 9th Int. Conf. on Document Analysis and Recognition, pages 103-107, 2007.
 
Sorci, M., Antonini, G. and Thiran, J. -Ph., Fisher's Discriminant and Relevant Component Analysis for static facial expression classification, in: 15th European Signal Processing Conference (EUSIPCO), Poznan, Poland, Poznan, Poland, 2007.
 
Starlander, M., Using a wizard of oz as a baseline to determine which system architecture is the best for a spoken language translation system, in: Proceedings of Nodalida 2007, pages 161-164, Tartu, Estonia, 2007.
 
Szekely, E., Bruno, E. and Marchand-Maillet, S., Clustered multidimensional scaling for exploration in information retrieval, in: International Conference on the Theory of Information Retrieval, 2007.
 
Thomas, A., Ferrari, V., Leibe, B., Tuytelaars, T. and van Gool, L., Depth-from-recognition: inferring metadata by cognitive feedback, in: ICCV'07 Workshop on 3D Representations for Recognition, 2007.
 
Valente, F. and Hermansky, H., Combination of acoustic classifiers based on dempster-shafer theory of evidence, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2007.
 
Valente, F., Vepa, J., Plahl, C., Gollan, C., Hermansky, H. and Schlüter, R., Hierarchical neural networks feature extraction for lvcsr system, in: Interspeech 2007, 2007.
 
Valente, F., Vepa, J. and Hermansky, H., Multi-stream features combination based on dempster-shafer rule for lvcsr system, in: Interspeech 2007, 2007.
 
Villán, R., Voloshynovskiy, S., Koval, O., Deguillaume, F. and Pun, T., Tamper-proofing of Electronic and Printed Text Documents via Robust Hashing and Data-Hiding, in: Proceedings of SPIE-IS&T Electronic Imaging 2007, Security, Steganography, and Watermarking of Multimedia Contents IX, 2007.
 
Vinciarelli, A. and Favre, S., Broadcast news story segmentation using social network analysis and hidden markov models, in: ACM International Conference on Multimedia, pages 261-264, 2007.
 
Vinciarelli, A., Fernàndez, F. and Favre, S., Semantic segmentation of radio programs using social network analysis and duration distribution modeling, in: IEEE International Conference on Multimedia and Expo (ICME), 2007.
 
Weise, T., Leibe, B. and van Gool, L., Fast 3d scanning with automatic motion compensation, in: IEEE Conference on Computer Vision and Pattern Recognition (CVPR'07), 2007.
 
Yao, J. and Odobez, J. -M., Multi-layer background subtraction based on color and texture, in: CVPR 2007 Workshop on Visual Surveillance (VS2007), pages 1-8, 2007. [DOI]
 
van Gool, L., Zeng, G., van den Borre, F. and Müller, P., Towards mass-produced building models, in: Photogrammetric Image Analysis, pages 209-220, Institute of Photogrammetry and Cartography, Technische Universitaet Muenchen, 2007.
 

2006

Ba, S. and Odobez, J. -M., A study on visual focus of attention recognition from head pose in a meeting room, in: 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI06), 2006.
 
Barber, D. and Chiappa, S., Unified inference for variational bayesian linear gaussian state-space models, in: NIPS, 2006.
 
Bertolami, R., Halter, B. and Bunke, H., Combination of multiple handwritten text line recognition systems with a recursive approach, in: Proc. 10th Int. Workshop Frontiers in Handwriting Recognition, pages 61-65, 2006.
 
Cattin, P. C., Bay, H., van Gool, L. and Székely, G., Retina mosaicing using local features, in: Medical Image Computing and Computer-Assisted Intervention (MICCAI), pages 185-192, 2006.
 
Chanel, G., Kronegg, J., Grandjean, D. and Pun, T., Emotion assessment: arousal evaluation using eeg's and peripheral physiological signals, in: Proc. Int. Workshop Multimedia Content Representation, Classification and Security (MRCS), pages 530-537, Lecture Notes in Computer Science, Springer, 2006.
 
Chiquet, H., Evéquoz, F. and Lalanne, D., Elcano, a tangible multimedia browser (demo)., in: Symposium on User Interface Software and Technology (UIST 2006), pages 51-52, 2006.
 
Hannani, A., Toledano, D., Petrovska, D., Montero-Asenjo, A. and Hennebert, J., Using data-driven and phonetic units for speaker verification, in: IEEE Speaker and Language Recognition Workshop (Odyssey 2006), Puerto Rico, 2006.
 
Kosinov, S., Marchand-Maillet, S., Kozintsev, I., Dulong, C. and Pun, T., Dual diffusion model of spreading activation for content-based image retrieval, in: 8th ACM SIGMM - International Workshop on Multimedia Information Retrieval, 2006.
 
Koval, O., Voloshynovskiy, S., Holotyak, T. and Pun, T., Information-theoretic analysis of steganalysis in real images, in: ACM Multimedia and Security Workshop 2006, 2006.
 
Leibe, B., Mikolajczyk, K. and Schiele, B., Efficient clustering and matching for object class recognition, in: British Machine Vision Conference (BMVC, 2006.
 
Leibe, B., Cornelis, N., Cornelis, K. and van Gool, L., Integrating recognition and reconstruction for cognitive traffic scene analysis from a moving vehicle, in: DAGM Annual Pattern Recognition Symposium, pages 192-201, Springer, 2006.
 
Leibe, B., Mikolajczyk, K. and Schiele, B., Segmentation based multi-cue integration for object detection, in: British Machine Vision Conference (BMVC, 2006.
 
Liwicki, M. and Bunke, H., HMM-based on-line recognition of handwritten whiteboard notes, in: Proceedings 10th International Workshop Frontiers in Handwriting Recognition, pages 595-599, 2006.
 
Melichar, M., Cenek, P., Ailomaa, M., Lisowska, A. and Rajman, M., From Vocal to Multimodal Dialogue Management, in: Eighth International Conference on Multimodal Interfaces (ICMI'06), Banff, Canada, 2006.
 
Müller, P., Wonka, P., Haegler, S., Ulmer, A. and van Gool, L., Procedural modeling of buildings, in: Proceedings of ACM SIGGRAPH 2006 / ACM Transactions on Graphics, pages 614-623, ACM Press, 2006.
 
Müller, M., Evéquoz, F. and Lalanne, D., Tjass, a smart board for augmenting card game playing and learning (demo), in: Symposium on User Interface Software and Technology (UIST 2006), pages 67-68, 2006.
 
Poh, N. and Bengio, S., Using chimeric users to construct fusion classifiers in biometric authentication tasks: an investigation, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), 2006.
 
Quelhas, P. and Odobez, J. -M., Natural scene image modeling using color and texture visterms., in: Conference on Image and Video Retrieval CIVR, 2006.
 
Radgohar, M., Evéquoz, F. and Lalanne, D., Phong, augmenting virtual and real gaming experience (demo), in: Symposium on User Interface Software and Technology (UIST 2006), pages 71-72, 2006.
 
Rienks, R., Zhang, D., Gatica-Perez, D. and Post, W., Detection and application of influence rankings in small group meetings, in: ICMI '06: Proceedings of the 8th international conference on Multimodal interfaces, pages 257-264, ACM Press, Banff, Alberta, Canada, 2006. [DOI]
 
Schlapbach, A. and Bunke, H., Off-line writer verification: a comparison of a hidden Markov model (HMM) and a Gaussian mixture model (GMM) based system, in: Proc. 10th Int. Workshop Frontiers in Handwriting Recognition, pages 275-280, 2006.
 
Smith, K., Schreiber, S., Beran, V., Potúcek, I., Rigoll, G. and Gatica-Perez, D., Multi-person tracking in meetings: a comparative study, in: Multimodal Interaction and Related Machine Learning Algorithms (MLMI), 2006.
 
Spindler, T., Wartmann, C., Roth, D., Steffen, A., Hovestadt, L. and van Gool, L., Privacy in video surveilled areas, in: International Conference on Privacy, Security and Trust (PST 2006), 2006.
 
Vila-Forcén, J. E., Voloshynovskiy, S., Koval, O. and Pun, T., Costa problem under channel ambiguity, in: Proceedings of 2006 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2006.
 
Voloshynovskiy, S., Koval, O., Topak, E., Forcen, J. E. V. and Pun, T., On reversibility of random binning based data-hiding techniques: security perspectives, in: ACM Multimedia and Security Workshop 2006, 2006.
 
Wey, P., Fischer, B., Bay, H. and Buhmann, J. M., Dense stereo by triangular meshing and cross validation, in: DAGM-Symposium, pages 708-717, 2006.
 
Zhang, D., Gatica-Perez, D. and Bengio, S., Exploring contextual information in a layered framework for group action recognition, in: In the Eighth International Conference on Multimodal Interfaces (ICMI'06), 2006.
 

Unknown year

Gatica-Perez, D., Modeling interest in face-to-face conversations from multimodal nonverbal behavior, in: In J.-P. Thiran, H. Bourlard, and F. Marques, (Eds.), Multimodal Signal Processing, Academic Press, in press, 0000.
 
Gatica-Perez, D. and Odobez, J. -M., Visual attention, speaking activity, and group conversational analysis in multi-sensor environments, in: H. Nakashima, J. Augusto, H. Aghajan (Eds.), Handbook of Ambient Intelligence and Smart Environments, Springer, in press, 0000.
 
Goldmann, L., Samour, A., Ebrahimi, T. and Sikora, T., Multimodal person search combining information fusion and relevance feedback, in: IEEE International Workshop on Multimedia Signal Processing (MMSP 2009), Rio de Janeiro, Brazil, 0000.
 
Lee, J. -S., De Simone, F. and Ebrahimi, T., Influence of audio-visual attention on perceived quality of standard definition multimedia content, in: First International Workshop on Quality of Multimedia Experience (QoMEX 2009), San Diego, CA, U.S.A., 0000.
 
Lee, J. -S. and Ebrahimi, T., Two-level bimodal association for audio-visual speech recognition, in: International Conference on Advanced Concepts for Intelligent Vision Systems (ACIVSâ09), Bordeaux, France, 0000.
 
Noris, B., Benmachiche, K. and Billard, A., Calibration-free eye gaze direction detection with gaussian processes, in: International Conference on Computer Vision Theory and Applications (VISAPP 08), 0000.
 
De Simone, F., Naccari, M., Tagliasacchi, M., Dufaux, F., Tubaro, S. and Ebrahimi, T., Subjective assessment of H.264/AVC video sequences transmitted over a noisy channel, in: First International Workshop on Quality of Multimedia Experience (QoMEX 2009), San Diego, CA, U.S.A., 0000.
 

2008

Kryszczuk, K. and Drygajlo, A., What do quality measures predict in biometrics, pages -,-29, 2008.
 
Kryszczuk, K. and Drygajlo, A., Impact of feature correlations on separation between bivariate normal distributions, 2008.
 

2009

Richiardi, J., Drygajlo, A. and Kryszczuk, K., Static models of derivative-coordinates phase spaces for multivariate time series classification: an application to signature verification, pages 140-149, 2009.
 
Zhu, K., Drygajlo, A. and Li, W., Q-stack aging model for face verification, 2009.
 

2008

Ketabdar, H. and Bourlard, H., In-context phone posteriors as complementary features for tandem asr, in: ICSLP'08, Brisbane, Australia,, 2008.
 
Millán, J. del R., Brain-controlled robots, in: IEEE International Conference on Robotics and Automation (ICRA 2008), Pasadena, CA, USA,, 2008. [DOI]
 

2009

Kryszczuk, K. and Drygajlo, A., Improving biometric verification with class-independent quality information, pages 310-321, 2009.
 

2008

Schouten, B., Juul, N., Drygajlo, A. and Tistarelli, M., Biometrics and identity management, Springer, 2008.
 
Kryszczuk, K. and Drygajlo, A., On quality of quality measures for classification, pages 19-28, Springer, 2008.
 
Richiardi, J., Drygajlo, A. and Todesco, L., Promoting diversity in gaussian mixture ensembles: an application to signature verification, pages 140-149, Springer, 2008.
 

Publications of type: Misc

Soleymani, M., Chanel, G., Kierkels, J. and Pun, T., valence-arousal representation of movie scenes based on multimedia content analysis and user's physiological emotional responses, 5th Joint Workshop on Machine Learning and Multimodal Interaction, 2008.
 

Publications of type: Phdthesis

2009

Scaringella, N., On the design of audio features robust to the album-effect for music information retrieval., Ecole Polytechnique Fédérale de Lausanne, 2009.
 

2008

Aradilla, G., Acoustic models for posterior features in speech recognition, Ecole Polytechnique Fédérale de Lausanne, 2008.
 
Galán, F., Methods for Asynchronous and Non-Invasive EEG-Based Brain-Computer Interfaces. Towards Intelligent Brain-Actuated Wheelchairs, University of Barcelona, 2008.
 
Grangier, D., Machine Learning for Information Retrieval, École Polytechnique Fédérale de Lausanne, 2008.
 
Humm, A., Modelling combined handwriting and speech modalities for user authentication, University of Fribourg, Switzerland, 2008.
 
Ketabdar, H., Enhancing posterior based speech recognition systems, Ecole Polytechnique Fédérale de Lausanne, 2008.
 
Mesot, B., Inference in switching linear dynamical systems applied to noise robust speech recognition of isolated digits, Ecole Polytechnique Fédérale de Lausanne, 2008.
 
Mesot, B., Switching linear dynamical systems for noise robust speech recognition of isolated degits, STI School of Engineering, EPFL, 2008.
 
Paiement, J. -F., Probabilistic models for music, École Polytechnique Fédérale de Lausanne, 2008.
 
Rigamonti, M., A framework for structuring multimedia archives and for browsing efficiently through multimodal links, University of Fribourg, Switzerland, 2008.
 
Rigamonti, M., A framework for structuring multimedia archives and for browsing efficiently through multimodal links, University of Fribourg, Switzerland, 2008.
 

2007

Ba, S., Joint head tracking and pose estimation for visual focus of attention recognition, École Polytechnique Fédérale de Lausanne, 2007.
 
Ferrez, P. W., Error-related eeg potentials in brain-computer interfaces, École Polytechnique Fédérale de Lausanne, 2007.
 
Smith, K., Bayesian methods for visual multi-object tracking with applications to human activity recognition, École Polytechnique Fédérale de Lausanne, 2007.
 

2006

Chiappa, S., Analysis and classification of eeg signals using probabilistic models for brain computer interfaces, École Polytechnique Fédérale de Lausanne, 2006.
 
Dimitrakakis, C., Ensembles for sequence learning, École Polytechnique Fédérale de Lausanne, 2006.
 
Just, A., Two-handed gestures for human-computer interaction, École Polytechnique Fédérale de Lausanne, 2006.
 
Keller, M., Machine learning approaches to text representation using unlabeled data, Ecole Polytechnique Fédérale de Lausanne, 2006.
 
Lathoud, G., Spatio-temporal analysis of spontaneous speech with microphone arrays, École Polytechnique Fédérale de Lausanne, 2006.
 
Poh, N., Multi-system biometric authentication: optimal fusion and user-specific information, École Polytechnique Fédérale de Lausanne, 2006.
 
Pozdnoukhov, A., Prior knowledge in kernel methods, École Polytechnique Fédérale de Lausanne, 2006.
 
Rodriguez, Y., Face detection and verification using local binary patterns, École Polytechnique Fédérale de Lausanne, 2006.
 
Zhang, D., Probabilistic graphical models for human interaction analysis, École Polytechnique Fédérale de Lausanne, 2006.
 

Publications of type: Proceedings

2008

Grandjean, D. and Pun, T., Multimodality in emotions and for their assessment, 2008.
 

Publications of type: Techreport

2009

Berclaz, J., Fleuret, F. and Fua, P., Multiple object tracking using flow linear programming, number 10-2009, 2009.
 
Garg, N., Co-occurrence Models for Image Annotation and Retrieval, number Idiap-RR-22-2009, 2009.
 
Garg, N. and Gatica-Perez, D., Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr, number Idiap-RR-21-2009, 2009.
 
Garner, P. N., A MAP Approach to Noise Compensation of Speech, number Idiap-RR-08-2009, 2009.
 
Heusch, G. and Marcel, S., Bayesian Networks to Combine Intensity and Color Information in Face Recognition, number Idiap-RR-27-2009, 2009.
 
Hung, H. and Ba, S., Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features, number Idiap-RR-20-2009, 2009.
 
Imseng, D., Novel initialization methods for Speaker Diarization, number Idiap-RR-07-2009, 2009.
 
Magimai-Doss, M., Aradilla, G. and Bourlard, H., On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR, number Idiap-RR-24-2009, 2009.
 
Negoescu, R. -A., Gatica-Perez, D., Adams, B., Phung, D. and Venkatesh, S., Flickr Hypergroups, number Idiap-Internal-RR-73-2009, 2009.
 
Perrin, X., Chavarriaga, R., Pradalier, C., Millán, J. del R. and Siegwart, R., Dialog Management Technique for Brain-Computer Interfaces, 2009.
 
Perrin, X., Colas, F., Pradalier, C. and Siegwart, R., Learning human habits and reactions to external events with a dynamic Bayesian network, 2009.
 
Picart, B., Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity, number Idiap-RR-18-2009, 2009.
 
Popescu-Belis, A., Comparing meeting browsers using a task-based evaluation method, number Idiap-RR-11-2009, 2009.
 
Roy, A. and Marcel, S., Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, number Idiap-RR-28-2009, 2009.
 
Thomas, S., Ganapathy, S. and Hermansky, H., Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features, number Idiap-RR-04-2009, 2009.
 
Yao, J. and Odobez, J. -M., Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets, number Idiap-RR-19-2009, 2009.
 

2008

Aradilla, G., Bourlard, H. and Magimai-Doss, M., Posterior features applied to speech recognition tasks with limited training data, number Idiap-RR-15-2008, 2008.
 
Aradilla, G., Bourlard, H. and Magimai-Doss, M., Using kl-based acoustic models in a large vocabulary recognition task, number Idiap-RR-14-2008, 2008.
 
Ba, S. and Odobez, J. -M., Multi-person visual focus of attention from head pose and meeting contextual cues, number Idiap-RR-47-2008, 2008.
 
Ba, S. and Odobez, J. -M., Multi-person visual focus of attention from head pose and meeting contextual cues, number 47, 2008.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, number Idiap-RR-75-2008, 2008.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, number Idiap-RR-74-2008, 2008.
 
Garner, P. N., A weighted finite state transducer tutorial, number Idiap-Com-03-2008, 2008.
 
Ketabdar, H. and Bourlard, H., Enhanced phone posteriors for improving speech recognition systems, number Idiap-RR-39-2008, 2008.
 
Kumatani, K., McDonough, J., Schacht, S., Klakow, D., Garner, P. N. and Li, W., Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition, number Idiap-RR-02-2008, 2008.
 
Kumatani, K., McDonough, J., Klakow, D., Garner, P. N. and Li, W., Maximum negentropy beamforming, number Idiap-RR-07-2008, 2008.
 
Li, W., Kumatani, K., Dines, J., Magimai-Doss, M. and Bourlard, H., A neural network based regression approach for recognizing simultaneous speech, number Idiap-RR-10-2008, 2008.
 
Mariéthoz, J., Bengio, S. and Grandvalet, Y., Kernel Based Text-Independnent Speaker Verification, number Idiap-RR-68-2008, 2008.
 
Motlicek, P., Ganapathy, S. and Hermansky, H., Entropy coding of Quantized Spectral Components in FDLP audio codec, number Idiap-RR-71-2008, 2008.
 
Paiement, J. -F., Grandvalet, Y. and Bengio, S., Predictive Models for Music, number Idiap-RR-51-2008, 2008.
 
Paiement, J. -F., Bengio, S. and Eck, D., Probabilistic Models for Melodic Prediction, number Idiap-RR-50-2008, 2008.
 
Parthasarathi, S. H. K. and Hermansky, H., A data-driven approach to speech/non-speech detection, number Idiap-RR-23-2008, 2008.
 
Parthasarathi, S. H. K., Motlicek, P. and Hermansky, H., Exploiting temporal context for speech/non-speech detection, number Idiap-RR-21-2008, 2008.
 
Perruchoud, L., The Anterior Cingulate Cortex, number Idiap-Com-02-2008, 2008.
 
Pronobis, M. and Magimai-Doss, M., Integrating audio and vision for robust automatic gender recognition, number Idiap-RR-73-2008, 2008.
 
Tommasi, T., Orabona, F. and Caputo, B., CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach, number Idiap-RR-77-2008, 2008.
 

2007

Chen, L., Barber, D. and Odobez, J. -M., Dynamical dirichlet mixture model, number 02, 2007.
 
Dines, J. and Magimai-Doss, M., A study of phoneme and grapheme based context-dependent asr systems, number 12, 2007.
 
Dines, J. and Vepa, J., Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics, number 13, 2007.
 
Galán, F., Ferrez, P. W., Oliva, F., Guàrdia, J. and del R. Millán, J., Feature extraction for multi-class bci using canonical variates analysis, number 23, 2007.
 
Gaudard, C., Aradilla, G. and Bourlard, H., Speech recognition based on template matching and phone posterior probabilities, number 02, 2007.
 
Heusch, G. and Marcel, S., A novel statistical generative model dedicated to face recognition, number Idiap-RR-39-2007, 2007.
 
Humm, A., Hennebert, J. and Ingold, R., Database and evaluation protocols for user authentication using combined handwriting and speech modalities, 2007.
 
Keshet, J., Theoretical foundations for large-margin kernel-based continuous speech recognition, number Idiap-RR-44-2007, 2007.
 
Li, W., Dines, J. and Magimai-Doss, M., Robust overlapping speech recognition based on neural networks, number Idiap-RR-55-2007, 2007.
 
Lovitt, A., Correcting confusion matrices for phone recognizers, number 03, 2007.
 
Marcel, S., Abbet, P. and Guillemot, M., Google portrait, number Idiap-Com-07-2007, 2007.
 
Marcel, S., Joint bi-modal face and speaker authentication using explicit polynomial expansion, number 14, 2007.
 
Mesot, B. and Barber, D., A bayesian switching linear dynamical system for scale-invariant robust speech extraction, 2007.
 
Motlicek, P., Ganapathy, S., Hermansky, H. and Garudadri, H., Scalable wide-band audio codec based on frequency domain linear prediction, number 16, 2007.
 
Orabona, F., Castellini, C., Caputo, B., Luo, J. and Sandini, G., On-line independent support vector machines for cognitive systems, number Idiap-RR-63-2007, 2007.
 
Pinto, J. P., Bourlard, H., Graves, A. and Hermansky, H., Comparing different word lattice rescoring approaches towards keyword spotting, number 32, 2007.
 
Prasanna, S. R. Mahadeva, Yegnanarayana, B., Pinto, J. P. and Hermansky, H., Analysis of confusion matrix to combine evidence for phoneme recognition, number 27, 2007.
 
Pronobis, A. and Caputo, B., Confidence-based cue integration for visual place recognition, number 17, 2007.
 
Uldry, L., Ferrez, P. W. and del R. Millán, J., Feature selection methods on distributed linear inverse solutions for a non-invasive brain-machine interface, number 04, 2007.
 
Valente, F., Bourlard, H. and Deepu, V., Agglomerative information bottleneck for speaker diarization of meetings data, number 31, 2007.
 
Vinciarelli, A., Mapping nonverbal communication into social status: automatic recognition of journalists and non-journalists in radio news, number 33, 2007.
 
Vinciarelli, A. and Favre, S., Role recognition in radio programs using social affiliation networks and mixtures of discrete distributions: an approach inspired by social cognition, number Idiap-RR-40-2007, 2007.
 
Zacharie, D. G. and Pinto, J. P., Keyword spotting on word lattices, number 22, 2007.
 

2006

Ba, S. and Odobez, J. -M., Recognizing people's focus of attention from head poses: a study, number 42, 2006.
 
Buttfield, A. and del R. Millán, J., Online classifier adaptation in brain-computer interfaces, number 16, 2006.
 
Cheng, O., Dines, J. and Magimai-Doss, M., A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition, number 62, 2006.
 
Cuendet, S., Model adaptation for sentence unit segmentation from speech, number 64, 2006.
 
Hemptinne, C., Master thesis: integration of the harmonic plus noise model (hnm) into the hidden markov model-based speech synthesis system (hts), number 69, 2006.
 
Keller, M. and Bengio, S., A multitask learning approach to document representation using unlabeled data, number 44, 2006.
 
Ketabdar, H. and Hermansky, H., Identifying unexpected words using in-context and out-of-context phoneme posteriors, number 68, 2006.
 
Lathoud, G., Observations on multi-band asynchrony in distant speech recordings, number 74, 2006.
 
Lathoud, G., Magimai-Doss, M. and Bourlard, H., Unsupervised spectral subtraction for noise-robust asr on unknown transmission channels, number 09, 2006.
 
Luo, J., Pronobis, A., Caputo, B. and Jensfelt, P., Incremental learning for place recognition in dynamic environments, number 52, 2006.
 
Luo, J., Pronobis, A. and Caputo, B., Svm-based transfer of visual knowledge across robotic platforms, number 65, 2006.
 
Maganti, H. K., Motlicek, P. and Gatica-Perez, D., Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms, number 57, 2006.
 
Marcel, S., Rodriguez, Y., Guillemot, M. and Popescu-Belis, A., Annotation of face detection: description of xml format and files, number 06, 2006.
 
Marcel, S., Keomany, J. and Rodriguez, Y., Robust-to-illumination face localisation using active shape models and local binary patterns, number 47, 2006.
 
Mariéthoz, J., Discrmininant models for text-independent speaker verification, number 70, 2006.
 
Mesot, B. and Barber, D., A bayesian alternative to gain adaptation in autoregressive hidden markov models, number 55, 2006.
 
Mesot, B. and Barber, D., Switching linear dynamical systems for noise robust speech recognition, number 08, 2006.
 
Moore, D., The juicer lvcsr decoder - user manual for juicer version 0.5.0, number 03, 2006.
 
Motlicek, P., Hermansky, H., Garudadri, H. and Srinivasamurthy, N., Audio coding based on long temporal contexts, number 30, 2006.
 
Motlicek, P., Ullal, V. and Hermansky, H., Wide-band perceptual audio coding based on frequency-domain linear prediction, number 58, 2006.
 
Poh, N. and Bengio, S., Estimating the confidence interval of expected performance curve in biometric authentication using joint bootstrap, number 25, 2006.
 
Richiardi, J. and Drygajlo, A., Applying biometrics to identity documents: estimating and coping with errors, 2006.
 
Richiardi, J. and Drygajlo, A., Applying biometrics to identity documents: implementation issues, 2006.
 
Smith, K., Ba, S., Odobez, J. -M. and Gatica-Perez, D., Tracking attention for multiple people: wandering visual focus of attention estimation, number 40, 2006.
 
Torre, E. L., Caputo, B. and Tommasi, T., Melanoma recognition using kernel classifiers, number 53, 2006.
 
Ullal, V. and Motlicek, P., Audio coding based on long temporal segments: experiments with quantization of excitation signal, number 46, 2006.
 
A. Peregoudov, , Vinciarelli, A. and Bourlard, H., Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations, number 56, 2006.
 

2007

Mesot, B. and Barber, D., A gaussian sum smoother for inference in switching linear dynamical systems, 2007.
 
Powered by Agaion