Guide:
  • If you want to have the list of publications issued from a specific Individual Project (IP), write in the search field (IM2.IP). IP can have the following value: DMA, AP, VP, MPR, MCA, HMI, ISD, BMI

  • If you want to find joint publications between IPs, write in the search field (joint), click on search and then click on Keywords

  • If you want to display all the publications for a specific author, use the shortcut called -Authors- located in the main menu
 

IDIAP Research Report, sorted on year



2009

Roy, A. and Marcel, S., Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection, number Idiap-RR-28-2009, 2009.
 
Heusch, G. and Marcel, S., Bayesian Networks to Combine Intensity and Color Information in Face Recognition, number Idiap-RR-27-2009, 2009.
 
Magimai-Doss, M., Aradilla, G. and Bourlard, H., On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR, number Idiap-RR-24-2009, 2009.
 
Garg, N., Co-occurrence Models for Image Annotation and Retrieval, number Idiap-RR-22-2009, 2009.
 
Garg, N. and Gatica-Perez, D., Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr, number Idiap-RR-21-2009, 2009.
 
Hung, H. and Ba, S., Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features, number Idiap-RR-20-2009, 2009.
 
Yao, J. and Odobez, J. -M., Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets, number Idiap-RR-19-2009, 2009.
 
Picart, B., Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity, number Idiap-RR-18-2009, 2009.
 
Popescu-Belis, A., Comparing meeting browsers using a task-based evaluation method, number Idiap-RR-11-2009, 2009.
 
Garner, P. N., A MAP Approach to Noise Compensation of Speech, number Idiap-RR-08-2009, 2009.
 
Imseng, D., Novel initialization methods for Speaker Diarization, number Idiap-RR-07-2009, 2009.
 
Thomas, S., Ganapathy, S. and Hermansky, H., Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features, number Idiap-RR-04-2009, 2009.
 
Negoescu, R. -A., Gatica-Perez, D., Adams, B., Phung, D. and Venkatesh, S., Flickr Hypergroups, number Idiap-Internal-RR-73-2009, 2009.
 
Berclaz, J., Fleuret, F. and Fua, P., Multiple object tracking using flow linear programming, number 10-2009, 2009.
 
Perrin, X., Chavarriaga, R., Pradalier, C., Millán, J. del R. and Siegwart, R., Dialog Management Technique for Brain-Computer Interfaces, 2009.
 
Perrin, X., Colas, F., Pradalier, C. and Siegwart, R., Learning human habits and reactions to external events with a dynamic Bayesian network, 2009.
 

2008

Tommasi, T., Orabona, F. and Caputo, B., CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach, number Idiap-RR-77-2008, 2008.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes, number Idiap-RR-75-2008, 2008.
 
Ganapathy, S., Motlicek, P. and Hermansky, H., MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION, number Idiap-RR-74-2008, 2008.
 
Pronobis, M. and Magimai-Doss, M., Integrating audio and vision for robust automatic gender recognition, number Idiap-RR-73-2008, 2008.
 
Motlicek, P., Ganapathy, S. and Hermansky, H., Entropy coding of Quantized Spectral Components in FDLP audio codec, number Idiap-RR-71-2008, 2008.
 
Mariéthoz, J., Bengio, S. and Grandvalet, Y., Kernel Based Text-Independnent Speaker Verification, number Idiap-RR-68-2008, 2008.
 
Paiement, J. -F., Grandvalet, Y. and Bengio, S., Predictive Models for Music, number Idiap-RR-51-2008, 2008.
 
Paiement, J. -F., Bengio, S. and Eck, D., Probabilistic Models for Melodic Prediction, number Idiap-RR-50-2008, 2008.
 
Ba, S. and Odobez, J. -M., Multi-person visual focus of attention from head pose and meeting contextual cues, number Idiap-RR-47-2008, 2008.
 
Ketabdar, H. and Bourlard, H., Enhanced phone posteriors for improving speech recognition systems, number Idiap-RR-39-2008, 2008.
 
Parthasarathi, S. H. K. and Hermansky, H., A data-driven approach to speech/non-speech detection, number Idiap-RR-23-2008, 2008.
 
Parthasarathi, S. H. K., Motlicek, P. and Hermansky, H., Exploiting temporal context for speech/non-speech detection, number Idiap-RR-21-2008, 2008.
 
Aradilla, G., Bourlard, H. and Magimai-Doss, M., Posterior features applied to speech recognition tasks with limited training data, number Idiap-RR-15-2008, 2008.
 
Aradilla, G., Bourlard, H. and Magimai-Doss, M., Using kl-based acoustic models in a large vocabulary recognition task, number Idiap-RR-14-2008, 2008.
 
Li, W., Kumatani, K., Dines, J., Magimai-Doss, M. and Bourlard, H., A neural network based regression approach for recognizing simultaneous speech, number Idiap-RR-10-2008, 2008.
 
Kumatani, K., McDonough, J., Klakow, D., Garner, P. N. and Li, W., Maximum negentropy beamforming, number Idiap-RR-07-2008, 2008.
 
Kumatani, K., McDonough, J., Schacht, S., Klakow, D., Garner, P. N. and Li, W., Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition, number Idiap-RR-02-2008, 2008.
 
Garner, P. N., A weighted finite state transducer tutorial, number Idiap-Com-03-2008, 2008.
 
Perruchoud, L., The Anterior Cingulate Cortex, number Idiap-Com-02-2008, 2008.
 
Ba, S. and Odobez, J. -M., Multi-person visual focus of attention from head pose and meeting contextual cues, number 47, 2008.
 

2007

Orabona, F., Castellini, C., Caputo, B., Luo, J. and Sandini, G., On-line independent support vector machines for cognitive systems, number Idiap-RR-63-2007, 2007.
 
Li, W., Dines, J. and Magimai-Doss, M., Robust overlapping speech recognition based on neural networks, number Idiap-RR-55-2007, 2007.
 
Keshet, J., Theoretical foundations for large-margin kernel-based continuous speech recognition, number Idiap-RR-44-2007, 2007.
 
Vinciarelli, A. and Favre, S., Role recognition in radio programs using social affiliation networks and mixtures of discrete distributions: an approach inspired by social cognition, number Idiap-RR-40-2007, 2007.
 
Heusch, G. and Marcel, S., A novel statistical generative model dedicated to face recognition, number Idiap-RR-39-2007, 2007.
 
Marcel, S., Abbet, P. and Guillemot, M., Google portrait, number Idiap-Com-07-2007, 2007.
 
Vinciarelli, A., Mapping nonverbal communication into social status: automatic recognition of journalists and non-journalists in radio news, number 33, 2007.
 
Pinto, J. P., Bourlard, H., Graves, A. and Hermansky, H., Comparing different word lattice rescoring approaches towards keyword spotting, number 32, 2007.
 
Valente, F., Bourlard, H. and Deepu, V., Agglomerative information bottleneck for speaker diarization of meetings data, number 31, 2007.
 
Prasanna, S. R. Mahadeva, Yegnanarayana, B., Pinto, J. P. and Hermansky, H., Analysis of confusion matrix to combine evidence for phoneme recognition, number 27, 2007.
 
Galán, F., Ferrez, P. W., Oliva, F., Guàrdia, J. and del R. Millán, J., Feature extraction for multi-class bci using canonical variates analysis, number 23, 2007.
 
Zacharie, D. G. and Pinto, J. P., Keyword spotting on word lattices, number 22, 2007.
 
Pronobis, A. and Caputo, B., Confidence-based cue integration for visual place recognition, number 17, 2007.
 
Motlicek, P., Ganapathy, S., Hermansky, H. and Garudadri, H., Scalable wide-band audio codec based on frequency domain linear prediction, number 16, 2007.
 
Marcel, S., Joint bi-modal face and speaker authentication using explicit polynomial expansion, number 14, 2007.
 
Dines, J. and Vepa, J., Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics, number 13, 2007.
 
Dines, J. and Magimai-Doss, M., A study of phoneme and grapheme based context-dependent asr systems, number 12, 2007.
 
Uldry, L., Ferrez, P. W. and del R. Millán, J., Feature selection methods on distributed linear inverse solutions for a non-invasive brain-machine interface, number 04, 2007.
 
Lovitt, A., Correcting confusion matrices for phone recognizers, number 03, 2007.
 
Chen, L., Barber, D. and Odobez, J. -M., Dynamical dirichlet mixture model, number 02, 2007.
 
Gaudard, C., Aradilla, G. and Bourlard, H., Speech recognition based on template matching and phone posterior probabilities, number 02, 2007.
 
Humm, A., Hennebert, J. and Ingold, R., Database and evaluation protocols for user authentication using combined handwriting and speech modalities, 2007.
 
Mesot, B. and Barber, D., A bayesian switching linear dynamical system for scale-invariant robust speech extraction, 2007.
 
Mesot, B. and Barber, D., A gaussian sum smoother for inference in switching linear dynamical systems, 2007.
 

2006

Lathoud, G., Observations on multi-band asynchrony in distant speech recordings, number 74, 2006.
 
Mariéthoz, J., Discrmininant models for text-independent speaker verification, number 70, 2006.
 
Hemptinne, C., Master thesis: integration of the harmonic plus noise model (hnm) into the hidden markov model-based speech synthesis system (hts), number 69, 2006.
 
Ketabdar, H. and Hermansky, H., Identifying unexpected words using in-context and out-of-context phoneme posteriors, number 68, 2006.
 
Luo, J., Pronobis, A. and Caputo, B., Svm-based transfer of visual knowledge across robotic platforms, number 65, 2006.
 
Cuendet, S., Model adaptation for sentence unit segmentation from speech, number 64, 2006.
 
Cheng, O., Dines, J. and Magimai-Doss, M., A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition, number 62, 2006.
 
Motlicek, P., Ullal, V. and Hermansky, H., Wide-band perceptual audio coding based on frequency-domain linear prediction, number 58, 2006.
 
Maganti, H. K., Motlicek, P. and Gatica-Perez, D., Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms, number 57, 2006.
 
A. Peregoudov, , Vinciarelli, A. and Bourlard, H., Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations, number 56, 2006.
 
Mesot, B. and Barber, D., A bayesian alternative to gain adaptation in autoregressive hidden markov models, number 55, 2006.
 
Torre, E. L., Caputo, B. and Tommasi, T., Melanoma recognition using kernel classifiers, number 53, 2006.
 
Luo, J., Pronobis, A., Caputo, B. and Jensfelt, P., Incremental learning for place recognition in dynamic environments, number 52, 2006.
 
Marcel, S., Keomany, J. and Rodriguez, Y., Robust-to-illumination face localisation using active shape models and local binary patterns, number 47, 2006.
 
Ullal, V. and Motlicek, P., Audio coding based on long temporal segments: experiments with quantization of excitation signal, number 46, 2006.
 
Keller, M. and Bengio, S., A multitask learning approach to document representation using unlabeled data, number 44, 2006.
 
Ba, S. and Odobez, J. -M., Recognizing people's focus of attention from head poses: a study, number 42, 2006.
 
Smith, K., Ba, S., Odobez, J. -M. and Gatica-Perez, D., Tracking attention for multiple people: wandering visual focus of attention estimation, number 40, 2006.
 
Motlicek, P., Hermansky, H., Garudadri, H. and Srinivasamurthy, N., Audio coding based on long temporal contexts, number 30, 2006.
 
Poh, N. and Bengio, S., Estimating the confidence interval of expected performance curve in biometric authentication using joint bootstrap, number 25, 2006.
 
Buttfield, A. and del R. Millán, J., Online classifier adaptation in brain-computer interfaces, number 16, 2006.
 
Lathoud, G., Magimai-Doss, M. and Bourlard, H., Unsupervised spectral subtraction for noise-robust asr on unknown transmission channels, number 09, 2006.
 
Mesot, B. and Barber, D., Switching linear dynamical systems for noise robust speech recognition, number 08, 2006.
 
Marcel, S., Rodriguez, Y., Guillemot, M. and Popescu-Belis, A., Annotation of face detection: description of xml format and files, number 06, 2006.
 
Moore, D., The juicer lvcsr decoder - user manual for juicer version 0.5.0, number 03, 2006.
 
Richiardi, J. and Drygajlo, A., Applying biometrics to identity documents: estimating and coping with errors, 2006.
 
Richiardi, J. and Drygajlo, A., Applying biometrics to identity documents: implementation issues, 2006.
 
Powered by Agaion