Roy, A. and Marcel, S. , Haar Local Binary Pattern Feature for Fast Illumination Invariant Face Detection , number Idiap-RR-28-2009, 2009.
Heusch, G. and Marcel, S. , Bayesian Networks to Combine Intensity and Color Information in Face Recognition , number Idiap-RR-27-2009, 2009.
Magimai-Doss, M. , Aradilla, G. and Bourlard, H. , On Joint Modelling of Grapheme and Phoneme Information using KL-HMM for ASR , number Idiap-RR-24-2009, 2009.
Garg, N. , Co-occurrence Models for Image Annotation and Retrieval , number Idiap-RR-22-2009, 2009.
Garg, N. and Gatica-Perez, D. , Tagging and Retrieving Images with Co-Occurrence Models: from Corel to Flickr , number Idiap-RR-21-2009, 2009.
Hung, H. and Ba, S. , Speech/Non-Speech Detection in Meetings from Automatically Extracted Low Resolution Visual Features , number Idiap-RR-20-2009, 2009.
Yao, J. and Odobez, J. -M. , Fast Human Detection in Videos using Joint Appearance and Foreground Learning from Covariances of Image Feature Subsets , number Idiap-RR-19-2009, 2009.
Picart, B. , Improved Phone Posterior Estimation Through k-NN and MLP-Based Similarity , number Idiap-RR-18-2009, 2009.
Popescu-Belis, A. , Comparing meeting browsers using a task-based evaluation method , number Idiap-RR-11-2009, 2009.
Garner, P. N. , A MAP Approach to Noise Compensation of Speech , number Idiap-RR-08-2009, 2009.
Imseng, D. , Novel initialization methods for Speaker Diarization , number Idiap-RR-07-2009, 2009.
Thomas, S. , Ganapathy, S. and Hermansky, H. , Phoneme Recognition Using Spectral Envelope and Modulation Frequency Features , number Idiap-RR-04-2009, 2009.
Negoescu, R. -A. , Gatica-Perez, D. , Adams, B. , Phung, D. and Venkatesh, S. , Flickr Hypergroups , number Idiap-Internal-RR-73-2009, 2009.
Berclaz, J. , Fleuret, F. and Fua, P. , Multiple object tracking using flow linear programming , number 10-2009, 2009.
Perrin, X. , Chavarriaga, R. , Pradalier, C. , Millán, J. del R. and Siegwart, R. , Dialog Management Technique for Brain-Computer Interfaces , 2009.
Perrin, X. , Colas, F. , Pradalier, C. and Siegwart, R. , Learning human habits and reactions to external events with a dynamic Bayesian network , 2009.
Tommasi, T. , Orabona, F. and Caputo, B. , CLEF2008 Image Annotation Task: an SVM Confidence-Based Approach , number Idiap-RR-77-2008, 2008.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , Low-Delay Error Resilient Speech Coding Using Sub-band Hilbert Envelopes , number Idiap-RR-75-2008, 2008.
Ganapathy, S. , Motlicek, P. and Hermansky, H. , MODIFIED DISCRETE COSINE TRANSFORM FOR ENCODING RESIDUAL SIGNALS IN FREQUENCY DOMAIN LINEAR PREDICTION , number Idiap-RR-74-2008, 2008.
Pronobis, M. and Magimai-Doss, M. , Integrating audio and vision for robust automatic gender recognition , number Idiap-RR-73-2008, 2008.
Motlicek, P. , Ganapathy, S. and Hermansky, H. , Entropy coding of Quantized Spectral Components in FDLP audio codec , number Idiap-RR-71-2008, 2008.
Mariéthoz, J. , Bengio, S. and Grandvalet, Y. , Kernel Based Text-Independnent Speaker Verification , number Idiap-RR-68-2008, 2008.
Paiement, J. -F. , Grandvalet, Y. and Bengio, S. , Predictive Models for Music , number Idiap-RR-51-2008, 2008.
Paiement, J. -F. , Bengio, S. and Eck, D. , Probabilistic Models for Melodic Prediction , number Idiap-RR-50-2008, 2008.
Ba, S. and Odobez, J. -M. , Multi-person visual focus of attention from head pose and meeting contextual cues , number Idiap-RR-47-2008, 2008.
Ketabdar, H. and Bourlard, H. , Enhanced phone posteriors for improving speech recognition systems , number Idiap-RR-39-2008, 2008.
Parthasarathi, S. H. K. and Hermansky, H. , A data-driven approach to speech/non-speech detection , number Idiap-RR-23-2008, 2008.
Parthasarathi, S. H. K. , Motlicek, P. and Hermansky, H. , Exploiting temporal context for speech/non-speech detection , number Idiap-RR-21-2008, 2008.
Aradilla, G. , Bourlard, H. and Magimai-Doss, M. , Posterior features applied to speech recognition tasks with limited training data , number Idiap-RR-15-2008, 2008.
Aradilla, G. , Bourlard, H. and Magimai-Doss, M. , Using kl-based acoustic models in a large vocabulary recognition task , number Idiap-RR-14-2008, 2008.
Li, W. , Kumatani, K. , Dines, J. , Magimai-Doss, M. and Bourlard, H. , A neural network based regression approach for recognizing simultaneous speech , number Idiap-RR-10-2008, 2008.
Kumatani, K. , McDonough, J. , Klakow, D. , Garner, P. N. and Li, W. , Maximum negentropy beamforming , number Idiap-RR-07-2008, 2008.
Kumatani, K. , McDonough, J. , Schacht, S. , Klakow, D. , Garner, P. N. and Li, W. , Filter Bank Design for Subband Adaptive Beamforming and Application to Speech Recognition , number Idiap-RR-02-2008, 2008.
Garner, P. N. , A weighted finite state transducer tutorial , number Idiap-Com-03-2008, 2008.
Perruchoud, L. , The Anterior Cingulate Cortex , number Idiap-Com-02-2008, 2008.
Ba, S. and Odobez, J. -M. , Multi-person visual focus of attention from head pose and meeting contextual cues , number 47, 2008.
Orabona, F. , Castellini, C. , Caputo, B. , Luo, J. and Sandini, G. , On-line independent support vector machines for cognitive systems , number Idiap-RR-63-2007, 2007.
Li, W. , Dines, J. and Magimai-Doss, M. , Robust overlapping speech recognition based on neural networks , number Idiap-RR-55-2007, 2007.
Keshet, J. , Theoretical foundations for large-margin kernel-based continuous speech recognition , number Idiap-RR-44-2007, 2007.
Vinciarelli, A. and Favre, S. , Role recognition in radio programs using social affiliation networks and mixtures of discrete distributions: an approach inspired by social cognition , number Idiap-RR-40-2007, 2007.
Heusch, G. and Marcel, S. , A novel statistical generative model dedicated to face recognition , number Idiap-RR-39-2007, 2007.
Marcel, S. , Abbet, P. and Guillemot, M. , Google portrait , number Idiap-Com-07-2007, 2007.
Vinciarelli, A. , Mapping nonverbal communication into social status: automatic recognition of journalists and non-journalists in radio news , number 33, 2007.
Pinto, J. P. , Bourlard, H. , Graves, A. and Hermansky, H. , Comparing different word lattice rescoring approaches towards keyword spotting , number 32, 2007.
Valente, F. , Bourlard, H. and Deepu, V. , Agglomerative information bottleneck for speaker diarization of meetings data , number 31, 2007.
Prasanna, S. R. Mahadeva , Yegnanarayana, B. , Pinto, J. P. and Hermansky, H. , Analysis of confusion matrix to combine evidence for phoneme recognition , number 27, 2007.
Galán, F. , Ferrez, P. W. , Oliva, F. , Guàrdia, J. and del R. Millán, J. , Feature extraction for multi-class bci using canonical variates analysis , number 23, 2007.
Zacharie, D. G. and Pinto, J. P. , Keyword spotting on word lattices , number 22, 2007.
Pronobis, A. and Caputo, B. , Confidence-based cue integration for visual place recognition , number 17, 2007.
Motlicek, P. , Ganapathy, S. , Hermansky, H. and Garudadri, H. , Scalable wide-band audio codec based on frequency domain linear prediction , number 16, 2007.
Marcel, S. , Joint bi-modal face and speaker authentication using explicit polynomial expansion , number 14, 2007.
Dines, J. and Vepa, J. , Direct optimisation of a multilayer perceptron for the estimation of cepstral mean and variance statistics , number 13, 2007.
Dines, J. and Magimai-Doss, M. , A study of phoneme and grapheme based context-dependent asr systems , number 12, 2007.
Uldry, L. , Ferrez, P. W. and del R. Millán, J. , Feature selection methods on distributed linear inverse solutions for a non-invasive brain-machine interface , number 04, 2007.
Lovitt, A. , Correcting confusion matrices for phone recognizers , number 03, 2007.
Chen, L. , Barber, D. and Odobez, J. -M. , Dynamical dirichlet mixture model , number 02, 2007.
Gaudard, C. , Aradilla, G. and Bourlard, H. , Speech recognition based on template matching and phone posterior probabilities , number 02, 2007.
Humm, A. , Hennebert, J. and Ingold, R. , Database and evaluation protocols for user authentication using combined handwriting and speech modalities , 2007.
Mesot, B. and Barber, D. , A bayesian switching linear dynamical system for scale-invariant robust speech extraction , 2007.
Mesot, B. and Barber, D. , A gaussian sum smoother for inference in switching linear dynamical systems , 2007.
Lathoud, G. , Observations on multi-band asynchrony in distant speech recordings , number 74, 2006.
Mariéthoz, J. , Discrmininant models for text-independent speaker verification , number 70, 2006.
Hemptinne, C. , Master thesis: integration of the harmonic plus noise model (hnm) into the hidden markov model-based speech synthesis system (hts) , number 69, 2006.
Ketabdar, H. and Hermansky, H. , Identifying unexpected words using in-context and out-of-context phoneme posteriors , number 68, 2006.
Luo, J. , Pronobis, A. and Caputo, B. , Svm-based transfer of visual knowledge across robotic platforms , number 65, 2006.
Cuendet, S. , Model adaptation for sentence unit segmentation from speech , number 64, 2006.
Cheng, O. , Dines, J. and Magimai-Doss, M. , A generalized dynamic composition algorithm of weighted finite state transducers for large vocabulary speech recognition , number 62, 2006.
Motlicek, P. , Ullal, V. and Hermansky, H. , Wide-band perceptual audio coding based on frequency-domain linear prediction , number 58, 2006.
Maganti, H. K. , Motlicek, P. and Gatica-Perez, D. , Unsupervised speech/non-speech detection for automatic speech recognition in meeting rooms , number 57, 2006.
A. Peregoudov, , Vinciarelli, A. and Bourlard, H. , Assessing the effectiveness of slides as a mean to improve the automatic transcription of oral presentations , number 56, 2006.
Mesot, B. and Barber, D. , A bayesian alternative to gain adaptation in autoregressive hidden markov models , number 55, 2006.
Torre, E. L. , Caputo, B. and Tommasi, T. , Melanoma recognition using kernel classifiers , number 53, 2006.
Luo, J. , Pronobis, A. , Caputo, B. and Jensfelt, P. , Incremental learning for place recognition in dynamic environments , number 52, 2006.
Marcel, S. , Keomany, J. and Rodriguez, Y. , Robust-to-illumination face localisation using active shape models and local binary patterns , number 47, 2006.
Ullal, V. and Motlicek, P. , Audio coding based on long temporal segments: experiments with quantization of excitation signal , number 46, 2006.
Keller, M. and Bengio, S. , A multitask learning approach to document representation using unlabeled data , number 44, 2006.
Ba, S. and Odobez, J. -M. , Recognizing people's focus of attention from head poses: a study , number 42, 2006.
Smith, K. , Ba, S. , Odobez, J. -M. and Gatica-Perez, D. , Tracking attention for multiple people: wandering visual focus of attention estimation , number 40, 2006.
Motlicek, P. , Hermansky, H. , Garudadri, H. and Srinivasamurthy, N. , Audio coding based on long temporal contexts , number 30, 2006.
Poh, N. and Bengio, S. , Estimating the confidence interval of expected performance curve in biometric authentication using joint bootstrap , number 25, 2006.
Buttfield, A. and del R. Millán, J. , Online classifier adaptation in brain-computer interfaces , number 16, 2006.
Lathoud, G. , Magimai-Doss, M. and Bourlard, H. , Unsupervised spectral subtraction for noise-robust asr on unknown transmission channels , number 09, 2006.
Mesot, B. and Barber, D. , Switching linear dynamical systems for noise robust speech recognition , number 08, 2006.
Marcel, S. , Rodriguez, Y. , Guillemot, M. and Popescu-Belis, A. , Annotation of face detection: description of xml format and files , number 06, 2006.
Moore, D. , The juicer lvcsr decoder - user manual for juicer version 0.5.0 , number 03, 2006.
Richiardi, J. and Drygajlo, A. , Applying biometrics to identity documents: estimating and coping with errors , 2006.
Richiardi, J. and Drygajlo, A. , Applying biometrics to identity documents: implementation issues , 2006.
Powered by Agaion