E-Book, Englisch, Band 3869, 498 Seiten, eBook
Renals / Bengio Machine Learning for Multimodal Interaction
2006
ISBN: 978-3-540-32550-5
Verlag: Springer
Format: PDF
Kopierschutz: 1 - PDF Watermark
Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers
E-Book, Englisch, Band 3869, 498 Seiten, eBook
Reihe: Lecture Notes in Computer Science
ISBN: 978-3-540-32550-5
Verlag: Springer
Format: PDF
Kopierschutz: 1 - PDF Watermark
Zielgruppe
Research
Autoren/Hrsg.
Weitere Infos & Material
InvitedPapers.- Gesture, Gaze, and Ground.- Toward Adaptive Information Fusion in Multimodal Systems.- Multimodal Processing.- The AMI Meeting Corpus: A Pre-announcement.- VACE Multimodal Meeting Corpus.- Multimodal Integration for Meeting Group Action Segmentation and Recognition.- Detection and Resolution of References to Meeting Documents.- Dominance Detection in Meetings Using Easily Obtainable Features.- Can Chimeric Persons Be Used in Multimodal Biometric Authentication Experiments?.- HCI and Applications.- Analysing Meeting Records: An Ethnographic Study and Technological Implications.- Browsing Multimedia Archives Through Intra- and Multimodal Cross-Documents Links.- The “FAME” Interactive Space.- Development of Peripheral Feedback to Support Lectures.- Real-Time Feedback on Nonverbal Behaviour to Enhance Social Dynamics in Small Group Meetings.- Discourse and Dialogue.- A Multimodal Discourse Ontology for Meeting Understanding.- Generic Dialogue Modeling for Multi-application Dialogue Systems.- Toward Joint Segmentation and Classification of Dialog Acts in Multiparty Meetings.- Emotion.- Developing a Consistent View on Emotion-Oriented Computing.- Multimodal Authoring Tool for Populating a Database of Emotional Reactive Animations.- Visual Processing.- A Testing Methodology for Face Recognition Algorithms.- Estimating the Lecturer’s Head Pose in Seminar Scenarios – A Multi-view Approach.- Foreground Regions Extraction and Characterization Towards Real-Time Object Tracking.- Projective Kalman Filter: Multiocular Tracking of 3D Locations Towards Scene Understanding.- Speech and Audio Processing.- Least Squares Filtering of Speech Signals for Robust ASR.- A Variable-Scale Piecewise Stationary Spectral Analysis Technique Applied to ASR.- AccentClassification for Speech Recognition.- Hierarchical Multi-stream Posterior Based Speech Recognition System.- Variational Bayesian Methods for Audio Indexing.- Microphone Array Driven Speech Recognition: Influence of Localization on the Word Error Rate.- Automatic Speech Recognition and Speech Activity Detection in the CHIL Smart Room.- The Development of the AMI System for the Transcription of Speech in Meetings.- Improving the Performance of Acoustic Event Classification by Selecting and Combining Information Sources Using the Fuzzy Integral.- NIST Meeting Recognition Evaluation.- The Rich Transcription 2005 Spring Meeting Recognition Evaluation.- Linguistic Resources for Meeting Speech Recognition.- Robust Speaker Segmentation for Meetings: The ICSI-SRI Spring 2005 Diarization System.- Speech Activity Detection on Multichannels of Meeting Recordings.- NIST RT’05S Evaluation: Pre-processing Techniques and Speaker Diarization on Multiple Microphone Meetings.- The TNO Speaker Diarization System for NIST RT05s Meeting Data.- The 2005 AMI System for the Transcription of Speech in Meetings.- Further Progress in Meeting Recognition: The ICSI-SRI Spring 2005 Speech-to-Text Evaluation System.- Speaker Localization in CHIL Lectures: Evaluation Criteria and Results.